CN103455524A - Method and device for displaying and acquiring entry information - Google Patents

Method and device for displaying and acquiring entry information Download PDF

Info

Publication number
CN103455524A
CN103455524A CN2012101838708A CN201210183870A CN103455524A CN 103455524 A CN103455524 A CN 103455524A CN 2012101838708 A CN2012101838708 A CN 2012101838708A CN 201210183870 A CN201210183870 A CN 201210183870A CN 103455524 A CN103455524 A CN 103455524A
Authority
CN
China
Prior art keywords
entry
entry information
classification
encyclopaedia
browsing pages
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2012101838708A
Other languages
Chinese (zh)
Other versions
CN103455524B (en
Inventor
王潇
周黄玲
苏雪峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Beijing Sogou Information Service Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Beijing Sogou Information Service Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd, Beijing Sogou Information Service Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201210183870.8A priority Critical patent/CN103455524B/en
Publication of CN103455524A publication Critical patent/CN103455524A/en
Application granted granted Critical
Publication of CN103455524B publication Critical patent/CN103455524B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention provides a method and device for displaying and acquiring entry information. The displaying method specifically includes: transmitting page information of a currently browsing page; receiving entry information of an encyclopedic entry corresponding to the currently browsing page; when the entry information is more than one, analyzing and selecting the entry information, and then returning; displaying the entry information. By the method, the entry information related to the currently browsing page can be displayed automatically when a user browses the webpage with a browser, and information access efficiency of the browser can be increased.

Description

Represent and obtain the method and apparatus of entry information
Technical field
The application relates to networking technology area, particularly relates to a kind of method and apparatus that represents and obtain entry information.
Background technology
At present, universal along with internet, have every day the message such as a large amount of news, event by network by bamboo telegraph, people have reached unprecedented height for the propagation enthusiasm of various information and degree of share.Constantly accumulate and precipitate and pass on civilizedly for convenience of the mankind, a kind of mode that records the encyclopaedia entry by the electronics macropaedia is arisen at the historic moment.People can carry out combing and accumulation to existing knowledge or the intellectual achievement just formed, or rely on self Knowledge Capability the relevant knowledge theme of own domain of interest to be edited and perfect.Constantly perfect electronics macropaedia has not only carried out effective combing and preservation to knowledge hierarchy, and is conducive to carry out the retrieval of knowledge or consult.
For example, when user's open any browser is read one piece of news, if run into while containing strange or unknown vocabulary, need in the de-electromation macropaedia to be retrieved corresponding encyclopaedia lexical or textual analysis; Typically be retrieved as the retrieval of " search box+keyword " in prior art, usually, the user need to open the webpage of electronics macropaedia, the strange vocabulary of input in the search box of this webpage, and obtain and the corresponding encyclopaedia lexical or textual analysis of this strange vocabulary by navigate search results.Like this, while in one piece of news, thering is more than one strange vocabulary, when particularly strange vocabulary has multinomial entry information, need to repeatedly retrieve, and each entry information is analyzed to judgement, thus consumed ample resources, affected the message reference efficiency of browser.
In a word, need the urgent technical matters solved of those skilled in the art to be exactly: the message reference efficiency that how can improve browser.
Summary of the invention
The application's technical matters to be solved is to provide a kind of method and apparatus that represents and obtain entry information, automatically represent the entry information relevant to current browsing pages in the user uses the process of browser browsing page, can improve the message reference efficiency of browser.
In order to address the above problem, the application discloses a kind of method of obtaining entry information, comprising:
Analyze the page info of current browsing pages, obtain corresponding encyclopaedia entry;
Obtain corresponding entry information according to described encyclopaedia entry retrieval;
At described encyclopaedia entry, corresponding entry information is one and every entry information is analyzed when above, and selects a corresponding entry information;
Selected this entry information is returned.
Preferably, described every entry information is analyzed, and is selected the step of a corresponding entry information further to comprise:
According to the classification of described current browsing pages and/or active user's user profile classification, select an entry information from described entry information.
Preferably, described method also comprises:
Obtain the entry information category of every entry information of described encyclopaedia entry;
The described classification according to described current browsing pages and/or active user's user profile classification, from described entry information, select the step of an entry information to be specially, the entry information that the classification of selection entry information category and described current browsing pages and/or active user's user profile classification is mated most from the entry information more than one of described encyclopaedia entry.
Preferably, described method also comprises:
Analyze the page info of described current browsing pages, obtain the classification of corresponding current browsing pages.
Preferably, described method also comprises:
Analyze described active user's use historical information, obtain corresponding user profile classification.
Preferably, the page info of the described current browsing pages of described analysis, obtain the step of the classification of corresponding current browsing pages, comprising:
Analyze the URL(uniform resource locator) information of described current browsing pages, obtain the classification of corresponding current browsing pages; And/or
Analyze the crumbs of described current browsing pages, navigation obtains the classification of corresponding current browsing pages; And/or
Analyze encyclopaedia entry described in described current browsing pages respectively in the weight of each set classification, obtain total weight of each set classification of current browsing pages, and using the set classification of total weight maximum as the classification of current browsing pages.
Preferably, described use historical information comprises: active user's browser access historical record and/or input historical record.
Preferably, the described active user's of described analysis use historical information, obtain the step of corresponding user profile classification, comprising:
The page classification of the corresponding page in described active user's browser access historical record is obtained in analysis, and using frequency the highest page classification as active user's user profile classification; And/or
The described active user's of analytic statistics input historical record, obtain the vocabulary classification that described input historical record is corresponding, and using frequency the highest vocabulary classification as active user's user profile classification.
Preferably, the page info of described current browsing pages comprises the content of current browsing pages;
The page info of the current browsing pages of described analysis, obtain the step of corresponding encyclopaedia entry, comprising:
Content to described current browsing pages is carried out word segmentation processing, obtains corresponding entry;
The dictionary of described entry and server end storage is analyzed to coupling;
Using the equivalent bar that the match is successful as the encyclopaedia entry.
Preferably, described method also comprises:
Add up the frequency that each entry occurs at described current browsing pages;
General's equivalent bar that the match is successful is specially as the step of encyclopaedia entry, using the match is successful and the frequency of statistics surpasses the equivalent bar of set frequency threshold as the encyclopaedia entry.
On the other hand, disclosed herein as well is a kind of method that represents entry information, comprising:
The page info of current browsing pages is sent;
Receive the entry information of the encyclopaedia entry corresponding with described current browsing pages; At described encyclopaedia entry, corresponding entry information is one when above to described entry information, after every entry information analysis is selected, is returned;
Described entry information is represented.
Preferably, described when described entry information is represented, adopt to play the window shape formula described entry information is represented.
Preferably, described when described entry information is represented, the length of described entry information is estimated, represent described entry information in conjunction with estimation results.
On the other hand, disclosed herein as well is a kind of device that obtains entry information, comprising:
Encyclopaedia entry acquisition module, for analyzing the page info of current browsing pages, obtain corresponding encyclopaedia entry;
Encyclopaedia entry retrieval module, for obtaining corresponding entry information according to described encyclopaedia entry retrieval;
The Information Selection module, be one for the entry information corresponding at described encyclopaedia entry and every entry information analyzed when above, and select a corresponding entry information; And
Return to module, for selected this entry information is returned.
Preferably, described Information Selection module specifically for the classification according to described current browsing pages and/or active user's user profile classification, is selected an entry information from described entry information.
Preferably, described device also comprises:
Entry information category acquisition module, for the entry information category of every entry information of obtaining described encyclopaedia entry;
Described Information Selection module, an entry information of mating most specifically for the classification of selecting entry information category and described current browsing pages the entry information more than one from described encyclopaedia entry and/or active user's user profile classification.
On the other hand, disclosed herein as well is a kind of device that represents entry information, comprising:
Sending module, sent for the page info by current browsing pages;
Receiver module, for receiving the entry information of the encyclopaedia entry corresponding with described current browsing pages; At described encyclopaedia entry, corresponding entry information is one when above to described entry information, after every entry information analysis is selected, is returned; And
Represent module, for described entry information is represented.
Preferably, the described module that represents, represented with described entry information specifically for adopting bullet window shape formula pair.
Compared with prior art, the application has the following advantages:
The user is in the process of using the browser browsing page, in the time of can running into strange or unknown vocabulary etc., the application means some key vocabularies that occur in current browsing pages with the encyclopaedia entry, and automatically represents the entry information with the corresponding encyclopaedia entry of current browsing pages at browser client; For the user, it,, without opening search box and inputting keyword and retrieved, just can directly obtain corresponding entry information; Therefore, the application provide the page when showing current browsing pages in the entry information of corresponding encyclopaedia entry, the information content provided to the user has been provided, thereby has been improved the message reference efficiency of browser.
In addition, the application can also be corresponding at described encyclopaedia entry entry information be one when above, according to the classification of described current browsing pages and/or active user's user profile classification, select an entry information from the entry information more than one of described encyclopaedia entry; Described choosing can be selected and current browsing pages and/or the maximally related entry information of active user from the entry information more than one of encyclopaedia entry, therefore can use as far as possible little zone to represent an entry information of the demand of being close to the users most, represent entry information corresponding in polysemant to the user exactly, improve the message reference efficiency of browser.
The accompanying drawing explanation
Fig. 1 is a kind of process flow diagram that represents the embodiment of the method for entry information of the application;
Fig. 2 is a kind of process flow diagram that obtains the embodiment of the method 1 of entry information of the application;
Fig. 3 is a kind of process flow diagram that obtains the embodiment of the method 2 of entry information of the application;
Fig. 4 is a kind of structural drawing that represents the device embodiment of entry information of the application;
Fig. 5 is a kind of structural drawing that obtains the device embodiment of entry information of the application.
Embodiment
For above-mentioned purpose, the feature and advantage that make the application can become apparent more, below in conjunction with the drawings and specific embodiments, the application is described in further detail.
The embodiment of the present application is used in the process of browser browsing page the user, and current browsing pages and active user's demand is combined, and at browser client, automatically represents the entry information with the corresponding encyclopaedia entry of current browsing pages; For the user, it without opening search box and inputting keyword and retrieved, just can directly obtain this entry information strange or unknown vocabulary when running into the key vocabularies of the current browsing pages such as strange or unknown vocabulary; Therefore, the embodiment of the present application can represent entry information corresponding in polysemant to the user exactly, thereby can, when the information content provided to the user has been provided, improve the message reference efficiency of browser.
With reference to Fig. 1, show a kind of process flow diagram that represents the embodiment of the method for entry information of the application, specifically can comprise:
Step 101, browser client are sent to the browser server end by the page info of current browsing pages;
Step 102, browser client receive the entry information of the encyclopaedia entry corresponding with described current browsing pages; At described encyclopaedia entry, corresponding entry information is one when above to described entry information, after by the browser server end, every entry information analysis being selected, is returned;
Step 103, browser client are represented described entry information.
For example, when user's open any browser is read one piece of news, the browser client of the embodiment of the present application can represent the entry information of some key vocabularies in this headline or text automatically below current browsing pages, the key vocabularies here specifically can comprise in this news the main name that relates to, place name, event etc., and the entry information here specifically can comprise encyclopaedia lexical or textual analysis of key vocabularies etc.
The embodiment of the present application means some key vocabularies that occur in current browsing pages with the encyclopaedia entry, these key vocabularies are probably strange or unknown for the user, be also, the user probably exists to the details of these key vocabularies the demand of understanding fully its implication, therefore the application also represents corresponding entry information automatically using it as the encyclopaedia entry.
In the embodiment of the present application, entry information general reference can strengthen all information of user to the understanding of encyclopaedia entry, it can comprise explanatory information, a typical example is the encyclopaedia lexical or textual analysis, wherein, the encyclopaedia lexical or textual analysis can be mainly derived from professional encyclopaedia website or the website channel that has certain authoritative universality through the human-edited, as wikipedia, Baidupedia, the star in amusement circle of Sohu storehouse etc.The application represents in current browsing pages the lexical or textual analysis information with the corresponding vocabulary of encyclopaedia entry automatically, makes the user needn't retrieve the implication that just can understand this vocabulary.
In specific implementation, when the user opens a browsing pages, browser client can be using it as current browsing pages, and the page info of current browsing pages is sent to the browser server end, receive the browser server end that return with the entry information corresponding encyclopaedia entry of described current browsing pages, and represented.
In the embodiment of the present application, send to the page info of the current browsing pages of browser server end mainly can comprise: the URL(URL(uniform resource locator), Uniform Resource Locator) and/or the content of current browsing pages (for example title of news and text) etc., the application is not limited the page info of the concrete current browsing pages that sends to the browser server end.Browser client can send the URL of current browsing pages to the browser server end, the browser server end triggers the content that this URL obtains current browsing pages, and the content that also can directly send current browsing pages is carried out analytic statistics for browser server.
In specific implementation, the entry information with the corresponding encyclopaedia entry of described current browsing pages that the browser server end returns is generally XML (extend markup language, Extensible Markup Language) form.Difference in view of XML and HTML: XML is used for storing data, and its focus is the content of data.And HTML is designed to show data, its focus is the outward appearance of data, therefore in a kind of application example of the application, the process that described step 102 represents specifically can comprise: browser client is converted to the HTML(HTML (Hypertext Markup Language) by the entry information of XML form, Hypertext Markup Language) form, and the entry information of html format is embedded in to plug-in unit, utilize javascript by this plug-in unit, the asynchronous JavaScript of ajax(and XML, Asynchronous JavaScript and XML), the front end script technologies such as jquery load the entry information of html format, and represented.
About the position that represents of entry information, it can be the optional position of browser, for example, and top, below, left, right-hand etc.In order not affect and the content of current browsing pages that interference user is not browsed, in a preferred embodiment of the present application, the described position that represents can be the below of browser or right-hand.
About the form that represents of entry information, it can be form arbitrarily, as played window, bubble etc.In a preferred embodiment of the present application, the implementation procedure of described step 102 can be to adopt bullet window shape formula pair to be represented with the entry information of the corresponding encyclopaedia entry of described current browsing pages.This bullet window can be positioned at the optional position of browser; This bullet window can provide X button, but User the operation of this X button is closed, or this bullet window can be set up has certain life cycle (as 50 seconds), after life cycle finishes, this bullet window exits automatically.
Certainly; except representing form of bullet window, bubble; the application represents form and can also comprise Toolbars Panel, menu bar, toolbar, status bar etc.; everyly can represent for the user UI(user interface of entry information entry, user interface) all belong in the application's protection domain.
The length of the entry information of encyclopaedia entry differs, and having long has shortly, and in order not affect and the content of the current browsing pages that interference user is not browsed, browser client can be taked certain strategy.
In a preferred embodiment of the present application, when described browser client is represented described entry information, browser client can be estimated the length of described entry information, in conjunction with estimation results, represents described entry information.
Provide a kind of application example that the length of described entry information is estimated at this.Browser client can arrange an area upper limit threshold for representing regional area, like this, when in the length of described entry information, shared real area is less than or equal to the area upper limit threshold, just can the direct basis real area be represented, what now represent is the full content of described entry information; When in the length of described entry information, shared real area is greater than the area upper limit threshold, can only show the entry information that the area upper limit threshold can bear, for example, entry information according to vertical order intercepting area upper limit threshold, abandon other entry information, what now represent is the partial content of described entry information.Described area upper limit threshold can be set according to representing of the form regional area of representing that plays window, bubble.
In some cases, may only have one with the corresponding encyclopaedia entry of described current browsing pages, now can be directly the entry information of this encyclopaedia entry be represented.
In the other situation, with the corresponding encyclopaedia entry of described current browsing pages may be for more than one, in order not affect and the content of current browsing pages that interference user is not browsed, in a preferred embodiment of the present application, described encyclopaedia entry is more than one; Described method can also comprise: browser client pair is represented with the identification information of the corresponding above encyclopaedia entry of described current browsing pages.
Described identification information is mainly used in distinguishing different encyclopaedia entries, and also, the user sees the identification information represented, and just can know the information of wanting which encyclopaedia entry.Suppose that the encyclopaedia entry is name, corresponding identification information can comprise name and corresponding head portrait, supposes that the encyclopaedia entry is place name, and corresponding identification information can comprise place name and corresponding sign thumbnail, etc.The application is not limited the identification information of concrete encyclopaedia entry.
In a preferred embodiment of the present application, described method can further include:
Browser client is the selection information for the identification information of represented a described above encyclopaedia entry according to the user who receives, and represents the entry information of selected this encyclopaedia entry.
Suppose that the user has chosen or clicked the selection information of the identification information of certain encyclopaedia entry by mouse, keyboard or touch gestures, think that the user wants to check the entry information of selected this encyclopaedia entry, so it is represented.
In other embodiments, the process that the page info of current browsing pages is sent is not limited to browser client, and the browser service end also can be carried out this operation; The entity in like manner related in step 102 ~ step 103 also is not limited to browser client, and the browser server end also can be carried out corresponding operation, completes the described logical process of the application, and corresponding entry information is represented to the user.
With reference to Fig. 2, show a kind of process flow diagram that obtains the embodiment of the method 1 of entry information of the application, specifically can comprise:
Step 201, browser server end are analyzed the page info of current browsing pages, obtain corresponding encyclopaedia entry;
Step 202, browser server end obtain corresponding entry information according to described encyclopaedia entry retrieval;
Step 203, at described encyclopaedia entry, corresponding entry information is one the browser server end is analyzed every entry information when above, and selects a corresponding entry information;
Step 204, browser server end return to browser client by selected this entry information.
The embodiment of the present application means some key vocabularies that occur in current browsing pages with the encyclopaedia entry, these key vocabularies are probably strange or unknown for the user, be also, the user probably exists demand to the details of these key vocabularies, therefore the application also represents corresponding entry information automatically using it as the encyclopaedia entry.
In a preferred embodiment of the present application, the page info of described current browsing pages specifically can comprise the content of current browsing pages;
Described browser server end is analyzed the page info of current browsing pages, obtains the step of corresponding encyclopaedia entry, may further include:
Sub-step A1, browser server end carry out word segmentation processing to the content of described current browsing pages, obtain corresponding entry;
Sub-step A2, browser server end are analyzed coupling by the dictionary of the entry of described current browsing pages content and server end storage;
Sub-step A3, the browser server end general equivalent bar that the match is successful are as the encyclopaedia entry.
In the embodiment of the present application, the dictionary of server end storage can be used for storing a series of entry.In actual applications, can arrange and obtain described entry according to professional encyclopaedia website or the website channel that there is certain authoritative universality through the human-edited, as wikipedia, Baidupedia, the star in amusement circle of Sohu storehouse, electronics macropaedia etc.And the dictionary of server end storage can synchronously upgrade along with the variation of website channel.
In this preferred embodiment, if the entry of described current browsing pages content hits the dictionary of server end storage, the deducibility user exists demand to the details of this entry, so, using this entry as the encyclopaedia entry, and obtain corresponding entry information according to described encyclopaedia entry retrieval.
In specific implementation, the scheme that obtains corresponding entry information according to described encyclopaedia entry retrieval can have multiple.For example, can directly described encyclopaedia entry be inputed in the existing retrieval websites such as wikipedia, Baidupedia, the star in amusement circle of Sohu storehouse, electronics macropaedia and go inquiry; And for example, also can be captured rear arrangement according to the data of the existing retrieval websites such as wikipedia, Baidupedia, the star in amusement circle of Sohu storehouse, electronics macropaedia and be obtained a new encyclopaedia database, this new encyclopaedia database stores encyclopaedia entry and corresponding entry information, like this, described encyclopaedia entry is inputed to this new encyclopaedia database and inquired about, also can obtain corresponding entry information.In a word, the application is not limited the scheme that obtains corresponding entry information according to described encyclopaedia entry retrieval.
In another preferred embodiment of the present application, described browser server end is analyzed the page info of current browsing pages, and the step that obtains corresponding encyclopaedia entry can also comprise:
Add up the frequency that each entry occurs in described current browsing pages content;
The browser server end general equivalent bar that the match is successful can be specially as the step of encyclopaedia entry, using the match is successful and the frequency of statistics surpasses the equivalent bar of set frequency threshold as the encyclopaedia entry.
Interference for fear of the entry information of the application's encyclopaedia entry to the user, this preferred embodiment has increased encyclopaedia entry set threshold really, be also, hit the dictionary of server end storage at the entry of current browsing pages content, and, when the frequency that entry occurs in described current browsing pages content surpasses set frequency threshold, just infer that the user exists demand to the details of this entry.The set frequency threshold here can be arranged according to the actual requirements by those skilled in the art, and the application is not limited concrete set frequency threshold.
The length of the entry information of encyclopaedia entry differs, and having long has shortly, and in order not affect and the content of the current browsing pages that interference user is not browsed, the browser server end can be taked certain strategy.
For example, in a kind of application example of the application, the browser server end can be adjusted the entry information of encyclopaedia entry according to real needs, for example, in the entry information of encyclopaedia entry in short-term, can not do any adjustment, and, when the entry Chief Information Officer of encyclopaedia entry, therefrom win main contents until the length of entry information is no more than certain length threshold etc.The length threshold here can be arranged according to the actual requirements by those skilled in the art, and the application is not limited concrete length threshold.
It should be noted that, in specific implementation, at first the browser server end can be the XML form by described entry Information encapsulation.Then return to browser client.
Due to reasons such as polysemants, in some cases, described encyclopaedia entry may corresponding entry information more than one.For example, encyclopaedia entry " Sun Yue " may relate to " singer Sun Yue ", also likely relates to " sportsman Sun Yue "; And for example, encyclopaedia entry " apple " may relate to plant, company, internal film and foreign films etc.
In the situation that the corresponding entry information more than one of above-mentioned encyclopaedia entry, if the browser server end directly returns to browser client by the entry information more than one, browser client can, according to the strategy of self, represent all or part of content of the entry information more than one.The full content that represents undoubtedly the entry information more than one can cause and represent regional waste, and is easy to the content of impact and the current browsing pages browsed of interference user; But, the partial content that represents the entry information more than one, can there is the risk that can not meet consumers' demand, for example, the user is at the webpage of checking about the literature and art report, the entry information that has represented " sportsman Sun Yue " while wanting the entry information of " singer Sun Yue ", now the application automatically represents the entry content and just becomes valueless at all.
Therefore, in order to represent regional waste avoiding causing, and avoid under the prerequisite of content of impact and the current browsing pages browsed of interference user, automatically represent the entry information that the user wants, at described encyclopaedia entry, corresponding entry information is one when above to the present embodiment, according to the classification of described current browsing pages and/or active user's user profile classification, from the entry information more than one of described encyclopaedia entry, select an entry information to be shown to the user; Thereby use as far as possible little zone to represent an entry information of the demand of being close to the users most, can, when promoting user's experience, further strengthen the message reference efficiency of browser.
In a preferred embodiment of the present application, described browser server end is analyzed every entry information, and select the step of a corresponding entry information may further include: the browser server end, according to the classification of described current browsing pages and/or active user's user profile classification, is selected an entry information from described entry information.
In a preferred embodiment of the present application, described method can also comprise:
The browser server end is analyzed the page info of described current browsing pages, obtains the classification of corresponding current browsing pages.
The application can provide and obtain as follows other technical scheme of classes of pages:
Page classification obtain scheme 1,
Page classification is obtained scheme 1 and specifically can be comprised: the browser server end is analyzed the URL(uniform resource locator) information of described current browsing pages, obtains the classification of corresponding current browsing pages.
In actual applications, each large website is provided with an above channel categories usually, for example, news, physical culture, amusement, finance and economics, video, woman, science and technology, mobile phone, number, automobile, tourism, house property, forum, blog, game, microblogging, dress ornament, application etc.And the URL of channel categories has certain rule usually, also, the URL of same frequency classification has identical feature, and the URL of different frequency classification has difference.
Therefore, page classification is obtained the URL(uniform resource locator) information that scheme 1 can be utilized the described current browsing pages of URL law-analysing of each large website channel categories, the classification of corresponding current browsing pages is navigated to the fine granularity of channel categories.
Provide the URL rule of number of site channel categories at this, for example, in the URL of some website entertainment channel classification, can include " the Chinese spelling of yule(amusement) ", for example http:// yule.sohu.com/, in the URL of number of site entertainment channel classification, can include in addition " english abbreviation of ent(amusement entertainment) ", for example, http:// ent.sina.com.cn/, http://ent.163.com/etc..Like this, if include " yule " or " ent " in the URL of current browsing pages, can think that the page classification of current browsing pages is " amusement " classification.
Certainly, the URL rule of above-mentioned website entertainment channel classification is just as example, and it is not as the application's application restric-tion.
Page classification obtain scheme 2,
Page classification is obtained scheme 2 and specifically can be comprised: the browser server end is analyzed the crumbs of described current browsing pages, and navigation obtains the classification of corresponding current browsing pages.
Crumbs are the application modes of a kind of " historical record ", and purpose is to help the user to review incoming road, thereby it is a kind of navigate mode of linearity.Be mainly used to the interface element of expression content attaching relation, namely " Main classification > the one-level classification > secondary classification > reclassify > ... final content page " such mode.
About how analyzing the crumbs of described current browsing pages, navigation obtains the classification of corresponding current browsing pages, in a kind of application example of the application, can be after web crawlers captures the HTML content of current browsing pages, resolve described HTML content, the content that template or extraction according to each website contain an above symbol ' > ' is oriented the instation guidance bar, thereby obtains ' > ' locates corresponding word; Because this instation guidance bar is generally, the TOC level of current browsing pages in station described, therefore the page classification of current browsing pages can be oriented in corresponding keyword by respective classes.For example, at the instation guidance bar, be " Netease > sports channel > China Basketball > text " time, can determine that current browsing pages is for " physical culture " classification.
Page classification obtain scheme 3,
Page classification is obtained scheme 3 and specifically can be comprised: the browser server end is analyzed encyclopaedia entry described in described current browsing pages respectively in the weight of each set classification, obtain total weight of each set classification of current browsing pages, and using the set classification of total weight maximum as the page classification of current browsing pages.
In specific implementation, can preset a series of set classification (the application's set classification is mainly used in meaning the page classification under entry, in practice can be according to the channel categories of each large website), and obtain encyclopaedia entry described in current browsing pages in the weight of each set classification.Provide a kind of Weight Acquisition scheme at this, it is not as the application's application restric-tion certainly.
This Weight Acquisition scheme adopts the method for machine learning, presets the grounding collection, reaches the weight in each set classification by each entry sample of artificial mark, and obtains corresponding weight sorter according to this training set.Like this, can respectively each encyclopaedia entry in current browsing pages be inputed to the weight sorter, export the weight of each encyclopaedia entry in each set classification.
In actual applications, the scope of weight usually from 0 to 1, more level off to 1, shows that this encyclopaedia entry more tends to this set classification, otherwise show that this encyclopaedia entry more is not inclined to this set classification." NBA " very large in the Sport Class weight for example, and very little etc. in the weight of " military affairs " classification.Weight to each set classification of all encyclopaedia entries is sued for peace respectively, obtains total weight of each set classification of current browsing pages, the page classification that the set classification of selecting total weight maximum is current browsing pages.
Abovely three kinds of page classifications are obtained to scheme be described in detail, be appreciated that, those skilled in the art can be combined with above-mentioned several scheme as required, perhaps, use wherein any scheme, perhaps, use other scheme to obtain the page classification of current browsing pages, the present invention is not limited this.
In a preferred embodiment of the present application, described method can also comprise:
The browser server end is analyzed described active user's use historical information, obtains corresponding user profile classification.
In the embodiment of the present application, preferably, described use historical information specifically can comprise: active user's browser access historical record and/or input historical record.Wherein, described browser access historical record can be obtained by the browser log statistics, and described input historical record can be obtained by the input method client statistics.
The application can provide the technical scheme of obtaining as follows the user profile classification:
The user profile classification obtain scheme 1,
The user profile classification is obtained scheme 1 and specifically can be comprised: the page classification of the corresponding page in described active user's browser access historical record is obtained in the analysis of browser server end, and using frequency the highest page classification as active user's user profile classification.
Users ' individualized requirement is derived from user's hobby often, and for example, certain user has the hobby of star-pursuing, and be the video display fans, thus its online every day main be exactly dynamic in order to browse star both domestic and external and video display; And for example, certain user is football and basket ball fan, and its online every day is main is exactly dynamic in order to browse football both domestic and external and basketball; For another example, certain user is digital fan, and the major part that surfs the web its every day is the page of digital class.Therefore the user profile classification is obtained the browser access historical record of scheme 1 according to the active user, statistics active user's user profile classification, the user profile classification under this kind of situation is suitable with user's hobby.
In practice, the browser access historical record records the information such as user ID, page URL, access time usually, in specific implementation, can utilize above-mentioned three kinds of page classifications to obtain the page classification of the page in the browser access historical record that one or more in scheme obtain the active user, then, count the wherein page classification of occurrence number maximum (being also that frequency is the highest), as active user's user profile classification.Why selecting the page classification that frequency is the highest, illustrate that the active user relatively pays close attention to this page classification, is also that user profile classification under this kind of situation and user's hobby is suitable.
In specific implementation, can be limited by the browser access historical record to the active user that will add up according to time point.The active user's that for example, add up browser access historical record can be: all historical records from the open any browser interface to current browsing pages; And for example, the active user's that add up browser access historical record can be by those skilled in the art according to the actual demand setting for N() day over all historical records, etc.
In specific implementation, can also be limited by the browser access historical record to the active user that will add up according to quantity.For example, the quantity of the active user's that add up browser access historical record is 10 or 100, etc.Certainly, above-mentioned time point and quantity can be combined with, and the application is not limited this.
The user profile classification obtain scheme 2,
The user profile classification is obtained scheme 2 and specifically can be comprised: the described active user's of browser server end analytic statistics input historical record, be the input history in the analytic statistics browser interface, the input that comprises browser address bar is historical, the input history etc. of the controls such as search box, input frame in browser page, obtain the vocabulary classification that described input historical record is corresponding, and using frequency the highest vocabulary classification as active user's user profile classification.
At present, development along with internet and infotech, the present epoch have just like become the information age, most of working clans need to bend over one's desk working for a long time in the face of computer, and what they inputted on computers usually is the vocabulary relevant to occupation, for example, what the administrative assistant inputted is the vocabulary that office administration is relevant, what accounting was inputted is the vocabulary that finance are relevant, and what building designers inputted is the vocabulary that real estate, structure are relevant, and what the programmer inputted is computing machine, code dependent vocabulary etc.
The above-mentioned vocabulary relevant to occupation can be added up under specific formal applied environment, for example, under the specific applied environment such as word, excel, autocad, powerpoint, protel, technical forum, adds up.
Therefore, user's input historical record can reflect user's occupational information to a certain extent, and then can amplify out user's pair information relevant to occupational information and exist demand, and for example, the programmer pays close attention to information that computing machine is relevant etc.
Except the vocabulary relevant to occupation, user's input historical record can also reflect user's hobby information to a certain extent; The input historical record relevant to hobby can be added up under some specific unofficial applied environments, the program such as the instant messaging such as QQ, Fetion, and and for example the ends of the earth, water wood, cat such as flutter at the various amusement forum etc.If the user is interested in the constellation, it can input corresponding vocabulary under these unofficial applied environments, as " Libra ", " Taurus " etc.; If the user is interested in swimming, it can input corresponding vocabulary under these unofficial applied environments, as " breaststroke ", " treading " etc.; If the user is interested in the football, it can input corresponding vocabulary under these unofficial applied environments, as " Juventus ", " Chelsea " etc.
If the user has fixing occupational habit and/or hobby, can often input the vocabulary of specific vocabulary classification.Therefore, the user profile classification is obtained scheme 2 and is obtained the described vocabulary classification of described input historical record, and using frequency, the highest vocabulary classification is as active user's user profile classification.In specific implementation, can preset a series of vocabulary classification (classification under vocabulary, in practice can according to the specialized vocabulary classification of input method and/or preset the obtaining of channel categories of each large website).
Abovely two kinds of user profile classifications are obtained to scheme be described in detail, be appreciated that, those skilled in the art can be combined with above-mentioned several scheme as required, perhaps, use wherein any scheme, perhaps, use other scheme to obtain active user's user profile classification, the present invention is not limited this.
To sum up, the classification of described current browsing pages is mainly the sign of user's browsing content, and itself and user's content demand is closely related; Described active user's user profile classification can reflect user's hobby and/or occupational habit to a certain extent, and itself and user's individual demand is closely related; Therefore, at described encyclopaedia entry, corresponding entry information is one when above to the present embodiment, according to the classification of described current browsing pages and/or active user's user profile classification, select an entry information from the entry information more than one of described encyclopaedia entry; Described choosing can be selected and current browsing pages and/or the maximally related entry information of active user from the entry information more than one of encyclopaedia entry, therefore can use as far as possible little zone to represent an entry information of the demand of being close to the users most.
In a preferred embodiment of the present application, described method can also comprise:
The browser server end obtains the entry information category of every entry information of described encyclopaedia entry;
The described classification according to described current browsing pages and/or active user's user profile classification, from the entry information more than one of described encyclopaedia entry, select the step of an entry information to be specifically as follows, at described encyclopaedia entry, corresponding entry information is one when above, an entry information selecting the classification of entry information category and described current browsing pages and/or active user's user profile classification to mate most from the entry information more than one of described encyclopaedia entry.
Because every entry information of described encyclopaedia entry is comprised of word, therefore in practice, the principle that can adopt page classification to obtain scheme 3 is obtained the entry information category of every entry information of described encyclopaedia entry, and described acquisition process specifically can comprise:
Step B1, browser server end carry out word segmentation processing to a certain entry information of described encyclopaedia entry, obtain a series of word;
Step B2, analyze the weight of described each word at corresponding entry information category, weight is added and obtains total weight that each entry information category is corresponding, and using total weight when maximum corresponding entry information category as the entry information category of this entry information of described encyclopaedia entry.
In specific implementation, can preset a series of entry information category (classification under entry information, can carry out preset obtaining according to the channel categories of each large website in practice).
In practice, the classification of the entry information category of every entry information and described current browsing pages and/or active user's user profile classification can be mated, if the entry information category matching rate maximum that a certain entry information is corresponding, can think that the classification of this entry information category and described current browsing pages and/or active user's user profile classification mate most, select the entry information under this entry information category to return to browser client.
With reference to Fig. 3, show a kind of process flow diagram that obtains the embodiment of the method 2 of entry information of the application, specifically can comprise:
Step 301, browser server end are analyzed the page info of current browsing pages, obtain corresponding encyclopaedia entry;
Step 302, browser server end obtain corresponding entry information according to described encyclopaedia entry retrieval;
Step 303, when the corresponding entry information of described encyclopaedia entry, the browser server end by this entry information return to corresponding browser client;
Step 304, at described encyclopaedia entry, corresponding entry information is one when above, the browser server end, according to the classification of described current browsing pages and/or active user's user profile classification, is selected an entry information from the entry information more than one of described encyclopaedia entry;
Step 305, browser server end return to browser client by selected this entry information.
In the above-mentioned embodiment of the method for obtaining entry information, the description of each embodiment is all emphasized particularly on different fields, there is no the part described in detail in certain embodiment, can get final product referring to the associated description of other embodiment.
And, those skilled in the art are easy to expect: above-mentioned embodiment of the method 1-embodiment 2 combination in any application of obtaining entry information are all feasible, therefore the combination in any of obtaining above-mentioned between the embodiment of the method 1-embodiment 2 of entry information is all embodiment of the present invention, but this instructions has not just described in detail one by one at this as space is limited.
For making those skilled in the art the application better, below provide the application a kind of application process embodiment that obtains and represent entry information.
Application process embodiment 1,
The application scenarios of application process embodiment 1 is, the user is browsing a webpage relevant with " Sun Yue ": http://sports.163.com/12/0317/16/7SQFA0VM00052UUC.html, and the title of current browsing pages is " Sun Yue appears again to match for practicing and looks for state to claim not do the duplicate of Lin Shuhao "; Described application process embodiment 1 specifically can comprise:
Step R1, browser client are sent to the browser server end by the page info of current browsing pages;
Step R2, browser server end are analyzed the content of current browsing pages, carry out participle and word frequency statistics, dictionary coupling with the server end storage, judge the dictionary that the server end storage appears repeatedly and hit in " Sun Yue " in this current browsing pages, so using it as the encyclopaedia entry;
Step R3, browser server end obtain corresponding entry information according to this encyclopaedia entry retrieval; This entry information is encyclopaedia lexical or textual analysis http://baike.baidu.com/view/6886.htm, can find out, this encyclopaedia entry is a polysemant, corresponding Chinese popular songstress, China Basketball professional athlete, Lu xun academy of fine arts Chinese painting is the multinomial encyclopaedia lexical or textual analysis such as lecturer;
Step R4, according to the crumbs navigation information of this current browsing pages (Netease > sports channel > China Basketball text), identify this current browsing pages for " physical culture " classification;
Step R5, for the multinomial encyclopaedia lexical or textual analysis of correspondence, obtain in the weight of corresponding entry information category total weight that each entry information category is corresponding according to all words after participle, when total weight is maximum, corresponding entry information category is defined as corresponding entry information category;
As " Chinese popular songstress.Middle nineteen nineties China popular music circles ... " the encyclopaedia lexical or textual analysis; because the entries such as " singer ", " music " are very high at the weighted value of entry information category " amusement "; calculate this encyclopaedia lexical or textual analysis the highest at the weighted value of " amusement " classification, thereby be defined as " amusement " classification; As " Chinese Professional basket baller.National team: China.Chinese Man's Basketball Team number: 9..... ", because the entries such as " basketball ", " sportsman " are very high at the weighted value of entry information category " physical culture ", calculate this encyclopaedia lexical or textual analysis the highest at the weighted value of " physical culture " classification, thereby be defined as " physical culture " classification; As " Lu xun academy of fine arts Chinese painting is the lecturer.Be engaged in teaching and the creation that draw on Chinese figure painting ,Hua island ... .. " the encyclopaedia lexical or textual analysis; due to " fine arts "; entries such as " figure paintings " is very high at the weighted value of " art " classification, calculates this lexical or textual analysis the highest in the weight of entry information category " art ", thereby is defined as " art " classification.
Step R6, browser server end, according to the classification of described current browsing pages, select an encyclopaedia lexical or textual analysis from the multinomial encyclopaedia lexical or textual analysis of this encyclopaedia entry;
Determine that current browsing pages classification " physical culture " the entry information category " physical culture " corresponding with " Chinese Professional basket baller " is complementary most, select " Chinese Professional basket baller " item as encyclopaedia lexical or textual analysis to be sent.
Step R7, browser server end are encapsulated as the XML form by selected this encyclopaedia lexical or textual analysis, and return to browser client.
The example of the encyclopaedia lexical or textual analysis of the XML form after this provides a kind of encapsulation:
Figure BDA00001728841800201
Step R8, browser client receive the encyclopaedia lexical or textual analysis that is encapsulated as the XML form, the encyclopaedia lexical or textual analysis of XML form is converted to html format, and the encyclopaedia lexical or textual analysis of html format is embedded in to plug-in unit, by this plug-in unit, utilize the front end script technology to load the encyclopaedia lexical or textual analysis of html format, and represented.
In a word, application process embodiment 1 represents and maximally related that encyclopaedia lexical or textual analysis of the classification of the current browsing pages of user automatically, therefore can use as far as possible little zone to represent an entry information of the demand of being close to the users most.
Application process embodiment 2,
The application scenarios of application process embodiment 2 is, supposes that the user is a cuisines intelligent, browsed the webpage of a series of cuisines aspect, then put a page of introducing Baconic's chicken croquette; Described application process embodiment 2 specifically can comprise:
Step S1, browser client are sent to the browser server end by the page info of current browsing pages;
Step S2, browser server end are analyzed the content of current browsing pages, carry out participle and word frequency statistics, dictionary coupling with the server end storage, judge the dictionary that the server end storage appears repeatedly and hit in " Baconic " in this current browsing pages, so using it as the encyclopaedia entry;
It is encyclopaedia lexical or textual analysis http://baike.baidu.com/view/1102.htm that step S3, browser server end obtain this entry information of corresponding entry information according to this encyclopaedia entry retrieval, can find out, it is a polysemant, corresponding British philosopher, Ireland artist, the multinomial encyclopaedia lexical or textual analysis such as Baconic's meat products;
Step S4, analyze the access history record of active user at browser, be front 100 accessed web pages, can be by order to obtain one or more in scheme by above-mentioned three kinds of page classifications, obtaining the page classification of each webpage, occurrence number is maximum, be the user profile classification that page classification that frequency is the highest is the active user, the user profile classification that can determine the active user belongs to " cuisines " classification;
Step R5, for the multinomial encyclopaedia lexical or textual analysis of correspondence, obtain in the weight of corresponding entry information category total weight that each entry information category is corresponding according to all words after participle, when total weight is maximum, corresponding entry information category is defined as corresponding entry information category;
As " British philosopher.Fo Langxisi Baconic, the most important writer of Britain's the Renaissance ... " the encyclopaedia lexical or textual analysis; due to " philosophy "; entries such as " the Renaissances " is very high at the weighted value of entry information category " history "; calculate this encyclopaedia lexical or textual analysis the highest at the weighted value of " history " classification, thereby be defined as " history " classification; As " Irish artist.Be born in Dublin, Ireland ... .. " the encyclopaedia lexical or textual analysis; due to " artist "; entries such as " Art Museums " is very high at the weighted value of entry information category " art ", calculates this encyclopaedia lexical or textual analysis the highest at the weighted value of " art " classification, thereby is defined as " art " classification; As " Baconic's meat products.Original meaning is the sootiness brisket ... .. " the encyclopaedia lexical or textual analysis; due to " meat products "; entries such as " briskets " is very high at the weighted value of entry information category " cuisines ", calculates this encyclopaedia lexical or textual analysis the highest in the weight of " cuisines " classification, thereby is defined as " cuisines " classification.
Step R6, browser server end, according to active user's user profile classification, select an encyclopaedia lexical or textual analysis from the multinomial encyclopaedia lexical or textual analysis of this encyclopaedia entry;
User profile classification " cuisines " the entry information category " cuisines " corresponding with " Baconic's meat products " of determining the active user is complementary most, selects " Baconic's meat products " item as encyclopaedia lexical or textual analysis to be sent.
Step R7, browser server end are encapsulated as the XML form by selected this encyclopaedia lexical or textual analysis, and return to browser client.
The example of the encyclopaedia lexical or textual analysis of the XML form after this provides a kind of encapsulation:
Figure BDA00001728841800221
Step R8, browser client receive the encyclopaedia lexical or textual analysis that is encapsulated as the XML form, the encyclopaedia lexical or textual analysis of XML form is converted to html format, and the encyclopaedia lexical or textual analysis of html format is embedded in to plug-in unit, by this plug-in unit, utilize the front end script technology to load the encyclopaedia lexical or textual analysis of html format, and represented.
In a word, application process embodiment 2 represents and maximally related that encyclopaedia lexical or textual analysis of user profile classification automatically, therefore can use as far as possible little zone to represent an entry information of the demand of being close to the users most, represent entry information corresponding in polysemant to the user exactly.
In other embodiments, analyze the page info of current browsing pages, obtain corresponding encyclopaedia entry and be not limited to the browser server end, browser client also can be carried out this operation; The entity in like manner related in step 302 ~ step 305 also is not limited to the browser server end, and browser client also can be carried out corresponding operation, completes the described logical process of the application, and corresponding entry information is represented to the user.
With reference to Fig. 4, show a kind of structural drawing that represents the device embodiment of entry information of the application, specifically can comprise:
Sending module 401, be sent to the browser server end for the page info by current browsing pages;
Receiver module 402, for receiving the entry information of the encyclopaedia entry corresponding with described current browsing pages; At described encyclopaedia entry, corresponding entry information is one when above to described entry information, after by the browser server end, every entry information analysis being selected, is returned; And
Represent module 403, for described entry information is represented.
In a preferred embodiment of the present application, the described module 403 that represents, can be represented with described entry information specifically for adopting bullet window shape formula pair.
In another preferred embodiment of the present application, the described module 403 that represents, can be estimated the length of described entry information when described entry information is represented, and in conjunction with estimation results, represents described entry information.
For the device embodiment that represents entry information, because it is substantially similar to the embodiment of the method that represents entry information, so description is fairly simple, relevant part gets final product referring to the part explanation of the embodiment of the method that represents entry information.
With reference to Fig. 5, show a kind of structural drawing that obtains the device embodiment of entry information of the application, specifically can comprise:
Encyclopaedia entry acquisition module 501, for analyzing the page info of current browsing pages, obtain corresponding encyclopaedia entry;
Encyclopaedia entry retrieval module 502, for obtaining corresponding entry information according to described encyclopaedia entry retrieval;
Information Selection module 503, be one for the entry information corresponding at described encyclopaedia entry and every entry information analyzed when above, and select a corresponding entry information; And
Return to module 504, for selected this entry information is returned to browser client.
In a preferred embodiment of the present application, described Information Selection module 503, can, specifically for the classification according to described current browsing pages and/or active user's user profile classification, select an entry information from the entry information more than one of described encyclopaedia entry.
In another preferred embodiment of the present application, the described device that obtains entry information can also comprise:
Entry information category acquisition module, for the entry information category of every entry information of obtaining described encyclopaedia entry;
Described Information Selection module, an entry information can mating most specifically for the classification of selecting entry information category and described current browsing pages the entry information more than one from described encyclopaedia entry and/or active user's user profile classification.
In another preferred embodiment of the application, the described device that obtains entry information can also comprise:
Page classification acquisition module, for analyzing the page info of described current browsing pages, obtain the classification of corresponding current browsing pages.
In a preferred embodiment of the present application, the described device that obtains entry information can also comprise:
User profile classification acquisition module, for analyzing described active user's use historical information, obtain corresponding user profile classification.
In a preferred embodiment of the present application, described page classification acquisition module may further include:
The first page classification is obtained submodule, for analyzing the URL(uniform resource locator) information of described current browsing pages, obtains the classification of corresponding current browsing pages; And/or
The second page Noodles does not obtain submodule, and for analyzing the crumbs of described current browsing pages, navigation obtains the classification of corresponding current browsing pages; And/or
The 3rd page classification is obtained submodule, for analyzing encyclopaedia entry described in described current browsing pages in the weight of each set classification, obtain total weight of each set classification of current browsing pages, and using the set classification of total weight maximum as the page classification of current browsing pages.
In the embodiment of the present application, preferably, described use historical information specifically can comprise: browser access historical record and/or input historical record.
In a preferred embodiment of the present application, described user profile classification acquisition module may further include:
The first user information category obtains submodule, obtains the page classification of described active user's the browser access historical record page for analysis, and using frequency the highest page classification as active user's user profile classification; And/or
The second user profile classification is obtained submodule, for the described active user's of analytic statistics input historical record, obtains the described vocabulary classification of described input historical record, and using frequency the highest vocabulary classification as active user's user profile classification.
In a preferred embodiment of the present application, the page info of described current browsing pages can comprise the content of current browsing pages;
Described encyclopaedia entry acquisition module may further include:
The participle submodule, carry out word segmentation processing for the content to described current browsing pages, obtains corresponding entry;
Matched sub-block, analyze coupling for the dictionary of the entry by described current browsing pages content and server end storage;
Determine submodule, for equivalent bar that will the match is successful as the encyclopaedia entry.
In a preferred embodiment of the present application, described encyclopaedia entry extraction module can also comprise:
The statistics submodule, the frequency occurred in described current browsing pages content for adding up each entry;
Described definite submodule, can surpass the equivalent bar of set frequency threshold as the encyclopaedia entry specifically for the frequency by the match is successful and add up.
For the device embodiment that obtains entry information, because it is substantially similar to the embodiment of the method for obtaining entry information, so description is fairly simple, relevant part gets final product referring to the part explanation of the embodiment of the method for obtaining entry information.
Those skilled in the art should understand, the application's embodiment can be provided as method, system or computer program.Therefore, the application can adopt complete hardware implementation example, implement software example or in conjunction with the form of the embodiment of software and hardware aspect fully.And the application can adopt the form that wherein includes the upper computer program of implementing of computer-usable storage medium (including but not limited to magnetic disk memory, CD-ROM, optical memory etc.) of computer usable program code one or more.
The application describes with reference to process flow diagram and/or the block scheme of method, equipment (system) and computer program according to the embodiment of the present application.Should understand can be in computer program instructions realization flow figure and/or block scheme each flow process and/or the flow process in square frame and process flow diagram and/or block scheme and/or the combination of square frame.Can provide these computer program instructions to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, make the instruction of carrying out by the processor of computing machine or other programmable data processing device produce for realizing the device in the function of flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame appointments.
These computer program instructions also can be stored in energy vectoring computer or the computer-readable memory of other programmable data processing device with ad hoc fashion work, make the instruction be stored in this computer-readable memory produce the manufacture that comprises command device, this command device is realized the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
These computer program instructions also can be loaded on computing machine or other programmable data processing device, make and carry out the sequence of operations step to produce computer implemented processing on computing machine or other programmable devices, thereby the instruction of carrying out on computing machine or other programmable devices is provided for realizing the step of the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
Although described the application's preferred embodiment, once those skilled in the art obtain the basic creative concept of cicada, can make other change and modification to these embodiment.So claims are intended to all changes and the modification that are interpreted as comprising preferred embodiment and fall into the application's scope.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, and what each embodiment stressed is and the difference of other embodiment that between each embodiment, identical similar part is mutually referring to getting final product.
Above a kind of method and apparatus that represents and obtain entry information that the application is provided, be described in detail, applied specific case herein the application's principle and embodiment are set forth, the explanation of above embodiment is just for helping to understand the application's method and core concept thereof; Simultaneously, for one of ordinary skill in the art, the thought according to the application, all will change in specific embodiments and applications, and in sum, this description should not be construed as the restriction to the application.

Claims (18)

1. a method of obtaining entry information, is characterized in that, comprising:
Analyze the page info of current browsing pages, obtain corresponding encyclopaedia entry;
Obtain corresponding entry information according to described encyclopaedia entry retrieval;
At described encyclopaedia entry, corresponding entry information is one and every entry information is analyzed when above, and selects a corresponding entry information;
Selected this entry information is returned.
2. the method for claim 1, is characterized in that, described every entry information analyzed, and selected the step of a corresponding entry information further to comprise:
According to the classification of described current browsing pages and/or active user's user profile classification, select an entry information from described entry information.
3. method as claimed in claim 2, is characterized in that, also comprises:
Obtain the entry information category of every entry information of described encyclopaedia entry;
The described classification according to described current browsing pages and/or active user's user profile classification, from described entry information, select the step of an entry information to be specially, the entry information that the classification of selection entry information category and described current browsing pages and/or active user's user profile classification is mated most from the entry information more than one of described encyclopaedia entry.
4. method as claimed in claim 1 or 2, is characterized in that, also comprises:
Analyze the page info of described current browsing pages, obtain the classification of corresponding current browsing pages.
5. method as claimed in claim 1 or 2, is characterized in that, also comprises:
Analyze described active user's use historical information, obtain corresponding user profile classification.
6. method as claimed in claim 4, is characterized in that, the page info of the described current browsing pages of described analysis obtains the step of the classification of corresponding current browsing pages, comprising:
Analyze the URL(uniform resource locator) information of described current browsing pages, obtain the classification of corresponding current browsing pages; And/or
Analyze the crumbs of described current browsing pages, navigation obtains the classification of corresponding current browsing pages; And/or
Analyze encyclopaedia entry described in described current browsing pages respectively in the weight of each set classification, obtain total weight of each set classification of current browsing pages, and using the set classification of total weight maximum as the classification of current browsing pages.
7. method as claimed in claim 5, is characterized in that, described use historical information comprises: active user's browser access historical record and/or input historical record.
8. method as claimed in claim 7, is characterized in that, the described active user's of described analysis use historical information obtains the step of corresponding user profile classification, comprising:
The page classification of the corresponding page in described active user's browser access historical record is obtained in analysis, and using frequency the highest page classification as active user's user profile classification; And/or
The described active user's of analytic statistics input historical record, obtain the vocabulary classification that described input historical record is corresponding, and using frequency the highest vocabulary classification as active user's user profile classification.
9. the method for claim 1, is characterized in that, the page info of described current browsing pages comprises the content of current browsing pages;
The page info of the current browsing pages of described analysis, obtain the step of corresponding encyclopaedia entry, comprising:
Content to described current browsing pages is carried out word segmentation processing, obtains corresponding entry;
The dictionary of described entry and server end storage is analyzed to coupling;
Using the equivalent bar that the match is successful as the encyclopaedia entry.
10. method as claimed in claim 9, is characterized in that, also comprises:
Add up the frequency that each entry occurs at described current browsing pages;
General's equivalent bar that the match is successful is specially as the step of encyclopaedia entry, using the match is successful and the frequency of statistics surpasses the equivalent bar of set frequency threshold as the encyclopaedia entry.
11. a method that represents entry information, is characterized in that, comprising:
The page info of current browsing pages is sent;
Receive the entry information of the encyclopaedia entry corresponding with described current browsing pages; At described encyclopaedia entry, corresponding entry information is one when above to described entry information, after every entry information analysis is selected, is returned;
Described entry information is represented.
12. method as claimed in claim 11, is characterized in that, described when described entry information is represented, and adopts to play the window shape formula described entry information is represented.
13. method as claimed in claim 11, is characterized in that, described when described entry information is represented, and the length of described entry information is estimated, and in conjunction with estimation results, represents described entry information.
14. a device that obtains entry information, is characterized in that, comprising:
Encyclopaedia entry acquisition module, for analyzing the page info of current browsing pages, obtain corresponding encyclopaedia entry;
Encyclopaedia entry retrieval module, for obtaining corresponding entry information according to described encyclopaedia entry retrieval;
The Information Selection module, be one for the entry information corresponding at described encyclopaedia entry and every entry information analyzed when above, and select a corresponding entry information; And
Return to module, for selected this entry information is returned.
15. device as claimed in claim 14, is characterized in that, described Information Selection module, specifically for the classification according to described current browsing pages and/or active user's user profile classification, is selected an entry information from described entry information.
16. device as claimed in claim 15, is characterized in that, also comprises:
Entry information category acquisition module, for the entry information category of every entry information of obtaining described encyclopaedia entry;
Described Information Selection module, an entry information of mating most specifically for the classification of selecting entry information category and described current browsing pages the entry information more than one from described encyclopaedia entry and/or active user's user profile classification.
17. a device that represents entry information, is characterized in that, comprising:
Sending module, sent for the page info by current browsing pages;
Receiver module, for receiving the entry information of the encyclopaedia entry corresponding with described current browsing pages; At described encyclopaedia entry, corresponding entry information is one when above to described entry information, after every entry information analysis is selected, is returned; And
Represent module, for described entry information is represented.
18. device as claimed in claim 17, is characterized in that, the described module that represents is represented with described entry information specifically for adopting bullet window shape formula pair.
CN201210183870.8A 2012-06-05 2012-06-05 Method and device for displaying and acquiring entry information Active CN103455524B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210183870.8A CN103455524B (en) 2012-06-05 2012-06-05 Method and device for displaying and acquiring entry information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210183870.8A CN103455524B (en) 2012-06-05 2012-06-05 Method and device for displaying and acquiring entry information

Publications (2)

Publication Number Publication Date
CN103455524A true CN103455524A (en) 2013-12-18
CN103455524B CN103455524B (en) 2021-06-22

Family

ID=49737902

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210183870.8A Active CN103455524B (en) 2012-06-05 2012-06-05 Method and device for displaying and acquiring entry information

Country Status (1)

Country Link
CN (1) CN103455524B (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103823868A (en) * 2014-02-26 2014-05-28 中国科学院计算技术研究所 Event recognition method and event relation extraction method oriented to on-line encyclopedia
CN104102739A (en) * 2014-07-28 2014-10-15 百度在线网络技术(北京)有限公司 Entity library expansion method and device
CN104537080A (en) * 2014-12-31 2015-04-22 北京畅游天下网络技术有限公司 Information recommendation method and system
CN104951450A (en) * 2014-03-26 2015-09-30 国际商业机器公司 Information processing method and system
CN105095441A (en) * 2015-07-23 2015-11-25 百度在线网络技术(北京)有限公司 Information acquisition method and device
CN106202041A (en) * 2016-07-01 2016-12-07 北京奇虎科技有限公司 A kind of method and apparatus of the entity alignment problem solved in knowledge mapping
CN106454426A (en) * 2016-10-27 2017-02-22 四川长虹电器股份有限公司 Method for identifying analog channel of intelligent television
CN106802921A (en) * 2016-12-19 2017-06-06 福建天泉教育科技有限公司 Entry exhibiting method and represent system
CN106997363A (en) * 2016-01-26 2017-08-01 华为技术有限公司 A kind of data processing method and equipment
CN107679043A (en) * 2017-09-22 2018-02-09 广州阿里巴巴文学信息技术有限公司 Data processing method, device and terminal device
CN107885888A (en) * 2017-12-11 2018-04-06 北京百度网讯科技有限公司 Information processing method and device, terminal device and computer-readable recording medium
CN108427508A (en) * 2017-02-15 2018-08-21 北京搜狗科技发展有限公司 Input method and device, the method and apparatus for establishing LAN dictionary
CN109002292A (en) * 2018-06-11 2018-12-14 广州环通信息技术有限公司 A kind of bullet frame realization method and system based on webpage ejection layer
CN109271615A (en) * 2017-07-13 2019-01-25 北京搜狗科技发展有限公司 Entry processing method, device and machine readable media
CN110209814A (en) * 2019-05-23 2019-09-06 西安交通大学 A method of knowledget opic is extracted from encyclopaedic knowledge website using field modeling
CN111666018A (en) * 2020-06-08 2020-09-15 上海连尚网络科技有限公司 Reading content processing method, electronic device and medium
CN113127641A (en) * 2021-04-23 2021-07-16 北京字节跳动网络技术有限公司 Encyclopedic entry display method, encyclopedic entry display device, encyclopedic entry display equipment, encyclopedic entry display medium and program product

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101216842A (en) * 2008-01-07 2008-07-09 华为技术有限公司 Method for obtaining page key words and page information processing apparatus
KR20080086217A (en) * 2007-03-22 2008-09-25 주식회사 시공미디어 Information providing method/system for mobile phone
CN101827320A (en) * 2010-02-04 2010-09-08 重庆索伦互联网信息服务有限公司 3G network-based method for transmitting encyclopedic data to mobile terminal
CN101976246A (en) * 2010-09-30 2011-02-16 互动在线(北京)科技有限公司 Classification retrieval method for encyclopedia entries
CN102129454A (en) * 2011-03-08 2011-07-20 国网信息通信有限公司 Method and system for processing encyclopaedia data based on cloud storage
CN102314456A (en) * 2010-06-30 2012-01-11 百度在线网络技术(北京)有限公司 Web page move search method and system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080086217A (en) * 2007-03-22 2008-09-25 주식회사 시공미디어 Information providing method/system for mobile phone
CN101216842A (en) * 2008-01-07 2008-07-09 华为技术有限公司 Method for obtaining page key words and page information processing apparatus
CN101827320A (en) * 2010-02-04 2010-09-08 重庆索伦互联网信息服务有限公司 3G network-based method for transmitting encyclopedic data to mobile terminal
CN102314456A (en) * 2010-06-30 2012-01-11 百度在线网络技术(北京)有限公司 Web page move search method and system
CN101976246A (en) * 2010-09-30 2011-02-16 互动在线(北京)科技有限公司 Classification retrieval method for encyclopedia entries
CN102129454A (en) * 2011-03-08 2011-07-20 国网信息通信有限公司 Method and system for processing encyclopaedia data based on cloud storage

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103823868B (en) * 2014-02-26 2017-05-03 中国科学院计算技术研究所 Event recognition method and event relation extraction method oriented to on-line encyclopedia
CN103823868A (en) * 2014-02-26 2014-05-28 中国科学院计算技术研究所 Event recognition method and event relation extraction method oriented to on-line encyclopedia
US10229335B2 (en) 2014-03-26 2019-03-12 International Business Machines Corporation Displaying the meaning of selected text
CN104951450A (en) * 2014-03-26 2015-09-30 国际商业机器公司 Information processing method and system
CN104102739B (en) * 2014-07-28 2018-03-06 百度在线网络技术(北京)有限公司 A kind of method and device for expanding entity storehouse
CN104102739A (en) * 2014-07-28 2014-10-15 百度在线网络技术(北京)有限公司 Entity library expansion method and device
CN104537080A (en) * 2014-12-31 2015-04-22 北京畅游天下网络技术有限公司 Information recommendation method and system
CN104537080B (en) * 2014-12-31 2018-08-07 北京畅游天下网络技术有限公司 Information recommends method and system
WO2017012234A1 (en) * 2015-07-23 2017-01-26 百度在线网络技术(北京)有限公司 Information acquiring method and apparatus, device, and computer storage medium
CN105095441A (en) * 2015-07-23 2015-11-25 百度在线网络技术(北京)有限公司 Information acquisition method and device
CN106997363A (en) * 2016-01-26 2017-08-01 华为技术有限公司 A kind of data processing method and equipment
CN106202041B (en) * 2016-07-01 2019-07-09 北京奇虎科技有限公司 A kind of method and apparatus of entity alignment problem in solution knowledge mapping
CN106202041A (en) * 2016-07-01 2016-12-07 北京奇虎科技有限公司 A kind of method and apparatus of the entity alignment problem solved in knowledge mapping
CN106454426A (en) * 2016-10-27 2017-02-22 四川长虹电器股份有限公司 Method for identifying analog channel of intelligent television
CN106454426B (en) * 2016-10-27 2019-04-30 四川长虹电器股份有限公司 A kind of method of identification intelligent TV analog channel
CN106802921A (en) * 2016-12-19 2017-06-06 福建天泉教育科技有限公司 Entry exhibiting method and represent system
CN108427508A (en) * 2017-02-15 2018-08-21 北京搜狗科技发展有限公司 Input method and device, the method and apparatus for establishing LAN dictionary
CN108427508B (en) * 2017-02-15 2024-01-19 北京搜狗科技发展有限公司 Input method and device, and method and device for establishing local area network word stock
CN109271615A (en) * 2017-07-13 2019-01-25 北京搜狗科技发展有限公司 Entry processing method, device and machine readable media
CN109271615B (en) * 2017-07-13 2023-10-31 北京搜狗科技发展有限公司 Entry processing method, apparatus and machine readable medium
CN107679043A (en) * 2017-09-22 2018-02-09 广州阿里巴巴文学信息技术有限公司 Data processing method, device and terminal device
CN107885888A (en) * 2017-12-11 2018-04-06 北京百度网讯科技有限公司 Information processing method and device, terminal device and computer-readable recording medium
CN109002292A (en) * 2018-06-11 2018-12-14 广州环通信息技术有限公司 A kind of bullet frame realization method and system based on webpage ejection layer
CN110209814A (en) * 2019-05-23 2019-09-06 西安交通大学 A method of knowledget opic is extracted from encyclopaedic knowledge website using field modeling
CN110209814B (en) * 2019-05-23 2021-02-02 西安交通大学 Method for extracting knowledge topic from encyclopedic knowledge website by utilizing domain modeling
CN111666018A (en) * 2020-06-08 2020-09-15 上海连尚网络科技有限公司 Reading content processing method, electronic device and medium
CN113127641A (en) * 2021-04-23 2021-07-16 北京字节跳动网络技术有限公司 Encyclopedic entry display method, encyclopedic entry display device, encyclopedic entry display equipment, encyclopedic entry display medium and program product

Also Published As

Publication number Publication date
CN103455524B (en) 2021-06-22

Similar Documents

Publication Publication Date Title
CN103455524A (en) Method and device for displaying and acquiring entry information
CN107609152B (en) Method and apparatus for expanding query expressions
US8600979B2 (en) Infinite browse
CN102708174B (en) Method and device for displaying rich media information in browser
US7519588B2 (en) Keyword characterization and application
US9582503B2 (en) Interactive addition of semantic concepts to a document
CN103631794B (en) A kind of method, apparatus and equipment for being ranked up to search result
CN107480158A (en) The method and system of the matching of content item and image is assessed based on similarity score
US8874586B1 (en) Authority management for electronic searches
US20110125759A1 (en) Method and system to contextualize information being displayed to a user
JP2017157192A (en) Method of matching between image and content item based on key word
US10402479B2 (en) Method, server, browser, and system for recommending text information
CN104598556A (en) Search method and search device
JP2015191655A (en) Method and apparatus for generating recommendation page
JP5340491B2 (en) Related word registration device, information processing device, related word registration method, program for related word registration device, recording medium, and related word registration system
CN105045864B (en) A kind of digitalization resource personalized recommendation method
CN104503988B (en) searching method and device
CN103678325A (en) Method and device for providing browsing page corresponding to initial page
CN105022775A (en) Apparatus and method for structuring web page access history
CN104090757A (en) Method and device for displaying rich media information in browser
CN107491465A (en) For searching for the method and apparatus and data handling system of content
CN104090923A (en) Method and device for displaying rich media information in browser
CN103745380A (en) Advertisement delivery method and apparatus
CN109033441A (en) A kind of method for pushing and device based on big data analysis
CN103955480A (en) Method and equipment for determining target object information corresponding to user

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant