WO2012122718A1 - 一种浏览器预读方法及其系统 - Google Patents

一种浏览器预读方法及其系统 Download PDF

Info

Publication number
WO2012122718A1
WO2012122718A1 PCT/CN2011/071915 CN2011071915W WO2012122718A1 WO 2012122718 A1 WO2012122718 A1 WO 2012122718A1 CN 2011071915 W CN2011071915 W CN 2011071915W WO 2012122718 A1 WO2012122718 A1 WO 2012122718A1
Authority
WO
WIPO (PCT)
Prior art keywords
group
webpage
browsing record
personal
record feature
Prior art date
Application number
PCT/CN2011/071915
Other languages
English (en)
French (fr)
Inventor
梁捷
江蔚然
Original Assignee
广州市动景计算机科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 广州市动景计算机科技有限公司 filed Critical 广州市动景计算机科技有限公司
Priority to US13/381,342 priority Critical patent/US8527585B2/en
Priority to PCT/CN2011/071915 priority patent/WO2012122718A1/zh
Priority to CN201180003569.8A priority patent/CN102804735B/zh
Publication of WO2012122718A1 publication Critical patent/WO2012122718A1/zh
Priority to US13/965,069 priority patent/US9094478B2/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation

Definitions

  • the present invention relates to the field of browser related technologies, and in particular, to a browser pre-reading method and system thereof.
  • the server predicts which files need to be pre-loaded based on the user history browsing behavior and webpage layout in the client, according to the pre-loaded when the user performs webpage browsing.
  • the file implements the read-ahead function of the web page.
  • a browser read-ahead method comprising:
  • the browser client submits the first webpage access request to the target server and uploads the personal browsing record feature of the first webpage;
  • the transit server forms a read-ahead strategy according to the received personal browsing record feature of the first webpage and the saved at least one group browsing record feature of the first webpage;
  • the transit server obtains the webpage from the target server according to the pre-reading strategy and sends the webpage to the browser client cache.
  • the relay server forms a read-ahead strategy according to the received personal browsing record feature of the first webpage and the saved at least one group browsing record feature of the first webpage, including: one on the first webpage or The plurality of page elements are sorted according to the pre-reading preference score, and the transit server obtains the link content as the pre-read content according to the link address included in the pre-read preference score of the page element of the top K, wherein K is a natural number greater than or equal to 1.
  • the pre-read preference score is calculated according to the following preference rules:
  • Pre-read preference score of page element personal preference coefficient ⁇ personal browsing record feature weight + group preference coefficient ⁇ group browsing record feature weight;
  • Each page element sets a personal preference coefficient according to a personal browsing record feature, and each page element sets a group preference coefficient according to the group browsing record feature, and presets a personal browsing record feature weight corresponding to the personal browsing record feature and one or more The group browsing record feature features corresponding to the group browsing record feature weight.
  • the personal browsing record feature is a personal access frequency of one or more page elements including a link address on the first webpage
  • the group browsing record feature is one of the relay server to the first webpage. Or group access frequency for multiple page elements.
  • the sum of the personal browsing record feature weight and one or more group browsing record feature weights is one.
  • the personal preference coefficient is a personal access frequency of a page element
  • the group preference coefficient is a group access frequency of a page element
  • the pre-reading strategy further includes: the personal preference coefficient is set according to a personal access frequency order of page elements of the first webpage, and the group preference coefficient is based on a group access frequency of the page element of the first webpage. The order is set in the appropriate order.
  • the pre-reading strategy further includes: sorting one or more page elements on the first webpage according to a pre-read preference score, and the relay server assigns the pre-read preference score to a page element of the top K name.
  • the content of the link obtained by the included link address is rearranged and merged, and the merged content is read-aheaded.
  • the method includes a first group browsing record feature, and the group access frequency of the first group browsing record feature to one or more page elements on the first web page is determined by:
  • the transit server performs statistical analysis on the first webpage. If the page element including the link address includes a key point, the frequency of the historical access key is queried as the group access frequency of the page element including the key point, and the key point is determined by historical statistics. .
  • the browser client also uploads one or more personal identity features associated with the user identity
  • the relay server also retaining one or more community identity features associated with the user community identity
  • the method also includes the second group browsing record feature, the group access frequency of the second group browsing record feature for the one or more page elements on the first web page being determined by:
  • the transit server performs statistical analysis on the first webpage. If the page element including the link address includes a key point, the frequency of the historical access key of the group identity feature corresponding to the personal identity feature is queried as a group access of the page element including the key point. Frequency, the key points are determined by historical statistics.
  • the group access frequency of one or more page elements on the first webpage is determined by:
  • the relay server performs statistical analysis on the first webpage. If the page element including the link address includes a key point, the frequency of accessing the key point when the history visits the first webpage is used as the group access frequency of the page element including the key point.
  • the key point is a keyword or a key map.
  • the method includes a third group browsing record feature, and the group access frequency of the third group browsing record feature to one or more page elements on the first web page is determined by:
  • the relay server performs statistical analysis on the first webpage, and queries the historical frequency of jumping to the second webpage after accessing the first webpage;
  • the page element associated with the second webpage is determined according to the link address included in the page element on the first webpage, and the group access frequency of the page element is obtained.
  • the browser client also uploads one or more personal identity features associated with the user identity
  • the relay server also retaining one or more community identity features associated with the user community identity
  • the method also includes a fourth group browsing record feature, the group access frequency of the fourth group browsing record feature for the one or more page elements on the first web page being determined by:
  • the transit server performs statistical analysis on the first webpage, and queries the historical frequency of the group identity feature corresponding to the personal identity feature to jump to the third webpage after accessing the first webpage;
  • the page element associated with the third webpage is determined according to the link address included in the page element on the first webpage, and the group access frequency of the page element is obtained.
  • the browser client is a mobile communication device terminal.
  • a browser read-ahead system comprising:
  • a personal browsing record feature uploading module configured to be used by the browser client to submit a first webpage access request to the target server and upload a personal browsing record feature of the first webpage
  • a pre-reading policy forming module configured to form a read-ahead policy by the relay server according to the received personal browsing record feature of the first webpage and the saved at least one group browsing record feature of the first webpage;
  • the pre-reading policy forming module forms a pre-reading policy according to the received personal browsing record feature of the first webpage and the saved at least one group browsing record feature of the first webpage, including: on the first webpage
  • the one or more page elements are sorted according to the pre-read preference score, and the transit server obtains the link content as the pre-read content according to the link address included in the pre-read preference score of the page element of the top K, wherein K is greater than or equal to The natural number of 1.
  • the pre-read preference score is calculated according to the following preference rules:
  • Pre-read preference score of page element personal preference coefficient ⁇ personal browsing record feature weight + group preference coefficient ⁇ group browsing record feature weight;
  • Each page element sets a personal preference coefficient according to a personal browsing record feature, and each page element sets a group preference coefficient according to the group browsing record feature, and presets a personal browsing record feature weight corresponding to the personal browsing record feature and one or more The group browsing record feature features corresponding to the group browsing record feature weight.
  • the personal browsing record feature is a personal access frequency of one or more page elements including a link address on the first webpage
  • the group browsing record feature is one of the relay server to the first webpage. Or group access frequency for multiple page elements.
  • the sum of the personal browsing record feature weight and one or more group browsing record feature weights is one.
  • the personal preference coefficient is a personal access frequency of a page element
  • the group preference coefficient is a group access frequency of a page element
  • the pre-reading strategy further includes: the personal preference coefficient is set according to a personal access frequency order of page elements of the first webpage, and the group preference coefficient is based on a group access frequency of the page element of the first webpage. The order is set in the appropriate order.
  • the pre-reading strategy further includes: sorting one or more page elements on the first webpage according to a pre-read preference score, and the relay server assigns the pre-read preference score to a page element of the top K name.
  • the content of the link obtained by the included link address is rearranged and merged, and the merged content is read-aheaded.
  • the pre-read policy forming module further includes a first group browsing record feature module for recording a first group browsing record feature, the first group browsing record feature to one or more pages on the first webpage
  • the group access frequency of an element is determined by:
  • the first group browsing record feature module performs statistical analysis on the first webpage. If the page element including the link address includes a key point, the frequency of the historical access key is queried as the group access frequency of the page element including the key point, the key Points are determined by historical statistics.
  • the system further includes a personal identity uploading module configured to be uploaded by the browser client for uploading one or more personal identity features associated with the user identity, configured to be saved on the relay server for saving a group identity feature saving module of one or more group identity features associated with a user group identity, the read-ahead policy forming module further comprising a second group browsing record feature module for recording a second group browsing record feature, the second group
  • the frequency of browsing the record characteristics to the group access to one or more page elements on the first web page is determined by:
  • the second group browsing record feature module performs statistical analysis on the first webpage. If the page element including the link address includes a key point, the frequency of the historical access key point of the group identity feature corresponding to the personal identity feature is queried as the key point. The group access frequency of the page elements, which are determined by historical statistics.
  • the group access frequency of the second group browsing record feature module to one or more page elements on the first web page is determined by:
  • the second group browsing record feature module performs statistical analysis on the first webpage. If the page element including the link address includes a key point, the frequency of accessing the key point when the first webpage is accessed by the query history is used as the group access of the page element including the key point. frequency.
  • the key point is a keyword or a key map.
  • the pre-read policy forming module includes a third group browsing record feature module for recording a third group browsing record feature, and the third group browsing record feature is for a group of one or more page elements on the first web page.
  • the frequency of access is determined by:
  • the third group browsing record feature module performs statistical analysis on the first webpage, and queries the historical frequency of jumping to the second webpage after accessing the first webpage;
  • the page element associated with the second webpage is determined according to the link address included in the page element on the first webpage, and the group access frequency of the page element is obtained.
  • the system further includes a personal identity feature uploading module disposed on the browser client for uploading one or more personal identity features associated with the user identity, configured to be stored on the relay server for saving the user community a group identity feature saving module of one or more group identity features associated with the identity, the read-ahead policy forming module further comprising a fourth group browsing record feature, the fourth group browsing record feature for one or more of the first webpage
  • the frequency of group visits for page elements is determined by:
  • the fourth group browsing record feature module performs statistical analysis on the first webpage, and queries the historical frequency of the group identity feature corresponding to the personal identity feature to jump to the third webpage after accessing the first webpage;
  • the page element associated with the third webpage is determined according to the link address included in the page element on the first webpage, and the group access frequency of the page element is obtained.
  • the browser client is a mobile communication device terminal.
  • the invention performs webpage pre-reading by combining the access habits and preferences of individual users with the access history of a large number of users, and performs weights and preference coefficients for different webpage page elements.
  • the calculation and analysis obtains the page that the user is most likely to click, which makes the pre-reading more accurate, and the pre-reading success rate is greatly improved.
  • the page is downloaded when idle, the user basically does not have to wait, which can save the user time very well.
  • the invention is applied to the pre-reading of various webpages, which greatly improves the user experience of the mobile browser.
  • FIG. 1 is a system frame diagram of an embodiment of the present invention
  • Figure 2 is a flow chart of the first embodiment of the present invention.
  • Figure 3 is a flow chart of a second embodiment of the present invention.
  • FIG. 4 is a structural diagram of a system according to an embodiment of the present invention.
  • Figure 5 is an example of a key figure.
  • FIG. 1 is a block diagram of an embodiment of the present invention, including a mobile phone browser client 1 accessing a target server 3 through a relay server 2, and the relay server 2 is also connected to a mass user history access behavior statistics server 4.
  • Step S110 the browser client submits a first webpage access request to the target server and uploads a personal browsing record feature of the first webpage;
  • Step S120 The relay server forms a read-ahead strategy according to the received personal browsing record feature of the first webpage and the saved at least one group browsing record feature of the first webpage;
  • Step S130 The relay server goes to the target server to obtain the webpage according to the pre-reading policy and sends the webpage to the browser client cache.
  • the pre-reading strategy in step S120 is implemented as follows:
  • the page elements that can trigger the user's click behavior after the webpage is read-ahead include three types: URL, text with URL, and image with URL.
  • the function of the transit server is to calculate the statistical result of the user's historical access behavior data according to the habit of the individual user to access the webpage, and to give the page element most likely to be clicked by the user, and push it to the browser client.
  • the personal browsing record feature in step S120 includes a personal access frequency of one or more page elements including a link address on the first webpage, the group browsing record feature being one or more of the relay server to the first webpage
  • the group access frequency of the page element, and the pre-reading strategy determines the pre-read content according to the personal access frequency and the group access frequency, and if the page element not including the link address on the first webpage returns the unreadable content and exits.
  • the pre-reading strategy includes: sorting one or more page elements on the first webpage according to the pre-reading preference score, and the transit server obtains the link content according to the pre-reading preference score as the link address included in the page element of the top K name as a pre-preview Reading the content, K is a natural number greater than or equal to 1, each page element sets a personal preference coefficient according to the personal access frequency, and each page element sets a group preference coefficient according to the group access frequency, and presets a correspondence corresponding to the personal browsing record feature.
  • the personal browsing record feature weight and the group browsing record feature weight corresponding to one or more group browsing record features, the pre-reading preference score is calculated according to the following preference rules:
  • Pre-read preference score of page element personal preference coefficient ⁇ personal browsing record feature weight + group preference coefficient ⁇ group browsing record feature weight;
  • the personal preference coefficient is based on individual user habits, which are the most frequently accessed page elements after accessing the webpage, for example, three, respectively the most frequently visited URL, the text with the URL and the image with the URL, the preference coefficient According to the user's habits, for example, according to the user's habits, he visits the webpage to click on the image up to 400 times, clicks on the text and the URL is less than 300 times, then the page element 1 (corresponding to the text with a URL preference coefficient of 0.3) ), page element 2 (corresponding to a URL preference coefficient of 0.3), page element 3 (corresponding to a picture preference coefficient with a URL of 0.4).
  • the personal browsing record feature weight is 0.7, and for other page elements including the link URL, the preference coefficient can be set to zero.
  • the group browsing record feature 1 is a statistical analysis of the first webpage by the relay server. If the page element including the link address includes a keyword, the massive user history access behavior statistics server is queried, and the historical access is obtained.
  • the frequency of the keyword is the group access frequency of the page element including the keyword, wherein the keyword is determined by historical statistics by the massive user history access behavior statistics server.
  • page element 1 For example, 3 page elements, page element 1 includes the keyword 'next page', page element 2 includes 'next chapter', and page element 4 includes 'news', then page element 1 is set to clicks 60000 times (preference)
  • the coefficient is 0.6
  • the number of clicks of page element 2 is 30,000
  • the preference coefficient is 0.3
  • the number of clicks of page element 4 is 10,000 (the preference coefficient is 0.1).
  • the group browsing record feature weight 1 is 0.2.
  • the group browsing record feature 2 queries a massive user history access behavior statistics server based on a key map, which refers to a hyperlink based on a site that is most directional in a certain webpage, and uses a picture URL to mark, in a certain In a webpage of a website, it is possible to pre-read the key map as a finger, or an arrow, which is usually displayed in the form of a picture, as shown in FIG. 5, the most accurate determination server from the web page is determined by the massive user history access behavior statistics server.
  • a key map which refers to a hyperlink based on a site that is most directional in a certain webpage
  • a picture URL to mark, in a certain In a webpage of a website, it is possible to pre-read the key map as a finger, or an arrow, which is usually displayed in the form of a picture, as shown in FIG. 5, the most accurate determination server from the web page is determined by the massive user history access behavior statistics server.
  • the frequently accessed pre-read key map is that the number of clicks of page element 3 is 80,000 times (the preference coefficient is 0.8), the number of clicks of page element 5 is 20,000 times (the preference coefficient is 0.2), and the group browsing record feature weight 2 Is 0.1.
  • k is set according to experience, and may be any number between 3 and tens.
  • the sum of the preference coefficient n in a personal browsing record feature or a group browsing record feature is 1, the personal browsing record feature and one or
  • the sum of the weights of the plurality of group browsing record features is also 1, and how to allocate the actual operation needs to be set according to the empirical value.
  • the characteristics of the group browsing record that are considered in the above-mentioned pre-reading strategy are as shown in Table 1, and may specifically include:
  • the global-based keyword that is, the querying the massive user historical access behavior statistics server, obtains the frequency of the historical access to the keyword as the group access frequency, wherein the keyword is determined by the historical statistics of the massive user historical access behavior statistics server;
  • a key map based on the domain name that is, querying a massive user historical access behavior statistics server, and obtaining the frequency of historical access to the key map under the same domain name as the group access frequency;
  • the link picture based on the historical operation habits of the user of a certain webpage that is, the query historical server of the massive user history access behavior, obtains the frequency of accessing the same link text by the same user history as the group access frequency.
  • Table 1 group browsing record feature table
  • the above-mentioned pre-reading strategy can also directly adopt the access frequency of the page element, and also adopt the above example:
  • the group browsing record feature weight needs to be set relatively small, the personal browsing record feature weight is set to 0.997, and the first group browsing record feature weight is set to 0.002, and the second group browsing record feature weight is 0.001.
  • the browser client submits a webpage access request to the target server through the relay server, and uploads the personal browsing record feature of the client using the browser client to access the first webpage and one or more personal identity features associated with the user identity;
  • the transit server accesses the personal browsing history feature of the first webpage, the personal identity feature, and one or more group browsing record characteristics and group identity features of the first webpage saved by the client using the browser client uploaded by the browser client. Determining pre-reading content according to a pre-reading strategy;
  • the S340 browser client stores the pre-read content in the cache.
  • the identity characteristics associated with the identity of the customer and the identity of the customer such as: gender, job type, education and other related personal identity characteristics;
  • the group identity is counted by the massive user history access behavior statistics server, and the identity characteristics associated with all customer identities, such as gender, job type, education, and so on.
  • the pre-reading strategy is basically the same as the first embodiment, except that the preference coefficient and the group browsing record feature weight are classified based on the group identity feature. Examples are as follows:
  • page element 1 corresponding to the text preference coefficient with URL is 0.3
  • page element 2 The corresponding URL preference coefficient is 0.3
  • page element 3 corresponding to the picture preference coefficient with URL is 0.4
  • the personal browsing record feature weight is 0.7, and for other page elements including the link URL, the preference coefficient can be set to zero.
  • the gender of the user is male
  • the job type is programmer
  • the master's degree is:
  • the transit server performs statistical analysis on the first webpage, and if the page element including the link address includes a keyword, queries the massive user historical access behavior statistics server, and obtains the frequency of historical access to the keyword as the group access of the page element including the keyword.
  • the frequency, in which the keyword is determined by the historical statistics of the massive user history access behavior statistics server.
  • page element 1 includes the keyword 'next page'
  • page element 2 includes 'next chapter'
  • page element 4 includes 'news'.
  • the page element 1 corresponding coefficient of gender is 0.6
  • the page element 2 has a preference coefficient of 0.3
  • the page element 4 has a preference coefficient of 0.1.
  • the group browsing record feature weight 1 is 0.25.
  • the page element 1 corresponding coefficient of the work type is programmer, the preference coefficient is 0.3, the page element 2 has a preference coefficient of 0.5, and the page element 4 has a preference coefficient of 0.2.
  • the group browsing record feature weight 2 is 0.04.
  • the page element 1 corresponding to the master's degree has a preference coefficient of 0.8
  • the page element 2 has a preference coefficient of 0.1
  • the page element 4 has a preference coefficient of 0.1.
  • the group browsing record feature weight 3 is 0.01.
  • the transit server obtains the sub-pages of the top k (k is generally a natural number less than or equal to four), and may perform the merge rearrangement process on the obtained sub-pages with similar URLs and then send the cache to the mobile terminal.
  • the mobile terminal When the user clicks on the above keyword or the most frequently accessed page link on the current browsing page, the mobile terminal directly retrieves the pre-read page in the cache for display.
  • Figure 4 is a block diagram showing an embodiment of the present invention.
  • the browser pre-reading system 400 includes a mobile browser client 410 connected to the relay server 420, wherein the mobile browser client 410 is provided with a personal browsing record feature database 411 for saving the customer's personal browsing history features and for submitting to the target server
  • the historical user access behavior statistics module 421 is configured on the transit server 420 for storing one or more group browsing record features of the plurality of users. As shown in FIG. 1 , the historical user access behavior statistics module 421 is used in the embodiment. Massive user history access behavior statistics server 4 implementation;
  • the transit server 420 is further provided with a read-ahead policy forming module 422 according to the received personal browsing record feature of the first webpage and the saved at least one group browsing record feature of the first webpage, and a pre-reading policy according to the pre-reading strategy.
  • the target server acquires the webpage and sends it to the browser client cached read-ahead file reading module 423;
  • the browser client 410 also includes a read-ahead cache module 413 for storing pre-read content returned by the pre-read file reading module into the cache.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Description

一种浏览器预读方法及其系统
技术领域
本发明涉及浏览器相关技术领域,特别是一种浏览器预读方法及其系统。
背景技术
关于网页的预读,现有技术普遍存在如下做法:服务器基于客户端中的用户历史浏览行为和网页排版预测哪些文件需要被预装载,当用户进行网页浏览的时候根据所述预装载的文件实现网页的预读功能。
US7284035 (B2) 号专利申请中揭露的技术方案为:决定网页的某些特定子页被用户获取,如果决定了就优先预读这些被确定的子页。对子页的偏好通过分析用户之前访问的网页来产生。所述学习用户喜好还包括某个用户对某一网页的子页的访问历史,过去的天数会被考虑并且在本页中有多少个子页也会被考虑进去。例如,当一个用户每天早上访问同一个新闻网站并总是阅读政治,计算机,旅游和阅读栏目的文章的时候,根据本发明当新闻网页被访问时,这些喜好将会被决定。即那些与政治,计算机,旅游和阅读栏目的文章将被比其他栏目更加优先的被加载入浏览器的缓存。
可见,现有技术中可以实现对某个特定用户的历史访问行为分析进而根据其喜好来进行网页预读。这种预读只能在该用户访问过某门户网站/主题网站且经常访问该网站的前提下,预读成功率才会很高,如果当前访问的页面是没有访问过的或者是不具有喜好参数的网站则无法实现预读,可见使用现有的预读方案总体成功率很低,且应用范围很有限。
并且,现有技术中虽然可以实现特定子页的预读,但是用户仍需要进行子页的翻页操作才能从客户端的缓存中获得这些子页。
发明内容
本发明的发明目的在于提供一种浏览器预读方法,以解决现有的预读技术未能精确预读的技术问题。
为了实现本发明的第一个方面目的,采用的技术方案如下:
一种浏览器预读方法,其特征在于,所述方法包括:
浏览器客户端向目标服务器提交第一网页访问请求并上传第一网页的个人浏览记录特征;
中转服务器根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略;
中转服务器根据所述预读策略去目标服务器获取网页并发送给浏览器客户端缓存。
作为一种优选方案,所述中转服务器根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器根据预读偏好分值为前K名的页面元素所包括的链接地址获取链接内容作为预读内容,其中,K为大于或等于1的自然数。
作为进一步的优选方案,所述预读偏好分值按照如下偏好规则计算:
页面元素的预读偏好分值=个人偏好系数×个人浏览记录特征权重+群体偏好系数×群体浏览记录特征权重;
每个页面元素根据个人浏览记录特征设定个人偏好系数,每个页面元素根据群体浏览记录特征设定群体偏好系数,预先设定与个人浏览记录特征对应的个人浏览记录特征权重以及与一个或多个群体浏览记录特征对应的群体浏览记录特征权重。
作为进一步的优选方案,所述个人浏览记录特征为第一网页上的一个或多个包括有链接地址的页面元素的个人访问频率,所述群体浏览记录特征为中转服务器对第一网页上的一个或多个页面元素的群体访问频率。
作为再进一步的优选方案,所述个人浏览记录特征权重与一个或多个群体浏览记录特征权重的总和为1。
作为进一步的优选方案,所述个人偏好系数是页面元素的个人访问频率,所述群体偏好系数是页面元素的群体访问频率。
作为再进一步的优选方案,所述预读策略还包括:个人偏好系数根据第一网页的页面元素的个人访问频率顺序设定相应的顺序,群体偏好系数根据第一网页的页面元素的群体访问频率顺序设定相应的顺序。
作为再进一步的优选方案,所述预读策略还包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器对预读偏好分值为前K名的页面元素所包括的链接地址获取的链接内容进行重排合并,重排合并后的内容为预读内容。
作为进一步的优选方案,所述方法包括第一群体浏览记录特征,第一群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
作为进一步的优选方案,浏览器客户端还上传包括与用户身份相关联的一个或多个个人身份特征,所述中转服务器还保存与用户群体身份相关联的一个或多个群体身份特征,所述方法还包括第二群体浏览记录特征,第二群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,查询与个人身份特征所对应的群体身份特征的历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
作为再进一步的优选方案,所述第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问第一网页时访问关键点的频率作为包括关键点的页面元素的群体访问频率。
作为更进一步的优选方案,所述关键点为关键字或关键图。
作为进一步的优选方案,所述方法包括第三群体浏览记录特征,第三群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
中转服务器对第一网页进行统计分析,查询对第一网页访问后跳转到第二网页的历史频率;
根据第一网页上的页面元素所包括的链接地址,确定与第二网页关联的页面元素,得到页面元素的群体访问频率。
作为进一步的优选方案,浏览器客户端还上传包括与用户身份相关联的一个或多个个人身份特征,所述中转服务器还保存与用户群体身份相关联的一个或多个群体身份特征,所述方法还包括第四群体浏览记录特征,第四群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
中转服务器对第一网页进行统计分析,查询与个人身份特征所对应的群体身份特征对第一网页访问后跳转到第三网页的历史频率;
根据第一网页上的页面元素所包括的链接地址,确定与第三网页关联的页面元素,得到页面元素的群体访问频率。
作为一种优选方案,所述浏览器客户端为移动通讯设备终端。
为了实现本发明的第二个发明目的,采用的技术方案如下:
一种浏览器预读系统,所述系统包括:
设置在浏览器客户端用于向目标服务器提交第一网页访问请求并上传第一网页的个人浏览记录特征的个人浏览记录特征上传模块;
设置在中转服务器根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略的预读策略形成模块;
设置在中转服务器根据所述预读策略去目标服务器获取网页并发送给浏览器客户端缓存的预读文件读取模块。
作为一种优选方案,所述预读策略形成模块根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器根据预读偏好分值为前K名的页面元素所包括的链接地址获取链接内容作为预读内容,其中,K为大于或等于1的自然数。
作为进一步的优选方案,所述预读偏好分值按照如下偏好规则计算:
页面元素的预读偏好分值=个人偏好系数×个人浏览记录特征权重+群体偏好系数×群体浏览记录特征权重;
每个页面元素根据个人浏览记录特征设定个人偏好系数,每个页面元素根据群体浏览记录特征设定群体偏好系数,预先设定与个人浏览记录特征对应的个人浏览记录特征权重以及与一个或多个群体浏览记录特征对应的群体浏览记录特征权重。
作为进一步的优选方案,所述个人浏览记录特征为第一网页上的一个或多个包括有链接地址的页面元素的个人访问频率,所述群体浏览记录特征为中转服务器对第一网页上的一个或多个页面元素的群体访问频率。
作为再进一步的优选方案,所述个人浏览记录特征权重与一个或多个群体浏览记录特征权重的总和为1。
作为再进一步的优选方案,所述个人偏好系数是页面元素的个人访问频率,所述群体偏好系数是页面元素的群体访问频率。
作为再进一步的优选方案,所述预读策略还包括:个人偏好系数根据第一网页的页面元素的个人访问频率顺序设定相应的顺序,群体偏好系数根据第一网页的页面元素的群体访问频率顺序设定相应的顺序。
作为再进一步的优选方案,所述预读策略还包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器对预读偏好分值为前K名的页面元素所包括的链接地址获取的链接内容进行重排合并,重排合并后的内容为预读内容。
作为进一步的优选方案,所述预读策略形成模块还包括用于记录第一群体浏览记录特征的第一群体浏览记录特征模块,第一群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
第一群体浏览记录特征模块对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
作为进一步的优选方案,所述系统还包括设置在浏览器客户端用于上传包括与用户身份相关联的一个或多个个人身份特征的个人身份特征上传模块,设置在中转服务器上用于保存与用户群体身份相关联的一个或多个群体身份特征的群体身份特征保存模块,所述预读策略形成模块还包括用于记录第二群体浏览记录特征的第二群体浏览记录特征模块,第二群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
第二群体浏览记录特征模块对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,查询与个人身份特征所对应的群体身份特征的历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
作为再进一步的优选方案,第二群体浏览记录特征模块对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
第二群体浏览记录特征模块对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问第一网页时访问关键点的频率作为包括关键点的页面元素的群体访问频率。
作为再进一步的优选方案,所述关键点为关键字或关键图。
作为进一步的优选方案,预读策略形成模块包括用于记录第三群体浏览记录特征的第三群体浏览记录特征模块,第三群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
第三群体浏览记录特征模块对第一网页进行统计分析,查询对第一网页访问后跳转到第二网页的历史频率;
根据第一网页上的页面元素所包括的链接地址,确定与第二网页关联的页面元素,得到页面元素的群体访问频率。
作为进一步的优选方案,系统还包括设置在浏览器客户端用于上传包括与用户身份相关联的一个或多个个人身份特征的个人身份特征上传模块,设置在中转服务器上用于保存与用户群体身份相关联的一个或多个群体身份特征的群体身份特征保存模块,所述预读策略形成模块还包括第四群体浏览记录特征,第四群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
第四群体浏览记录特征模块对第一网页进行统计分析,查询与个人身份特征所对应的群体身份特征对第一网页访问后跳转到第三网页的历史频率;
根据第一网页上的页面元素所包括的链接地址,确定与第三网页关联的页面元素,得到页面元素的群体访问频率。
作为一种优选方案,所述浏览器客户端为移动通讯设备终端。
本发明通过对个人用户的访问习惯和偏好结合海量用户的访问历史进行网页预读,并针对不同的网页页面元素进行权重和 偏好系数的 计算分析得到用户最有可能点击的页面,使得预读更加精确,预读成功率大幅提高,利用空闲时下载页面,用户基本不用等待,可以很好的节省用户时间。
本发明应用在各类网页的预读上,很大程度提高了移动浏览器用户的使用感受。
附图说明
图1 为本发明实施例的系统框架图;
图2为本发明第一实施例的流程图;
图3为本发明第二实施例的流程图;
图4为本发明实施例的系统结构图;
图5为关键图的一种例子。
具体实施方式
下面结合附图和具体实施例对本发明做进一步详细的说明。
如图1所示为本发明实施例的框架图,包括手机浏览器客户端1通过中转服务器2访问目标服务器3,中转服务器2还与 海量用户历史访问行为统计服务器4连接。
具体方法包括:
步骤S110,浏览器客户端向目标服务器提交第一网页访问请求并上传第一网页的个人浏览记录特征;
步骤S120,中转服务器根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略;
步骤S130,中转服务器根据所述预读策略去目标服务器获取网页并发送给浏览器客户端缓存。
其中,步骤S120中的预读策略是这样实现的:
首先,我们分析一下网页中的网页元素,网页预读之后能触发用户点击行为的页面元素包括三种:URL,带URL的文字,带URL的图片。
中转服务器的作用是根据个人用户访问网页的习惯结合海量用户历史访问行为数据统计结果进行计算,给出最有可能被该用户点击的页面元素,推送给浏览器客户端。
因此步骤S120中的个人浏览记录特征包括第一网页上的一个或多个包括有链接地址的页面元素的个人访问频率,所述群体浏览记录特征为中转服务器对第一网页上的一个或多个页面元素的群体访问频率,而所述预读策略根据个人访问频率和群体访问频率确定预读内容,如果第一网页上没有包括链接地址的页面元素则返回无法读取预读内容并退出。
第一实施例给出如下权重算法:
预读策略包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器根据预读偏好分值为前K名的页面元素所包括的链接地址获取链接内容作为预读内容,K为大于或等于1的自然数,每个页面元素根据个人访问频率设定个人偏好系数,每个页面元素根据群体访问频率设定群体偏好系数,预先设定与个人浏览记录特征对应的个人浏览记录特征权重以及与一个或多个群体浏览记录特征对应的群体浏览记录特征权重,所述预读偏好分值按照如下偏好规则计算:
页面元素的预读偏好分值=个人偏好系数×个人浏览记录特征权重+群体偏好系数×群体浏览记录特征权重;
如果第一网页中没有包括链接地址的页面元素,则返回无法读取预读内容信息。
其中个人偏好系数是基于个人用户习惯,在访问该网页之后最常访问的页面元素是哪几个,例如3个,分别是最常访问的URL,带URL的文字及带URL的图片,偏好系数根据用户习惯进行设置,例如根据用户的习惯,他访问网页点击图片的次数最多达到400次,点击文字和URL的次数少一点是300次,则页面元素1(对应带URL的文字偏好系数为0.3),页面元素2(对应URL偏好系数为0.3),页面元素3(对应带URL的图片偏好系数为0.4)。该个人浏览记录特征权重为0.7,对于其他的包括链接URL的页面元素,其偏好系数可以设置为0。
群体浏览记录特征可以有多个:
在第一个实施例中,群体浏览记录特征1为中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键词,则查询海量用户历史访问行为统计服务器,获得历史访问该关键词的频率作为包括关键词的页面元素的群体访问频率,其中关键词是由海量用户历史访问行为统计服务器通过历史统计确定。
例如3个页面元素,页面元素1包括关键词'下一页'、页面元素2包括'下一章'而页面元素4包括'新闻',则设定页面元素1的点击次数是60000次(偏好系数为0.6),页面元素2的点击次数是30000次(偏好系数为0.3),页面元素4的点击次数是10000次(偏好系数为0.1)。该群体浏览记录特征权重1为0.2。
群体浏览记录特征2基于关键图查询海量用户历史访问行为统计服务器,所述预读关键图是指基于站点的在某一网页中指向性最明确的一个超链接,使用图片URL来标示,在某一网站的网页中,有可能预读关键图是一个手指,也有可能是一个箭头,它们通常是以图片的形式展现,如图5所示,通过海量用户历史访问行为统计服务器从网页中确定最常访问预读关键图即为页面元素3的的点击次数是80000次(其偏好系数为0.8)、页面元素5的点击次数是20000次(其偏好系数为0.2),该群体浏览记录特征权重2为0.1。
那么页面元素的预读可能性即页面元素的预读偏好分值f=个人偏好系数×个人浏览记录特征权重+群体偏好系数×群体浏览记录特征权重
即f1=0.3×0.7+0.6×0.2=0.33;
f2=0.3 ×0.7+0.3×0.2=0.27;
f3=0.4 ×0.7+0.8×0.1=0.36;
f4=0.1 ×0.2=0.02;
f5=0.2 ×0.1=0.02。
对上述5个页面元素根据预读偏好分值f进行排序,则得到f3>f1> f2>f4=f5。
如果该页面是没有被访问过的则该网页上的所有页面元素的个人偏好系数均为0,则上述的计算为:
即f1=0×0.7+0.6×0.2=0.12;
f2=0 ×0.7+0.3×0.2=0.06;
f3=0 ×0.7+0.8×0.1=0.08;
f4=0.1 ×0.2=0.02;
f5=0.2 ×0.1=0.02。
对上述5个页面元素根据预读偏好分值f进行排序,则得到f1>f3>f2>f4=f5。
需要说明的是,k根据经验设置,可以是3到几十之间的任意数,偏好系数n在一个个人浏览记录特征或者一个群体浏览记录特征中的总和是1,个人浏览记录特征与一个或多个群体浏览记录特征的权重总和也是1,具体如何分配实际操作中需要根据经验值设置。
在上述的预读策略中被考量的群体浏览记录特征如表1所示,具体可以包括:
1 、基于全局的关键词,即查询海量用户历史访问行为统计服务器,获得历史访问该关键词的频率作为群体访问频率,其中关键词是由海量用户历史访问行为统计服务器通过历史统计确定;
2 、基于域名的关键词,即查询海量用户历史访问行为统计服务器,获取在同一域名下历史访问该关键词的频率作为群体访问频率;
3 、基于域名的关键图,即查询海量用户历史访问行为统计服务器,获取在同一域名下历史访问该关键图的频率作为群体访问频率;
4 、基于某网页的用户历史操作习惯的链接文字,即查询海量用户历史访问行为统计服务器,获取同一用户历史访问同一链接文字的频率作为群体访问频率;
5 、基于某网页的用户历史操作习惯的链接图片,即查询海量用户历史访问行为统计服务器,获取同一用户历史访问同一链接文字的频率作为群体访问频率。
基于全局的关键词
基于域名的关键词
基于域名的关键图
基于某网页的用户历史操作习惯的链接文字
基于某网页的用户历史操作习惯的链接图片
表1群体浏览记录特征表
上述的预读策略也可以直接采用页面元素的访问频率,同样采取上述的例子:
由于直接采用访问频率计算,因此群体浏览记录特征权重需要设置得比较小,个人浏览记录特征权重设置为0.997,而第一群体浏览记录特征权重设置为0.002,而第二群体浏览记录特征权重为0.001,则:
即f1=300×0.997+60000×0.002= 419.1 ;
f2=300 ×0.997+30000×0.002= 359.1 ;
f3=400 ×0.997+80000×0.001= 478.8 ;
f4=10000 ×0.002=20;
f5=20000 ×0.001=20。
对上述5个页面元素根据预读偏好分值f进行排序,则得到f3>f1>f2>f4=f5。
本发明的第二个实施例的具体步骤如下:
S310 浏览器客户端通过中转服务器向目标服务器提交网页访问请求,同时上传客户使用浏览器客户端访问第一网页的个人浏览记录特征及与用户身份相关联的一个或多个个人身份特征;
S320 中转服务器根据浏览器客户端上传的客户使用浏览器客户端访问第一网页的个人浏览记录特征、个人身份特征和中转服务器保存的对第一网页的一个或多个群体浏览记录特征及群体身份特征,根据预读策略确定预读内容;
S330 返回预读内容给浏览器客户端;
S340 浏览器客户端把预读内容存入缓存。
其中,个人身份特征与客户身份相关联的身份特征,如:性别,工作类型,学历等相关个人身份特征;
群体身份特征由海量用户历史访问行为统计服务器统计,为所有客户身份相关联的身份特征,如:性别,工作类型,学历等等。
其预读策略与第一实施例基本相同,所不同处,仅在于偏好系数及群体浏览记录特征权重基于群体身份特征进行分类。举例如下:
例如根据用户的习惯,他访问网页点击图片的次数最多达到400次,点击文字和URL的次数少一点是300次,则页面元素1(对应带URL的文字偏好系数为0.3),页面元素2(对应URL偏好系数为0.3),页面元素3(对应带URL的图片偏好系数为0.4)。该个人浏览记录特征权重为0.7,对于其他的包括链接URL的页面元素,其偏好系数可以设置为0。
同时,该用户的性别为男性,工作类型为程序员,学历为硕士,则:
群体浏览记录特征可以有多个:
中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键词,则查询海量用户历史访问行为统计服务器,获得历史访问该关键词的频率作为包括关键词的页面元素的群体访问频率,其中关键词是由海量用户历史访问行为统计服务器通过历史统计确定。
例如3个页面元素,页面元素1包括关键词'下一页'、页面元素2包括'下一章'而页面元素4包括'新闻'。
对应性别为男性的页面元素1偏好系数为0.6,页面元素2的偏好系数为0.3,页面元素4的偏好系数为0.1。该群体浏览记录特征权重1为0.25。
对应工作类型为程序员的页面元素1偏好系数为0.3,页面元素2的偏好系数为0.5,页面元素4的偏好系数为0.2。该群体浏览记录特征权重2为0.04。
对应学历为硕士的页面元素1偏好系数为0.8,页面元素2的偏好系数为0.1,页面元素4的偏好系数为0.1。该群体浏览记录特征权重3为0.01。
即f1=0.3×0.7+0.6×0.25+0.3×0.04= 0.372 ;
f2=0.3 ×0.7+0.3×0.25+0.5×0.04= 0.305 ;
f3=0.4 ×0.7=0.28;
f4=0.1 ×0.25+0.2×0.04+0.1×0.01= 0.034 ;
对上述5个页面元素根据预读偏好分值f进行排序,则得到f1>f2> f3>f4。
中转服务器获取排名靠前k(k一般为小于等于四的自然数)的子页面,并可以对获取到的具有相似URL的子页面进行合并重排处理后再发送到移动终端的缓存。当用户点击当前浏览页面上的上述关键字或最常访问页面链接的时候,移动终端直接调取缓存中的预读页面进行显示。
在有些网页的预读中,例如网页小说连载,可以采用如下的策略,其中,预读关键字的优先级从左到右,依次降低:
~ 下页~[下页]~下一页~[下一页]~下页|~>>下页~>>下页|~下一张~[下一张]~[->]~>~[>]~[->>]~>>~[>>]~下章~[下章]~下一章~[下一章]~下节~[下节]~',中转服务器判断网页上关键字的优先级,选取优先级最高的关键字,将该关键字链接指向的网页的子页及该子页的同一关键字链接指向的子页等都保存下来,例如一篇新闻的首页只有新闻的摘要内容,该页面的最高优先级关键字是'下页',该新闻的正文一共有5页,且每一页的下方都有关键字'下页',则中转服务器会将该新闻的第2页到第5都作为首页的子页进行预读, 但组合重排的子页数一般不超过四层,优选地对2-3层的子页进行合并重排。
如图4所示为本发明一个实施例的结构图。
浏览器预读系统400包括手机浏览器客户端410与中转服务器420连接,其中手机浏览器客户端410设置有用于保存客户的个人浏览记录特征的个人浏览记录特征数据库411及用于向目标服务器提交第一网页访问请求并上传第一网页的个人浏览记录特征的个人浏览记录特征上传模块412;
中转服务器420上设置用于保存的多个用户的一个或多个群体浏览记录特征的历史用户访问行为统计模块421,如图1所示,在本实施例中该历史用户访问行为统计模块421采用 海量用户历史访问行为统计服务器4实现 ;
中转服务器420上还设置有根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略的预读策略形成模块422及根据预读策略去目标服务器获取网页并发送给浏览器客户端缓存的预读文件读取模块423;
浏览器客户端410还包括有用于把预读文件读取模块返回的预读内容存入缓存的预读缓存模块413。

Claims (30)

  1. 一种浏览器预读方法,其特征在于,所述方法包括:
    浏览器客户端向目标服务器提交第一网页访问请求并上传第一网页的个人浏览记录特征;
    中转服务器根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略;
    中转服务器根据所述预读策略去目标服务器获取网页并发送给浏览器客户端缓存。
  2. 根据权利要求1所述的预读方法,其特征在于,所述中转服务器根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器根据预读偏好分值为前K名的页面元素所包括的链接地址获取链接内容作为预读内容,其中K为大于或等于1的自然数。
  3. 根据权利要求2所述的预读方法,其特征在于,所述预读偏好分值按照如下偏好规则计算:
    页面元素的预读偏好分值=个人偏好系数×个人浏览记录特征权重+群体偏好系数×群体浏览记录特征权重;
    每个页面元素根据个人浏览记录特征设定个人偏好系数,每个页面元素根据群体浏览记录特征设定群体偏好系数,预先设定与个人浏览记录特征对应的个人浏览记录特征权重以及与一个或多个群体浏览记录特征对应的群体浏览记录特征权重。
  4. 根据权利要求2所述的预读方法,其特征在于,所述个人浏览记录特征为第一网页上的一个或多个包括有链接地址的页面元素的个人访问频率,所述群体浏览记录特征为中转服务器对第一网页上的一个或多个页面元素的群体访问频率。
  5. 根据权利要求4所述的预读方法,其特征在于,所述个人浏览记录特征权重与一个或多个群体浏览记录特征权重的总和为1。
  6. 根据权利要求2所述的预读方法,其特征在于,所述个人偏好系数是页面元素的个人访问频率,所述群体偏好系数是页面元素的群体访问频率。
  7. 根据权利要求4所述的预读方法,其特征在于,所述预读策略还包括:个人偏好系数根据第一网页的页面元素的个人访问频率顺序设定相应的顺序,群体偏好系数根据第一网页的页面元素的群体访问频率顺序设定相应的顺序。
  8. 根据权利要求4所述的预读方法,其特征在于,所述预读策略还包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器对预读偏好分值为前K名的页面元素所包括的链接地址获取的链接内容进行重排合并,重排合并后的内容为预读内容。
  9. 根据权利要求2所述的预读方法,其特征在于,所述方法包括第一群体浏览记录特征,第一群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
  10. 根据权利要求2所述的预读方法,其特征在于,浏览器客户端还上传包括与用户身份相关联的一个或多个个人身份特征,所述中转服务器还保存与用户群体身份相关联的一个或多个群体身份特征,所述方法还包括第二群体浏览记录特征,第二群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,查询与个人身份特征所对应的群体身份特征的历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
  11. 根据权利要求9或10所述的预读方法,其特征在于,所述第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    中转服务器对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问第一网页时访问关键点的频率作为包括关键点的页面元素的群体访问频率。
  12. 根据权利要求11所述的预读方法,其特征在于,所述关键点为关键字或关键图。
  13. 根据权利要求2所述的预读方法,其特征在于,所述方法包括第三群体浏览记录特征,第三群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    中转服务器对第一网页进行统计分析,查询对第一网页访问后跳转到第二网页的历史频率;
    根据第一网页上的页面元素所包括的链接地址,确定与第二网页关联的页面元素,得到页面元素的群体访问频率。
  14. 根据权利要求2所述的预读方法,其特征在于,浏览器客户端还上传包括与用户身份相关联的一个或多个个人身份特征,所述中转服务器还保存与用户群体身份相关联的一个或多个群体身份特征,所述方法还包括第四群体浏览记录特征,第四群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    中转服务器对第一网页进行统计分析,查询与个人身份特征所对应的群体身份特征对第一网页访问后跳转到第三网页的历史频率;
    根据第一网页上的页面元素所包括的链接地址,确定与第三网页关联的页面元素,得到页面元素的群体访问频率。
  15. 根据权利要求1所述的预读方法,其特征在于,所述浏览器客户端为移动通讯设备终端。
  16. 一种浏览器预读系统,其特征在于,所述系统包括:
    设置在浏览器客户端用于向目标服务器提交第一网页访问请求并上传第一网页的个人浏览记录特征的个人浏览记录特征上传模块;
    设置在中转服务器根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略的预读策略形成模块;
    设置在中转服务器根据所述预读策略去目标服务器获取网页并发送给浏览器客户端缓存的预读文件读取模块。
  17. 根据权利要求16所述的预读系统,其特征在于,所述预读策略形成模块根据接收到的第一网页的个人浏览记录特征以及保存的对第一网页的至少一个群体浏览记录特征形成预读策略包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器根据预读偏好分值为前K名的页面元素所包括的链接地址获取链接内容作为预读内容,其中K为大于或等于1的自然数。
  18. 根据权利要求17所述的预读系统,其特征在于,所述预读偏好分值按照如下偏好规则计算:
    页面元素的预读偏好分值=个人偏好系数×个人浏览记录特征权重+群体偏好系数×群体浏览记录特征权重;
    每个页面元素根据个人浏览记录特征设定个人偏好系数,每个页面元素根据群体浏览记录特征设定群体偏好系数,预先设定与个人浏览记录特征对应的个人浏览记录特征权重以及与一个或多个群体浏览记录特征对应的群体浏览记录特征权重。
  19. 根据权利要求17所述的预读系统,其特征在于,所述个人浏览记录特征为第一网页上的一个或多个包括有链接地址的页面元素的个人访问频率,所述群体浏览记录特征为中转服务器对第一网页上的一个或多个页面元素的群体访问频率。
  20. 根据权利要求19所述的预读系统,其特征在于,所述个人浏览记录特征权重与一个或多个群体浏览记录特征权重的总和为1。
  21. 根据权利要求17所述的预读系统,其特征在于,所述个人偏好系数是页面元素的个人访问频率,所述群体偏好系数是页面元素的群体访问频率。
  22. 根据权利要求19所述的预读系统,其特征在于,所述预读策略还包括:个人偏好系数根据第一网页的页面元素的个人访问频率顺序设定相应的顺序,群体偏好系数根据第一网页的页面元素的群体访问频率顺序设定相应的顺序。
  23. 根据权利要求19所述的预读系统,其特征在于,所述预读策略还包括:对第一网页上的一个或多个页面元素按照预读偏好分值排序,中转服务器对预读偏好分值为前K名的页面元素所包括的链接地址获取的链接内容进行重排合并,重排合并后的内容为预读内容。
  24. 根据权利要求17所述的预读系统,其特征在于,所述预读策略形成模块还包括用于记录第一群体浏览记录特征的第一群体浏览记录特征模块,第一群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    第一群体浏览记录特征模块对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
  25. 根据权利要求17所述的预读系统,其特征在于,所述系统还包括设置在浏览器客户端用于上传包括与用户身份相关联的一个或多个个人身份特征的个人身份特征上传模块,设置在中转服务器上用于保存与用户群体身份相关联的一个或多个群体身份特征的群体身份特征保存模块,所述预读策略形成模块还包括用于记录第二群体浏览记录特征的第二群体浏览记录特征模块,第二群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    第二群体浏览记录特征模块对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,查询与个人身份特征所对应的群体身份特征的历史访问关键点的频率作为包括关键点的页面元素的群体访问频率,所述关键点通过历史统计确定。
  26. 根据权利要求24或25所述的预读系统,其特征在于,第二群体浏览记录特征模块对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    第二群体浏览记录特征模块对第一网页进行统计分析,如果包括链接地址的页面元素中包括关键点,则查询历史访问第一网页时访问关键点的频率作为包括关键点的页面元素的群体访问频率。
  27. 根据权利要求26所述的预读系统,其特征在于,所述关键点为关键字或关键图。
  28. 根据权利要求17所述的预读系统,其特征在于,预读策略形成模块包括用于记录第三群体浏览记录特征的第三群体浏览记录特征模块,第三群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    第三群体浏览记录特征模块对第一网页进行统计分析,查询对第一网页访问后跳转到第二网页的历史频率;
    根据第一网页上的页面元素所包括的链接地址,确定与第二网页关联的页面元素,得到页面元素的群体访问频率。
  29. 根据权利要求17所述的预读系统,其特征在于,系统还包括设置在浏览器客户端用于上传包括与用户身份相关联的一个或多个个人身份特征的个人身份特征上传模块,设置在中转服务器上用于保存与用户群体身份相关联的一个或多个群体身份特征的群体身份特征保存模块,所述预读策略形成模块还包括第四群体浏览记录特征,第四群体浏览记录特征对第一网页上的一个或多个页面元素的群体访问频率通过以下方式确定:
    第四群体浏览记录特征模块对第一网页进行统计分析,查询与个人身份特征所对应的群体身份特征对第一网页访问后跳转到第三网页的历史频率;
    根据第一网页上的页面元素所包括的链接地址,确定与第三网页关联的页面元素,得到页面元素的群体访问频率。
  30. 根据权利要求16所述的预读系统,其特征在于,所述浏览器客户端为移动通讯设备终端。
PCT/CN2011/071915 2011-03-17 2011-03-17 一种浏览器预读方法及其系统 WO2012122718A1 (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
US13/381,342 US8527585B2 (en) 2011-03-17 2011-03-17 Prereading method and system for web browser
PCT/CN2011/071915 WO2012122718A1 (zh) 2011-03-17 2011-03-17 一种浏览器预读方法及其系统
CN201180003569.8A CN102804735B (zh) 2011-03-17 2011-03-17 一种浏览器预读方法及其系统
US13/965,069 US9094478B2 (en) 2011-03-17 2013-08-12 Prereading method and system for web browser

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2011/071915 WO2012122718A1 (zh) 2011-03-17 2011-03-17 一种浏览器预读方法及其系统

Related Child Applications (2)

Application Number Title Priority Date Filing Date
US13/381,342 A-371-Of-International US8527585B2 (en) 2011-03-17 2011-03-17 Prereading method and system for web browser
US13/965,069 Continuation US9094478B2 (en) 2011-03-17 2013-08-12 Prereading method and system for web browser

Publications (1)

Publication Number Publication Date
WO2012122718A1 true WO2012122718A1 (zh) 2012-09-20

Family

ID=46829348

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/071915 WO2012122718A1 (zh) 2011-03-17 2011-03-17 一种浏览器预读方法及其系统

Country Status (3)

Country Link
US (2) US8527585B2 (zh)
CN (1) CN102804735B (zh)
WO (1) WO2012122718A1 (zh)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108121748A (zh) * 2016-11-30 2018-06-05 腾讯科技(北京)有限公司 提供媒体推送的方法和装置
CN110298004A (zh) * 2019-06-06 2019-10-01 阿里巴巴集团控股有限公司 一种目标对象的缓存管理方法、系统、装置及电子设备
CN110855778A (zh) * 2019-11-13 2020-02-28 首都经济贸易大学 一种基于网络嗅探器的数据加载方法、装置和系统
CN111556116A (zh) * 2020-04-22 2020-08-18 深圳市元征科技股份有限公司 一种培训媒体推送方法、培训媒体的显示方法及相关装置

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8661328B2 (en) * 2011-12-15 2014-02-25 Sap Portals Israel Ltd Managing web content on a mobile communication device
US9311294B2 (en) * 2013-03-15 2016-04-12 International Business Machines Corporation Enhanced answers in DeepQA system according to user preferences
WO2014155663A1 (ja) * 2013-03-29 2014-10-02 楽天株式会社 データ提供装置、データ提供方法及びデータ提供プログラム
CN103955507B (zh) * 2014-04-25 2017-08-11 中国联合网络通信集团有限公司 基于用户感知的预警方法及装置
CN105589914B (zh) * 2015-07-20 2018-07-06 广州市动景计算机科技有限公司 一种网页页面的预读取方法、装置及智能终端设备
CN107870912A (zh) * 2016-09-22 2018-04-03 广州市动景计算机科技有限公司 文章质量评分方法、设备、客户端、服务器及可编程设备
CN108228640A (zh) * 2016-12-21 2018-06-29 天脉聚源(北京)科技有限公司 一种互联网商城网页加载的方法和系统
CN108345481B (zh) * 2017-01-22 2023-04-18 腾讯科技(深圳)有限公司 一种页面显示方法、装置、客户端及服务器
US11803993B2 (en) * 2017-02-27 2023-10-31 Disney Enterprises, Inc. Multiplane animation system
CN109918150A (zh) * 2019-03-11 2019-06-21 北京城市网邻信息技术有限公司 一种基于用户行为的个性化页面预加载方法及系统
CN110276042A (zh) * 2019-06-30 2019-09-24 浪潮卓数大数据产业发展有限公司 一种基于机器学习的智能Web代理缓存系统及方法
CN110516180B (zh) * 2019-08-21 2023-04-07 北京拉勾网络技术有限公司 一种内容显示方法、装置和计算设备
CN110781426B (zh) * 2019-10-30 2022-04-19 河南广播电视大学 一种空间动态分配的浏览器及其分配方法
WO2021184174A1 (zh) * 2020-03-17 2021-09-23 深圳市欢太科技有限公司 信息推送方法、装置、电子设备及存储介质
CN112104714A (zh) * 2020-08-31 2020-12-18 广州携龙商务服务有限公司 一种基于用户互动元素权重精准推送方法
CN116955873B (zh) * 2023-09-18 2023-12-08 北京佰信蓝图科技股份有限公司 一种海量动态面状矢量数据在浏览器上快速显示方法

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020078165A1 (en) * 2000-12-14 2002-06-20 International Business Machines Corporation System and method for prefetching portions of a web page based on learned preferences
US20020103778A1 (en) * 2000-12-06 2002-08-01 Epicrealm Inc. Method and system for adaptive prefetching
CN101189608A (zh) * 2005-03-31 2008-05-28 谷歌公司 用于分析用户的Web历史的系统和方法
US20080208789A1 (en) * 2007-02-23 2008-08-28 Microsoft Corporation HTTP acceleration by prediction and pre-fetching

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6721780B1 (en) * 1999-11-09 2004-04-13 Fireclick, Inc. Predictive pre-download of network objects
WO2008040004A2 (en) * 2006-09-28 2008-04-03 Sfgt Inc. Apparatuses, methods, and systems for code triggered information querying and serving
CN101777068B (zh) * 2009-12-31 2015-07-22 优视科技有限公司 一种用于移动通讯设备终端的网页页面预读及整合浏览系统及其应用方法

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020103778A1 (en) * 2000-12-06 2002-08-01 Epicrealm Inc. Method and system for adaptive prefetching
US20020078165A1 (en) * 2000-12-14 2002-06-20 International Business Machines Corporation System and method for prefetching portions of a web page based on learned preferences
CN101189608A (zh) * 2005-03-31 2008-05-28 谷歌公司 用于分析用户的Web历史的系统和方法
US20080208789A1 (en) * 2007-02-23 2008-08-28 Microsoft Corporation HTTP acceleration by prediction and pre-fetching

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108121748A (zh) * 2016-11-30 2018-06-05 腾讯科技(北京)有限公司 提供媒体推送的方法和装置
CN108121748B (zh) * 2016-11-30 2020-08-28 腾讯科技(北京)有限公司 提供媒体推送的方法和装置
CN110298004A (zh) * 2019-06-06 2019-10-01 阿里巴巴集团控股有限公司 一种目标对象的缓存管理方法、系统、装置及电子设备
CN110298004B (zh) * 2019-06-06 2023-03-28 创新先进技术有限公司 一种目标对象的缓存管理方法、系统、装置及电子设备
CN110855778A (zh) * 2019-11-13 2020-02-28 首都经济贸易大学 一种基于网络嗅探器的数据加载方法、装置和系统
CN111556116A (zh) * 2020-04-22 2020-08-18 深圳市元征科技股份有限公司 一种培训媒体推送方法、培训媒体的显示方法及相关装置
CN111556116B (zh) * 2020-04-22 2023-05-30 深圳市元征科技股份有限公司 一种培训媒体推送方法、培训媒体的显示方法及相关装置

Also Published As

Publication number Publication date
CN102804735A (zh) 2012-11-28
CN102804735B (zh) 2014-03-12
US20120239732A1 (en) 2012-09-20
US8527585B2 (en) 2013-09-03
US9094478B2 (en) 2015-07-28
US20140047009A1 (en) 2014-02-13

Similar Documents

Publication Publication Date Title
WO2012122718A1 (zh) 一种浏览器预读方法及其系统
WO2015158297A1 (en) Method, apparatus, and system for controlling delivery task in social networking platform
WO2013139239A1 (en) Method for recommending users in social network and the system thereof
WO2019061613A1 (zh) 贷款资质筛选方法、装置及计算机可读存储介质
WO2019061612A1 (zh) 贷款产品推广方法、装置及计算机可读存储介质
WO2019061614A1 (zh) 贷款产品匹配方法、装置及计算机可读存储介质
WO2017143692A1 (zh) 智能电视及其语音控制方法
WO2018107610A1 (zh) 业务数据处理方法、系统、设备及计算机可读存储介质
WO2017054592A1 (zh) 一种界面显示的方法及终端
WO2018166314A1 (zh) 额度审批方法、装置、设备以及计算机可读存储介质
WO2015120808A1 (en) Method and system for security protection of account information
WO2015035777A1 (zh) 一种移动终端的软件升级方法和系统
WO2015144089A1 (en) Application recommending method and apparatus
WO2018233301A1 (zh) 产品推荐方法、装置、设备以及计算机可读存储介质
WO2015131803A1 (en) Application recommending method and system
WO2013170662A1 (zh) 添加好友信息的方法及装置、计算机存储介质
WO2019000801A1 (zh) 数据同步方法、装置、设备及计算机可读存储介质
WO2018201699A1 (zh) 客户评价方法、装置、设备以及计算机可读存储介质
WO2011007935A1 (ko) 홈페이지 통합 서비스 제공 시스템 및 방법
WO2019177182A1 (ko) 속성 정보 분석을 통한 멀티미디어 컨텐츠 검색장치 및 검색방법
WO2019041851A1 (zh) 家电售后咨询方法、电子设备和计算机可读存储介质
WO2020022819A1 (en) Communication via simulated user
WO2019100531A1 (zh) 数字签名生成、验证方法及其设备和存储介质
WO2019169814A1 (zh) 自动生成中文注释的方法、装置、设备及存储介质
WO2018188196A1 (zh) 一种数据版本控制方法、数据版本控制器、设备及计算机可读存储介质

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201180003569.8

Country of ref document: CN

WWE Wipo information: entry into national phase

Ref document number: 13381342

Country of ref document: US

WWE Wipo information: entry into national phase

Ref document number: 422/MUMNP/2012

Country of ref document: IN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11860947

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11860947

Country of ref document: EP

Kind code of ref document: A1