CN103914490B - Pages operating method and system - Google Patents

Pages operating method and system Download PDF

Info

Publication number
CN103914490B
CN103914490B CN 201310006595 CN201310006595A CN103914490B CN 103914490 B CN103914490 B CN 103914490B CN 201310006595 CN201310006595 CN 201310006595 CN 201310006595 A CN201310006595 A CN 201310006595A CN 103914490 B CN103914490 B CN 103914490B
Authority
CN
Grant status
Grant
Patent type
Prior art keywords
page
number
correlation coefficient
web
relating
Prior art date
Application number
CN 201310006595
Other languages
Chinese (zh)
Other versions
CN103914490A (en )
Inventor
黄申
韩军
Original Assignee
北京京东尚科信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Grant date

Links

Abstract

本发明公开了种网页运行方法和系统,所述网页运行方法包括步骤:从网页中提取主题内容;从数据库获取每个与所述主题内容相关联的关键词以及为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的页面;分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数;按相关系数从高到低的顺序选取与预设数值相等数量的关键词作为显示关键词;将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 The present invention discloses a method and system of operation types of web pages, the web page operating method comprising the steps of: extracting from the subject matter of the web page; obtaining each keyword associated with the subject matter from the database and associated generated for each keyword containing the keyword clicks later, purchase page number, and display the number of search times, respectively; each page based on keyword clicks, purchases, keyword searches and displays the number of calculations of correlation coefficients of the page; press descending order of correlation coefficient with the preset value is equal to the number of selected keywords as keyword display; respective display contents including the keywords of the web page as the page composition. 本发明还提供了种使用所述网页运行方法的系统。 The present invention further provides a method of operating a system using the kinds of the web page. 本发明通过判断页面内容与主题内容的相关度甄选最匹配的页面内容,所以便于用户获取与主题内容关联的更有价值的信息。 Page that best matches the present invention is determined by the selection of the relevant subject matter and content of the page, the user is facilitated to obtain more valuable information associated with the subject matter.

Description

网页运行方法和系统 Pages operating method and system

技术领域 FIELD

[0001] 本发明涉及一种网页运行方法和系统,特别是涉及一种互联网中的网页运行方法和系统。 [0001] The present invention relates to a method and system for operating page, the page runs more particularly to a method and system in the Internet.

背景技术 Background technique

[0002] 互联网已经成为计算机领域的热门技术,互联网的普及使得人们可以突破空间、 地域的限制,方便地共享信息资源。 [0002] The Internet has become a hot field of computer technology, the popularity of the Internet makes it possible to break through space, geographical restrictions, easy sharing of information resources. 互联网作为一种信息服务,自诞生以来得到的迅猛发展,使其成为一个巨大的信息库,存储着大量有价值的信息,所以人们可以在其上查找自己感兴趣的各种内容。 Internet as an information service, since its inception been the rapid development, making it a huge repository of information, stores a lot of valuable information, so people can find a variety of content on their own interest.

[0003] 但是正是由于互联网的数据量非常庞大,所以与请求页面中主题内容相关联的页面内容非常多,动辄上万,甚至是数十万。 [0003] However, it is precisely because of the very large amount of data to the Internet, it is related to the subject matter of the request page linked page content is very large, hundreds of thousands, even hundreds of thousands. 这其中有相当一部分页面内容虽然与主题内容相关,但是与请求页面实际请求的内容无关,汇集并反馈这些无关的页面内容将浪费用户的大量时间并分散用户的精力。 Of which a considerable portion of page content, although related to the subject matter, but has nothing to do with the actual content of the requested page request, bringing together the content of these pages and feedback unrelated users will waste a lot of time and energy dispersed users.

发明内容 SUMMARY

[0004] 本发明要解决的技术问题是为了克服现有技术中为请求页面的主题内容关联了大量包含了无用信息的页面内容的缺陷,提供一种网页运行方法和系统,通过判断包含主题内容的页面内容与主题内容的相关度来甄选与主题内容最为匹配的页面内容,所以提高了主题内容关联的效率。 [0004] The present invention is to solve the technical problem to overcome in the prior art associated with a content page contains a lot of unnecessary information to the subject matter of a defect of the requested page, the page is provided a method of operating the system and, by determining the subject matter comprising the page content and relevance to the selection of subject matter that most closely matches the page content and subject matter, it improves the efficiency of the associated subject matter.

[0005] 本发明是通过下述技术方案来解决上述技术问题的: [0005] The present invention is to solve the above problems by the following technical solution:

[0006] 本发明提供了一种网页运行方法,其特点是,所述网页运行方法包括以下步骤: [0006] The present invention provides a method for running web, characterized in that the method comprises the steps of operating the page:

[0007] Sl、从一网页中提取主题内容; [0007] Sl, the subject matter is extracted from a Web page;

[0008] 本发明中利用现有的网页解析技术,从包含主题内容的网页中解析提取主题内容,而且本发明中所述主题内容是指用户的点击网页、网页查找或检索等操作的操作内容, 通过这些操作本发明可以将用户对应于所述操作的结果或过程中间值等数据回馈给用户。 [0008] The present invention is utilized in a conventional web parsing techniques, direct extraction from the subject matter is the subject matter contained in the web, and the subject matter of the present invention refers to a user's click operation or the like web, or the web for content retrieval operation , through which a user operation of the present invention may correspond to results or intermediate values ​​during operation of the data such as feedback to the user.

[0009] S2、从一数据库获取每个与所述主题内容相关联的关键词以及为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的一页面; [0009] S2, each acquired from a database associated with the subject matter associated keywords and generate clicks that includes the keyword for each keyword, buy a page number, the number of searches and the number of display;

[0010] 本发明中通过比较与主题内容相关联的所述关键词的关联程度来确定反馈数据。 [0010] The present invention associated with the subject matter by comparing the degree of association of the keyword to determine feedback data. 其中所述主题内容相关联的所有所述关键词是通过现有的关联技术设定好的关键词,所以本发明中不再详细阐述所述主题内容和所述关键词的关联机理。 Wherein all of the subject matter associated with the keywords associated with the prior art by setting a good keyword, so that the mechanism associated with the keyword and the subject matter of the present invention will not be explained in detail. 而且所述关键词的点击次数、购买次数、搜索次数和显示次数均是预先设定的所述关键词的参数数据,所以此处同样不再详细阐述所述关键词的上述参数的取得等机理。 Parameter data and said keyword the keyword clicks, purchases, and display the number of search times are set in advance, it is not described here in detail the mechanism of the same like the keyword obtaining the parameters .

[0011] 其中所述页面是互联网技术中网页的构成部分,所以此处对页面的结构以及其与网页的关联不在详细赘述。 [0011] wherein said component is a page in the page is the Internet, where it and its associated structures of the page and the page is not repeated herein in detail. 而且本发明在确定关键词后,通过现有的页面解析方法,为每一个关键词生成包含上述参数内容的一个页面。 Furthermore the present invention in determining the keywords by the existing page analysis method for generating a page for each keyword parameters including the content.

[0012] S3、分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数; [0012] S3, respectively, on a per-page keyword clicks, purchases, the number of searches and displaying the calculated correlation coefficient page keyword;

[0013] S4、按相关系数从高到低的顺序选取与一预设数值相等数量的关键词作为显示关键词; [0013] S4, by descending order of correlation coefficient with a preset value equal to the selected number as the keyword to display keyword;

[0014] 由于主题内容所关联的关键词的数量是很大的,例如100万等等,所以本发明通过相关系数确定与所述主题内容关联度最高一组关键词寻找出来,作为反馈的数据。 [0014] Since the number of keywords associated with the subject matter is large, for example, 1,000,000 and the like, the present invention is determined to find out the subject matter associated with the highest correlation coefficient by a set of keywords, as feedback data .

[0015] S5、将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 [0015] S5, the display content contain various combinations of keywords as the web page of the display.

[0016] 其中所述显示内容是现有网页页面中以图形等方式显示出来的内容,所以此处不再详细阐述所述内容的具体显示机理。 [0016] DETAILED wherein the display mechanism displays the web page content is conventional in graphically displayed content, etc., so here is not the content detail.

[0017] 较佳地,所述步骤S3为: [0017] Preferably, the step S3:

[0018] 通过下式计算每个页面的关键词的相关系数, [0018] calculated by keywords for each page of the correlation coefficient,

[0019] [0019]

Figure CN103914490BD00071

[0020] 其中所述Rs (T,K)为所述相关系数,所述Search (T,K)为所述搜索次数,所述Click (τ,K)为所述点击次数,所述Sale (T,K)为所述购买次数,所述Show (T,K)为显示次数,所述Wl、W2和W3均大于等于零,且所述W1+W2+W3 = 1。 [0020] wherein Rs (T, K) is the correlation coefficient, the Search (T, K) to the number of searches, the Click (τ, K) to the number of clicks, the Sale ( T, K) is the number of purchases, the Show (T, K) for the display count, the Wl, W2 and W3 are greater than zero, and the W1 + W2 + W3 = 1.

[0021] 本发明中利用加权权重的方式来调节所述搜索次数、点击次数以及购买次数对所述关键词和所述主题内容之间的关联度的影响。 Adjusting the number of searches, the number of clicks, and later influence on the degree of association between the keyword and the subject matter of the [0021] present invention, a weight-weighted weighting scheme. 也就是说,通过这种相关度的计算来确定所述主题内容和各个页面之间的相关度。 That is, to determine the degree of correlation between the subject matter and content of each page calculated by this correlation degrees.

[0022] 较佳地,所述步骤S3中为: [0022] Preferably, the step S3:

[0023] 通过下式计算每个页面的关键词的相关系数, [0023] is calculated by the following formula keyword correlation coefficient of each page,

[0024] [0024]

Figure CN103914490BD00072

[0025] Rs (T,K) = (1-i/n) XRm+i/nXRi (Τ,Κ) [0025] Rs (T, K) = (1-i / n) XRm + i / nXRi (Τ, Κ)

[0026] 其中所述Rs (Τ,Κ)为所述相关系数,所述Ri (Τ,Κ)为系统相关系数,所述Rm为干预系数,所述Search (Τ,Κ)为所述搜索次数,所述Click (Τ,Κ)为所述点击次数,所述Sale (Τ,Κ) 为所述购买次数,所述Show (Τ,Κ)为显示次数,而且所述W1、烈和抑均大于等于零,所述W1+W2+ W3=l,所述η为干预时间段长度,所述i为当前运行时间,其中0<i<n。 [0026] wherein Rs (Τ, Κ) is the correlation coefficient, the Ri (Τ, Κ) correlation coefficient for the system, the coefficient Rm intervention, the Search (Τ, Κ) of the search times, the click (Τ, Κ) is the number of clicks, the Sale (Τ, Κ) of the purchases, the Show (Τ, Κ) to display the number, and the W1, and strong suppression greater than zero, the W1 + W2 + W3 = l, η is the intervention period length, the running time for the current i, where 0 <i <n.

[0027] 本发明中为了可以修正和检测所述页面相关度,本发明通过加入人工干预的参数来调整所述页面相关度的计算,因此所述Rm和η是本领域技术人员可以任意设置的。 [0027] The present invention may be modified in order to detect the page and correlation, the present invention is to adjust the degree of correlation is calculated by adding the parameter page manual intervention, and thus the η Rm skilled in the art can arbitrarily set . 而且由于所述i随着运行时间的流逝而不断变大,从而所述人工干预的参数对计算得到的页面相关度的影响越小。 And because the run i with the lapse of time becomes larger and larger, the smaller the impact parameters so that the manual intervention of a page of the calculated correlation. 而且本领域技术人员通过改变η可以控制修正和检测的时间,并且当i = n 时,本领域技术人员可以重新进行上述页面相关度的计算或彻底摒弃人工干预,而将系统相关系数直接作为所述相关系数。 And that those skilled in the art by varying η can be controlled time correction and detection, and when i = n, one skilled in the art can re-calculates the page correlation or completely abandoned human intervention, the system correlation coefficients directly as the said correlation coefficient.

[0028] 较佳地,所述步骤S5之后还包括步骤S6,其中步骤S6中包括: [0028] Preferably, further comprising the step after step S6 S5, S6, which comprises the step of:

[0029] S61、为所述数据库中每个主题生成包含所述主题在一特定时间段的点击次数和购买次数的一第一主题页面; [0029] S61, the database generated for each subject in the subject included in clicks a particular time period relating to a first page and later times;

[0030] S62、分别基于每个所述第一主题页面的点击次数和购买次数计算所述第一主题页面的主题的第一相关系数; [0030] S62, respectively, each of said first clicks on the theme of the page number and the first correlation coefficients for later calculation relating to the first page of the subject matter;

[0031] S63、按所述第一相关系数从高到低的顺序选取与一第一预设数值相等数量的主题作为第一显示主题; [0031] S63, by the first correlation coefficient selected in descending order of value equal to a first predetermined number of topics relating to the first display;

[0032] S64、将各个包含所述第一显示主题的主题页面组合作为所述网页的显示内容。 [0032] S64, the respective compositions comprising a display content of the subject page as the first page of the display subject.

[0033] 其中本发明中所述主题是数据库中预先记录的,而且所述主题同样是现有技术中常用的表征一类关键词的索引,所以此处不再赘述所述主题的功能和构造。 [0033] The subject matter of the present invention wherein the database is recorded in advance, and the theme is also commonly used in the prior art to characterize a class keyword index, it is not repeated here functions and configurations of the subject matter . 而且本发明中所述特定时间段是用户可以任意设定的。 Further according to the present invention, a certain time period can be arbitrarily set by the user.

[0034] 较佳地,所述步骤S5之后还包括步骤S7,其中步骤S7中包括: [0034] Preferably, further comprising the step S5 after step S7, S7, which comprises the step of:

[0035] S71、从所述网页提取用户身份数据; [0035] S71, extracts the user identification data from said web;

[0036] S72、从所述数据库中提取与所述用户身份数据相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第二主题页面; [0036] S72, and extracts from the database the user identity data relating to each of the associated, respectively, and each subject to generate the number of clicks of the subject included in a second and later times is subject page;

[0037] S73、分别基于每个所述第二主题页面的点击次数和购买次数计算所述第二主题页面的主题的第二相关系数; [0037] S73, respectively, based on the number of clicks of each of the second and later pages relating to a second frequency and calculating a correlation coefficient relating to the second subject of the page;

[0038] S74、按所述第二相关系数从高到低的顺序选取与一第二预设数值相等数量的主题作为第二显示主题; [0038] S74, by the second correlation coefficients in descending order of selecting a second predetermined value equal to the number of topics relating to the second display;

[0039] S75、将各个包含所述第二显示主题的主题页面组合作为所述网页的显示内容。 [0039] S75, the respective compositions comprising a display page relating to the content of the web page as the second display subject matter.

[0040] 本发明中通过将当前网页的用户信息数据也作为主题页面的关联的一个影响因素来确定与当前网页关联度高的主题页面,其中所述用户信息数据也是用户在操作网页等活动中,网页代码中所包含的数据信息,所以此处不再详细阐述所述用户信息数据。 [0040] The present invention, by the user information associated with the current page as the subject page is also a factor in determining a web page associated with the current page relating to high, wherein said user information is a user operation event web pages data information contained in web page code, it is not described herein in detail the user information data.

[0041] 较佳地,所述步骤S5之后还包括步骤S8,其中步骤S8中包括: [0041] Preferably, further comprising the step S5 after step S8, S8, which comprises the step of:

[0042] S81、从所述数据库中提取与所述主题内容相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第三主题页面; [0042] S81, extracts from the database associated with each topic related to the subject matter, and respectively relating to each of the subject to generate clicks comprising a third and later times is subject page;

[0043] S82、分别基于每个所述第三主题页面的点击次数和购买次数计算所述第三主题页面的主题的第三相关系数; [0043] S82, respectively, based on the number of clicks of the third subject matter of each page and later the third correlation coefficient calculating frequency and the third subject matter relating to the page;

[0044] S83、按所述第三相关系数从高到低的顺序选取与一第三预设数值相等数量的主题作为第三显示主题; [0044] S83, according to the third correlation coefficients in descending order of selecting a third predetermined value equal to the number of topics relating to a third display;

[0045] S84、将各个包含所述第三显示主题的主题页面组合作为所述网页的显示内容。 [0045] S84, the respective display comprising the third contents relating to the subject matter as a combination of page of the web page.

[0046] 本发明还进一步地通过主题页面与网页中主题内容的关联度来确定与当前网页关联度高的主题页面。 [0046] The subject matter of the present invention is further associated with a page of the web page to determine the subject matter associated with the current web page by the theme high.

[0047] 上述的第一预设数值、第二预设数值和第三预设数值的数值均是可以任意设置的。 [0047] the value of said first predetermined value, the second and third predetermined values ​​are preset value can be arbitrarily set.

[0048] 较佳地,所述步骤S5之后还包括步骤S9,其中步骤S9中包括: [0048] Preferably, further comprising the step S5 after the step S9, wherein the step S9, comprising:

[0049] S91、为所述数据库中每个主题生成包含所述主题在一样本时间段的点击次数和购买次数的一第四主题页面; [0049] S91, for each subject in the database to generate a fourth relating to the subject matter contained in the same page clicks this period and later times;

[0050] S92、基于所述第四主题页面的点击次数和购买次数计算所述主题的第四相关系数; [0050] S92, is calculated based on the theme of the fourth theme clicks and page number of purchases fourth correlation coefficient;

[0051] S93、从所述网页提取用户身份数据; [0051] S93, extracts the user identification data from said web;

[0052] S94、从所述数据库中提取与所述用户身份数据相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第五主题页面; [0052] S94, each subject is extracted from the database and the user identity data associated with each topic separately and generate clicks of the subject included the purchase of a fifth theme and page number is;

[0053] S95、基于所述第五主题页面的点击次数和购买次数计算所述主题的第五相关系数; [0053] S95, is calculated based on the theme of the fifth topic page clicks and purchases fifth correlation coefficient;

[0054] S96、从所述数据库中提取与所述主题内容相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第六主题页面; [0054] S96, extracts from the database associated with each topic related to the subject matter, and each topic separately generated clicks the subject included in the sixth and later a number of subject page;

[0055] S97、基于所述第六主题页面的点击次数和购买次数计算所述主题的第六相关系数; [0055] S97, is calculated based on the theme of clicks of the sixth topic of the page and the number of purchases sixth correlation coefficient;

[0056] S98、分别基于所述数据库中每个主题的第四相关系数、第五相关系数和第六相关系数计算所述主题的总相关系数; [0056] S98, respectively based on the correlation coefficient for each database in the fourth subject, the fifth and sixth correlation coefficient calculated overall correlation coefficient of the correlation coefficient of the subject matter;

[0057] 按所述总相关系数从高到低的顺序选取与一第四预设数值相等数量的主题作为第四显示主题; [0057] The correlation coefficient of the total selected in descending order of value equal to a fourth predetermined number of topics relating to a fourth display;

[0058] S99、将各个包含所述第四显示主题的主题页面组合作为所述网页的显示内容。 [0058] S99, each comprising a fourth display contents relating to the page as the subject combination page.

[0059] 本发明中为数据库中存储的所有的主题与主题内容均进行不同相关度的计算,并通过每个主题的各个相关度的综合计算,来判断主题与主题内容之间关联程度的高低。 [0059] The present invention, both the degree of correlation is calculated for all the different topics and subject matter stored in the database, and by combining each correlation calculation of each subject, to determine the degree of association between the level of the subject matter and topics . 其中没有进行某项相关度计算的主题中该项相关度的相关系数默认为〇等不会对总相关度有任何影响的数值。 A particular theme which no correlation calculation of the correlation coefficient of correlation of default will not have any impact on the overall correlation value billion so on.

[0060] 优选地,所述步骤S98为: [0060] Preferably, the step S98 is:

[0061] 通过下式计算所述主题的总相关系数: [0061] calculated by the subject matter of the overall correlation coefficient:

[0062] Rz (T) =ViXH(T) +V2XP(T) +V3XA(TJj) [0062] Rz (T) = ViXH (T) + V2XP (T) + V3XA (TJj)

[0063] 其中所述Rz⑺为总相关系数,所述H⑺为第四相关系数,所述P⑺为第五相关系数,所述A (T,Τ')为第六相关系数,所述Vl、V2和V3均大于等于零,且所述V1+V2+V3 = 1。 [0063] wherein the total Rz⑺ correlation coefficient, the correlation coefficient H⑺ fourth, fifth P⑺ the correlation coefficient, the A (T, Τ ') correlation coefficient for the sixth, the Vl, V2 greater than zero and V3 and the V1 + V2 + V3 = 1.

[0064] 本发明中利用加权权重的方式来调节所述主题与主题内容之间的关联度的影响。 Effect of adjusting the degree of association between the subject matter and the subject matter [0064] The present invention is utilized in a weight weighted weighting scheme. 也就是说,通过这种相关度的计算来确定所述主题内容和各个主题页面之间的相关度。 That is, to determine the degree of correlation between each of the topics and subject matter of such pages by calculating a degree of correlation.

[0065] 本发明还提供了一种网页运行系统,其特点是,所述网页运行系统包括一网页服务器和多个客户端,其中所述网页服务器包括一数据库、一页面生成模块、一相关度计算模块和一网页生成模块; [0065] The present invention also provides a system running web, characterized in that the web comprises a web server operating system and a plurality of clients, wherein the web server comprises a database, a page generation module, a correlation a calculation module and a page generation module;

[0066] 所述网页服务器从所述客户端获取的网页中提取一主题内容,并从所述数据库获取每个与所述主题内容相关联的关键词; [0066] The web server extracts the content from a web page relating to the acquired client, and obtaining each associated with the subject matter keywords from said database;

[0067] 所述页面生成模块为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的一页面; [0067] The page generation module generates the keyword clicks comprising for each keyword, the number of purchases, the number of searches and displays a page number;

[0068] 所述相关度计算模块分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数; [0068] The correlation calculating module, respectively, on a per-page keyword clicks, purchases, searches and displaying the calculated number of pages keyword correlation coefficient;

[0069] 所述网页服务器按相关系数从高到低的顺序选取与一预设数值相等数量的关键词作为显示关键词; [0069] The web server according to a descending order of correlation coefficient equal to the number of selected keywords and keyword displayed as a preset value;

[0070] 所述网页生成模块将各个包含所述显示关键词的页面组合作为所述网页的显示内容并将所述页面发送至所述客户端。 [0070] The individual page generating module comprising a combination of the display page as keywords of the web page displaying the content of the page and sent to the client.

[0071] 为了便于描述,本发明中将所述网页服务器按照功能划分为各种模块进行分别描述,所以在实施本发明时,可以把各模块的功能在同一个或多个软件和/或硬件中实现。 [0071] To facilitate the description, the web server of the present invention will be described for the various modules are divided according to function, so that in the practice of the present invention, the function of each module in one or more software and / or hardware implemented.

[0072] 较佳地,所述相关度计算模块通过下式计算每个页面的关键词的相关系数, [0072] Preferably, the correlation degree calculated by the calculation module for each page keyword correlation coefficient,

[0073] [0073]

Figure CN103914490BD00091

[0074] 其中所述Rs (Τ,Κ)为所述相关系数,所述Search (Τ,Κ)为所述搜索次数,所述Click (τ,K)为所述点击次数,所述Sale (T,K)为所述购买次数,所述Show (T,K)为显示次数,所述Wl、W2和W3均大于等于零,且所述W1+W2+W3 = 1。 [0074] wherein Rs (Τ, Κ) is the correlation coefficient, the Search (Τ, Κ) is the number of searches, the Click (τ, K) to the number of clicks, the Sale ( T, K) is the number of purchases, the Show (T, K) for the display count, the Wl, W2 and W3 are greater than zero, and the W1 + W2 + W3 = 1.

[0075] 较佳地,所述相关度计算模块通过下式计算每个页面的关键词的相关系数, [0075] Preferably, the correlation degree calculated by the calculation module for each page keyword correlation coefficient,

[0076] [0076]

Figure CN103914490BD00101

[0077] Rs (T,K) = (1-i/n) XRm+i/nXRi (Τ,Κ) [0077] Rs (T, K) = (1-i / n) XRm + i / nXRi (Τ, Κ)

[0078] 其中所述Rs (Τ,Κ)为所述相关系数,所述Ri (Τ,Κ)为系统相关系数,所述Rm为干预系数,所述Search (Τ,Κ)为所述搜索次数,所述Click (Τ,Κ)为所述点击次数,所述Sale (Τ,Κ) 为所述购买次数,所述Show (Τ,Κ)为显示次数,所述Wl、W2和W3均大于等于零,且所述W1+W2+W3 =1,所述η为干预时间段长度,所述i为当前运行时间,其中0<i<n。 [0078] wherein Rs (Τ, Κ) is the correlation coefficient, the Ri (Τ, Κ) correlation coefficient for the system, the coefficient Rm intervention, the Search (Τ, Κ) of the search the number of times the click (Τ, Κ) is the number of clicks, the Sale (Τ, Κ) is the number of purchases, the show (Τ, Κ) to display the number of the Wl, W2 and W3 are is greater than zero, and the W1 + W2 + W3 = 1, η the intervention period length, the running time for the current i, where 0 <i <n.

[0079] 较佳地,所述网页服务器的页面生成模块还为所述数据库中每个主题生成包含所述主题在一特定时间段的点击次数和购买次数的一第一主题页面; [0079] Preferably, the web server page generation module is further generated in the database relating to each of the subject included in clicks a specific period of time and later a first subject page number;

[0080] 所述相关度计算模块分别基于每个所述第一主题页面的点击次数和购买次数计算所述第一主题页面的主题的第一相关系数; [0080] The correlation calculating module, respectively, each of said first clicks on the theme of the page number and the first correlation coefficients for later calculation relating to the first page of the subject matter;

[0081] 所述网页服务器按所述第一相关系数从高到低的顺序选取与一第一预设数值相等数量的主题作为第一显示主题; [0081] The web server according to a descending order of coefficients of the first selection and related to a first predetermined value equal to the number of topics relating to the first display;

[0082] 所述网页生成模块还将各个包含所述第一显示主题的主题页面组合作为所述网页的显示内容。 [0082] The Web page generation module further comprises each page of the subject composition as display contents of the first page of the display subject.

[0083] 较佳地,所述网页服务器还从所述客户端获取的网页提取用户身份数据; [0083] Preferably, the web server also extracts user data from the client acquires the web page;

[0084] 所述页面生成模块还从所述数据库中提取与所述用户身份数据相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第二主题页面; [0084] The page generating module also extracts the user identity with each theme associated data from the database, respectively, and a second subject each topic for later generation clicks and comprising the subject matter page;

[0085] 所述相关度计算模块分别基于每个所述第二主题页面的点击次数和购买次数计算所述第二主题页面的主题的第二相关系数; [0085] The correlation calculating module, respectively, based on the second correlation coefficient calculation times and later pages relating to the second subject of the second subject Hits per page;

[0086] 所述网页服务器按所述第二相关系数从高到低的顺序选取与一第二预设数值相等数量的主题作为第二显示主题; [0086] web server according to the coefficients in descending order of the selected second correlation value equal to a second preset number relating to the second display subject;

[0087] 所述网页生成模块还将各个包含所述第二显示主题的主题页面组合作为所述网页的显示内容。 The [0087] Web page generation module further comprising relating each content page as the web page displaying a combination of the second display subject matter.

[0088] 较佳地,所述网页服务器的页面生成模块还从所述数据库中提取与所述主题内容相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第三主题页面; [0088] Preferably, the web server also extracts a page generation module associated with each topic relating to the content from the database, and clicks generated separately for each subject, and the subject included later as the third subject of a number of pages;

[0089] 所述相关度计算模块分别基于每个所述第三主题页面的点击次数和购买次数计算所述第三主题页面的主题的第三相关系数; [0089] The correlation calculating module, respectively, based on the number of clicks, and later the third subject matter of each of the third page of calculating a correlation coefficient relating to the subject matter of the third page;

[0090] 所述网页服务器按所述第三相关系数从高到低的顺序选取与一第三预设数值相等数量的主题作为第三显示主题; [0090] The web server according to a descending order of said third coefficients related to selecting a third predetermined value equal to the number of topics relating to a third display;

[0091] 所述网页生成模块还将各个包含所述第三显示主题的主题页面组合作为所述网页的显示内容。 [0091] The Web page generation module further comprising a respective display contents relating to the page composition as the third display of the web page topic.

[0092] 在符合本领域常识的基础上,上述各优选条件,可任意组合,即得本发明各较佳实例。 [0092] On the basis of compliance with the general knowledge in the art, the above-described preferred conditions, can be any combination, i.e., to obtain various preferred examples of the present invention.

[0093] 本发明的积极进步效果在于: [0093] The positive effect of the present invention is characterized in progress:

[0094] 本发明的网页运行方法和系统通过判断包含主题内容的页面内容与主题内容的相关度来甄选与主题内容最为匹配的页面内容,所以提高了主题内容关联的效率,便于用户获取与主题内容关联的更有价值的信息。 [0094] The page operation method and system of the present invention is the selection of page content and subject matter that most closely matches by page content relevance of the subject matter of determining comprises the subject matter, it improves the efficiency of the subject matter associated, user acquires topic more valuable information associated with the content.

附图说明 BRIEF DESCRIPTION

[0095] 图1为本发明的实施例1的网页运行系统的结构示意图。 [0095] FIG. 1 is a schematic configuration page operation of the system in Example 1 of the embodiment of the present invention.

[0096] 图2为本发明的实施例1的网页运行方法的流程图。 A flowchart of a method of operation of Example 1 of the page [0096] FIG. 2 embodiment of the present invention.

[0097] 图3为本发明的实施例2的网页运行方法的流程图。 A flowchart of a method of operating a website Example 2 [0097] FIG. 3 embodiment of the present invention.

[0098] 图4为本发明的实施例3的网页运行方法的流程图。 The method of operation of the flowchart of pages Example 3 [0098] 4 embodiment of the present invention. FIG.

[0099] 图5为本发明的实施例4的网页运行方法的流程图。 A flowchart of a method of operation of the website Example 4 [0099] FIG. 5 of the present invention.

[0100] 图6为本发明的实施例5的网页运行方法的流程图。 A flowchart of a method of operating a web of Example 5 [0100] FIG. 6 of the present invention.

具体实施方式 detailed description

[0101] 本发明通过判断包含主题内容的页面内容与主题内容的相关度来选取与主题内容最为匹配的页面内容,从而在海量的页面内容中快速选取与主题内容匹配的页面内容, 便于用户获取更有价值的信息。 [0101] The present invention is to select the page content and subject matter of the best matching by correlation page content and subject matter of determining comprises the subject matter to quickly select the page content matching the subject matter of the page content mass, the user-friendly acquisition more valuable information. 下面通过实施例的方式进一步说明本发明,但并不因此将本发明限制在所述的实施例范围之中。 The present invention will be further described by way of embodiment, it is not thus limit the invention embodiments within the scope of the embodiments.

[0102] 实施例1 [0102] Example 1

[0103] 如图1所示,本实施例中的网页运行系统中包括一网页服务器1和多个客户端2,本实施例中所有的客户端为网页服务器提供包含主题内容的网页,所以所述客户端2的数量是可以任意的。 [0103] As shown in FIG 1, the present embodiment of the web in the operation of the system embodiment comprises a web server and a plurality of clients 2, page contains subject matter provides a web server for all clients in this embodiment, so the 2 said number of clients can be arbitrary.

[0104] 其中所述网页服务器1包括一数据库11、一页面生成模块12、一相关度计算模块13 和一网页生成模块14。 [0104] 1 wherein the web server comprises a database 11, a page generating module 12, a correlation calculating module 13 and a Web page generation module 14.

[0105] 本实施例中所述网页服务器1与各个客户端2之间均建立有通信链路,所以所述网页服务器1能够分别单独与每个客户端2通信,其中所述通信链路的建立和通信方式均是现有网络通信技术中的常用手段,所以此处不再详细赘述。 [0105] In the present embodiment, the web server 1 and terminal 2 are between each client to establish a communication link, the web server 1 can therefore separately and each client communication terminal, wherein the communication link and establishing a communication network embodiment are prior art common means of communication, so here again in detail herein.

[0106] 本实施例所述网页服务器1通过与所述客户端2的通信链路从所述客户端2获取的网页中提取一主题内容,并从所述数据库11获取每个与所述主题内容相关联的关键词。 [0106] This embodiment of the web server 1 via the communication link with the client of the second web 2 extracts obtained from the subject matter of a client, and obtaining from the subject matter of each of the database 11 contextual keywords associated. 其中本实施例中所述数据库11包括所有的关键词以及与所述关键词相关的数据内容,所述数据库11中具体包括的数据内容可以根据技术人员工作的实际需要进行任意调整,本实施例中并不限定所述数据11中所包含的具体的数据内容。 In the embodiment wherein the present embodiment database 11 includes all keywords, and keyword data associated with the content, the database 11 comprises data content can be adjusted according to actual needs of the art work, the present embodiment are not limited to the specific content of the data contained in the data 11.

[0107] 所述页面生成模块12为每个关键词生成包含所述关键词的点击次数、购买次数、 搜索次数和显示次数的一页面。 The [0107] page generating module 12 generates a number of clicks for each keyword including the keyword, the number of purchases, the number of searches and displays a page number. 也就是说,本实施例中将在数据库11中查找得到的所述关键词相关联的数据整合至一个页面,即所述页面中包括涉及所述关键词的相关内容。 That is, in the embodiment lookup data in the database 11 to obtain the keyword associated with the present embodiment is integrated into a page, i.e. the page content includes the keyword relates.

[0108] 所述相关度计算模块13分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数。 [0108] The correlation degree calculating module 13, respectively, each page based on keyword clicks, purchases, searches and displaying the page number of calculations of correlation coefficient keyword.

[0109] 具体地说,所述相关度计算模块13通过下式1)计算每个页面的关键词的相关系数, [0109] Specifically, the correlation calculation module 13 calculates the degree of keywords for each page by the following formula 1) correlation coefficient,

[0110] [0110]

Figure CN103914490BD00121

[0111] 所述式1)中所述Rs (Τ,κ)为所述相关系数,所述Search (T,K)为所述搜索次数,所述Click (Τ,Κ)为所述点击次数,所述Sale (Τ,Κ)为所述购买次数,所述Show (Τ,Κ)为显示次数,所述Wl、W2和W3均大于等于零,且所述W1+W2+W3 = 1。 [0111] The Formula 1) in the Rs (Τ, κ) is the correlation coefficient, the Search (T, K) to the number of searches, the Click (Τ, Κ) is the number of clicks the Sale (Τ, Κ) of the purchases, the Show (Τ, Κ) to display the number of the Wl, W2 and W3 are greater than zero, and the W1 + W2 + W3 = 1.

[0112] 本实施例中可以通过调节所述Wl、W2和W3的数值来调节加权权重,进而调节所述搜索次数、点击次数以及购买次数对所述关键词和所述主题内容之间的关联度的影响。 [0112] Examples by adjusting the Wl of the, W2 and W3 values ​​weighted to adjust the weights, thereby adjusting the number of searches, the number of purchases and clicks association between the keyword and the embodiment of the present subject matter degree of influence. 即通过计算式1)来确定所述主题内容和各个页面之间的相关度。 I.e., 1) to determine the degree of correlation between the subject matter and content of each page is calculated by formula.

[0113] 具体地说,本实施例中上述实现方式的伪代码如下: [0113] Specifically, in the embodiment described above pseudo code implementations of the present embodiment is as follows:

Figure CN103914490BD00122

[0116] 本实施例中计算相关系数的另一种实现方式如下: Another implementation manner of calculation of the correlation coefficient embodiment [0116] The present embodiment is as follows:

[0117] 所述相关度计算模块13通过式2)和式3)计算每个页面的关键词的相关系数, [0117] The correlation degree calculating module 13 by Formula 2) and 3) the correlation coefficient is calculated for each page of the keyword,

[0118] [01]

Figure CN103914490BD00123

[0119] Rs (T,K) = (1-i/n) XRm+i/nXRi (Τ,Κ) 3) [0119] Rs (T, K) = (1-i / n) XRm + i / nXRi (Τ, Κ) 3)

[0120] 式2)和式3)中所述Rs (Τ,Κ)为所述相关系数,所述Ri (Τ,Κ)为系统相关系数,所述Rm为干预系数,所述Search (Τ,Κ)为所述搜索次数,所述Cl ick (Τ,Κ)为所述点击次数,所述Sale (T,K)为所述购买次数,所述Show (T,K)为显示次数,所述W1、均大于等于零,且所述W1+W2+W3 = 1,所述η为干预时间段长度,所述i为当前运行时间,其中0彡i彡η。 [0120] Formula 2) and 3) the Rs (Τ, Κ) is the correlation coefficient, the Ri (Τ, Κ) correlation coefficient for the system, the coefficient Rm intervention, the Search (Τ , K0) is the number of searches, the Cl ick (Τ, Κ) is the number of clicks, the Sale (T, K) is the number of purchases, the Show (T, K) times the display, the W1, greater than zero, and the W1 + W2 + W3 = 1, [eta] is the intervention period length, the running time for the current i, where i 0 San San η.

[0121] 在这种计算相关系数的实现方式中,可以修正和检测所述页面相关度,具体地说, 就是通过加入人工干预的参数来调整所述页面相关度的计算,因此所述Rm和η是本领域技术人员可以任意设置的。 [0121] In this implementation computing correlation coefficients, and detection of the page may be modified correlation, in particular, is to adjust the degree of correlation is calculated by adding the page parameter manual intervention, and thus the Rm η skilled in the art can arbitrarily set. 而且由于所述i随着运行时间的流逝而不断变大,从而所述人工干预的参数对计算得到的页面相关度的影响越小。 And because the run i with the lapse of time becomes larger and larger, the smaller the impact parameters so that the manual intervention of a page of the calculated correlation. 而且本领域技术人员通过改变η可以控制修正和检测的时间,并且当i=n时,本领域技术人员可以重新进行上述页面相关度的计算, 或者彻底摒弃人工干预,即使用本实施例中计算相关系数的第一种实现方式,将系统相关系数直接作为所述相关系数。 And that those skilled in the art by varying η can be controlled time correction and detection, and when i = n, one skilled in the art can re-calculates the page correlation, or completely abandon manual intervention, i.e., the use of the present embodiment is calculated a first implementation of the correlation coefficient, the correlation coefficient of the system is directly used as the correlation coefficient.

[0122] 具体地说,本实施中所述实现方式的伪代码如下: [0122] Specifically, the pseudo-code implementation of the present embodiment in the following:

[0123] [0123]

Figure CN103914490BD00131

[0124] 所述网页服务器1按相关系数从高到低的顺序选取与一预设数值N相等数量的关键词作为显示关键词。 [0124] The web server 1 according to a descending order of correlation coefficient selection value N is equal to a predetermined number of keywords as a keyword displayed. 具体的说,本实施例中所述网页服务器1基于每个页面的相关系数将所述页面降序排列,并按照所述预设数值N的数值大小,从相关系数最大的页面开始,选取与所述预设数值N的数值相等数量的页面。 Specifically, the web server in the embodiment 1 based on the correlation coefficient for each page to the page in descending order according to the present embodiment, and numerical values ​​according to the preset value N, starting from the largest correlation coefficient of the page, the selection and said predetermined value is equal to value N number of pages.

[0125] 此后,所述网页生成模块14将各个包含所述显示关键词的页面组合作为所述网页的显示内容并将所述页面发送至所述客户端2。 [0125] Thereafter, the individual page generating module 14 comprises a combination of the display page as keywords of the web page displaying the content of the page and sent to the client 2. 此时客户端2也就得到了与主题内容最匹配的页面内容。 At this point the client will get a 2 page content and subject matter that most closely matches.

[0126] 所以如图2所示,本实施例的网页运行系统的工作流程如下: [0126] Therefore, as shown in FIG. 2, page workflow runtime system according to the present embodiment is as follows:

[0127] 步骤101,从客户端2的网页中提取主题内容。 [0127] In step 101, the subject matter is extracted from the web client 2.

[0128] 步骤102、从数据库11获取每个与所述主题内容相关联的关键词。 [0128] Step 102, for each acquired keywords associated with the content from the topic database 11.

[0129] 步骤103,所述页面生成模块12为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的页面。 [0129] Step 103, the page generating module 12 generates a number of clicks for each keyword including the keyword, the number of purchases, the number of searches and displays a page number.

[0130] 步骤104,相关度计算模块13分别基于每个页面中关键词的点击次数、购买次数、 搜索次数和显示次数计算所述页面的关键词的相关系数。 [0130] Step 104, correlation calculating module 13, respectively, on a per-page keyword clicks, purchases, searches and displaying the page number of calculations of correlation coefficient keyword.

[0131] 其中所述相关度计算模块13既可以使用式1)的计算方法计算相关系数,也可以使用式2)和式3)的计算方法计算相关系数。 [0131] wherein the correlation calculation module 13 may use either Formula 1) is calculated by calculating the correlation coefficient, the correlation coefficient can be calculated using Formula 2) and 3) is calculated.

[0132] 步骤105,所述网页服务器1按相关系数从高到低的顺序选取与所述预设数值N相等数量的关键词作为显示关键词。 [0132] Step 105, the web server 1 according to a descending order of the correlation coefficient and selecting the preset value N equal to the number of keywords as a keyword display.

[0133] 步骤106,所述网页生成模块14将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 [0133] Step 106, the page generation module 14 comprises a display content of the respective keywords of the web page as a page composition of the display.

[0134] 实施例2 [0134] Example 2

[0135] 本实施例在实施例1的基础上,对甄选的页面内进一步计算相关度,从而使得甄选的页面与主题内容更加匹配。 [0135] In the present embodiment, on the basis of the embodiment 1, the calculation of further correlation selection page, and the page so that more closely matches the subject matter of selection.

[0136] 具体的说,本实施例的所述页面生成模块12还为所述数据库11中每个主题生成包含所述主题在一特定时间段T的点击次数和购买次数的一第一主题页面。 [0136] Specifically, the page generating module 12 of the present embodiment the database 11 also generates a topic for each of the subject relating to a first page and clicks later times a certain time period T . 本实施例中不但使用了所述数据库1包括的所有的关键词以及与所述关键词相关的数据内容,还使用了所有的主题及其相关的数据内容。 Examples 1 not only comprises a database using the keywords of all the keyword and associated data content, all used and its associated data relating to the present embodiment.

[0137] 所述特点时间段T的具体时间点和时间长度是可以任意设置的,例如选定2012-12-21 23:00至2012-12-22 1:00的时间段内所述主题的点击次数和购买次数。 [0137] The features of a specific time point of the time period T and the length of time can be arbitrarily set, for example, relating to the selected time period 2012-12-22 2012-12-21 23:00 to 1:00 clicks and purchases. 所述数据库11中具体包括的主题相关的数据内容同样是可以根据技术人员工作的实际需要进行任意调整,本实施例中同样并不限定所述数据11中所包含的具体的所述数据内容。 Topic data content database 11 comprises the same can be adjusted according to actual needs of the art work, in the present embodiment is not limited specifically the same data content of the data 11 contained.

[0138] 所述相关度计算模块13分别基于每个所述第一主题页面的点击次数和购买次数计算所述第一主题页面的主题的第一相关系数。 [0138] The correlation calculating module 13 calculates the first correlation coefficient relating to a first page-based theme clicks and later each of the first page topic. 其中所述第一相关系数可以采用任意现有的相关系数计算方式,本实施例并不限定所述第一相关系数的具体计算方式。 Wherein the first correlation coefficient may take any conventional correlation coefficient calculation embodiment, the present embodiment is not limited to the particular embodiment of the first correlation coefficient is calculated.

[0139] 所述网页服务器1按所述第一相关系数从高到低的顺序选取与一第一预设数值N1 相等数量的主题作为第一显示主题。 [0139] The web server 1 by the coefficients in descending order of selecting the first correlation value equal to a first predetermined number N1 topic relating to the first display. 此处所述主题的选取方式与实施例1中相同,所以此处不再赘述。 Select embodiment of the subject matter herein with the same as in Example 1, it is not repeated here.

[0140] 所述网页生成模块14还将各个包含所述第一显示主题的主题页面组合作为所述网页的显示内容。 [0140] The Web page generation module 14 also includes a display content of each page of the subject as a combination of the first display subject of the web page.

[0141] 所以如图3所示,本实施例的网页运行系统的工作流程如下: [0141] Therefore, as shown in FIG. 3, page workflow runtime system embodiment of the present embodiment is as follows:

[0142] 步骤201,从客户端2的网页中提取主题内容。 [0142] Step 201, the subject matter extracted from the web client 2.

[0143] 步骤202、从数据库11获取每个与所述主题内容相关联的关键词。 [0143] Step 202, for each acquired keywords associated with the content from the topic database 11.

[0144] 步骤203,所述页面生成模块12为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的页面。 [0144] Step 203, the page generating module 12 generates a keyword for each of the keyword clicks, purchases, searches and displays the page number.

[0145] 步骤204,所述相关度计算模块13分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数。 [0145] Step 204, the correlation degree calculating module 13, respectively, on a per-page keyword clicks, purchases, searches and displaying the page number of calculations of correlation coefficient keyword.

[0146] 其中所述相关度计算模块13既可以使用实施例1中式1)的计算方法计算相关系数,也可以使用实施例1中式2)和式3)的计算方法计算相关系数。 [0146] wherein the correlation degree calculating module 13 may be calculated using the method of Example 1 of formula 1) calculation of the correlation coefficient, the correlation coefficient may be calculated using one of formula 2) and 3) the calculation method of the embodiment.

[0M7] 步骤205,所述网页服务器1按相关系数从高到低的顺序选取与所述预设数值N相等数量的关键词作为显示关键词。 [0M7] Step 205, the web server 1 according to descending order of correlation coefficient and selecting the preset value N equal the number of display keyword as the keyword.

[0148] 步骤206,所述网页生成模块14将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 [0148] Step 206, the page generation module 14 comprises a respective display contents page keywords of the web page as a combination of the display.

[0149] 步骤207,所述页面生成模块12为所述数据库11中每个主题生成包含所述主题在所述特定时间段T的点击次数和购买次数的一个所述第一主题页面。 [0149] Step 207, the page generating module 12 to the database 11 each generate a topic relating to the first page and clicks the subject included in the number of purchases of the particular time period T.

[0150] 步骤208,所述相关度计算模块13分别基于每个所述第一主题页面的点击次数和购买次数计算所述第一主题页面的主题的第一相关系数。 [0150] Step 208, the correlation degree calculating module 13, respectively, based on the first correlation coefficient computing the subject matter relating to the first page of the purchase clicks and each of the first page of the subject matter.

[0151] 步骤209,所述网页服务器1按所述第一相关系数从高到低的顺序选取与所述第一预设数值见相等数量的主题作为第一显示主题。 [0151] Step 209, the web server 1 according to the first selected in descending order of correlation coefficient, see the first preset value equal to the number of topics relating to the first display.

[0152] 步骤210,所述网页生成模块14将各个包含所述第一显示主题的主题页面组合作为所述网页的显示内容。 [0152] Step 210, the Web page generation module 14 each comprising a first display content of the subject page as the subject combination page.

[0153] 实施例3 [0153] Example 3

[0154] 本实施例同样在实施例1的基础上,对甄选的页面内进一步计算相关度,从而使得甄选的页面与主题内容更加匹配。 [0154] Also in the present embodiment 1 on the basis of the embodiment, the calculation of further correlation selection page, and the page so that more closely matches the subject matter of selection.

[0155] 具体的说,所述网页服务器1还从所述客户端2获取的网页中提取用户身份数据。 [0155] Specifically, the web server 1 acquires the page 2 further extracts the user data from the client.

[0156] 所述页面生成模块12还从所述数据库11中提取与所述用户身份数据相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第二主题页面。 [0156] The page generating module 12 in the database 11 further extracts the user identity data relating to each associated, respectively, and from each subject and generates a second later the number of clicks of the subject matter is comprising two theme pages. 本实施例的所述数据库11中所述主题涉及的数据在实施例2中所述数据内容的基础上至少还具有用户身份数据。 Data in the database 11 in the subject matter according to the present embodiment further includes at least user identification data on the basis of the data content of the Example 2 embodiment.

[0157] 所述相关度计算模块13分别基于每个所述第二主题页面的点击次数和购买次数计算所述第二主题页面的主题的第二相关系数。 [0157] The correlation degree calculating module 13, respectively, the second correlation coefficient calculation relating to the second page based on the subject matter and the number of clicks, each of said second later the subject page. 同样,本实施例中所述第二相关系数可以采用任意现有的相关系数计算方式,本实施例并不限定所述第二相关系数的具体计算方式。 Also, in the present embodiment, any of the conventional correlation coefficient calculated second correlation coefficient may be employed, the present embodiment is not limited to the specific embodiments of the calculated second correlation coefficient.

[0158] 所述网页服务器1按所述第二相关系数从高到低的顺序选取与一第二预设数值N2 相等数量的主题作为第二显示主题。 [0158] The web server 1 according to the second correlation coefficient selected in descending order of value equal to a second predetermined number N2 relating to a theme of the second display. 此处所述主题的选取方式与实施例1中相同,所以此处同样不再赘述。 Select embodiment of the subject matter herein in Example 1 with the same, so the same will not be repeated here.

[0159] 所述网页生成模块14还将各个包含所述第二显示主题的主题页面组合作为所述网页的显示内容。 [0159] The Web page generation module 14 also includes a display content of each page of the subject composition, as a second display subject of the web page.

[0160] 所以如图4所示,本实施例的网页运行系统的工作流程如下: [0160] Therefore, as shown in FIG. 4, page workflow runtime system embodiment of the present embodiment is as follows:

[0161] 步骤301,从客户端2的网页中提取主题内容。 [0161] In step 301, the subject matter is extracted from the web client 2.

[0162] 步骤302、从数据库11获取每个与所述主题内容相关联的关键词。 [0162] Step 302, for each acquired keywords associated with the content from the topic database 11.

[0163] 步骤303,所述页面生成模块12为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的页面。 [0163] Step 303, the page generating module 12 generates a keyword for each of the keyword clicks, purchases, searches and displays the page number.

[0164] 步骤304,所述相关度计算模块13分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数。 [0164] Step 304, the correlation degree calculating module 13, respectively, on a per-page keyword clicks, purchases, searches and displaying the page number of calculations of correlation coefficient keyword.

[0165] 其中所述相关度计算模块13既可以使用实施例1中式1)的计算方法计算相关系数,也可以使用实施例1中式2)和式3)的计算方法计算相关系数。 [0165] wherein the correlation degree calculating module 13 may be calculated using the method of Example 1 of formula 1) calculation of the correlation coefficient, the correlation coefficient may be calculated using one of formula 2) and 3) the calculation method of the embodiment.

[0166] 步骤305,所述网页服务器1按相关系数从高到低的顺序选取与所述预设数值N相等数量的关键词作为显示关键词。 [0166] Step 305, the web server 1 according to a descending order of the correlation coefficient and selecting the preset value N equal to the number of keywords as a keyword display.

[0167] 步骤306,所述网页生成模块14将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 [0167] Step 306, the Web page generation module 14 will display the contents of each keyword pages containing composition as the display of the web page.

[0168] 步骤307,所述网页服务器1从所述网页提取用户身份数据。 [0168] Step 307, the web server 1 extracts the user identity data from the web page.

[0169] 步骤308,所述网页服务器1从所述数据库11中提取与所述用户身份数据相关联的每个主题。 [0 169] Step 308, the web server 1 for each topic database 11 extracts the identity data associated with the user from the.

[0170] 步骤309,所述页面生成模块12分别为每个主题生成包含所述主题的点击次数和购买次数的一个所述第二主题页面。 [0170] Step 309, the page generating module generates a number of clicks for each subject of the subject included in the second and later a subject page number of 12, respectively.

[0171] 步骤310,所述相关度计算模块13分别基于每个所述第二主题页面的点击次数和购买次数计算所述第二主题页面的主题的第二相关系数。 [0171] Step 310, the correlation degree calculating module 13, respectively, based on the number of clicks and each of said second subject page for later calculating the subject matter relating to the second page of the second correlation coefficient.

[0172] 步骤311,所述页面服务器1按所述第二相关系数从高到低的顺序选取与所述第二预设数值他相等数量的主题作为第二显示主题。 [0172] Step 311, the page server 1 according to the second correlation coefficients in descending order of the second preset value and selecting an equal number relating to his display as a second theme.

[0173] 步骤312,所述网页服务器1将各个包含所述第二显示主题的主题页面组合作为所述网页的显示内容。 [0,173] step 312, the web server comprising 1 relating to the respective display contents as a combination of page of the web page display of the second subject matter.

[0174] 实施例4 [0174] Example 4

[01M] 本实施例同样在实施例1的基础上,对甄选的页面内进一步计算相关度,从而使得甄选的页面与主题内容更加匹配。 [01M] Also in the present embodiment, on the basis of the embodiment 1, the calculation of further correlation selection page, and the page so that more closely matches the subject matter of selection.

[0176] 具体的说,所述网页服务器1的页面生成模块12还从所述数据库11中提取与所述主题内容相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第三主题页面; [0,176] Specifically, the web server page generation module 12 1 is also associated with each topic in the database 11 extracts from the subject matter, and each respectively relating to the subject matter contained generate a click is a third theme page number and the number of purchase;

[0177] 所述相关度计算模块13分别基于每个所述第三主题页面的点击次数和购买次数计算所述第三主题页面的主题的第三相关系数。 [0177] The correlation degree calculating module 13 calculates a third correlation coefficient relating to the subject matter of the third page based on the number of clicks and each third later the subject page. 同样,本实施例中所述第三相关系数可以采用任意现有的相关系数计算方式,本实施例并不限定所述第三相关系数的具体计算方式。 Also, in the present embodiment, any of the conventional correlation coefficient is calculated using a third correlation coefficient, the present embodiment is not limited to the particular embodiment of the third correlation coefficient is calculated.

[0178] 所述网页服务器1按所述第三相关系数从高到低的顺序选取与一第三预设数值N3 相等数量的主题作为第三显示主题。 [0 178] The web server 1 according to a descending order of said third coefficients associated with a selected equal to the third predetermined value N3 number of topics relating to a third display. 此处所述主题的选取方式与实施例1中相同,所以此处同样不再赘述。 Select embodiment of the subject matter herein in Example 1 with the same, so the same will not be repeated here.

[0179] 所述网页生成模块14还将各个包含所述第三显示主题的主题页面组合作为所述网页的显示内容。 [0179] The Web page generation module 14 also comprises a respective display contents relating to the third page of the subject combination as the webpage.

[0180] 所以如图5所示,本实施例的网页运行系统的工作流程如下: [0180] Therefore, as shown in FIG. 5, page workflow runtime system according to the present embodiment is as follows:

[0181] 步骤401,从客户端2的网页中提取主题内容。 [0181] Step 401, the subject matter extracted from the web client 2.

[0182] 步骤402、从数据库11获取每个与所述主题内容相关联的关键词。 [0182] Step 402, obtaining each associated with the keyword from a database 11 relating to the content.

[0183] 步骤403,所述页面生成模块12为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的页面。 [0183] Step 403, the page generating module 12 generates a keyword for each of the keyword clicks, purchases, searches and displays the page number.

[0184] 步骤404,所述相关度计算模块13分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数。 [0184] Step 404, the correlation degree calculating module 13, respectively, on a per-page keyword clicks, purchases, searches and displaying the page number of calculations of correlation coefficient keyword.

[0185] 其中所述相关度计算模块13既可以使用实施例1中式1)的计算方法计算相关系数,也可以使用实施例1中式2)和式3)的计算方法计算相关系数。 [0185] wherein the correlation degree calculating module 13 may be calculated using the method of Example 1 of formula 1) calculation of the correlation coefficient, the correlation coefficient may be calculated using one of formula 2) and 3) the calculation method of the embodiment.

[0186] 步骤405,所述网页服务器1按相关系数从高到低的顺序选取与所述预设数值N相等数量的关键词作为显示关键词。 [0186] Step 405, the web server 1 according to a descending order of the correlation coefficient and selecting the preset value N equal to the number of keywords as a keyword display.

[0187] 步骤406,所述网页生成模块14将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 [0187] Step 406, the Web page generation module 14 comprises a display content of each page as a combination of keywords of the web page of the display.

[0188] 步骤407,所述网页服务器1从所述数据库11中提取与所述主题内容相关联的每个主题。 [0188] Step 407, the web server associated with each one of said theme database 11 extracts from the subject matter.

[0189] 步骤408,所述页面生成模块12分别为每个主题生成包含所述主题的点击次数和购买次数的一个所述第三主题页面。 [0189] Step 408, the page generating module generates a number of clicks for each subject of the subject included in the third and later a subject page number of 12, respectively.

[0190] 步骤409,所述相关度计算模块13分别基于每个所述第三主题页面的点击次数和购买次数计算所述第三主题页面的主题的第三相关系数。 [0190] Step 409, the correlation degree calculating module 13 calculates a third correlation coefficient relating to the subject matter of the third page are based on the number of clicks and each third later the subject page.

[0191] 步骤410,所述网页服务器1按所述第三相关系数从高到低的顺序选取与所述第三预设数值N3相等数量的主题作为第三显示主题。 [0191] Step 410, the web server 1 according to the third correlation coefficient is equal to the descending order of selecting the third predetermined value N3 number of topics relating to a third display.

[0192] 步骤411,所述网页生成模块14将各个包含所述第三显示主题的主题页面组合作为所述网页的显示内容。 [0192] Step 411, the web page displaying the content generation module 14 each comprising a combination of the third display page relating to the subject matter as the web page.

[0193] 实施例5 [0193] Example 5

[0194] 本实施例整合了所述实施例2-4,对甄选的页面内进一步优化计算相关度,从而使得甄选的页面与主题内容更加匹配。 [0194] Example of the present embodiment incorporates the embodiments 2-4, further optimization of selection within the page correlation calculation, so that the selection of pages and the subject matter more closely matches.

[0195] 具体的说,本实施例中所述网页运行系统中各个部件和模块的功能与实施例2-4 中相同,所以此处不再详细赘述。 [0,195] Specifically, the page operation function described in the various system components and modules of the present embodiment the same as in Example 2-4, so that here again in detail herein. 其中所述第四相关系数为实施例2的第一相关系数;所述第五相关系数为实施例3的第二相关系数;所述第六相关系数为实施例4的第三相关系数。 Where the fourth embodiment of the first correlation coefficient of the correlation coefficient 2; correlation coefficient of the fifth embodiment of the second correlation coefficient 3; correlation coefficient of the sixth embodiment of the third correlation coefficient Example 4. 所述第四主题页面相当于实施例2中第一主题页面;所述第五主题页面相当于实施例3中第二主题页面;所述第六主题页面相当于实施例4中第三主题页面。 The fourth embodiment is equivalent to 2 pages relating to the first embodiment of the subject page; page corresponding to the fifth embodiment relating to the subject page 3 of the second embodiment; the sixth embodiment relating to page 4 of the third embodiment corresponds to the subject page . 此外,正是由于本实施例包含了上述实施例中的功能,所以本实施例中其余参数与上述实施例中涉及的各个参数相互对应。 Further, because of the function of the present embodiment includes the above-described embodiments, the various parameters of the above-described embodiment, other parameters according to embodiments of the present embodiment correspond to each other.

[0196] 此外,本实施例中相关度计算模块13还所述进一步通过上述相关系数和式4)计算所述主题的总相关系数: [0196] In addition, the overall correlation coefficient calculating module embodiment 13 further said further) of the subject is calculated by the correlation coefficient and the embodiment of Formula 4:

[0197] Rz (T) =ViXH(T) +V2XP(T) +V3XA(TJj) 4) [0197] Rz (T) = ViXH (T) + V2XP (T) + V3XA (TJj) 4)

[0198] 其中所述Rz⑺为总相关系数,所述H⑺为第四相关系数,所述P⑺为第五相关系数,所述A (T,Τ')为第六相关系数,所述Vl、V2和V3均大于等于零,且所述V1+V2+V3 = 1。 [0198] wherein the total Rz⑺ correlation coefficient, the correlation coefficient H⑺ fourth, fifth P⑺ the correlation coefficient, the A (T, Τ ') correlation coefficient for the sixth, the Vl, V2 and V3 are greater than zero, and the V1 + V2 + V3 = 1.

[0199] 具体地说,本实施例中所述总相关系数的实现方式的伪代码如下: [0199] Specifically, pseudo-code implementation of the overall correlation coefficient embodiment of the present embodiment is as follows:

[0200] [0200]

Figure CN103914490BD00171

[0201] [0201]

Figure CN103914490BD00181

[0202] 所以如图6所示,本实施例的网页运行系统的工作流程如下: [0202] Therefore, as shown in FIG. 6, page workflow runtime system embodiment of the present embodiment is as follows:

[0203] 步骤501,从客户端2的网页中提取主题内容。 [0203] Step 501, the subject matter extracted from the web client 2.

[0204] 步骤502、从数据库11获取每个与所述主题内容相关联的关键词。 [0204] Step 502, for each acquired keywords associated with the content from the topic database 11.

[0205] 步骤503,所述页面生成模块12为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的页面。 [0205] Step 503, the page generating module 12 generates a keyword for each of the keyword clicks, purchases, searches and displays the page number.

[0206] 步骤504,所述相关度计算模块13分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数。 [0206] Step 504, the correlation degree calculating module 13, respectively, on a per-page keyword clicks, purchases, searches and displaying the page number of calculations of correlation coefficient keyword.

[0207] 其中所述相关度计算模块13既可以使用实施例1中式1)的计算方法计算相关系数,也可以使用实施例1中式2)和式3)的计算方法计算相关系数。 [0207] wherein the correlation degree calculating module 13 may be calculated using the method of Example 1 of formula 1) calculation of the correlation coefficient, the correlation coefficient may be calculated using one of formula 2) and 3) the calculation method of the embodiment.

[0208] 步骤505,所述网页服务器1按相关系数从高到低的顺序选取与所述预设数值N相等数量的关键词作为显示关键词。 [0208] Step 505, the web server 1 according to a descending order of the correlation coefficient and selecting the preset value N equal to the number of keywords as a keyword display.

[0209] 步骤506,所述网页生成模块14将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 [0209] Step 506, the Web page generation module 14 comprises a display content of each page as a combination of keywords of the web page of the display.

[0210] 步骤507,所述页面生成模块12为所述数据库11中每个主题生成包含所述主题在一样本时间段!\的点击次数和购买次数的一个所述第四主题页面。 [0210] Step 507, the page generating module 12 to the database 11 to generate each topic as the subject included in the present period! \ Relating to a fourth page of the purchase and the number of clicks.

[0211] 步骤508,所述相关度计算模块13基于所述第四主题页面的点击次数和购买次数计算所述主题的第四相关系数。 [0211] Step 508, the correlation degree calculating a fourth correlation coefficient calculation module 13 based on the number of clicks of the subject matter relating to the fourth page and the number of purchases.

[0212] 步骤509,所述网页服务器1从所述网页提取用户身份数据。 [0212] Step 509, the web server 1 extracts the user identity data from the web page.

[0213] 步骤510,所述网页服务器1从所述数据库中提取与所述用户身份数据相关联的每个主题。 [0213] Step 510, the web server 1 extracts from the database with the user identification data associated with each topic.

[0214] 步骤511,所述页面生成模块12分别为每个主题生成包含所述主题的点击次数和购买次数的一个所述第五主题页面。 [0214] Step 511, the page generating module 12 generates the number of clicks for each subject of the subject included in a later and the fifth relating to the number of pages, respectively.

[0215] 步骤512,所述相关度计算模块13基于所述第五主题页面的点击次数和购买次数计算所述主题的第五相关系数。 [0215] Step 512, the fifth correlation calculation module 13 calculates the correlation coefficient based on the number of clicks of the subject matter relating to the fifth page and the number of purchases.

[0216] 步骤513,所述网页服务器1从所述数据库中提取与所述主题内容相关联的每个主题。 [0216] Step 513, the web server 1 extracts each matter related to the subject matter from the associated database.

[0217] 步骤514,所述相关度计算模块13分别为每个主题生成包含所述主题的点击次数和购买次数的一个所述第六主题页面。 [0217] Step 514, the correlation calculating module 13 generates the number of clicks for each subject including the subject matter relating to the sixth and later a page number, respectively.

[0218] 步骤515,所述相关度计算模块13基于所述第六主题页面的点击次数和购买次数计算所述主题的第六相关系数。 [0218] step 515, the calculating module 13 calculates correlation of the subject based on the theme of the sixth page clicks and sixth times for later correlation.

[0219] 步骤516,所述相关度计算模块13还分别基于所述数据库中每个主题的第四相关系数、第五相关系数和第六相关系数计算所述主题的总相关系数。 [0219] Step 516, the correlation degree calculating module 13 also are a fifth and sixth correlation coefficients calculated correlation coefficients relating to the overall correlation coefficient based on the correlation coefficient for each database in the fourth theme.

[0220] 步骤517,所述网页服务器1按所述总相关系数从高到低的顺序选取与一第四预设数值N4相等数量的主题作为第四显示主题。 [0220] step 517, the web server 1 of the total correlation coefficient selected in descending order of value equal to a fourth predetermined number N4 relating to the fourth theme display.

[0221] 步骤518,所述网页生成模块14将各个包含所述第四显示主题的主题页面组合作为所述网页的显示内容。 [0221] Step 518, the Web page generation module 14 comprises a respective display contents relating to the fourth page of the subject combination displayed as the web page.

[0222] 通过以上的具体实施方式的描述可知,本领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件平台的方式来实现。 [0222] By the above described specific embodiments can be seen, those skilled in the art can understand that the present application may be implemented by software plus a necessary universal hardware platform. 基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品可以存储在存储介质中,如R0M/RAM (只读存储器/随机存取存储器)、磁碟、光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例或者实施例的某些部分所述的方法。 Based on such understanding, the technical solutions of the present application or the nature of the part contributing to the prior art may be embodied in a software product, which computer software product may be stored in a storage medium, such as a R0M / RAM (read only memory / random access memory), a magnetic disk, an optical disk, and include several instructions which execute various embodiments of the present application certain embodiments or portions to enable a computer device (may be a personal computer, a server, or a network device) the method of claim.

[0223] 本说明书中的各个实施例均采用递进的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。 [0223] In the present specification, various embodiments are described in a progressive manner, similar portions of the same between the various embodiments refer to each other, are different from the embodiment and the other embodiments described each embodiment focus. 尤其,对于系统实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。 In particular, for embodiments of the system, since they are substantially similar to the method embodiments, the description is relatively simple, some embodiments of the methods see relevant point can be described.

[0224] 本申请可用于众多通用或专用的计算系统环境或配置中。 [0224] The present application can be used in numerous general purpose or special purpose computing system environments or configurations. 例如:个人计算机、服务器计算机、手持设备或便携式设备、平板型设备、多处理器系统、基于微处理器的系统、置顶盒、可编程的消费电子设备、网络PC (个人电脑)、小型计机、大型计算机、包括以上任何系统或设备的分布式计算环境等等。 For example: personal computers, server computers, handheld or portable devices, tablet devices, multiprocessor systems, microprocessor-based systems, set top boxes, programmable consumer electronics, network PC (personal computer), a small machine gauge , mainframe computers, including any of the above systems or devices, distributed computing environments and the like.

[0225] 本申请可以在由计算机执行的计算机可执行指令的一般上下文中描述,例如程序模块。 [0225] The present application may be described in the general context of computer-executable instructions, executed by a computer, such as program modules. 一般地,程序模块包括执行特定任务或实现特定抽象数据类型的例程、程序、对象、组件、数据结构等等。 Generally, program modules include performing particular tasks or implement particular abstract data types routines, programs, objects, components, data structures, and the like. 也可以在分布式计算环境中实践本申请,在这些分布式计算环境中,由通过通信网络而被连接的远程处理设备来执行任务。 The present application may also be practiced in distributed computing environments, in which a distributed computing environment, tasks are performed by remote processing devices that are linked through a communications network. 在分布式计算环境中,程序模块可以位于包括存储设备在内的本地和远程计算机存储介质中。 In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices in.

[0226] 虽然以上描述了本发明的具体实施方式,但是本领域的技术人员应当理解,这些仅是举例说明,本发明的保护范围是由所附权利要求书限定的。 [0226] While the foregoing specific embodiments of the present invention, those skilled in the art will appreciate that these are merely illustrative, the scope of the present invention is defined by the appended claims. 本领域的技术人员在不背离本发明的原理和实质的前提下,可以对这些实施方式做出多种变更或修改,但这些变更和修改均落入本发明的保护范围。 Those skilled in the art without departing from the principles and spirit of the present invention is the premise that various changes or modifications may be made to the embodiments, but all such changes and modifications falling within the scope of the present invention.

Claims (13)

  1. 1. 一种网页运行方法,其特征在于,所述网页运行方法包括以下步骤: si、从一网页中提取主题内容; 52、 从一数据库获取每个与所述主题内容相关联的关键词以及为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的一页面; 53、 分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数; 54、 按相关系数从高到低的顺序选取与一预设数值相等数量的关键词作为显示关键词; 55、 将各个包含所述显示关键词的页面组合作为所述网页的显示内容。 CLAIMS 1. A method for operating a web, wherein said running web comprising the steps of: si, extracted from the subject matter of a webpage; 52, each associated with acquiring the subject matter keywords from a database, and generated for each keyword clicks includes the keyword, a page number for later search and display the number of times; 53, respectively, on a per-page keyword clicks, purchases, searches and displays the number of calculations the correlation coefficient of the page keyword; 54, selected according to descending order of correlation coefficient with a preset value equal to the number of keywords as a keyword display; 55, the respective pages containing the keywords displayed in combination as the display of the web page content.
  2. 2. 如权利要求1所述的网页运行方法,其特征在于,所述步骤S3为: 通过下式计算每个页面的关键词的相关系数, 2. The method of operation of said web of Claim 1, wherein said step S3: Keyword calculated by the correlation coefficient for each page,
    Figure CN103914490BC00021
    其中所述Rs (T,K)为所述相关系数,所述Search (T,K)为所述搜索次数,所述Cl ick (T, K)为所述点击次数,所述Sale (T,K)为所述购买次数,所述Show (T,K)为显示次数,所述W1、W2 和W3均大于等于零,且W1+W2+W3 = 1。 Wherein the Rs (T, K) is the correlation coefficient, the Search (T, K) to the number of searches, the Cl ick (T, K) to the number of clicks, the Sale (T, K) of the purchases, the Show (T, K) for the display count, the W1, W2 and W3 are greater than zero, and W1 + W2 + W3 = 1.
  3. 3. 如权利要求1所述的网页运行方法,其特征在于,所述步骤S3为: 通过下式计算每个页面的关键词的相关系数, 3. The method of operation of the web of claim 1, wherein said step S3: Keyword calculated by the correlation coefficient for each page,
    Figure CN103914490BC00022
    其中所述Rs (Τ,Κ)为所述相关系数,所述Ri (Τ,Κ)为系统相关系数,所述Rm为干预系数, 所述Search (Τ,Κ)为所述搜索次数,所述Click (Τ,Κ)为所述点击次数,所述Sale (Τ,Κ)为所述购买次数,所述Show (Τ,Κ)为显示次数,所述W1、W2和W3均大于等于零,且所述W1+W2+W3 = 1, 所述η为干预时间段长度,所述i为当前运行时间,其中0<i<n。 Wherein the Rs (Τ, Κ) is the correlation coefficient, the Ri (Τ, Κ) correlation coefficient for the system, the coefficient Rm intervention, the Search (Τ, Κ) is the number of searches, the said click (Τ, Κ) is the number of clicks, the Sale (Τ, Κ) of the purchases, the Show (Τ, Κ) to display the number, the W1, W2 and W3 are greater than zero, and wherein W1 + W2 + W3 = 1, η the intervention period length, the running time for the current i, where 0 <i <n.
  4. 4. 如权利要求1所述的网页运行方法,其特征在于,所述步骤S5之后还包括步骤S6,其中步骤S6中包括: 561、 为所述数据库中每个主题生成包含所述主题在一特定时间段的点击次数和购买次数的一第一主题页面; 562、 分别基于每个所述第一主题页面的点击次数和购买次数计算所述第一主题页面的主题的第一相关系数; 563、 按所述第一相关系数从高到低的顺序选取与一第一预设数值相等数量的主题作为第一显示主题; 564、 将各个包含所述第一显示主题的主题页面组合作为所述网页的显示内容。 4. The method of operation of the web to claim 1, wherein, after said step further comprises the step S6 S5, S6, which comprises the step of: 561, each database relating to the generation of the subject included in a 562 first correlation coefficients were calculated relating to the subject matter of the first page and the number of clicks later each of the first page based on the subject matter;; relating to a first page and clicks, purchases of a particular time period 563 , according to the first correlation coefficient and selecting a descending order of value equal to the first predetermined number of topics relating to the first display; 564, each comprising the combination of the first display page relating to the subject matter as the web page display.
  5. 5. 如权利要求1所述的网页运行方法,其特征在于,所述步骤S5之后还包括步骤S7,其中步骤S7中包括: 571、 从所述网页提取用户身份数据; 572、 从所述数据库中提取与所述用户身份数据相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第二主题页面; 573、 分别基于每个所述第二主题页面的点击次数和购买次数计算所述第二主题页面的主题的第二相关系数; 574、 按所述第二相关系数从高到低的顺序选取与一第二预设数值相等数量的主题作为第二显示主题; 575、 将各个包含所述第二显示主题的主题页面组合作为所述网页的显示内容。 5. The method of operation of the web to claim 1, wherein, after step S5 further includes the step S7, wherein the step S7 includes: 571, extracts the user identification data from the web page; 572, from the database each subject extracts the user identity data associated, respectively, and each topic relating to generate a second page and later the number of clicks of the subject is contained; 573, respectively, based on each of said second theme calculating the subject matter relating to the second page of the second correlation coefficient and the number of clicks later page; 574, according to the second correlation coefficient selected in descending order of value equal to a second predetermined number of topics as a second display subject; 575, each comprising the composition of the second display page relating to the subject matter as the display content of the web page.
  6. 6. 如权利要求1所述的网页运行方法,其特征在于,所述步骤S5之后还包括步骤S8,其中步骤S8中包括: 581、 从所述数据库中提取与所述主题内容相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第三主题页面; 582、 分别基于每个所述第三主题页面的点击次数和购买次数计算所述第三主题页面的主题的第三相关系数; 583、 按所述第三相关系数从高到低的顺序选取与一第三预设数值相等数量的主题作为第三显示主题; 584、 将各个包含所述第三显示主题的主题页面组合作为所述网页的显示内容。 6. The method of operation of the web to claim 1, wherein, after step S5 further includes the step S8, wherein the step S8, comprising: 581, related to the subject matter extracted from the database associated with each topics, each topic separately generated and number of clicks of the subject included in a third and later times of the subject page; 582, calculated based on the third respectively and the number of clicks later each of the third page topic third correlation coefficient relating to the subject page; 583, according to the third correlation coefficient selected in descending order of value equal to a third predetermined number of display relating to the third theme; 584, containing the individual the third page displays the contents of the theme as the subject combination of the web page.
  7. 7. 如权利要求1所述的网页运行方法,其特征在于,所述步骤S5之后还包括步骤S9,其中步骤S9中包括: 591、 为所述数据库中每个主题生成包含所述主题在一样本时间段的点击次数和购买次数的一第四主题页面; 592、 基于所述第四主题页面的点击次数和购买次数计算所述第四主题页面的主题的第四相关系数; 593、 从所述网页提取用户身份数据; 594、 从所述数据库中提取与所述用户身份数据相关联的每个主题,并分别为与所述用户身份数据相关联的每个主题生成包含所述与所述用户身份数据相关联的主题的点击次数和购买次数的一第五主题页面; 595、 基于所述第五主题页面的点击次数和购买次数计算所述第五主题页面的主题的第五相关系数; 596、 从所述数据库中提取与所述主题内容相关联的每个主题,并分别为与所述主题内容相关联的每个主题 7. The method of operation of the web to claim 1, wherein, after step S5 further includes the step S9, wherein the step S9, comprising: 591, each database relating to the generation of the subject included in the same a fourth relating to the number of clicks of the present page and later period of times; 592, relating to the fourth subject matter is calculated based on the page number of clicks and the fourth relating to the number of pages for later fourth correlation coefficient; 593, from the extracting said user identity data page; 594, extracted from the database with the user identification data associated with each topic and theme data separately for each of the user identity associated with said generating comprises the and number of clicks to buy a fifth theme topic of the page the user identity data associated; 595, computing theme of the fifth page of the theme of the fifth theme based on the number of clicks and purchases the fifth page of the correlation coefficient; 596, extracted from the database associated with the subject matter associated with each topic, and are associated with the subject matter of each topic 成包含所述与所述主题内容相关联的主题的点击次数和购买次数的一第六主题页面; 597、 基于所述第六主题页面的点击次数和购买次数计算所述第六主题页面的主题的第六相关系数; 598、 分别基于所述数据库中每个主题的第四相关系数、第五相关系数和第六相关系数计算各相关系数对应的所述主题的总相关系数; 按所述总相关系数从高到低的顺序选取与一第四预设数值相等数量的主题作为第四显示主题; 599、 将各个包含所述第四显示主题的主题页面组合作为所述网页的显示内容。 597 theme calculating the theme sixth page based on the number of clicks the sixth topic pages and the number of purchased; to contain content related to the topic of the sixth subject of a page clicks and purchases of themes linked sixth correlation coefficient; 598, respectively, based on the correlation coefficient for each database relating to the fourth, fifth and sixth correlation coefficient calculated correlation coefficients relating to the overall correlation coefficient for each coefficient associated; of the total selected in descending order of correlation coefficient equal to a fourth preset value relating to the number of display as a fourth theme; 599, each comprising a fourth display contents relating to the topic as a combination of a page of the web page.
  8. 8. 如权利要求7所述的网页运行方法,其特征在于,所述步骤S98为: 通过下式计算所述主题的总相关系数: Rz ⑺=ViXH(T)+V2XP ⑺+V3XA (T,T,) 其中所述Rz (T)为总相关系数,所述H(T)为第四相关系数,所述P⑺为第五相关系数, 所述A (T,Τ')为第六相关系数,所述Vl、V2和V3均大于等于零,且所述V1+V2+V3 = 1。 8. The method of operation of the web to claim 7, characterized in that, the step S98 is: The calculated overall correlation coefficient of the subject matter of the formula: Rz ⑺ = ViXH (T) + V2XP ⑺ + V3XA (T, T,) wherein said Rz (T) is the total correlation coefficient, the H (T) is the correlation coefficient of the fourth, the fifth correlation coefficient P⑺, the A (T, Τ ') a sixth correlation coefficient the Vl, V2 and V3 are zero or greater, and the V1 + V2 + V3 = 1.
  9. 9. 一种网页运行系统,其特征在于,所述网页运行系统包括一网页服务器和多个客户端,其中所述网页服务器包括一数据库、一页面生成模块、一相关度计算模块和一网页生成丰旲块; 所述网页服务器从所述客户端获取的网页中提取一主题内容,并从所述数据库获取每个与所述主题内容相关联的关键词; 所述页面生成模块为每个关键词生成包含所述关键词的点击次数、购买次数、搜索次数和显示次数的一页面; 所述相关度计算模块分别基于每个页面中关键词的点击次数、购买次数、搜索次数和显示次数计算所述页面的关键词的相关系数; 所述网页服务器按相关系数从高到低的顺序选取与一预设数值相等数量的关键词作为显示关键词; 所述网页生成模块将各个包含所述显示关键词的页面组合作为所述网页的显示内容并将所述页面发送至所述客户 A system running web, wherein the web comprises a web server operating system and a plurality of clients, wherein the web server comprises a database, a page generation module, and a correlation degree calculating a Web page generation module Feng Dae block; web page from the web server extracts the client relating to a content acquired, and acquires content related to the subject matter associated with each of keywords from said database; the page generating module for each key word generating clicks includes the keyword, the number of purchases, the number of searches and displays a page number; the correlation calculating module, respectively, on a per-page keyword clicks, purchases, searches and displays the number of calculations the correlation coefficient page keyword; selecting the web server with a preset value equal to the number of keyword correlation coefficients in descending order as a display keyword; each of said page generation module comprising the display page composition as keywords of the web page content and display the page transmitted to the client .
  10. 10. 如权利要求9所述的网页运行系统,其特征在于,所述相关度计算模块通过下式计算每个页面的关键词的相关系数, The operation of the system 10. The web according to claim 9, wherein the correlation calculated by the calculation module for each page keyword correlation coefficient,
    Figure CN103914490BC00041
    其中所述Rs (Τ,Κ)为所述相关系数,所述Search (Τ,Κ)为所述搜索次数,所述Cl ick (Τ, K)为所述点击次数,所述Sale (T,K)为所述购买次数,所述Show (T,K)为显示次数,所述W1、W2 和W3均大于等于零,且所述W1+W2+W3 = 1。 Wherein the Rs (Τ, Κ) is the correlation coefficient, the Search (Τ, Κ) is the number of searches, the Cl ick (Τ, K) to the number of clicks, the Sale (T, K) of the purchases, the Show (T, K) for the display count, the W1, W2 and W3 are greater than zero, and the W1 + W2 + W3 = 1. 11 .如权利要求9所述的网页运行系统,其特征在于,所述相关度计算模块通过下式计算每个页面的关键词的相关系数, 11. The system of claim 9 running web as claimed in claim, characterized in that the correlation coefficient calculating module is calculated by the formula of keywords for each page,
    Figure CN103914490BC00042
    其中所述Rs (Τ,Κ)为所述相关系数,所述Ri (Τ,Κ)为系统相关系数,所述Rm为干预系数, 所述Search (Τ,Κ)为所述搜索次数,所述Click (Τ,Κ)为所述点击次数,所述Sale (Τ,Κ)为所述购买次数,所述Show (Τ,Κ)为显示次数,所述W1、W2和W3均大于等于零,且所述W1+W2+W3 = 1, 所述η为干预时间段长度,所述i为当前运行时间,其中0<i<n。 Wherein the Rs (Τ, Κ) is the correlation coefficient, the Ri (Τ, Κ) correlation coefficient for the system, the coefficient Rm intervention, the Search (Τ, Κ) is the number of searches, the said click (Τ, Κ) is the number of clicks, the Sale (Τ, Κ) of the purchases, the Show (Τ, Κ) to display the number, the W1, W2 and W3 are greater than zero, and wherein W1 + W2 + W3 = 1, η the intervention period length, i is the current run-time, wherein the 0 <i <n.
  11. 12.如权利要求9所述的网页运行系统,其特征在于,所述网页服务器的页面生成模块还为所述数据库中每个主题生成包含所述主题在一特定时间段的点击次数和购买次数的一第一主题页面; 所述相关度计算模块分别基于每个所述第一主题页面的点击次数和购买次数计算所述第一主题页面的主题的第一相关系数; 所述网页服务器按所述第一相关系数从高到低的顺序选取与一第一预设数值相等数量的主题作为第一显示主题; 所述网页生成模块还将各个包含所述第一显示主题的主题页面组合作为所述网页的显示内容。 The operation of the system 12. The web according to claim 9, wherein the web server page generation module is further to generate a database for each topic of the subject included in clicks a certain period of time and the number of purchases a first subject of the page; the correlation calculating module, respectively, based on the number of clicks and relating to each of the first page of later calculating the first correlation coefficient relating to the subject matter of the first page; the web server by the said first selected in descending order of correlation coefficient equal to a first predetermined value as a first quantity relating to the display subject; the web page generation module further comprises a respective combination of the first display page relating to the subject matter as the said display contents page.
  12. 13. 如权利要求9所述的网页运行系统,其特征在于,所述网页服务器还从所述客户端获取的网页提取用户身份数据; 所述页面生成模块还从所述数据库中提取与所述用户身份数据相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第二主题页面; 所述相关度计算模块分别基于每个所述第二主题页面的点击次数和购买次数计算所述第二主题页面的主题的第二相关系数; 所述网页服务器按所述第二相关系数从高到低的顺序选取与一第二预设数值相等数量的主题作为第二显示主题; 所述网页生成模块还将各个包含所述第二显示主题的主题页面组合作为所述网页的显示内容。 The operation of the system 13. The web according to claim 9, wherein said web server also extracts user data from the client acquires the web page; the page generating module further extracts from the database the each topic data associated user identity, and a second separately generated for each topic relating to the page number of clicks and later is the subject included; said correlation calculating module respectively based on each of the second page topic clicks and the number of second correlation coefficients for later calculation relating to the second subject of the page; the web server by the coefficients in descending order of the selected second correlation value equal to a second predetermined number of topics relating to the second display; the web page generation module further comprising relating each content page displayed as a combination of the second display of the web page topic.
  13. 14. 如权利要求9所述的网页运行系统,其特征在于,所述网页服务器的页面生成模块还从所述数据库中提取与所述主题内容相关联的每个主题,并分别为每个主题生成包含所述主题的点击次数和购买次数的一第三主题页面; 所述相关度计算模块分别基于每个所述第三主题页面的点击次数和购买次数计算所述第三主题页面的主题的第三相关系数; 所述网页服务器按所述第三相关系数从高到低的顺序选取与一第三预设数值相等数量的主题作为第三显示主题; 所述网页生成模块还将各个包含所述第三显示主题的主题页面组合作为所述网页的显示内容。 The operation of the system 14. A web according to claim 9, wherein the web server page generation module also extracts each topic related to the subject matter associated from the database, respectively, and each topic generating clicks the subject included in a third and later subject page number; the correlation calculating module, respectively, based on the number of clicks for each of the third and later pages relating to the number of calculations relating to the subject matter of the third page of third correlation coefficient; the web server by a factor related to the third descending order equal to the number of selected third preset value relating to a third display subject; the web page generation module further comprises each of the said third display page relating to the subject combination as display contents of the web page.
CN 201310006595 2013-01-08 2013-01-08 Pages operating method and system CN103914490B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201310006595 CN103914490B (en) 2013-01-08 2013-01-08 Pages operating method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201310006595 CN103914490B (en) 2013-01-08 2013-01-08 Pages operating method and system

Publications (2)

Publication Number Publication Date
CN103914490A true CN103914490A (en) 2014-07-09
CN103914490B true CN103914490B (en) 2018-06-12

Family

ID=51040181

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201310006595 CN103914490B (en) 2013-01-08 2013-01-08 Pages operating method and system

Country Status (1)

Country Link
CN (1) CN103914490B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105760393A (en) * 2014-12-17 2016-07-13 纽海信息技术(上海)有限公司 Webpage display method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101178728A (en) * 2007-11-21 2008-05-14 北京搜狗科技发展有限公司 Web side navigation method and system
CN101315623A (en) * 2007-05-29 2008-12-03 阿里巴巴集团控股有限公司 A text-theme recommended method and apparatus
CN101551806A (en) * 2008-04-03 2009-10-07 北京搜狗科技发展有限公司 Personalized website navigation method and system
CN102609869A (en) * 2012-02-03 2012-07-25 纽海信息技术(上海)有限公司 Commodity purchasing system and method

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8229942B1 (en) * 2007-04-17 2012-07-24 Google Inc. Identifying negative keywords associated with advertisements

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101315623A (en) * 2007-05-29 2008-12-03 阿里巴巴集团控股有限公司 A text-theme recommended method and apparatus
CN101178728A (en) * 2007-11-21 2008-05-14 北京搜狗科技发展有限公司 Web side navigation method and system
CN101551806A (en) * 2008-04-03 2009-10-07 北京搜狗科技发展有限公司 Personalized website navigation method and system
CN102609869A (en) * 2012-02-03 2012-07-25 纽海信息技术(上海)有限公司 Commodity purchasing system and method

Also Published As

Publication number Publication date Type
CN103914490A (en) 2014-07-09 application

Similar Documents

Publication Publication Date Title
Matthijs et al. Personalizing web search using long term browsing history
US8447760B1 (en) Generating a related set of documents for an initial set of documents
US20090182727A1 (en) System and method for generating tag cloud in user collaboration websites
US20080071739A1 (en) Using anchor text to provide context
US20080154798A1 (en) Dynamic Pricing Models for Digital Content
US20150324868A1 (en) Query Categorizer
US20130054672A1 (en) Systems and methods for contextualizing a toolbar
US20100094868A1 (en) Detection of undesirable web pages
US20120078825A1 (en) Search result ranking using machine learning
US20090287645A1 (en) Search results with most clicked next objects
CN102332006A (en) Information push control method and device
CN102479366A (en) Commodity recommending method and system
US8346792B1 (en) Query generation using structural similarity between documents
US8209331B1 (en) Context sensitive ranking
US7849081B1 (en) Document analyzer and metadata generation and use
US8615514B1 (en) Evaluating website properties by partitioning user feedback
CN102663617A (en) Method and system for prediction of advertisement clicking rate
CN103886090A (en) Content recommendation method and device based on user favorites
Zhao et al. Connecting social media to e-commerce: Cold-start product recommendation using microblogging information
CN102622445A (en) User interest perception based webpage push system and webpage push method
CN102033877A (en) Search method and apparatus
CN102779136A (en) Method and device for information search
JP2009252070A (en) Method for calculating score for search query
US20110125739A1 (en) Algorithmically choosing when to use branded content versus aggregated content
US20120066203A1 (en) Online content ranking system based on authenticity metric values for web elements

Legal Events

Date Code Title Description
C06 Publication
C10 Entry into substantive examination
C41 Transfer of patent application or patent right or utility model
GR01