CN108093013B - Webpage data calculation method and server - Google Patents
Webpage data calculation method and server Download PDFInfo
- Publication number
- CN108093013B CN108093013B CN201611042516.8A CN201611042516A CN108093013B CN 108093013 B CN108093013 B CN 108093013B CN 201611042516 A CN201611042516 A CN 201611042516A CN 108093013 B CN108093013 B CN 108093013B
- Authority
- CN
- China
- Prior art keywords
- page
- target
- exit
- server
- session
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/14—Session management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Description
技术领域technical field
本发明涉及计算机应用领域,尤其涉及一种网页数据计算方法及服务器。The invention relates to the field of computer applications, in particular to a web page data calculation method and a server.
背景技术Background technique
在网页分析中,网页退出率可以评价网站某页面的用户体验效果,被广泛应用在网站优化、网站分析领域中。In web page analysis, the page exit rate can evaluate the user experience effect of a certain page of a website, and is widely used in the fields of website optimization and website analysis.
现有网页退出率,往往被定义为用户退出网站的次数除以用户进入浏览网站的次数的百分比。Existing page exit rate, often defined as the percentage of times users exit a website divided by the number of times users enter a website.
但是现有技术对网页退出率的计算比较简单,计算的过程忽略了很多影响用户退出行为的因素。例如页面性质,同一网站上不同页面具有不同重要级别,比如订单页比联系方式页重要,用户访问了部分重要页面后,相应的退出的可能性会增高,现有技术的计算方式忽略了这些因素,计算出来的退出率并不能合理的评价用户在页面中的退出行为。However, the calculation of the exit rate of the webpage in the prior art is relatively simple, and the calculation process ignores many factors that affect the user's exit behavior. For example, the nature of pages, different pages on the same website have different levels of importance. For example, the order page is more important than the contact page. After the user visits some important pages, the corresponding possibility of exiting will increase. The calculation method of the existing technology ignores these factors. , the calculated exit rate cannot reasonably evaluate the user's exit behavior on the page.
发明内容SUMMARY OF THE INVENTION
本发明实施例提供了一种网页数据计算方法及服务器,用于准确评价用户在页面中的退出行为,为网页建设提供更有效的数据。Embodiments of the present invention provide a web page data calculation method and server, which are used to accurately evaluate a user's exit behavior in a web page and provide more effective data for web page construction.
本发明实施例提供了一种网页数据计算方法,包括:An embodiment of the present invention provides a web page data calculation method, including:
服务器获取网站对应的预置数量的目标会话,每个目标会话包含一个或多个目标页面;The server obtains a preset number of target sessions corresponding to the website, and each target session includes one or more target pages;
所述服务器根据页面浏览顺序及页面重要程度确定所述目标会话中每个目标页面对应的退出权重;The server determines the exit weight corresponding to each target page in the target session according to the page browsing sequence and the page importance;
所述服务器根据所述退出权重计算所述目标页面的退出率。The server calculates the exit rate of the target page according to the exit weight.
可选地,所述服务器根据页面浏览顺序及页面重要程度确定所述预置数量的目标会话中每个目标页面对应的退出权重包括:Optionally, the server determines the exit weight corresponding to each target page in the preset number of target sessions according to the page browsing order and page importance, including:
针对每个目标会话,所述服务器通过如下方式计算每个目标页面在该目标会话中对应的退出权重:For each target session, the server calculates the corresponding exit weight of each target page in the target session in the following manner:
(1-a)X-Y(1-b)Y;(1-a) XY (1-b) Y ;
其中,所述a为第一退出系数,所述第一退出系数为预设的一般页面对应的退出几率系数,所述一般页面为重要程度为一般的页面,所述b为第二退出系数,所述第二退出系数为预设的重要页面退出几率系数,所述重要页面为重要程度为重要的页面,所述X为该目标会话中浏览到该目标页面时对应的总页面浏览量,所述Y为所述总页面浏览量中重要页面的浏览量,所述X-Y为所述总页面流量中一般页面的浏览量。Wherein, the a is the first exit coefficient, the first exit coefficient is the exit probability coefficient corresponding to the preset general page, the general page is the page whose importance is general, and the b is the second exit coefficient, The second exit coefficient is a preset important page exit probability coefficient, the important page is a page with an important degree of importance, and the X is the total page views corresponding to the target page in the target session, so The Y is the page views of important pages in the total page views, and the X-Y is the page views of general pages in the total page traffic.
可选地,所述服务器根据所述退出权重计算所述目标页面的退出率包括:Optionally, calculating, by the server, the exit rate of the target page according to the exit weight includes:
所述服务器通过如下方式计算退出率GA:The server calculates the exit rate GA by:
GA=M/N*100%;GA=M/N*100%;
其中,所述M为在所述目标会话中作为退出页的目标页面对应的退出权重之和,所述N为所述目标会话中每个目标页面对应的退出权重之和。Wherein, the M is the sum of the logout weights corresponding to the target pages in the target session as the logout pages, and the N is the sum of the logout weights corresponding to each target page in the target session.
可选地,所述服务器获取预置数量的目标会话之前包括:Optionally, before the server acquires a preset number of target sessions, it includes:
所述服务器将所述网站中的每个页面按照网页内容的重要程度分为一般页面及重要页面。The server divides each page in the website into a general page and an important page according to the importance of the content of the web page.
可选地,所述服务器获取预置数量的目标会话包括:Optionally, the server acquiring a preset number of target sessions includes:
所述服务器从所述网站对应的历史会话记录中获取预置数量的目标会话。The server acquires a preset number of target sessions from historical session records corresponding to the website.
本发明实施例还提供了一种服务器,包括:The embodiment of the present invention also provides a server, including:
获取模块,用于获取网站对应的预置数量的目标会话,每个目标会话包含一个或多个目标页面;an acquisition module, used to acquire a preset number of target sessions corresponding to the website, each target session contains one or more target pages;
确定模块,用于根据页面浏览顺序及页面重要程度确定所述目标会话中每个目标页面对应的退出权重;A determination module, configured to determine the exit weight corresponding to each target page in the target session according to the page browsing order and page importance;
计算模块,用于根据所述退出权重计算所述目标页面的退出率。A calculation module, configured to calculate the exit rate of the target page according to the exit weight.
可选地,所述确定模块包括:Optionally, the determining module includes:
第一计算单元,用于针对每个目标会话,通过如下方式计算每个目标页面在该目标会话中对应的退出权重:The first calculation unit is configured to, for each target session, calculate the exit weight corresponding to each target page in the target session in the following manner:
(1-a)X-Y(1-b)Y;(1-a) XY (1-b) Y ;
其中,所述a为第一退出系数,所述第一退出系数为预设的一般页面对应的退出几率系数,所述一般页面为重要程度为一般的页面,所述b为第二退出系数,所述第二退出系数为预设的重要页面退出几率系数,所述重要页面为重要程度为重要的页面,所述X为该目标会话中浏览到该目标页面时对应的总页面浏览量,所述Y为所述总页面浏览量中重要页面的浏览量,所述X-Y为所述总页面流量中一般页面的浏览量。Wherein, the a is the first exit coefficient, the first exit coefficient is the exit probability coefficient corresponding to the preset general page, the general page is the page whose importance is general, and the b is the second exit coefficient, The second exit coefficient is a preset important page exit probability coefficient, the important page is a page with an important degree of importance, and the X is the total page views corresponding to the target page in the target session, so The Y is the page views of important pages in the total page views, and the X-Y is the page views of general pages in the total page traffic.
可选地,所述计算模块包括:Optionally, the computing module includes:
第二计算单元,用于通过如下方式计算退出率GA:The second calculation unit is used to calculate the exit rate GA in the following manner:
GA=M/N*100%;GA=M/N*100%;
其中,所述M为在所述目标会话中作为退出页的目标页面对应的退出权重之和,所述N为所述目标会话中每个目标页面对应的退出权重之和。Wherein, the M is the sum of the logout weights corresponding to the target pages in the target session as the logout pages, and the N is the sum of the logout weights corresponding to each target page in the target session.
可选地,所述服务器还包括:Optionally, the server further includes:
划分模块,用于将所述网站中的每个页面按照网页内容的重要程度分为一般页面及重要页面。The dividing module is used for dividing each page in the website into a general page and an important page according to the importance of the content of the web page.
可选地,所述获取模块包括:Optionally, the obtaining module includes:
获取单元,用于从所述网站对应的历史会话记录中获取预置数量的目标会话。an acquiring unit, configured to acquire a preset number of target sessions from the historical session records corresponding to the website.
从以上技术方案可以看出,本发明实施例具有以下优点:As can be seen from the above technical solutions, the embodiments of the present invention have the following advantages:
本发明实施例中,服务器获取预置数量的包含有目标页面的目标会话,根据目标页面在目标会话中被浏览的顺序及目标页面的重要程度,确定目标页面退出的可能性,即退出权重,再根据该退出权重计算出这个目标页面的退出率。本方案能够结合页面自身的重要程度,以及页面在会话中的访问深度计算页面的退出率,从而能够更准确的评价用户在页面中的退出行为,为网页建设提供更有效的数据。In the embodiment of the present invention, the server obtains a preset number of target sessions containing target pages, and determines the possibility of the target page exiting, that is, the exit weight, according to the order in which the target pages are browsed in the target session and the importance of the target pages. Then calculate the exit rate of this target page according to the exit weight. This solution can calculate the exit rate of the page based on the importance of the page itself and the access depth of the page in the session, so as to more accurately evaluate the user's exit behavior on the page and provide more effective data for web page construction.
附图说明Description of drawings
图1为本发明实施例中网页数据计算方法的一个实施例流程图;1 is a flowchart of an embodiment of a method for calculating web page data in an embodiment of the present invention;
图2为本发明实施例中服务器的一个实施例示意图;FIG. 2 is a schematic diagram of an embodiment of a server in an embodiment of the present invention;
图3为本发明实施例中服务器的另一实施例示意图。FIG. 3 is a schematic diagram of another embodiment of a server in an embodiment of the present invention.
具体实施方式Detailed ways
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, but not all of the embodiments.
本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”、“第三”“第四”等(如果存在)是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例例如能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。The terms "first", "second", "third", "fourth", etc. (if present) in the description and claims of the present invention and the above-mentioned drawings are used to distinguish similar objects and are not necessarily used to Describe a particular order or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the invention described herein can, for example, be practiced in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having" and any variations thereof, are intended to cover non-exclusive inclusion, for example, a process, method, system, product or device comprising a series of steps or units is not necessarily limited to those expressly listed Rather, those steps or units may include other steps or units not expressly listed or inherent to these processes, methods, products or devices.
本发明实施例提供了一种网页数据计算方法及服务器,用于准确的评价用户在页面中的退出行为,为网页建设提供更有效的数据。Embodiments of the present invention provide a web page data calculation method and server, which are used to accurately evaluate a user's exit behavior on a web page and provide more effective data for web page construction.
下面先介绍本发明实施例中的网页数据计算方法,请参阅图1,本发明实施例中网页数据计算方法的一个实施例包括:The following first introduces the web page data calculation method in the embodiment of the present invention. Please refer to FIG. 1. An embodiment of the web page data calculation method in the embodiment of the present invention includes:
101、服务器获取网站对应的预置数量的目标会话;101. The server obtains a preset number of target sessions corresponding to the website;
当服务器想要知道该目标页面的退出率时,服务器获取该网站对应的预置数量的目标会话,其中,会话指的是用户开一个浏览器,访问某一个网站,在这个网站点击多个超链接,即浏览这个网站的多个页面,访问服务器的多个网站资源,然后关闭浏览器,整个过程称之为一个会话。而本发明实施例中目标会话指的是包含一个或多个目标页面的会话,即目标会话的过程中用户至少访问了一次目标页面。When the server wants to know the exit rate of the target page, the server obtains a preset number of target sessions corresponding to the website, where a session means that a user opens a browser, visits a certain website, and clicks multiple hypervisors on this website. Linking, that is, browsing multiple pages of this website, accessing multiple website resources on the server, and then closing the browser, the whole process is called a session. In the embodiment of the present invention, a target session refers to a session including one or more target pages, that is, a user visits the target page at least once during the target session.
102、服务器根据页面浏览顺序及页面重要程度确定目标会话中每个目标页面对应的退出权重;102. The server determines the exit weight corresponding to each target page in the target session according to the page browsing sequence and the page importance;
服务器获取网站对应的预置数量的目标会话后,针对每一个目标会话,根据该目标会话中各个页面的浏览顺序以及各个页面的重要程度,确定该目标会话中每个目标页面对应的退出权重。After acquiring the preset number of target sessions corresponding to the website, the server determines, for each target session, the exit weight corresponding to each target page in the target session according to the browsing order of each page in the target session and the importance of each page.
103、服务器根据该退出权重计算目标页面的退出率。103. The server calculates the exit rate of the target page according to the exit weight.
服务器确定预置数量的目标会话中每个目标页面对应的退出权重后,根据该退出权重计算目标页面的退出率。After determining the exit weight corresponding to each target page in the preset number of target sessions, the server calculates the exit rate of the target page according to the exit weight.
本发明实施例中,服务器获取预置数量的包含有目标页面的目标会话,根据目标页面在目标会话中被浏览的顺序及目标页面的重要程度,确定目标页面退出的可能性,即退出权重,再根据该退出权重计算出这个目标页面的退出率。本方案能够结合页面自身的重要程度,以及页面在会话中的访问深度计算页面的退出率,从而能够更准确的评价用户在页面中的退出行为,为网页建设提供更有效的数据。In the embodiment of the present invention, the server obtains a preset number of target sessions containing target pages, and determines the possibility of the target page exiting, that is, the exit weight, according to the order in which the target pages are browsed in the target session and the importance of the target pages. Then calculate the exit rate of this target page according to the exit weight. This solution can calculate the exit rate of the page based on the importance of the page itself and the access depth of the page in the session, so as to more accurately evaluate the user's exit behavior on the page and provide more effective data for web page construction.
基于上述图1对应的实施例,在本发明实施例提供的网页数据计算方法的另一实施例中,服务器按照网页重要程度将该网站中的网页划分为一般页面和重要页面两种,服务器可以通过如下方式确定目标会话中每个目标页面对应的退出权重:Based on the above-mentioned embodiment corresponding to FIG. 1 , in another embodiment of the webpage data calculation method provided by the embodiment of the present invention, the server divides the webpages in the website into general pages and important pages according to the importance of the webpages. The server may Determine the exit weight corresponding to each target page in the target session as follows:
服务器针对每个目标会话,通过如下方式计算每个目标页面在该目标会话中对应的退出权重:For each target session, the server calculates the corresponding exit weight of each target page in the target session in the following way:
(1-a)X-Y(1-b)Y;(1-a) XY (1-b) Y ;
其中,a为第一退出系数,第一退出系数为预设的一般页面对应的退出几率系数,一般页面为重要程度为一般的页面;b为第二退出系数,第二退出系数为预设的重要页面退出几率系数,重要页面为重要程度为重要的页面;X为该目标会话中浏览到该目标页面时对应的总页面浏览量;Y为该总页面浏览量中重要页面的浏览量;X-Y为该总页面流量中一般页面的浏览量。Among them, a is a first exit coefficient, the first exit coefficient is a preset exit probability coefficient corresponding to a general page, and a general page is a page with a general degree of importance; b is a second exit coefficient, and the second exit coefficient is preset. The coefficient of exit probability of important pages, important pages are pages with an important degree of importance; X is the total page views corresponding to the target page in the target session; Y is the page views of important pages in the total page views; X-Y It is the number of pageviews of general pages in the total page traffic.
需要说明的是,本发明实施例中a和b在0到1之间,且a小于b。这样的设定,是因为一般页面的重要程度低于重要页面,那么用户在浏览完一般页面后,获得的信息比较不重要,很可能还会继续访问其他页面,以获得更重要的信息。相反地,用户在浏览完重要页面后,很可能已经获得了想要知道的信息,从而退出网站。因此一般页面的退出几率小于重要页面的退出几率。It should be noted that, in the embodiment of the present invention, a and b are between 0 and 1, and a is smaller than b. This setting is because the importance of the general page is lower than that of the important page, so the information obtained by the user after browsing the general page is less important, and it is likely to continue to visit other pages to obtain more important information. On the contrary, after browsing important pages, users may have obtained the information they want to know and exit the website. Therefore, the exit probability of general pages is less than that of important pages.
还需要说明的是,本发明实施例中X为该目标会话中浏览到该目标页面时对应的总页面浏览量,指的是该目标会话中访客浏览到该目标页面时,所浏览过的页面的总数,能够反映访客访问的深度。It should also be noted that, in the embodiment of the present invention, X is the total page views corresponding to the target page in the target session, which refers to the pages browsed by the visitor when the target page is browsed in the target session. The total number of , which can reflect the depth of visitor visits.
本发明实施例提供了一种服务器确定目标会话中每个目标页面对应的退出权重的方式,提高了方案的可实现性。The embodiment of the present invention provides a way for the server to determine the exit weight corresponding to each target page in the target session, which improves the practicability of the solution.
基于上述多个实施例中任意一个实施例,在本发明实施例提供的网页数据计算方法的另一实施例中,服务器可以通过如下方式计算目标页面的退出率GA:Based on any one of the foregoing embodiments, in another embodiment of the webpage data calculation method provided by the embodiment of the present invention, the server may calculate the exit rate GA of the target page in the following manner:
GA=M/N*100%;GA=M/N*100%;
其中,M为在目标会话中作为退出页的目标页面对应的退出权重之和,N为目标会话中每个目标页面对应的退出权重之和。Wherein, M is the sum of the logout weights corresponding to the target pages serving as the logout pages in the target session, and N is the sum of the logout weights corresponding to each target page in the target session.
本发明实施例提供了一种服务器计算退出率的具体方式,提高了方案的可实现性。The embodiment of the present invention provides a specific way for the server to calculate the withdrawal rate, which improves the practicability of the solution.
基于上述多个实施例中任意一个实施例,在本发明实施例提供的网页数据计算方法的另一实施例中,服务器在获取预置数量的目标会话之前可以执行如下步骤:Based on any one of the foregoing embodiments, in another embodiment of the web page data calculation method provided by the embodiment of the present invention, the server may perform the following steps before acquiring a preset number of target sessions:
服务器将网站中的每个页面按照页面内容的重要程度分为一般页面及重要页面。The server divides each page in the website into a general page and an important page according to the importance of the page content.
应理解,除了页面内容的重要程度,服务器还可以按照页面属性等其他特征将页面划分为一般页面和重要页面。It should be understood that, in addition to the importance of page content, the server may also divide pages into general pages and important pages according to other features such as page attributes.
本发明实施例提供了多种服务器划分一般页面及重要页面的方式,提高了方案的灵活性。The embodiments of the present invention provide a variety of ways for the server to divide general pages and important pages, thereby improving the flexibility of the solution.
基于上述多个实施例中任意一个实施例,在本发明实施例提供的网页数据计算方法的另一实施例中,服务器可以通过如下方式获取预置数量的目标会话:Based on any one of the foregoing embodiments, in another embodiment of the web page data calculation method provided by the embodiment of the present invention, the server may acquire a preset number of target sessions in the following manner:
服务器从网站对应的历史会话记录中获取预置数量的目标会话。The server obtains a preset number of target sessions from the historical session records corresponding to the website.
应理解,除了从历史会话记录中获取目标会话,服务器还可以从正在进行的网站对应的会话中获取目标会话,还可以通过其他途径获取目标会话,具体此处不作限定。It should be understood that, in addition to acquiring the target session from historical session records, the server may also acquire the target session from the session corresponding to the ongoing website, and may also acquire the target session through other means, which is not specifically limited here.
本发明实施例提供了多种服务器获取目标会话的方式,提高了方案的灵活性。The embodiments of the present invention provide multiple ways for the server to acquire the target session, which improves the flexibility of the solution.
为了便于理解,下面以一具体应用场景对本发明实施例中的网页数据计算方法进行说明:For ease of understanding, the web page data calculation method in the embodiment of the present invention is described below with a specific application scenario:
网站A包含有页面W、F、N、E和D,服务器根据网页内容的重要程度将W和N划分为重要页面,将D、E和F划分为一般页面。服务器设定一般页面对应的退出几率系数(第一退出系数)为0.2,重要页面对应的退出几率系数(第二退出系数)为0.3。现在服务器想要知道页面D(目标页面)的退出率时多少,首先服务器从历史会话中获取四个(预置数量)会话,分别为:Website A includes pages W, F, N, E, and D. The server divides W and N into important pages and D, E, and F into general pages according to the importance of the content of the webpage. The server sets the exit probability coefficient (first exit coefficient) corresponding to general pages to 0.2, and the exit probability coefficient (second exit coefficient) corresponding to important pages to 0.3. Now the server wants to know the exit rate of page D (target page). First, the server obtains four (preset number) sessions from historical sessions, which are:
会话一:W—F—N—F—W;Session 1: W—F—N—F—W;
会话二:N—W—E—W;Session 2: N—W—E—W;
会话三:F—N—N—D—E—W;Session 3: F-N-N-D-E-W;
会话四:N—W—E。Session Four: N-W-E.
为了便于描述,下面将会话一种第一次浏览的W页面称为W1,第二次浏览的W页面称为W2,会话二种第一次浏览的W页面称为W3,会话二中第二次浏览的W页面称为W4,会话三中浏览的W页面称为W5,会话四种浏览的W页面称为W6。其中W1、W4和W5分别为会话一、二和三的退出页面For the convenience of description, the W page browsed for the first time in session one is called W1, the W page browsed for the second time is called W2, the W page browsed for the first time in session two is called W3, and the second W page in session two is called W3. The W page browsed for the second time is called W4, the W page browsed in the third session is called W5, and the W page browsed in the fourth session is called W6. where W1, W4 and W5 are the exit pages of session one, two and three respectively
对于W1,在会话一中浏览到W1时对应的总页面浏览量为1,即X=1;W1为重要页面,故Y=1,Y-X=0。则W1对应的退出权重为:For W1, the total page views corresponding to W1 in session 1 is 1, that is, X=1; W1 is an important page, so Y=1, Y-X=0. Then the exit weight corresponding to W1 is:
(1-a)X-Y(1-b)Y=(1-0.2)0(1-0.3)1=0.7;(1-a) XY (1-b) Y = (1-0.2) 0 (1-0.3) 1 = 0.7;
对于W2,在会话一中浏览到W2时对应的总页面浏览量为5,即X=5;其中重要页面的浏览量为3(包括两次W以及一次N),故Y=3,Y-X=2。则W2对应的退出权重为:For W2, the total page views corresponding to W2 in session 1 is 5, that is, X=5; the page views of important pages are 3 (including two W and one N), so Y=3, Y-X= 2. Then the exit weight corresponding to W2 is:
(1-a)X-Y(1-b)Y=(1-0.2)2(1-0.3)3=0.21952;(1-a) XY (1-b) Y = (1-0.2) 2 (1-0.3) 3 = 0.21952;
对于W3,在会话二中浏览到W3时对应的总页面浏览量为2,即X=2;其中重要页面的浏览量为2(包括一次W以及一次N),故Y=2,Y-X=0。则W3对应的退出权重为:For W3, the total page views corresponding to W3 in session 2 is 2, that is, X=2; the page views of important pages are 2 (including one W and one N), so Y=2, Y-X=0 . Then the exit weight corresponding to W3 is:
(1-a)X-Y(1-b)Y=(1-0.2)0(1-0.3)2=0.49;(1-a) XY (1-b) Y = (1-0.2) 0 (1-0.3) 2 = 0.49;
对于W4,在会话二中浏览到W4时对应的总页面浏览量为4,即X=4;其中重要页面的浏览量为3(包括两次W以及一次N),故Y=3,Y-X=1。则W4对应的退出权重为:For W4, the total page views corresponding to W4 in session 2 is 4, that is, X=4; the page views of important pages are 3 (including two W and one N), so Y=3, Y-X= 1. Then the exit weight corresponding to W4 is:
(1-a)X-Y(1-b)Y=(1-0.2)1(1-0.3)3=0.2744;(1-a) XY (1-b) Y = (1-0.2) 1 (1-0.3) 3 = 0.2744;
对于W5,在会话三中浏览到W5时对应的总页面浏览量为6,即X=6;其中重要页面的浏览量为3(包括一次W以及两次N),故Y=3,Y-X=3。则W5对应的退出权重为:For W5, the total page views corresponding to W5 in session 3 is 6, that is, X=6; the page views of important pages are 3 (including one W and two N), so Y=3, Y-X= 3. Then the exit weight corresponding to W5 is:
(1-a)X-Y(1-b)Y=(1-0.2)3(1-0.3)3=0.175616;(1-a) XY (1-b) Y = (1-0.2) 3 (1-0.3) 3 = 0.175616;
对于W6,在会话四中浏览到W6时对应的总页面浏览量为2,即X=2;其中重要页面的浏览量为2(包括一次W以及一次N),故Y=2,Y-X=0。则W5对应的退出权重为:For W6, the total page views corresponding to W6 in session 4 is 2, that is, X=2; the page views of important pages are 2 (including one W and one N), so Y=2, Y-X=0 . Then the exit weight corresponding to W5 is:
(1-a)X-Y(1-b)Y=(1-0.2)0(1-0.3)2=0.49。(1-a) XY (1-b) Y = (1-0.2) 0 (1-0.3) 2 =0.49.
为了便于理解,请参阅下表1:For ease of understanding, please refer to Table 1 below:
服务器计算出会话一至会话四(目标会话)中各个W页面(即W1至W6)对应的退出权重之后,计算会话一至会话四中作为退出页的W(即W2、W4和W5)对应的退出权重之和M=0.21952+0.2744+0.175616=0.669536,会话一至会话四中每个W页面(即W1至W6)对应的退出权重之和N=0.7+0.21952+0.49+0.2744+0.175616+0.49=2.349536,最后计算页面W的退出率GA=M/N*100%=0.669536/2.349536*100%=28.50%。After the server calculates the exit weights corresponding to the W pages (ie W1 to W6) in the sessions 1 to 4 (target sessions), it calculates the exit weights corresponding to the W (ie W2, W4 and W5) that are the exit pages in the sessions 1 to 4 The sum M=0.21952+0.2744+0.175616=0.669536, the sum of the exit weights corresponding to each W page (ie, W1 to W6) in sessions one to four sessions N=0.7+0.21952+0.49+0.2744+0.175616+0.49=2.349536, and finally Calculate the exit rate of page W GA=M/N*100%=0.669536/2.349536*100%=28.50%.
以会话二为例,第一次访问到页面W(即W3)时,访客获取了一定信息量,故这里的W页面的退出权重均为0.49,而再后面会话二再访问了页面E和页面W(即W4),那么这次页面W访问后,访客的信息获取比之前更多了,那么访客退出的可能性也会更高。不应该是原退出率那样,将这两者的退出可能性都认为是1。Taking session 2 as an example, when visiting page W (ie W3) for the first time, the visitor obtained a certain amount of information, so the exit weight of page W here is 0.49, and then session 2 visited page E and page again. W (ie W4), then after the page W visits this time, the visitor obtains more information than before, so the possibility of the visitor exiting will be higher. It should not be the same as the original exit rate, and the exit probability of both should be regarded as 1.
以会话三和会话四为例,页面W之前访问的页面数也会有所影响。在W页面之前,会话四明显只获取了N和W的信息,而会话三则获取了F、N、D、E和W的信息,明显会话三的信息量要大很多,那么当会话三达到W页面时,其退出的概率也会增大,故退出权重应该减少。Taking session 3 and session 4 as examples, the number of pages visited before page W will also have an impact. Before the W page, session 4 obviously only obtained the information of N and W, while session 3 obtained the information of F, N, D, E and W. Obviously, the amount of information of session 3 is much larger, then when session 3 reaches When the page is W, the probability of its exit will also increase, so the exit weight should be reduced.
上面介绍了本发明实施例中的的网页数据计算方法,下面对本发明实施例中的服务器进行介绍,请参阅图2,本发明实施例中服务器的一个实施例包括:The web page data calculation method in the embodiment of the present invention is described above, and the server in the embodiment of the present invention is introduced below. Referring to FIG. 2, an embodiment of the server in the embodiment of the present invention includes:
获取模块201,用于获取网站对应的预置数量的目标会话,每个目标会话包含一个或多个目标页面;an
确定模块202,用于根据页面浏览顺序及页面重要程度确定目标会话中每个目标页面对应的退出权重;A
计算模块203,用于根据退出权重计算目标页面的退出率。The
本发明实施例中,获取模块201获取预置数量的包含有目标页面的目标会话,确定模块202根据目标页面在目标会话中被浏览的顺序及目标页面的重要程度,确定目标页面退出的可能性,即退出权重,计算模块203再根据该退出权重计算出这个目标页面的退出率。本方案能够结合页面自身的重要程度,以及页面在会话中的访问深度计算页面的退出率,从而能够更准确的评价用户在页面中的退出行为,为网页建设提供更有效的数据。In this embodiment of the present invention, the obtaining
为了便于理解,下面对本发明实施例中的服务器进行详细描述,请参阅图3,本发明实施例中服务器的另一实施例包括:For ease of understanding, the server in the embodiment of the present invention is described in detail below, and please refer to FIG. 3 . Another embodiment of the server in the embodiment of the present invention includes:
获取模块301,用于获取网站对应的预置数量的目标会话,每个目标会话包含一个或多个目标页面;an obtaining
确定模块302,用于根据页面浏览顺序及页面重要程度确定目标会话中每个目标页面对应的退出权重;A
计算模块303,用于根据退出权重计算目标页面的退出率;A
其中,确定模块302包括:Wherein, the determining
第一计算单元3021,用于针对每个目标会话,通过如下方式计算每个目标页面在该目标会话中对应的退出权重:The
(1-a)X-Y(1-b)Y;(1-a) XY (1-b) Y ;
其中,a为第一退出系数,第一退出系数为预设的一般页面对应的退出几率系数,一般页面为重要程度为一般的页面,b为第二退出系数,第二退出系数为预设的重要页面退出几率系数,重要页面为重要程度为重要的页面,X为该目标会话中浏览到该目标页面时对应的总页面浏览量,Y为总页面浏览量中重要页面的浏览量,X-Y为总页面流量中一般页面的浏览量。Among them, a is a first exit coefficient, the first exit coefficient is a preset exit probability coefficient corresponding to a general page, a general page is a page whose importance is general, b is a second exit coefficient, and the second exit coefficient is a preset The exit probability coefficient of important pages, important pages are pages with an important degree of importance, X is the total page views corresponding to the target page in the target session, Y is the page views of important pages in the total page views, and X-Y is The number of page views in general out of total page traffic.
计算模块303包括:The
第二计算单元3031,用于通过如下方式计算退出率GA:The second calculation unit 3031 is used to calculate the withdrawal rate GA in the following manner:
GA=M/N*100%;GA=M/N*100%;
其中,M为在目标会话中作为退出页的目标页面对应的退出权重之和,N为目标会话中每个目标页面对应的退出权重之和。Wherein, M is the sum of the logout weights corresponding to the target pages serving as the logout pages in the target session, and N is the sum of the logout weights corresponding to each target page in the target session.
可选地,在本发明实施例中,服务器还可以包括:Optionally, in this embodiment of the present invention, the server may further include:
划分模块304,用于将该网站中的每个页面按照网页内容的重要程度分为一般页面及重要页面。The
可选地,在本发明实施例中,获取模块301可以包括:Optionally, in this embodiment of the present invention, the obtaining
获取单元3011,用于从该网站对应的历史会话记录中获取预置数量的目标会话。The obtaining
本发明实施例中,获取模块301获取预置数量的包含有目标页面的目标会话,确定模块302根据目标页面在目标会话中被浏览的顺序及目标页面的重要程度,确定目标页面退出的可能性,即退出权重,计算模块303再根据该退出权重计算出这个目标页面的退出率。本方案能够结合页面自身的重要程度,以及页面在会话中的访问深度计算页面的退出率,从而能够更准确的评价用户在页面中的退出行为,为网页建设提供更有效的数据。In the embodiment of the present invention, the obtaining
其次,本发明实施例提供了一种确定模块302确定退出权重的具体方式,提高了方案的可实现性。Secondly, the embodiment of the present invention provides a specific way for the
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统,装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working process of the system, device and unit described above may refer to the corresponding process in the foregoing method embodiments, which will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(英文全称:Read-OnlyMemory,英文缩写:ROM)、随机存取存储器(英文全称:Random AccessMemory,英文缩写:RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention is essentially or the part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (full English name: Read-Only Memory, English abbreviation: ROM), random access memory (English full name: Random Access Memory, English abbreviation: RAM), disk or Various media that can store program codes, such as optical discs.
以上所述,以上实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。As mentioned above, the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand: The technical solutions described in the embodiments are modified, or some technical features thereof are equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions depart from the spirit and scope of the technical solutions of the embodiments of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611042516.8A CN108093013B (en) | 2016-11-23 | 2016-11-23 | Webpage data calculation method and server |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611042516.8A CN108093013B (en) | 2016-11-23 | 2016-11-23 | Webpage data calculation method and server |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108093013A CN108093013A (en) | 2018-05-29 |
CN108093013B true CN108093013B (en) | 2020-06-16 |
Family
ID=62171013
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611042516.8A Expired - Fee Related CN108093013B (en) | 2016-11-23 | 2016-11-23 | Webpage data calculation method and server |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108093013B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110489684A (en) * | 2019-08-16 | 2019-11-22 | 腾讯科技(深圳)有限公司 | For showing method, unit and the storage medium of browser page |
CN112036666B (en) * | 2020-09-29 | 2024-03-22 | 中移(杭州)信息技术有限公司 | Binding flow evaluation method, device, server and storage medium |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102364468A (en) * | 2011-09-29 | 2012-02-29 | 北京亿赞普网络技术有限公司 | User network behavior analysis method, device and system |
US20140280699A1 (en) * | 2013-03-13 | 2014-09-18 | General Instrument Corporation | Method and apparatus for enabling discovery and communications between unrelated browser sessions |
CN104408143A (en) * | 2014-12-01 | 2015-03-11 | 北京国双科技有限公司 | Webpage data monitoring method and device |
CN105468729A (en) * | 2015-11-23 | 2016-04-06 | 深圳大粤网络视界有限公司 | Internet mobile vertical search engine |
-
2016
- 2016-11-23 CN CN201611042516.8A patent/CN108093013B/en not_active Expired - Fee Related
Non-Patent Citations (1)
Title |
---|
网站数据分析中的误区探讨;马达;《SILICON VALLEY》;20131130;全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN108093013A (en) | 2018-05-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10747771B2 (en) | Method and apparatus for determining hot event | |
US20140310691A1 (en) | Method and device for testing multiple versions | |
JP6266656B2 (en) | System and method for resizing an image | |
US20160070691A1 (en) | Method and system for auto-populating electronic forms | |
CN106469376B (en) | Risk control method and equipment | |
JP5438087B2 (en) | Advertisement distribution device | |
JP7119124B2 (en) | Action indicator for search behavior output element | |
US20100082637A1 (en) | Web Page and Web Site Importance Estimation Using Aggregate Browsing History | |
JP6689955B2 (en) | Machine learning based identification of broken network connections | |
US20130198240A1 (en) | Social Network Analysis | |
US9866454B2 (en) | Generating anonymous data from web data | |
US20160173637A1 (en) | Cached Data Detection | |
RU2634218C2 (en) | Method for determining sequence of web browsing and server used | |
CN104361092A (en) | Searching method and device | |
CN104090908B (en) | Count mean residence time, the method and apparatus of web site contents popularization of page group | |
Ghasemisharif et al. | Speedreader: Reader mode made fast and private | |
AU2019200084A1 (en) | Automatically generating meaningful user segments | |
CN108093013B (en) | Webpage data calculation method and server | |
CN103049497A (en) | Method and device for website navigation | |
US20230114228A1 (en) | Method for arbitrating encrypted electronic transactions among intermediary and authoring users only when an interaction occurs between authoring and candidate users who was exposed by the intermediary user to data published by authoring user | |
CN104951476B (en) | Method and device for determining link level in website | |
WO2018054352A1 (en) | Item set determination method, apparatus, processing device, and storage medium | |
JP5781242B2 (en) | Web tracking prevention | |
CN103971326B (en) | Personalized caching method and device for map tiles | |
CN103685198A (en) | Method and device for interaction of user data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 100080 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd. Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing Applicant before: BEIJING GRIDSUM TECHNOLOGY Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20200616 |
|
CF01 | Termination of patent right due to non-payment of annual fee |