CN105989002A - Webpage data query method and device, and method and device for establishing webpage jump path database - Google Patents

Webpage data query method and device, and method and device for establishing webpage jump path database Download PDF

Info

Publication number
CN105989002A
CN105989002A CN201510041278.8A CN201510041278A CN105989002A CN 105989002 A CN105989002 A CN 105989002A CN 201510041278 A CN201510041278 A CN 201510041278A CN 105989002 A CN105989002 A CN 105989002A
Authority
CN
China
Prior art keywords
webpage
node
path
information
redirects
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510041278.8A
Other languages
Chinese (zh)
Inventor
陈东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201510041278.8A priority Critical patent/CN105989002A/en
Publication of CN105989002A publication Critical patent/CN105989002A/en
Pending legal-status Critical Current

Links

Abstract

The embodiment of the invention discloses a webpage data query method and device, and a method and device for establishing webpage jump path database. The webpage data query method includes: providing a first database, wherein the first database saves pre-acquired statistical information of webpage jump paths, the webpage jump paths use visited web pages as nodes, and relevant nodes are connected in series according to source information included in a websites of the visited web pages, the statistical data of the webpage jump paths include the frequency of occurrence of each webpage jump path, an initial node of each webpage jump path, and the type of the initial node as a flow source; receiving a query request associated with the type of the flow source; and providing a query result according to the information saved in the first database. The webpage data query method can provide a basis for webpage operation workers who can master the flow condition of the webpage from a whole, and can improve the releasing efficiency of resources.

Description

Web data inquiry, set up webpage and redirect the method and device of routing database
Technical field
The application relates to webpage flow technical field of information processing, particularly relates to web data inquiry, sets up Webpage redirects the method and device of routing database.
Background technology
Along with becoming increasingly popular of computer network and developing rapidly of correlation technique, website and webpage quantity The hugest, for webpage provider, it is thus achieved that user's flowing of access as much as possible, is it The target pursued.To this end, in various webpages, hyperlinks between Web pages technology is nearly ubiquitous, and, press According to the difference of link path, the hyperlink in webpage can be generally divided into internal links, external linkage etc., In a word, Hyperlink technology so that complicated annexation can be set up between webpage and webpage, For same target web, user typically can be conducted interviews by number of ways.
Such as, most basic approach can be the network address that the direct address field at browser inputs webpage.Or, Webpage provider can add concrete webpage in the position such as homepage of internal web site by the form of internal links Link, so, user can be conducted interviews by this internal links.Such as, at e-commerce platform The homepage of website can add the link of the various shops page, or the link etc. of each concrete business object page Deng.Again or, web page address can also be added to outside other by external linkage by webpage provider In the page of website, such as, for certain shop page in certain e-commerce platform, can be in some door classes Webpage (such as, the webpage etc. of news portal website) in throw in its link, user access door class net During Ye, it is possible to by the way of clicking on this link, enter into this shop page, or, also may be used To throw in its link in the webpage of some navigation type, user is after opening this navigation type webpage, by point Hit the link of correspondence, equally enter into this shop page, etc..
In a word, multiple access approach makes a webpage can obtain the access stream of user in several ways Amount, but, for webpage provider, often also need to webpage flow is analyzed, in order to net The input mode etc. of page link is adjusted, and to optimize user's flowing of access of its webpage further, improves money The input efficiency in source.But, the web page interlinkage situation in actual application is intricate, therefore, how to provide The flow information of webpage so that web page provision can quickly understand the traffic conditions of webpage, and then to its chain Connect input direction etc. effectively to adjust, become and solve the technical problem that in the urgent need to those skilled in the art.
Summary of the invention
This application provides the method and device that webpage traffic statistics is provided, it is possible to for webpage operation personnel The traffic conditions grasping webpage on the whole provides foundation, and then can enter its link input direction etc. accordingly Row is effective to be adjusted, to improve the input efficiency of resource.
This application provides following scheme:
A kind of web data querying method, including:
First data base is provided, described first data base preserves the webpage collected in advance and redirects path Statistical information;Wherein, described webpage redirects path using accessed webpage as node, and according to accessed net The source-information comprised in page network address, connects relevant node;Described webpage redirects the statistics in path Data include that each bar webpage redirects the occurrence number in path, each webpage redirects the start node in path and described The type that start node is affiliated when as traffic source;
Receive the inquiry request relevant to traffic source type;
According to the information preserved in described first data base, it is provided that Query Result.
A kind of set up the method that webpage redirects routing database, including:
Collect the information relevant to the web page access of preset website;
Using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, by phase The node closed is connected, and generates a plurality of webpage and redirects path, and adds up each bar webpage and redirect the appearance in path Number of times;Wherein, the start node in path is redirected about each webpage, according to what the network address of start node comprised Domain-name information, determines the type that described start node is affiliated when as traffic source;
Path and occurrence number thereof, and the traffic source type of each start node is redirected according to each bar webpage Information, generates the first data base.
A kind of web data querying method, including:
Second data base is provided, described second data base preserves the webpage collected in advance and redirects path Statistical information;Wherein, described webpage redirects path using accessed webpage as node, and according to accessed net The source-information comprised in page network address, connects relevant node;
Receiving the 3rd inquiry request, described 3rd inquiry request is for checking source and the whereabouts of named web page Details;
Inquire about described second data base, determine that each article of the 3rd target web comprising described named web page redirects road Footpath;
Redirect location in path according to described named web page at each article of the 3rd target web, determine described Named web page one jumping or multi-hop source Nodes, one jump or multi-hop whereabouts node and each hop node between jumping Transfer the registration of Party membership, etc. from one unit to another;
According to described one jump or multi-hop source Nodes, one jump or multi-hop whereabouts node and each hop node between Redirect relation, return described source and whereabouts details.
A kind of set up the method that webpage redirects routing database, including:
Collect the information relevant to the web page access of preset website;
Using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, by phase The node closed is connected, and generates a plurality of webpage and redirects path, and adds up each bar webpage and redirect the appearance in path Number of times;
Redirect path and occurrence number thereof according to each bar webpage, generate the second data base.
A kind of web data querying method, including:
3rd data base is provided, described 3rd data base preserves the webpage collected in advance and redirects path Statistical information;Wherein, described webpage redirects path using accessed webpage as node, and according to accessed net The source-information comprised in page network address, connects relevant node;Described webpage redirects the statistics in path Information includes: the preset feature that each node is had;
Receive the inquiry request relevant to nodal properties;
According to the information preserved in described 3rd data base, it is provided that Query Result.
A kind of set up the method that webpage redirects routing database, including:
Collect the information relevant to the web page access of preset website;
Using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, by phase The node closed is connected, and generates a plurality of webpage and redirects path, and adds up each bar webpage and redirect the appearance in path Number of times;Wherein, redirect each node on path about each webpage, according to the attribute information comprised in network address, Determine the preset feature that each node is had;
Path and occurrence number thereof, and the characteristic information that each node is had is redirected according to each bar webpage, Generate the 3rd data base.
A kind of web data inquiry unit, including:
First data base provides unit, for providing the first data base, preserves pre-in described first data base The webpage first collected redirects the statistical information in path;Wherein, described webpage redirects path with accessed webpage As node, and according to the source-information comprised in accessed webpage network address, relevant node is connected; Described webpage redirects the statistical data in path and includes that each bar webpage redirects the occurrence number in path, each webpage redirects The start node in path and the type affiliated when as traffic source of described start node;
Type queries request reception unit, for receiving the inquiry request relevant to traffic source type;
Type queries result provides unit, for according to the information preserved in described first data base, it is provided that look into Ask result.
A kind of webpage of setting up redirects the device of routing database, including:
First collector unit, for collecting the information relevant to the web page access of preset website;
First statistic unit, is used for accessed webpage as node, and wraps according in accessed webpage network address The source-information contained, connects relevant node, generates a plurality of webpage and redirects path, and adds up each bar Webpage redirects the occurrence number in path;Wherein, redirect the start node in path about each webpage, according to initial The domain-name information comprised in the network address of node, determines the class that described start node is affiliated when as traffic source Type;
First signal generating unit, for redirecting path and occurrence number thereof according to each bar webpage, and each initiates The traffic source type information of node, generates the first data base.
A kind of web data inquiry unit, including:
Second data base provides unit, for providing the second data base, preserves pre-in described second data base The webpage first collected redirects the statistical information in path;Wherein, described webpage redirects path with accessed webpage As node, and according to the source-information comprised in accessed webpage network address, relevant node is connected;
Source whereabouts inquiry request receives unit, for receiving the 3rd inquiry request, described 3rd inquiry request For checking source and the whereabouts details of named web page;
Data base querying unit, is used for inquiring about described second data base, determines and comprise each of described named web page Article the 3rd target web redirects path;
Redirect relation determination unit, for redirecting path according to described named web page at each article of the 3rd target web Middle location, determines a jumping or multi-hop source Nodes, a jumping or the multi-hop whereabouts joint of described named web page Point and each hop node between redirect relation;
Return unit, for according to described one jump or multi-hop source Nodes, one jump or multi-hop whereabouts node and Redirect relation between each hop node, return described source and whereabouts details.
A kind of webpage of setting up redirects the device of routing database, including:
Second collector unit, for collecting the information relevant to the web page access of preset website;
Second statistic unit, is used for accessed webpage as node, and wraps according in accessed webpage network address The source-information contained, connects relevant node, generates a plurality of webpage and redirects path, and adds up each bar Webpage redirects the occurrence number in path;
Second signal generating unit, for redirecting path and occurrence number thereof according to each bar webpage, generates the second data Storehouse.
A kind of web data inquiry unit, including:
3rd data base provides unit, for providing the 3rd data base, preserves pre-in described 3rd data base The webpage first collected redirects the statistical information in path;Wherein, described webpage redirects path with accessed webpage As node, and according to the source-information comprised in accessed webpage network address, relevant node is connected; Described webpage redirects the statistical information in path and includes: the preset feature that each node is had;
Characteristic inquiry request receives unit, for receiving the inquiry request relevant to nodal properties;
Characteristic Query Result provides unit, for according to the information preserved in described 3rd data base, it is provided that look into Ask result.
A kind of webpage of setting up redirects the device of routing database, including:
3rd collector unit, for collecting the information relevant to the web page access of preset website;
3rd statistic unit, is used for accessed webpage as node, and wraps according in accessed webpage network address The source-information contained, connects relevant node, generates a plurality of webpage and redirects path, and adds up each bar Webpage redirects the occurrence number in path;Wherein, each node on path is redirected about each webpage, according to network address In the attribute information that comprises, determine the preset feature that each node is had;
3rd signal generating unit, for redirecting path and occurrence number thereof, and each node according to each bar webpage The characteristic information being had, generates the 3rd data base
The specific embodiment provided according to the application, this application discloses techniques below effect:
Pass through the embodiment of the present application, it is possible to the daily record produced during accessing webpage according to user, set up webpage Redirect routing information data base, and the data on each paths and/or node added up and real-time update, In the process, it is possible to receive query flows statistical information request, and according in data base record number According to, it is provided that concrete traffic statistics result.As such, it is possible to provide the stream of certain named web page on the whole The amount information such as statistics, thus grasp the traffic conditions of webpage on the whole for webpage operation personnel and provide foundation, And then can accordingly its link input direction etc. effectively be adjusted, in order to more effectively utilize network Resource, it is to avoid the wasting of resources or underutilization.
Certainly, the arbitrary product implementing the application it is not absolutely required to reach all the above advantage simultaneously.
Accompanying drawing explanation
In order to be illustrated more clearly that the embodiment of the present application or technical scheme of the prior art, below will be to enforcement In example, the required accompanying drawing used is briefly described, it should be apparent that, the accompanying drawing in describing below is only Some embodiments of the application, for those of ordinary skill in the art, are not paying creative work Under premise, it is also possible to obtain other accompanying drawing according to these accompanying drawings.
Fig. 1 is the flow chart of the first method that the embodiment of the present application provides;
Fig. 2 is the flow chart of the second method that the embodiment of the present application provides;
Fig. 3-1 is the schematic diagram of the first view that the embodiment of the present application provides;
Fig. 3-2 is the schematic diagram of the second view that the embodiment of the present application provides;
Fig. 4 is the flow chart of the third method that the embodiment of the present application provides;
Fig. 5 is the flow chart of the fourth method that the embodiment of the present application provides;
Fig. 6-1 is the schematic diagram of the three-view diagram that the embodiment of the present application provides;
Fig. 6-2 is the schematic diagram of the 4th view that the embodiment of the present application provides;
Fig. 7 is the flow chart of the 5th method that the embodiment of the present application provides;
Fig. 8 is the flow chart of the 6th method that the embodiment of the present application provides;
Fig. 9 is the schematic diagram of the 5th view that the embodiment of the present application provides;
Figure 10 is the schematic diagram of the first device that the embodiment of the present application provides;
Figure 11 is the schematic diagram of the second device that the embodiment of the present application provides;
Figure 12 is the schematic diagram of the 3rd device that the embodiment of the present application provides;
Figure 13 is the schematic diagram of the 4th device that the embodiment of the present application provides;
Figure 14 is the schematic diagram of the 5th device that the embodiment of the present application provides;
Figure 15 is the schematic diagram of the 6th device that the embodiment of the present application provides.
Detailed description of the invention
Below in conjunction with the accompanying drawing in the embodiment of the present application, the technical scheme in the embodiment of the present application is carried out clearly Chu, be fully described by, it is clear that described embodiment be only some embodiments of the present application rather than Whole embodiments.Based on the embodiment in the application, those of ordinary skill in the art obtained all its His embodiment, broadly falls into the scope of the application protection.
In the embodiment of the present application, access relevant data for the ease of query webpage, can be first to each The web page access daily record of user is collected, and by this log information, generates webpage and redirects routing information Data base, the webpage that in this data base, record has various operation based on user to generate redirects path, and respectively Statistical data on paths and node, and then can be according to this data base, it is provided that comprehensively traffic statistics Information.Below concrete implementation mode is described in detail.
Embodiment one
In this embodiment one, the information relevant to traffic source type, this traffic source class can be inquired about The information that type is relevant can be that the traffic conditions in various sources to a webpage on the whole is added up, or Person, it is also possible to add up the traffic source of certain type and flow to situation to each webpage.Such as, certain webpage is altogether Have A, B, C tri-source, then it is how many for can counting these three source flow respectively respectively, enters And, webpage operation personnel etc. can be in this, as reference, input strategy of adjustment web page interlinkage etc..
To this end, see Fig. 1, the embodiment of the present application one provide firstly one and sets up webpage and redirect path data The method in storehouse, the method may comprise steps of:
S101: collect the information relevant to the web page access of preset website;
Redirect routing information data base to set up webpage, the web page access information of users can be carried out Collect.Certainly, in the embodiment of the present application, the access letter relevant to the webpage of preset website can only be collected Breath, for example, it is assumed that the page that preset website is " Taobao " and the page of " sky cat ", then can only receive Collect the access information to " Taobao " and " sky cat " page that the two website is relevant.For example, it is assumed that certain After user opens browser, first pass through browser and open certain navigation page, in navigation page, click on certain The link of portal website (such as, Sina, Sohu etc.), in the homepage of this portal website, click is correlated with It is linked into a certain webpage of Taobao website, afterwards the webpage of this Taobao has been closed, then collected user's During access information, can only collect user and enter this this information of Taobao's page from the homepage of this portal website, And can no longer carry out record about from the information of navigation page portal entry Website page.
When being specifically collected, therefrom can be carried out by the history access log of each user of server lookup The extraction of web page access information and collection.Or, it is also possible to actively submitted to by client.Such as, visitor The web page access situation of user can be monitored by family end, when monitoring the webpage that have accessed preset website, Then relevant access information being uploaded onto the server, so, collecting that server can be more real-time is relevant Web page access information.
S102: using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, Relevant node is connected, generates a plurality of webpage and redirect path, and add up each bar webpage and redirect path Occurrence number;Wherein, redirect the start node in path about each webpage, wrap according in the network address of start node The domain-name information contained, determines the type that described start node is affiliated when as traffic source;
Specifically after the access information collecting user, a plurality of webpage can be generated and redirect path, Mei Tiaolu Can be made up of multiple nodes on footpath, the corresponding concrete webpage of each node, and it is possible to according to quilt Access the source-information comprised in webpage network address, the node that each is relevant is together in series, thus generate and redirect Path, and also the information such as every occurrence number redirecting path can be counted.It should be noted that In the embodiment of the present application, the access information of user can be dynamic collection, along with the user collected accesses Increasing of information, the information preserved in this data base is also in real-time update.It addition, jump generating each bar webpage During turning path, it is also possible to the statistical data on each paths and/or node is recorded and updates. It should be noted that about statistical information, can be to add up in certain period of time, such as, can Add up with every day, from the initial time such as the 0 of every day proceed by webpage redirect path generation and The statistics of various information, and can be with real-time update.After 24 hours, the data of statistics are zeroed out, Re-use the data genaration path newly collected and add up various data, by that analogy.Certainly, for mistake The statistical result in measurement period is gone to preserve, in order to carry out the comparison etc. of data.
Wherein, specifically when generating each bar webpage and redirecting path, can wrap according to the network address of accessed webpage etc. The information contained is carried out.Such as, if certain webpage A be the link by clicking in another webpage B it After open, then in the refer field of the network address such as URL of this webpage A, typically can carry about webpage The URL of B.So, if certain access information is that certain webpage A is accessed, then by analyzing this webpage The URL of A, then can know that upper one of this webpage comes from webpage B, so, jumps generating webpage When turning path, it is possible to using webpage A and webpage B as a node, and the two node it Between set up its series relationship, form the path of a webpage B to webpage A.Afterwards, further according to other Access information, it is also possible to the upstream and downstream in this path is extended, in a word, in this way, it is possible to Set up the upstream and downstream series relationship between multiple node, and one or many path trees can be set up accordingly.
About the statistical data of each paths, the number of times etc. that each paths occurs mainly can be included.Such as, During generation webpage redirects path, if certain paths has existed in data base, then can be right The occurrence number in this path carries out adding a process.
It addition, as it was noted above, the refer field in http request the most also carries traffic source information, Therefore, after generating each bar and redirecting path, it is also possible to determine the start node of every paths, and root According to information such as the domain names comprised in the URL of start node, it may be determined that go out the flow belonging to start node The type information in source.Such as, this type information can include that search engine, website promotion (can also divide For promoting in station or station is outer promotes), internal links etc..Such as, for from external website guiding Flow, in the access http request of this flow, http header has field refer and specifies Source station address;If other websites of right and wrong guide the flow of coming, in the access http request of this flow Http header is empty, therefore, it can distinguish the type in source.
Such as, certain path is A > B > C, and wherein, A is the start node in this path, the most now, The http request of webpage B can be analyzed, it is assumed that in the http request of B webpage, http header The URL that comprises of refer field be: www.bing.com, at this point it is possible to prove that the flow in this path comes From outside station, and according to the domain name of webpage A, this domain name is included in preset search engine domain name In list, hence, it can be determined that the traffic source type going out this path is search engine.
S103: redirect path and occurrence number thereof according to each bar webpage, and the flow of each start node comes Source Type information, generates the first data base.
Determining that each bar webpage redirects path and occurrence number thereof, and the traffic source of each start node After type information, it is possible to be saved in the first data base.That is, each in the first data base Data Entry can include following information: webpage redirects each node that path includes, webpage redirects path and goes out Existing number of times, start node mark and start node are as traffic source type affiliated during traffic source. Such as, when implementing, the structure of the first data base can be as shown in the following Table 1:
Table 1
Webpage redirects path Occurrence number Start node Traffic source type
A—>B—>C n1 A Search engine
C—>B—>D n2 C Guide in standing
D—>E—>F n3 D Promote outside standing
After generating above-mentioned first data base, it is possible to provide the user relevant to traffic source type Inquiry service.Now, seeing Fig. 2, the embodiment of the present application one additionally provides a kind of web data querying method, The method specifically may comprise steps of:
S201: the first data base is provided, preserves, in described first data base, the webpage collected in advance and redirect The statistical information in path;Wherein, described webpage redirects path using accessed webpage as node, and according to quilt Access the source-information comprised in webpage network address, relevant node is connected;Described webpage redirects path Statistical data include each bar webpage redirect the occurrence number in path, each webpage redirect the start node in path with And the type that described start node is affiliated when as traffic source;
Set up the process by the agency of the most of the first data base, repeat no more here.
S202: receive the inquiry request relevant to traffic source type;
Wherein, the inquiry request relevant to traffic source type can have multiple, and such as, one of which is permissible Being the first inquiry request, this first inquiry request is for inquiring about the traffic source type information of named web page.This Time, specifically when providing the traffic source type information of named web page, can first inquire about the first data base, Determine that each bar first object webpage comprising named web page redirects path, it is then determined that each bar first object net Page redirects the traffic source type of start node in path, and each bar first object webpage redirects going out of path Occurrence number, and based on this type, each source page is sorted out, under same traffic source type First object webpage redirect the occurrence number in path and collect, determine each traffic source type correspondence respectively First object webpage redirect total occurrence number in path, and then can be by each traffic source type and right The total occurrence number answered, returns as traffic source information.So, webpage operation personnel just can know it The webpage specified has how many flows to come from search engine, has how many flows to come from website promotion, etc..
The person of sending of inquiry request can be the operation personnel etc. of certain website, may comprise multiple in a website Webpage, it can select it to need the webpage paid close attention to, send concrete look into relevant to traffic source type Ask request.Select for the ease of user, drop-down list etc. can be provided in the user interface, select for user The webpage that can check, or user can also be allowed to pass through the network address etc. at the input frame input webpage specified Mode searches for its webpage needing to check flow information.
For example, it is assumed that certain webpage A is the webpage that user specifies, then in order to provide the stream about this webpage A Amount source type information, first can take out all paths including this webpage A, example from data base As wherein one has 100, every paths correspondence can occur the statistical datas such as number of times;Assume wherein There are 20 paths, using this webpage A as start node, then can originate as directly inputting address visit The flow asked is defined as 20;Remain in 80 paths, wherein have the start node of 40 paths to broadly fall into Promote in standing, have 30 to belong to search engine class, promote for station is outer for other 10.Then may finally determine The flow that this webpage A is the most corresponding under above-mentioned four kinds of source type, and then concrete information can be carried Supply requesting party.
It addition, the request relevant to traffic source type can also is that the second request, this second request can be used Flow whereabouts information in the traffic source to node each in specified sites inquiring about specified type.Such as, referring to In the case of fixed concrete website (such as " sky cat "), some type of traffic source can be inquired about (such as, Stand outer popularization) flow to situation to this website each webpage interior.When implementing, can first inquire about first Data base, determines that each bar the second target web using described specified type as start node redirects path, so After the second target web including same node point under described specified sites redirected the occurrence number in path enter Row collects, determine the traffic source of described specified type under described specified sites each node flow to number of times, Finally according to the traffic source of specified type to the number of times that flows to of node each under specified sites, return Query Result.
S203: according to the information preserved in described first data base, it is provided that Query Result.
Specifically when returning Query Result, concrete form can have multiple, for example, it is possible to directly with word Form be shown, or, so that provide result more directly perceived, it is also possible to according to target web The flow information in corresponding all kinds source, generates overall traffic source view, shows in the way of view Traffic source information, such as Fig. 3-1.Or the traffic source according to target type is arrived at a station the flow direction of interior each webpage Information, the flow direction generating all types of flow attempts, and is then shown this view, equally with the side of view Formula displaying flows to information, as shown in figure 3-2.Wherein n1 to n5 represents the number of times of the flow direction, Ye Jiliu respectively Amount.
Embodiment two
In embodiment two, it is also possible to detailed source and the whereabouts information of certain target web are provided, that is, Its flow is respectively from which node (referred to as source Nodes), after flowing through this node, which has flowed to again A little nodes (referred to as whereabouts node), etc..Wherein, either source Nodes or whereabouts node, all may be used To be multi-hop.When implementing, this embodiment two provide firstly one and sets up webpage and redirect routing database Method, see Fig. 4, the method may comprise steps of:
S401: collect the information relevant to the web page access of preset website;
S402: using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, Relevant node is connected, generates a plurality of webpage and redirect path, and add up each bar webpage and redirect path Occurrence number;
S403: redirect path and occurrence number thereof according to each bar webpage, generate the second data base.
The second data base generated in this embodiment two is compared with the first data base generated in embodiment one, no Be with part, it is only necessary to preserve in the second data base each bar webpage redirect path and each occur time Number, therefore, the structure of the second data base can be as shown in the following Table 2:
Table 2
Webpage redirects path Occurrence number
A—>B—>C n1
C—>B—>D n2
D—>E—>F n3
Collection and webpage about the information of access redirect the generation in path, occurrence number statistics etc., Ke Yican See the introduction in embodiment one, repeat no more here.
After generating this second data base, may be used for inquiring about the detailed derivation whereabouts information of certain webpage.Specifically , this embodiment two additionally provides a kind of web data querying method, sees Fig. 5, and the method can include Following steps:
S501: the second data base is provided, preserves, in described second data base, the webpage collected in advance and redirect The statistical information in path;Wherein, described webpage redirects path using accessed webpage as node, and according to quilt Access the source-information comprised in webpage network address, relevant node is connected;
S502: receive the 3rd inquiry request, described 3rd inquiry request for check the source of named web page with And whereabouts details;
S503: inquire about described second data base, determines each article of the 3rd target web comprising described named web page Redirect path;
When implementing, after receiving the 3rd concrete inquiry request, can be first according to named web page Each the 3rd target web at place redirects path, and follow-up concrete source whereabouts information just can be according to these 3rd target web redirects path and obtains.
S504: redirect location in path, really at each article of the 3rd target web according to described named web page One jumping of fixed described named web page or multi-hop source Nodes, one jump or multi-hop whereabouts node and each hop node it Between redirect relation;
All include on path that multiple node, each node are gone here and there according to the relation of redirecting owing to every webpage redirects Connection, therefore, after determining that each the 3rd target web redirects path, it is possible to exist according to named web page Location in each bar destination path, determines jumping or a plurality of source Nodes, a Yi Jiyi of named web page Jump or multi-hop whereabouts node, then jump or a plurality of source Nodes according to described one, and one jumps or multi-hop whereabouts Node, returns the source whereabouts details of this named web page.For example, it is assumed that certain the 3rd target web is jumped Turning path is: A > B > C > D, and named web page to be checked is C, then B is a jumping source of C Node, A is the two jumping source Nodes of C, and D is a jumping whereabouts node of C, by that analogy.
S505: according to a described jumping or multi-hop source Nodes, a jumping or multi-hop whereabouts node and each hop node Between redirect relation, return described source and whereabouts details.
Redirect in path owing to same webpage possibly be present at a plurality of webpage, therefore, it can to comprise this net The path that redirects of page is taken out, and then collects, and knows the concrete source whereabouts situation of this webpage, such as figure Shown in 6-1.Such as, the node that certain named web page is corresponding is node D, the path at this node place include with Lower four:
A—>C—>D—>F
A—>D
B—>D
B—>D—>E—>G
Then a jumping source Nodes of this node D includes node C, A, B, and two jump source Nodes includes node A, one jumps whereabouts node includes node E, F, and two jump whereabouts node includes node G
When implementing, so that user obtains more intuitive information, it is also possible to corresponding with named web page Centered by node, according to jumping or the multi-hop source Nodes obtained, and one jumps or multi-hop whereabouts node, raw The source becoming this named web page is removed direction view and returns.Such as, for previous example, the corresponding view generated Can be as in fig. 6-2.
Certainly, in actual applications, this node source is not limited to the double bounce shown in Fig. 3 with whereabouts, but Can launch, the most deployable one jump, two jump, three jump to end.
Embodiment three
In actual applications, also some webpage, due to have certain characteristic (such as, the industry classification page, The shop page, brand page, business object details page, experiment page etc.), it may be necessary to obtain and refer to Determine web page joint traffic conditions under certain characteristic, in order to understand this named web page node based on this characteristic Traffic conditions.Or, it is also possible to it should be understood that the webpage of certain characteristic flow to situation, etc..To this end, In the embodiment of the present application three, additionally provide another kind and set up the method that webpage redirects routing database, see figure 7, the method may comprise steps of:
S701: collect the information relevant to the web page access of preset website;
S702: using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, Relevant node is connected, generates a plurality of webpage and redirect path, and add up each bar webpage and redirect path Occurrence number;Wherein, redirect each node on path about each webpage, according to the attribute letter comprised in network address Breath, determines the preset feature that each node is had;
Wherein, redirect the statistical of the generating mode occurrence number in path about each bar webpage, permissible As described in embodiment one.And about the characteristic information of node, due in the information such as the general URL at webpage Concrete characteristic information can be carried, therefore, it can the URL by analyzing webpage, determine that each node is No have certain feature, if it has, concrete property information corresponding for node then can recorded the 3rd number According in storehouse.
S703: redirect path and occurrence number thereof, and the characteristic that each node is had according to each bar webpage Information, generates the 3rd data base.
It is to say, in the 3rd data base in addition to including that webpage redirects path and occurrence number information thereof, The characteristic information that each node is had can also be preserved, it is of course possible to be not that each node has characteristic, Therefore, so-called " characteristic information that node is had " can include two layers of meaning, first, if having Characteristic, second, if it has, so have which kind of concrete characteristic.Such as, when implementing, the 3rd number Structure according to storehouse can be as shown in the following Table 3:
Table 3
Wherein, the Article 3 webpage in above-mentioned table 3 redirects in path, does not has to record the spy relevant to node F Property information, then mean that this node F does not have specific characteristic.
In a word, based on above-mentioned 3rd data base, the traffic statistics relevant to nodal properties can be inquired about. Concrete, seeing Fig. 8, this embodiment three additionally provides a kind of web data querying method, and the method is concrete May comprise steps of:
S801: the 3rd data base is provided, preserves, in described 3rd data base, the webpage collected in advance and redirect The statistical information in path;Wherein, described webpage redirects path using accessed webpage as node, and according to quilt Access the source-information comprised in webpage network address, relevant node is connected;Described webpage redirects path Statistical information include: the preset feature that each node is had;
S802: receive the inquiry request relevant to nodal properties;
S803: according to the information preserved in described 3rd data base, it is provided that Query Result.
Wherein, the most relevant to nodal properties inquiry request can include many aspects, such as, wherein In the case of one, this inquiry request can be the 4th inquiry request, and the 4th inquiry request is used for checking each joint Point flows to the flow information of specified characteristic node.Such as, for each webpage under sky cat website, Ke Yicha Ask from these webpages to have experimental features webpage flow to information.
Concrete, it is possible to first inquiry the 3rd data base, determine and include each article of specified characteristic node the Four target webs redirect path, then redirect path according to each article of the 4th target web, determine and flow to described finger Determine source Nodes and the occurrence number of each source Nodes of property node, finally according to source Nodes and each The occurrence number of source Nodes, returns Query Result.For example, it is assumed that the node-flow in needing to check certain website To the flow of brand page, then can extract each paths with brand page property node from data base, For example, it is assumed that have 100, every paths can corresponding be determined this with brand page characteristic The source Nodes of node, such as, has node B, C, D etc., and then can be according to the appearance of each paths Number of times, collects the occurrence number of each source Nodes and correspondence, determines each source Nodes stream To the flow of brand page, that is, node B, C, D etc. flow to the flow of brand page respectively.
It addition, the inquiry request relevant to nodal properties can also is that the 5th inquiry request, the 5th inquiry please Ask for checking the specified characteristic node flow information to node each in specified sites, at this point it is possible to first look into Ask the 3rd data base, determine that each article of the 4th target web including described specified characteristic node redirects path; Then, redirect path according to each article of the 4th target web, determine whereabouts node that specified characteristic node flows to And the occurrence number of each whereabouts node, the most just can going out according to described whereabouts node and each whereabouts node Occurrence number, returns Query Result.
For the Query Result in embodiment three, equally use the mode of view to provide concrete flow Statistical information, to improve the readability of information.Such as, the node in certain website is flowed to the stream of brand page Amount information, can be as shown in Figure 9.
In a word, pass through the embodiment of the present application, it is possible to the daily record produced during accessing webpage according to user, build Vertical webpage redirects routing information data base, and adds up the data on each paths and/or node and real Shi Gengxin, in the process, it is possible to receive the request of query flows statistical information, and according in data base The data of record, it is provided that concrete traffic statistics result.As such, it is possible to provide certain to specify on the whole The information such as the traffic statistics of webpage, thus grasp the traffic conditions of webpage on the whole for webpage operation personnel and carry Supply foundation, and then can accordingly its link input direction etc. effectively have been adjusted, in order to more effectively Utilize Internet resources, it is to avoid the wasting of resources or underutilization.
Corresponding with the web data querying method that the embodiment of the present application one provides, the embodiment of the present application also provides for A kind of web data inquiry unit, sees Figure 10, and this device specifically may include that
First data base provides unit 1001, for providing the first data base, protects in described first data base There is the webpage collected in advance and redirects the statistical information in path;Wherein, described webpage redirects path with interviewed Ask that relevant node, as node, and according to the source-information comprised in accessed webpage network address, is entered by webpage Row series connection;Described webpage redirects the statistical data in path and includes that each bar webpage redirects the occurrence number in path, each Webpage redirects the start node in path and the type that described start node is affiliated when as traffic source;
Type queries request reception unit 1002, for receiving the inquiry request relevant to traffic source type;
Type queries result provides unit 1003, is used for according to the information preserved in described first data base, Query Result is provided.
Wherein, described type queries request reception unit 1002 specifically may include that
First inquiry request receives subelement, and for receiving the first inquiry request, described first inquiry request is used Traffic source type information in inquiry named web page;
Accordingly, described type queries result provides unit 1003 may include that
First inquiry subelement, is used for inquiring about described first data base, determines and comprise each of described named web page Bar first object webpage redirects path;
Type determination unit is corresponding for determining that each bar first object webpage redirects start node in path Type, and each bar first object webpage redirects the occurrence number in path;
First collects subelement, for the first object webpage with same type start node is redirected path Occurrence number collect, determine that all types of first object webpage corresponding respectively redirects total appearance in path Number of times;
First returns subelement, is used for according to each type described and described total occurrence number of correspondence, really Fixed described traffic source information also returns.
Or, described type queries request reception unit 1002 includes:
Second inquiry request receives subelement, and for receiving the second inquiry request, described second inquiry request is used Flow whereabouts information in the traffic source to node each in specified sites inquiring about specified type;
Accordingly, described type queries result provides unit 1003 may include that
Second inquiry subelement, is used for inquiring about described first data base, determines using described specified type as rising Each bar second target web of beginning node redirects path;
Second collects subelement, for including the second target web of same node point under described specified sites The occurrence number redirecting path collects, and determines that the traffic source of described specified type is to described specified sites Under each node flow to number of times;
Second returns subelement, each under the traffic source according to described specified type to described specified sites Node flow to number of times, return Query Result.
To redirect the method for routing database corresponding for the webpage of setting up provided with the embodiment of the present application one, the application Embodiment additionally provides a kind of webpage of setting up and redirects the device of routing database, sees Figure 11, and this device has Body may include that
First collector unit 1101, for collecting the information relevant to the web page access of preset website;
First statistic unit 1102, is used for accessed webpage as node, and according to accessed webpage net The source-information comprised in location, connects relevant node, generates a plurality of webpage and redirects path, and unites Count each bar webpage and redirect the occurrence number in path;Wherein, the start node in path, root are redirected about each webpage According to the domain-name information comprised in the network address of start node, determine that described start node is as traffic source time institute The type belonged to;
First signal generating unit 1103, for redirecting path and occurrence number thereof according to each bar webpage, and respectively The traffic source type information of individual start node, generates the first data base.
Corresponding with the web data querying method that the embodiment of the present application two provides, the embodiment of the present application also provides for A kind of web data inquiry unit, sees Figure 12, and this device specifically may include that
Second data base provides unit 1201, for providing the second data base, protects in described second data base There is the webpage collected in advance and redirects the statistical information in path;Wherein, described webpage redirects path with interviewed Ask that relevant node, as node, and according to the source-information comprised in accessed webpage network address, is entered by webpage Row series connection;
Source whereabouts inquiry request receives unit 1202, and for receiving the 3rd inquiry request, the described 3rd looks into Request of asking is for checking source and the whereabouts details of named web page;
Data base querying unit 1203, is used for inquiring about described second data base, determines and comprise described appointment net Each article of the 3rd target web of page redirects path;
Redirect relation determination unit 1204, for jumping at each article of the 3rd target web according to described named web page Turn location in path, determine a jumping or multi-hop source Nodes, jumping or a multi-hop of described named web page Relation is redirected between whereabouts node and each hop node;
Return unit 1205, for jumping according to described one or multi-hop source Nodes, a jumping or multi-hop whereabouts joint The relation that redirects between point and each hop node, returns described source and whereabouts details.
Wherein, described return unit 1205 specifically may be used for:
Centered by the node that described named web page is corresponding, jump according to described one or multi-hop source Nodes, a jumping Or redirecting relation between multi-hop whereabouts node and each hop node, the source whereabouts generating this named web page is closed It is view and returns.
To redirect the method for routing database corresponding for the webpage of setting up provided with the embodiment of the present application two, the application Embodiment additionally provides a kind of webpage of setting up and redirects the device of routing database, sees Figure 13, and this device has Body may include that
Second collector unit 1301, for collecting the information relevant to the web page access of preset website;
Second statistic unit 1302, is used for accessed webpage as node, and according to accessed webpage net The source-information comprised in location, connects relevant node, generates a plurality of webpage and redirects path, and unites Count each bar webpage and redirect the occurrence number in path;
Second signal generating unit 1303, for redirecting path and occurrence number thereof according to each bar webpage, generates the Two data bases.
Corresponding with the web data querying method that the embodiment of the present application three provides, the embodiment of the present application also provides for A kind of web data inquiry unit, sees Figure 14, and this device specifically may include that
3rd data base provides unit 1401, for providing the 3rd data base, protects in described 3rd data base There is the webpage collected in advance and redirects the statistical information in path;Wherein, described webpage redirects path with interviewed Ask that relevant node, as node, and according to the source-information comprised in accessed webpage network address, is entered by webpage Row series connection;Described webpage redirects the statistical information in path and includes: the preset feature that each node is had;
Characteristic inquiry request receives unit 1402, for receiving the inquiry request relevant to nodal properties;
Characteristic Query Result provides unit 1403, is used for according to the information preserved in described 3rd data base, Query Result is provided.
Wherein, described characteristic inquiry request reception unit 1402 specifically may include that
4th inquiry request receives subelement, and for receiving the 4th inquiry request, described 4th inquiry request is used In checking that each node flows to the flow information of specified characteristic node;
Accordingly, described characteristic Query Result provides unit 1403 may include that
3rd inquiry subelement, is used for inquiring about described 3rd data base, determines and includes described specified characteristic joint Each article of the 4th target web of point redirects path;
3rd collects subelement, for redirecting path according to each article of the 4th target web, determines and flows to described finger Determine source Nodes and the occurrence number of each source Nodes of property node;
3rd returns subelement, for according to described source Nodes and the occurrence number of each source Nodes, returns Return Query Result.
Or, described characteristic inquiry request receives unit 1402 and includes:
5th inquiry request receives subelement, and for receiving the 5th inquiry request, described 5th inquiry request is used In checking the specified characteristic node flow information to node each in specified sites;
Described characteristic Query Result provides unit 1403 may include that
4th inquiry subelement, is used for inquiring about described 3rd data base, determines and includes described specified characteristic joint Each article of the 4th target web of point redirects path;
4th collects subelement, for redirecting path according to each article of the 4th target web, determines described appointment spy Property node flow to whereabouts node and the occurrence number of each whereabouts node;
4th returns subelement, for according to described whereabouts node and the occurrence number of each whereabouts node, returns Return Query Result.
To redirect the method for routing database corresponding for the webpage of setting up provided with the embodiment of the present application two, the application Embodiment additionally provides a kind of webpage of setting up and redirects the device of routing database, sees Figure 15, and this device has Body may include that
3rd collector unit 1501, for collecting the information relevant to the web page access of preset website;
3rd statistic unit 1502, is used for accessed webpage as node, and according to accessed webpage net The source-information comprised in location, connects relevant node, generates a plurality of webpage and redirects path, and unites Count each bar webpage and redirect the occurrence number in path;Wherein, each node on path, root are redirected about each webpage According to the attribute information comprised in network address, determine the preset feature that each node is had;
3rd signal generating unit 1503, for redirecting path and occurrence number thereof according to each bar webpage, and respectively The characteristic information that individual node is had, generates the 3rd data base.
Pass through the embodiment of the present application, it is possible to the daily record produced during accessing webpage according to user, set up webpage Redirect routing information data base, and the data on each paths and/or node added up and real-time update, In the process, it is possible to receive query flows statistical information request, and according in data base record number According to, it is provided that concrete traffic statistics result.As such, it is possible to provide the stream of certain named web page on the whole The amount information such as statistics, thus grasp the traffic conditions of webpage on the whole for webpage operation personnel and provide foundation, And then can accordingly its link input direction etc. effectively be adjusted, in order to more effectively utilize network Resource, it is to avoid the wasting of resources or underutilization.
As seen through the above description of the embodiments, those skilled in the art is it can be understood that arrive this Application can add the mode of required general hardware platform by software and realize.Based on such understanding, this Shen The part that prior art is contributed by technical scheme please the most in other words can be with the shape of software product Formula embodies, and this computer software product can be stored in storage medium, as ROM/RAM, magnetic disc, CD etc., including some instructions with so that computer equipment (can be personal computer, server, Or the network equipment etc.) perform each embodiment of the application or the method described in some part of embodiment.
Each embodiment in this specification all uses the mode gone forward one by one to describe, phase homophase between each embodiment As part see mutually, what each embodiment stressed is the difference with other embodiments. For system or system embodiment, owing to it is substantially similar to embodiment of the method, so describing Obtaining fairly simple, relevant part sees the part of embodiment of the method and illustrates.System described above and System embodiment is only schematically, and the wherein said unit that illustrates as separating component can be or also Can not be physically separate, the parts shown as unit can be or may not be physical location, I.e. may be located at a place, or can also be distributed on multiple NE.Can be according to actual need Select some or all of module therein to realize the purpose of the present embodiment scheme.Ordinary skill Personnel, in the case of not paying creative work, are i.e. appreciated that and implement.
Above to web data inquiry provided herein, set up webpage redirect routing database method and Device, is described in detail, and principle and the embodiment of the application are entered by specific case used herein Having gone elaboration, the explanation of above example is only intended to help and understands the present processes and core concept thereof; Simultaneously for one of ordinary skill in the art, according to the thought of the application, in detailed description of the invention and should All will change with in scope.In sum, this specification content should not be construed as the limit to the application System.

Claims (22)

1. a web data querying method, it is characterised in that including:
First data base is provided, described first data base preserves the webpage collected in advance and redirects path Statistical information;Wherein, described webpage redirects path using accessed webpage as node, and according to accessed net The source-information comprised in page network address, connects relevant node;Described webpage redirects the statistics in path Data include that each bar webpage redirects the occurrence number in path, each webpage redirects the start node in path and described The type that start node is affiliated when as traffic source;
Receive the inquiry request relevant to traffic source type;
According to the information preserved in described first data base, it is provided that Query Result.
Method the most according to claim 1, it is characterised in that described reception and traffic source type Relevant inquiry request, including:
Receiving the first inquiry request, described first inquiry request is for inquiring about the traffic source type of named web page Information;
Described according to the information preserved in described first data base, it is provided that Query Result, including:
Inquire about described first data base, determine that each bar first object webpage comprising described named web page redirects road Footpath;
Determine that each bar first object webpage redirects the type that start node in path is corresponding, and each bar the first mesh Mark webpage redirects the occurrence number in path;
The occurrence number that the first object webpage with same type start node redirects path collects, Determine that all types of first object webpage corresponding respectively redirects total occurrence number in path;
According to each type described and described total occurrence number of correspondence, determine described traffic source information also Return.
Method the most according to claim 1, it is characterised in that receive relevant to traffic source type Inquiry request, including:
Receiving the second inquiry request, described second inquiry request is for inquiring about the traffic source of specified type to referring to Determine the flow whereabouts information of each node in website;
Described according to the information preserved in described first data base, it is provided that Query Result, including:
Inquire about described first data base, determine each bar the second target using described specified type as start node Webpage redirects path;
The second target web including same node point under described specified sites is redirected the occurrence number in path Collect, determine the traffic source of described specified type under described specified sites each node flow to number of times;
Traffic source according to described specified type under described specified sites each node flow to number of times, return Query Result.
4. set up the method that webpage redirects routing database for one kind, it is characterised in that including:
Collect the information relevant to the web page access of preset website;
Using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, by phase The node closed is connected, and generates a plurality of webpage and redirects path, and adds up each bar webpage and redirect the appearance in path Number of times;Wherein, the start node in path is redirected about each webpage, according to what the network address of start node comprised Domain-name information, determines the type that described start node is affiliated when as traffic source;
Path and occurrence number thereof, and the traffic source type of each start node is redirected according to each bar webpage Information, generates the first data base.
5. a web data querying method, it is characterised in that including:
Second data base is provided, described second data base preserves the webpage collected in advance and redirects path Statistical information;Wherein, described webpage redirects path using accessed webpage as node, and according to accessed net The source-information comprised in page network address, connects relevant node;
Receiving the 3rd inquiry request, described 3rd inquiry request is for checking source and the whereabouts of named web page Details;
Inquire about described second data base, determine that each article of the 3rd target web comprising described named web page redirects road Footpath;
Redirect location in path according to described named web page at each article of the 3rd target web, determine described Named web page one jumping or multi-hop source Nodes, one jump or multi-hop whereabouts node and each hop node between jumping Transfer the registration of Party membership, etc. from one unit to another;
According to described one jump or multi-hop source Nodes, one jump or multi-hop whereabouts node and each hop node between Redirect relation, return described source and whereabouts details.
Method the most according to claim 5, it is characterised in that described according to described jumping or a multi-hop Source Nodes, one jump or multi-hop whereabouts node and each hop node between redirect relation, return to described source And whereabouts details, including:
Centered by the node that described named web page is corresponding, jump according to described one or multi-hop source Nodes, a jumping Or redirecting relation between multi-hop whereabouts node and each hop node, the source whereabouts generating this named web page is closed It is view and returns.
7. set up the method that webpage redirects routing database for one kind, it is characterised in that including:
Collect the information relevant to the web page access of preset website;
Using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, by phase The node closed is connected, and generates a plurality of webpage and redirects path, and adds up each bar webpage and redirect the appearance in path Number of times;
Redirect path and occurrence number thereof according to each bar webpage, generate the second data base.
8. a web data querying method, it is characterised in that including:
3rd data base is provided, described 3rd data base preserves the webpage collected in advance and redirects path Statistical information;Wherein, described webpage redirects path using accessed webpage as node, and according to accessed net The source-information comprised in page network address, connects relevant node;Described webpage redirects the statistics in path Information includes: the preset feature that each node is had;
Receive the inquiry request relevant to nodal properties;
According to the information preserved in described 3rd data base, it is provided that Query Result.
Method the most according to claim 8, it is characterised in that described reception is relevant to nodal properties Inquiry request, including:
Receiving the 4th inquiry request, described 4th inquiry request is used for checking that each node flows to specified characteristic node Flow information;
Described according to the information preserved in described 3rd data base, it is provided that Query Result, including:
Inquire about described 3rd data base, determine each article of the 4th target web including described specified characteristic node Redirect path;
Redirect path according to each article of the 4th target web, determine the source Nodes flowing to described specified characteristic node And the occurrence number of each source Nodes;
According to described source Nodes and the occurrence number of each source Nodes, return Query Result.
Method the most according to claim 8, it is characterised in that described reception is relevant to nodal properties Inquiry request, including:
Receiving the 5th inquiry request, described 5th inquiry request is used for checking that specified characteristic node is to specified sites The flow information of interior each node;
Described according to the information preserved in described 3rd data base, it is provided that Query Result, including:
Inquire about described 3rd data base, determine each article of the 4th target web including described specified characteristic node Redirect path;
Redirect path according to each article of the 4th target web, determine the whereabouts node that described specified characteristic node flows to And the occurrence number of each whereabouts node;
According to described whereabouts node and the occurrence number of each whereabouts node, return Query Result.
Set up the method that webpage redirects routing database for 11. 1 kinds, it is characterised in that including:
Collect the information relevant to the web page access of preset website;
Using accessed webpage as node, and according to the source-information comprised in accessed webpage network address, by phase The node closed is connected, and generates a plurality of webpage and redirects path, and adds up each bar webpage and redirect the appearance in path Number of times;Wherein, redirect each node on path about each webpage, according to the attribute information comprised in network address, Determine the preset feature that each node is had;
Path and occurrence number thereof, and the characteristic information that each node is had is redirected according to each bar webpage, Generate the 3rd data base.
12. 1 kinds of web data inquiry units, it is characterised in that including:
First data base provides unit, for providing the first data base, preserves pre-in described first data base The webpage first collected redirects the statistical information in path;Wherein, described webpage redirects path with accessed webpage As node, and according to the source-information comprised in accessed webpage network address, relevant node is connected; Described webpage redirects the statistical data in path and includes that each bar webpage redirects the occurrence number in path, each webpage redirects The start node in path and the type affiliated when as traffic source of described start node;
Type queries request reception unit, for receiving the inquiry request relevant to traffic source type;
Type queries result provides unit, for according to the information preserved in described first data base, it is provided that look into Ask result.
13. devices according to claim 12, it is characterised in that the request of described type queries receives Unit includes:
First inquiry request receives subelement, and for receiving the first inquiry request, described first inquiry request is used Traffic source type information in inquiry named web page;
Described type queries result provides unit to include:
First inquiry subelement, is used for inquiring about described first data base, determines and comprise each of described named web page Bar first object webpage redirects path;
Type determination unit is corresponding for determining that each bar first object webpage redirects start node in path Type, and each bar first object webpage redirects the occurrence number in path;
First collects subelement, for the first object webpage with same type start node is redirected path Occurrence number collect, determine that all types of first object webpage corresponding respectively redirects total appearance in path Number of times;
First returns subelement, is used for according to each type described and described total occurrence number of correspondence, really Fixed described traffic source information also returns.
14. devices according to claim 12, it is characterised in that the request of described type queries receives Unit includes:
Second inquiry request receives subelement, and for receiving the second inquiry request, described second inquiry request is used Flow whereabouts information in the traffic source to node each in specified sites inquiring about specified type;
Described type queries result provides unit to include:
Second inquiry subelement, is used for inquiring about described first data base, determines using described specified type as rising Each bar second target web of beginning node redirects path;
Second collects subelement, for including the second target web of same node point under described specified sites The occurrence number redirecting path collects, and determines that the traffic source of described specified type is to described specified sites Under each node flow to number of times;
Second returns subelement, each under the traffic source according to described specified type to described specified sites Node flow to number of times, return Query Result.
Set up webpage for 15. 1 kinds and redirect the device of routing database, it is characterised in that including:
First collector unit, for collecting the information relevant to the web page access of preset website;
First statistic unit, is used for accessed webpage as node, and wraps according in accessed webpage network address The source-information contained, connects relevant node, generates a plurality of webpage and redirects path, and adds up each bar Webpage redirects the occurrence number in path;Wherein, redirect the start node in path about each webpage, according to initial The domain-name information comprised in the network address of node, determines the class that described start node is affiliated when as traffic source Type;
First signal generating unit, for redirecting path and occurrence number thereof according to each bar webpage, and each initiates The traffic source type information of node, generates the first data base.
16. 1 kinds of web data inquiry units, it is characterised in that including:
Second data base provides unit, for providing the second data base, preserves pre-in described second data base The webpage first collected redirects the statistical information in path;Wherein, described webpage redirects path with accessed webpage As node, and according to the source-information comprised in accessed webpage network address, relevant node is connected;
Source whereabouts inquiry request receives unit, for receiving the 3rd inquiry request, described 3rd inquiry request For checking source and the whereabouts details of named web page;
Data base querying unit, is used for inquiring about described second data base, determines and comprise each of described named web page Article the 3rd target web redirects path;
Redirect relation determination unit, for redirecting path according to described named web page at each article of the 3rd target web Middle location, determines a jumping or multi-hop source Nodes, a jumping or the multi-hop whereabouts joint of described named web page Point and each hop node between redirect relation;
Return unit, for according to described one jump or multi-hop source Nodes, one jump or multi-hop whereabouts node and Redirect relation between each hop node, return described source and whereabouts details.
17. devices according to claim 16, it is characterised in that described return unit specifically for:
Centered by the node that described named web page is corresponding, jump according to described one or multi-hop source Nodes, a jumping Or redirecting relation between multi-hop whereabouts node and each hop node, the source whereabouts generating this named web page is closed It is view and returns.
Set up webpage for 18. 1 kinds and redirect the device of routing database, it is characterised in that including:
Second collector unit, for collecting the information relevant to the web page access of preset website;
Second statistic unit, is used for accessed webpage as node, and wraps according in accessed webpage network address The source-information contained, connects relevant node, generates a plurality of webpage and redirects path, and adds up each bar Webpage redirects the occurrence number in path;
Second signal generating unit, for redirecting path and occurrence number thereof according to each bar webpage, generates the second data Storehouse.
19. 1 kinds of web data inquiry units, it is characterised in that including:
3rd data base provides unit, for providing the 3rd data base, preserves pre-in described 3rd data base The webpage first collected redirects the statistical information in path;Wherein, described webpage redirects path with accessed webpage As node, and according to the source-information comprised in accessed webpage network address, relevant node is connected; Described webpage redirects the statistical information in path and includes: the preset feature that each node is had;
Characteristic inquiry request receives unit, for receiving the inquiry request relevant to nodal properties;
Characteristic Query Result provides unit, for according to the information preserved in described 3rd data base, it is provided that look into Ask result.
20. devices according to claim 19, it is characterised in that described characteristic inquiry request receives Unit includes:
4th inquiry request receives subelement, and for receiving the 4th inquiry request, described 4th inquiry request is used In checking that each node flows to the flow information of specified characteristic node;
Described characteristic Query Result provides unit to include:
3rd inquiry subelement, is used for inquiring about described 3rd data base, determines and includes described specified characteristic joint Each article of the 4th target web of point redirects path;
3rd collects subelement, for redirecting path according to each article of the 4th target web, determines and flows to described finger Determine source Nodes and the occurrence number of each source Nodes of property node;
3rd returns subelement, for according to described source Nodes and the occurrence number of each source Nodes, returns Return Query Result.
21. devices according to claim 19, it is characterised in that described characteristic inquiry request receives Unit includes:
5th inquiry request receives subelement, and for receiving the 5th inquiry request, described 5th inquiry request is used In checking the specified characteristic node flow information to node each in specified sites;
Described characteristic Query Result provides unit to include:
4th inquiry subelement, is used for inquiring about described 3rd data base, determines and includes described specified characteristic joint Each article of the 4th target web of point redirects path;
4th collects subelement, for redirecting path according to each article of the 4th target web, determines described appointment spy Property node flow to whereabouts node and the occurrence number of each whereabouts node;
4th returns subelement, for according to described whereabouts node and the occurrence number of each whereabouts node, returns Return Query Result.
Set up webpage for 22. 1 kinds and redirect the device of routing database, it is characterised in that including:
3rd collector unit, for collecting the information relevant to the web page access of preset website;
3rd statistic unit, is used for accessed webpage as node, and wraps according in accessed webpage network address The source-information contained, connects relevant node, generates a plurality of webpage and redirects path, and adds up each bar Webpage redirects the occurrence number in path;Wherein, each node on path is redirected about each webpage, according to network address In the attribute information that comprises, determine the preset feature that each node is had;
3rd signal generating unit, for redirecting path and occurrence number thereof, and each node according to each bar webpage The characteristic information being had, generates the 3rd data base.
CN201510041278.8A 2015-01-27 2015-01-27 Webpage data query method and device, and method and device for establishing webpage jump path database Pending CN105989002A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510041278.8A CN105989002A (en) 2015-01-27 2015-01-27 Webpage data query method and device, and method and device for establishing webpage jump path database

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510041278.8A CN105989002A (en) 2015-01-27 2015-01-27 Webpage data query method and device, and method and device for establishing webpage jump path database

Publications (1)

Publication Number Publication Date
CN105989002A true CN105989002A (en) 2016-10-05

Family

ID=57034767

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510041278.8A Pending CN105989002A (en) 2015-01-27 2015-01-27 Webpage data query method and device, and method and device for establishing webpage jump path database

Country Status (1)

Country Link
CN (1) CN105989002A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107239970A (en) * 2017-05-12 2017-10-10 百川通联(北京)网络技术有限公司 A kind of Behavior-based control daily record determines the method and system of ad click rate
CN110020364A (en) * 2017-11-27 2019-07-16 北京京东尚科信息技术有限公司 The method and apparatus for determining the traffic source of page access
CN113434556A (en) * 2021-07-22 2021-09-24 支付宝(杭州)信息技术有限公司 Data processing method and system
CN114491371A (en) * 2022-01-27 2022-05-13 佛山众陶联供应链服务有限公司 Front-end multi-system skip method and system for web system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1949259A (en) * 2006-01-28 2007-04-18 商助科技(北京)有限公司 Method for point contacting information of collecting web page by embedding code in web page
CN101072122A (en) * 2007-03-30 2007-11-14 腾讯科技(深圳)有限公司 Method, system and user end device for obtaining access amount statistical data
CN102054004A (en) * 2009-11-04 2011-05-11 清华大学 Webpage recommendation method and device adopting same
CN104252348A (en) * 2013-06-27 2014-12-31 腾讯科技(深圳)有限公司 Webpage access statistics method and device based on browser

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1949259A (en) * 2006-01-28 2007-04-18 商助科技(北京)有限公司 Method for point contacting information of collecting web page by embedding code in web page
CN101072122A (en) * 2007-03-30 2007-11-14 腾讯科技(深圳)有限公司 Method, system and user end device for obtaining access amount statistical data
CN102054004A (en) * 2009-11-04 2011-05-11 清华大学 Webpage recommendation method and device adopting same
CN104252348A (en) * 2013-06-27 2014-12-31 腾讯科技(深圳)有限公司 Webpage access statistics method and device based on browser

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
白凯 等: "旅游信息来源类型对消费者行为意图的影响", 《人文地理》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107239970A (en) * 2017-05-12 2017-10-10 百川通联(北京)网络技术有限公司 A kind of Behavior-based control daily record determines the method and system of ad click rate
CN110020364A (en) * 2017-11-27 2019-07-16 北京京东尚科信息技术有限公司 The method and apparatus for determining the traffic source of page access
CN110020364B (en) * 2017-11-27 2021-11-30 北京京东尚科信息技术有限公司 Method and device for determining flow source of page access
CN113434556A (en) * 2021-07-22 2021-09-24 支付宝(杭州)信息技术有限公司 Data processing method and system
CN114491371A (en) * 2022-01-27 2022-05-13 佛山众陶联供应链服务有限公司 Front-end multi-system skip method and system for web system
CN114491371B (en) * 2022-01-27 2022-09-16 佛山众陶联供应链服务有限公司 Front-end multi-system jump method and system of web system

Similar Documents

Publication Publication Date Title
CN102521251B (en) Method for directly realizing personalized search, device for realizing method, and search server
CN103389983B (en) A kind of capturing webpage contents method and device for network crawler system
US8903800B2 (en) System and method for indexing food providers and use of the index in search engines
KR101130108B1 (en) Method, system and computer readable recording medium for detecting web page traps based on perpectual calendar and building the search database using the same
CN102054004B (en) Webpage recommendation method and device adopting same
US20080104113A1 (en) Uniform resource locator scoring for targeted web crawling
CN102142033B (en) Method and device for providing relative sub-link information in search result
CN102663048B (en) Method and device for providing search result
Jain et al. Page ranking algorithms in web mining, limitations of existing methods and a new method for indexing web pages
CN102708132A (en) Method and system for webpage recommendation
CN103618696B (en) Method and server for processing cookie information
CN105989002A (en) Webpage data query method and device, and method and device for establishing webpage jump path database
CN102663054A (en) Method and device for determining weight of website
CN104202418B (en) Recommend the method and system of the content distributing network of business for content supplier
CN104252348A (en) Webpage access statistics method and device based on browser
CN102629265B (en) A kind of method and system setting up web database
Sethi et al. A novel page ranking mechanism based on user browsing patterns
US20180337930A1 (en) Method and apparatus for providing website authentication data for search engine
CN109586942A (en) Web site performance assessment method and device
US9183299B2 (en) Search engine for ranking a set of pages returned as search results from a search query
CN105224555A (en) A kind of methods, devices and systems of search
CN102541947A (en) Method and equipment for updating authority score of webpage based on friefox event
Aggarwal An efficient focused web crawling approach
CN105930385A (en) Data crawling method and system
CN103631793A (en) Method, device and equipment for sorting search results

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20161005