CN109145230A - Information output method and device - Google Patents

Information output method and device Download PDF

Info

Publication number
CN109145230A
CN109145230A CN201710454694.XA CN201710454694A CN109145230A CN 109145230 A CN109145230 A CN 109145230A CN 201710454694 A CN201710454694 A CN 201710454694A CN 109145230 A CN109145230 A CN 109145230A
Authority
CN
China
Prior art keywords
uniform resource
sequence
finger url
user
page
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710454694.XA
Other languages
Chinese (zh)
Inventor
李曼
覃健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201710454694.XA priority Critical patent/CN109145230A/en
Publication of CN109145230A publication Critical patent/CN109145230A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

This application discloses information output methods and device.One specific embodiment of this method includes: to obtain at least one user, access log for recording the page request sent;Uniform resource locator in the page request sent in the access log of each user is sorted to obtain the uniform resource locator sequence of the user by the sequence after arriving first according to the time;Obtain destination Uniform Resource finger URL arrangement set to be counted;The uniform resource locator sequence of each user and destination Uniform Resource finger URL arrangement set are subjected to pattern match, obtain the number that each destination Uniform Resource finger URL sequence occurs in the uniform resource locator sequence of each user;The number that each destination Uniform Resource finger URL sequence occurs in the uniform resource locator sequence of each user is converted to the conversion ratio between the page to export.The embodiment can reduce the complexity of analysis conversion of page rate.

Description

Information output method and device
Technical field
This application involves field of computer technology, and in particular to Internet technical field more particularly to information output method And device.
Background technique
In Internet application, continuous conversion and loss statistics of the customer flow between multiple pages produce analysis The layer-by-layer conversion of user's performance of product and user on product core path, is very necessary.Layer-by-layer traffic transformation, no Only the two neighboring page jumps ratio, further includes customer flow on a fullpath of user access path length > 2 Conversion, i.e., the flow diagram of one funnel shaped.
Current flow conversion rate analysis scheme needs to preassign the product page to be counted, in page source code Bury a statistics.Then to the log recording being collected into, counting statistics are carried out, and each product line is required to customize, it cannot be general Change processing.Therefore the complexity implemented is high.
Summary of the invention
The purpose of the application is to propose a kind of improved information output method and device, to solve background above technology department Divide the technical issues of mentioning.
In a first aspect, the embodiment of the present application provides a kind of information output method, this method comprises: obtaining at least one use Family, access log for recording the page request sent, wherein page request includes uniform resource locator, access Log includes the time for sending page request;For the access log of each user, the page that will have been sent in the access log Uniform resource locator in request is sorted to obtain the system of the user by the sequence after arriving first according to the time for sending page request One Resource Locator sequence;Obtain destination Uniform Resource finger URL arrangement set to be counted, wherein each destination Uniform Resource Each destination Uniform Resource finger URL in finger URL sequence is sorted by the time for sending page request by the sequence after arriving first, sequence The page corresponding to posterior destination Uniform Resource finger URL is by and sequence adjacent with the destination Uniform Resource finger URL preceding Destination Uniform Resource finger URL corresponding to page jump;The uniform resource locator sequence of each user and target are united One Resource Locator arrangement set carries out pattern match, obtains each destination Uniform Resource finger URL sequence in the unification of each user The number occurred in Resource Locator sequence;According to destination Uniform Resource finger URL institute in each destination Uniform Resource finger URL sequence Jump relationship between the corresponding page, by each destination Uniform Resource finger URL sequence each user uniform resource locator The number occurred in sequence is converted to page corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Conversion ratio between face is exported.
In some embodiments, by the uniform resource locator sequence of each user and destination Uniform Resource finger URL sequence Before column set carries out pattern match, this method further include: according to preset character map by each uniform resource locator It is mapped as single character, the uniform resource locator sequence after being mapped, wherein character map is for characterizing unified resource The corresponding relationship of finger URL and single character;According to character map by each of destination Uniform Resource finger URL arrangement set Destination Uniform Resource finger URL sequence is mapped as the destination Uniform Resource finger URL sequence being made of the character on character map.
In some embodiments, by the uniform resource locator sequence of each user and destination Uniform Resource finger URL sequence Set carries out pattern match, comprising: is positioned the unified resource after character map maps of each user by KMP algorithm It accords with sequence and carries out pattern match with the destination Uniform Resource finger URL arrangement set after character map maps.
In some embodiments, at least one user, access log for recording the page request sent are obtained, It include: the original log for obtaining the access information including at least one user, wherein access information includes at least one of the following: Page request, page elements request and style sheet request;Page elements request and style sheet are filtered out from original log It requests, and the page request of same user is combined into the access log of the user.
In some embodiments, single character is corresponding at least one uniform resource locator in character map.
In some embodiments, this method further include: provided according to target in each destination Uniform Resource finger URL sequence is unified Conversion ratio between the page corresponding to the finger URL of source generates crater blasting.
Second aspect, the embodiment of the present application provide a kind of information output apparatus, which includes: first acquisition unit, For obtaining at least one user, access log for recording the page request sent, wherein page request includes system One Resource Locator, access log include the time for sending page request;Sequencing unit, for the access day for each user Will, by the uniform resource locator in the page request sent in the access log according to the time of transmission page request by elder generation Sequence after sorts to obtain the uniform resource locator sequence of the user;Second acquisition unit, for obtaining mesh to be counted Mark uniform resource locator arrangement set, wherein each destination Uniform Resource in each destination Uniform Resource finger URL sequence is fixed Position symbol is sorted by the time for sending page request by the sequence after arriving first, and is sorted corresponding to posterior destination Uniform Resource finger URL The page be the page as corresponding to and sequence preceding destination Uniform Resource finger URL adjacent with the destination Uniform Resource finger URL What face jumped;Matching unit, for by the uniform resource locator sequence of each user and destination Uniform Resource finger URL sequence Set carries out pattern match, obtains each destination Uniform Resource finger URL sequence in the uniform resource locator sequence of each user The number of appearance;Output unit, for right according to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Jump relationship between the page answered, by each destination Uniform Resource finger URL sequence each user uniform resource locator sequence The number occurred in column is converted to the page corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Between conversion ratio exported.
In some embodiments, which further includes map unit, is used for: by the uniform resource locator of each user Before sequence and destination Uniform Resource finger URL arrangement set carry out pattern match, according to preset character map by each system One Resource Locator is mapped as single character, the uniform resource locator sequence after being mapped, wherein character map is used for Characterize the corresponding relationship of uniform resource locator and single character;According to character map by destination Uniform Resource finger URL sequence Each destination Uniform Resource finger URL sequence in set is mapped as the unified money of the target being made of the character on character map Source finger URL sequence.
In some embodiments, matching unit is further used for: by KMP algorithm by each user through character map Uniform resource locator sequence after mapping and the destination Uniform Resource finger URL arrangement set after character map maps into Row pattern match.
In some embodiments, first acquisition unit is further used for: obtaining the access information including at least one user Original log, wherein access information include at least one of the following: page request, page elements request and style sheet request; Page elements request and style sheet request are filtered out from original log, and the page request of same user is combined into the use The access log at family.
In some embodiments, single character is corresponding at least one uniform resource locator in character map.
In some embodiments, the device further include: generation unit, for according to each destination Uniform Resource finger URL sequence Conversion ratio between the page corresponding to middle destination Uniform Resource finger URL generates crater blasting.
The third aspect, the embodiment of the present application provide a kind of server, comprising: one or more processors;Storage device, For storing one or more programs, when one or more programs are executed by one or more processors, so that one or more Processor is realized such as method any in first aspect.
Fourth aspect, the embodiment of the present application provide a kind of computer readable storage medium, are stored thereon with computer journey Sequence is realized when the program is executed by processor such as method any in first aspect.
Information output method and device provided by the embodiments of the present application, by by the page request recorded in access log URL (Uniform Resource Locator, uniform resource locator) is ranked up by the time order and function for sending page request, Then pattern match is carried out with target URL arrangement set, obtains the conversion ratio between the corresponding page of each URL.So as to letter The process for changing the conversion ratio between the analysis page, saves time cost.
Detailed description of the invention
By reading a detailed description of non-restrictive embodiments in the light of the attached drawings below, the application's is other Feature, objects and advantages will become more apparent upon:
Fig. 1 is that this application can be applied to exemplary system architecture figures therein;
Fig. 2 is the flow chart according to one embodiment of the information output method of the application;
Fig. 3 is the schematic diagram according to an application scenarios of the information output method of the application;
Fig. 4 is the flow chart according to another embodiment of the information output method of the application;
Fig. 5 is the structural schematic diagram according to one embodiment of the information output apparatus of the application;
Fig. 6 is adapted for the structural schematic diagram for the computer system for realizing the server of the embodiment of the present application.
Specific embodiment
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to Convenient for description, part relevant to related invention is illustrated only in attached drawing.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 is shown can be using the exemplary system of the embodiment of the information output method or information output apparatus of the application System framework 100.
As shown in Figure 1, system architecture 100 may include terminal device 101,102,103, network 104 and server 105. Network 104 between terminal device 101,102,103 and server 105 to provide the medium of communication link.Network 104 can be with Including various connection types, such as wired, wireless communication link or fiber optic cables etc..
User can be used terminal device 101,102,103 and be interacted by network 104 with server 105, to receive or send out Send message etc..Various telecommunication customer end applications can be installed, such as web browser is answered on terminal device 101,102,103 With, shopping class application, searching class application, instant messaging tools, mailbox client, social platform software etc..
Terminal device 101,102,103 can be the various electronic equipments with display screen and supported web page browsing, packet Include but be not limited to smart phone, tablet computer, E-book reader, MP3 player (Moving Picture Experts Group Audio Layer III, dynamic image expert's compression standard audio level 3), MP4 (Moving Picture Experts Group Audio Layer IV, dynamic image expert's compression standard audio level 4) it is player, on knee portable Computer and desktop computer etc..
Server 105 can be to provide the server of various services, such as to showing on terminal device 101,102,103 Webpage provides the backstage web page server supported.Backstage web page server can to receive Webpage request etc. data into The processing such as row analysis, and processing result (such as webpage data) is fed back into terminal device.
It should be noted that information output method provided by the embodiment of the present application is generally executed by server 105, accordingly Ground, information output apparatus are generally positioned in server 105.
It should be understood that the number of terminal device, network and server in Fig. 1 is only schematical.According to realization need It wants, can have any number of terminal device, network and server.
With continued reference to Fig. 2, the process 200 of one embodiment of the information output method according to the application is shown.The letter Cease output method, comprising the following steps:
Step 201, at least one user, access log for recording the page request sent are obtained.
In the present embodiment, the electronic equipment (such as server shown in FIG. 1) of information output method operation thereon can To receive page request using its terminal for carrying out web page browsing from user by wired connection mode or radio connection And record and send the time of page request and the content of page request to generate access log, it can also be obtained from third-party server The access log for the page request for taking record user to send.Wherein, the content of above-mentioned page request includes that user's expectation is clear The address for the webpage look at, i.e. network address.In practice, network address is generally indicated by URL.It is pointed out that above-mentioned wireless connection side Formula can include but is not limited to 3G/4G connection, WiFi connection, bluetooth connection, WiMAX connection, Zigbee connection, UWB (ultra Wideband) connection and other currently known or exploitation in the future radio connections.
In general, user browses webpage using the web browser installed in terminal, at this moment, user can be by directly defeated Enter the chain in the webpage presented in network address or webpage clicking browser to fetch to web page server initiation page request.In this reality Apply in example, above-mentioned webpage may include html format, xhtml format, asp format, php format, jsp format, shtml format, The webpage of nsp format, the webpage of xml format or other following formats by exploitation is (as long as the web page files of this format can With opened and browsed with browser it includes the contents such as picture, animation, text).
In some optional implementations of the present embodiment, obtain at least one user, sent for recording The access log of page request, comprising: obtain the original log of the access information including at least one user, wherein access letter Breath includes at least one of the following: page request, page elements request and style sheet request;The page is filtered out from original log Element request and style sheet are requested, and the page request of same user is combined into the access log of the user.For example, can make With product line PHP (Hypertext Preprocessor, HyperText Preprocessor) original log of default print.Then To the data cleansing process of original log, specifically includes that and filter out CSS (Cascading Style Sheets, cascading style Table), the requests of the page elements and style sheet such as JPG, reduce the data volume of follow-up phase processing.Finally, original log is pressed Cutting is carried out according to user's granularity, generates the access log of each user
Step 202, for the access log of each user, by the unification in the page request sent in the access log Resource Locator is sorted to obtain the uniform resource locator of the user by the sequence after arriving first according to the time for sending page request Sequence.
In the present embodiment, URL is arranged according to the timestamp that the user recorded in access log sends page request Sequence obtains uniform resource locator sequence, i.e. URL sequence, and URL sequence is exemplified below:
The uniform resource locator sequence of user U1:
Http:// www.test.com/1/index.php, http://www.test.com/1/detail.php
The uniform resource locator sequence of user U2:
Http:// www.test.com/2/index.php, http://www.test.com/1/index.php, Http:// www.test.com/1/detail.php, http://www.test.com/2/detail.php
The uniform resource locator sequence of user U3:
http://www.test.com/1/index.php
Step 203, destination Uniform Resource finger URL arrangement set to be counted is obtained.
In the present embodiment, each destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence is by transmission The time of page request by after arriving first sequence sort, the page corresponding to posterior destination Uniform Resource finger URL that sorts be by Page jump corresponding to and the preceding destination Uniform Resource finger URL of sequence adjacent with the destination Uniform Resource finger URL.Mesh Marking longest destination Uniform Resource finger URL sequence in uniform resource locator arrangement set is the longest page jump institute in path Need by all uniform resource locator, remaining destination Uniform Resource finger URL sequence be the longest destination Uniform Resource Each length substring of finger URL sequence, length mentioned here are the numbers of URL.
For example, destination Uniform Resource finger URL sequence to be counted are as follows:
Http:// www.test.com/1/index.php, http://www.test.com/1/detail.php, http://www.test.com/2/detail.php
Then destination Uniform Resource finger URL arrangement set includes the substring http://www.test.com/1/ that length is 1 Index.php, substring the http://www.test.com/1/index.php, http://www.test.com/1/ that length is 2 Substring the http://www.test.com/1/index.php, http that detail.php and length are 3: // Www.test.com/1/detail.php, http://www.test.com/2/detail.php.
Step 204, by the uniform resource locator sequence of each user and destination Uniform Resource finger URL arrangement set into Row pattern match obtains what each destination Uniform Resource finger URL sequence occurred in the uniform resource locator sequence of each user Number.
In the present embodiment, pattern match is a kind of basic operation of character string in data structure, gives a substring, It asks and finds out all substrings identical with the substring in some character string, here it is pattern match.Common schema matching algorithm packet Include simple pattern matching algorithm, KMP matching algorithm and BM matching algorithm etc..Wherein, the algorithm of simple pattern matching algorithm Thought are as follows: from the first character of target strings compared with the first character of pattern string, if equal, continue to carry out character Subsequent comparison, otherwise target strings from second character with the first character of pattern string again compared with, until pattern string in Each character it is successively equal with a continuous character string in target strings until, referred to as successful match at this time, otherwise With failure.The full name of KMP matching algorithm is Knuth-Morris-Pratt algorithm, be by D.E.Knuth, J.H.Morris and The innovatory algorithm that V.R.Pratt is proposed jointly eliminates in simple pattern matching algorithm and recalls problem, completes the mould of string Formula matching.BM algorithm is a kind of precise character string matching algorithm (being different from fuzzy matching).Using the method compared from right to left, Two kinds of heuristic rules have been applied to it simultaneously, i.e., batter accords with rule and becomes reconciled suffix rule, to determine the distance jumped to the right.
For example, pattern string 1 can be set by the uniform resource locator sequence of user U1:
Http:// www.test.com/1/index.php, http://www.test.com/1/detail.php
Pattern string 2 is set by the uniform resource locator sequence of user U2:
Http:// www.test.com/2/index.php, http://www.test.com/1/index.php, Http:// www.test.com/1/detail.php, http://www.test.com/2/detail.php
Pattern string 3 is set by the uniform resource locator sequence of user U3:
http://www.test.com/1/index.php
Target strings are set by destination Uniform Resource finger URL sequence:
Http:// www.test.com/1/index.php, http://www.test.com/1/detail.php, http://www.test.com/2/detail.php
The substring http://www.test.com/1/index.php that then length of target strings is 1 is in the pattern string of place Occur 3 times;
Then the length of target strings be 2 substring http://www.test.com/1/index.php, http: // Www.test.com/1/detail.php occurs 2 times in the pattern string of place;
Then the length of target strings be 3 substring http://www.test.com/1/index.php, http: // Www.test.com/1/detail.php, http://www.test.com/2/detail.php occur 1 in the pattern string of place It is secondary.
Step 205, the page according to corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Between jump relationship, each destination Uniform Resource finger URL sequence is occurred in the uniform resource locator sequence of each user Number be converted in each destination Uniform Resource finger URL sequence and to turn between the page corresponding to destination Uniform Resource finger URL The rate of changing is exported.
In the present embodiment, in upper example from the page corresponding to http://www.test.com/1/index.php to The conversion ratio of the page corresponding to http://www.test.com/1/detail.php are as follows: 2/3=67%.From http: // The page corresponding to www.test.com/1/index.php is through corresponding to http://www.test.com/1/detail.php The page to http://www.test.com/2/detail.php corresponding to the page conversion ratio are as follows: 1/3=33%.To The layer-by-layer conversion ratio of target URL can be counted, then in digital form or diagrammatic form output.
In some optional implementations of the present embodiment, united according to target in each destination Uniform Resource finger URL sequence Conversion ratio between the page corresponding to one Resource Locator generates crater blasting.Crater blasting be suitable for operation flow compare specification, Process analysis more than period length, link intuitively can be found and be described the problem by the comparison of each link business datum of funnel Place.In web analytics, compare commonly used in conversion ratio, it can not only show that user buys most from into website to realization Whole conversion ratio can also show the conversion ratio of each step.
With continued reference to the schematic diagram that Fig. 3, Fig. 3 are according to the application scenarios of the information output method of the present embodiment.? In the application scenarios of Fig. 3, multiple users send the page request of browsing pages 301, and certain customers continue to send browsing pages 302 Page request, certain customers continue send browsing pages 303 page request.The page request that user sends all is documented in clothes It is engaged in the access log on device.Server obtains URL and target URL progress pattern match in different user page request From the page 301 to the conversion ratio X% of the page 302, and the conversion ratio Y% from the page 301 through the page 302 to the page 303.
The method provided by the above embodiment of the application passes through the conversion ratio between the page is related to the pattern match of URL Connection reduces the complexity of the conversion ratio between the analysis page.
With further reference to Fig. 4, it illustrates the processes 400 of another embodiment of information output method.Information output The process 400 of method, comprising the following steps:
Step 401, at least one user, access log for recording the page request sent are obtained.
Step 402, for the access log of each user, by the unification in the page request sent in the access log Resource Locator is sorted to obtain the uniform resource locator of the user by the sequence after arriving first according to the time for sending page request Sequence.
Step 401,402 with step 201,202 essentially identical, therefore repeat no more.
Step 403, each uniform resource locator single character is mapped as according to preset character map to be reflected Uniform resource locator sequence after penetrating.
In the present embodiment, character map is used to characterize the corresponding relationship of uniform resource locator Yu single character.This The generation of character map, there are two types of scheme is available: artificially collecting, automatically generates.In the character map of generation, every URL record is marked with monocase, monocase can include: 0-9 Arabic numerals, lowercase, capitalization, spcial character Deng, can according to ASCII character obtain respective value.
The two schemes that character map generates are described in detail individually below:
(a) it artificially collects: the URL of all page requests of product line is collected and is summarized
(b) it automatically generates: to the logged result after original log cleaning, sending curl (CommandLine respectively Uniform Resource Locator, order line uniform resource locator) it requests, acquisition returns the result: if wrapped in result Include " <!DOCTYPE html " class page html label illustrates that the corresponding request of this URL is page request, rather than interface is asked It asks.The set of URL for successively obtaining all pages of product closes.
The character map of generation, every row URL are recorded as two column:
Original URL mapping character
http://www.test.com/1/index.php A
http://www.test.com/1/detail.php B
http://www.test.com/2/index.php C
http://www.test.com/2/detail.php D
In this way, the mapping URL sequence that user U1 is generated are as follows: AB
The mapping URL sequence that user U2 is generated are as follows: CABD
The mapping URL sequence that user U3 is generated are as follows: A
In some optional implementations of the present embodiment, single character and at least one unified money in character map Source finger URL is corresponding.That is, multiple URL can be mapped as same character.For example, to same article with the different pages from multiple angles It can be the same character by this series of page-map when degree is shown.In this way convenient for by the conversion ratio of the class statistics page.
Step 404, destination Uniform Resource finger URL arrangement set to be counted is obtained, and according to character map by target Each destination Uniform Resource finger URL sequence in uniform resource locator arrangement set is mapped as by the word on character map Accord with the destination Uniform Resource finger URL sequence of composition.
In the present embodiment, shown in example as above, URL sequence to be counted, i.e. destination path map URL sequence are as follows: ABD.
Step 405, by the uniform resource locator sequence of each user and destination Uniform Resource finger URL arrangement set into Row pattern match obtains what each destination Uniform Resource finger URL sequence occurred in the uniform resource locator sequence of each user Number.
In the present embodiment, the unified resource after character map maps of each user is positioned by KMP algorithm It accords with sequence and carries out pattern match with the destination Uniform Resource finger URL arrangement set after character map maps.
The pattern matching process indicated in upper example with URL can simplify are as follows:
Each user maps URL sequence=> pattern string;
Destination path to be counted maps URL sequence=> target strings;
By using KMP Matching Algorithm of String Pattern, each length substring of target strings is determined, it is matched in pattern string Number.Reduce the complexity of analysis.
Such as:
User's U1 sequence of mapping: AB (pattern string 1)
User's U2 sequence of mapping: CABD (pattern string 2)
User's U3 sequence of mapping: A (pattern string 3)
Destination path sequence of mapping to be counted are as follows: ABD (target strings)
That is: it need to count in target strings respectively, length=1 character string A, length=2 character string AB, length=3 character strings ABD, the frequency occurred in all pattern strings.It, can using KMP Matching Algorithm of String Pattern according to the problem after such conversion It solves.
Length=1 character string A: occur 3 times in all pattern strings;
Length=2 character string AB: occur 2 times in all pattern strings;
Length=3 character string ABD: occur 1 time in all pattern strings;
Step 406, the page according to corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Between jump relationship, each destination Uniform Resource finger URL sequence is occurred in the uniform resource locator sequence of each user Number be converted in each destination Uniform Resource finger URL sequence and to turn between the page corresponding to destination Uniform Resource finger URL The rate of changing is exported.
In the present embodiment, the number occurred in pattern string by the target strings that step 405 obtains, can be obtained the page it Between conversion ratio, for example, A- > B conversion ratio: 2/3=67%, A- > B- > D conversion ratio: 1/3=33%.
Figure 4, it is seen that compared with the corresponding embodiment of Fig. 2, the process of the information output method in the present embodiment 400 highlight the step of carrying out character mapping to URL.The scheme of the present embodiment description can simplify pattern matching process as a result, The matched complexity of reduction mode, to reduce the complexity for calculating conversion ratio between the page.
With further reference to Fig. 5, as the realization to method shown in above-mentioned each figure, this application provides a kind of outputs of information to fill The one embodiment set, the Installation practice is corresponding with embodiment of the method shown in Fig. 2, which specifically can be applied to respectively In kind electronic equipment.
As shown in figure 5, the information output apparatus 500 of the present embodiment include: first acquisition unit 501, sequencing unit 502, Second acquisition unit 503, matching unit 504 and output unit 505.Wherein, first acquisition unit 501 is for obtaining at least one User, access log for recording the page request sent, wherein page request includes uniform resource locator, is visited Ask that log includes the time for sending page request;Sequencing unit 502 is used for the access log for each user, by the access day The uniform resource locator in page request sent in will is arranged according to the time for sending page request by the sequence after arriving first Sequence obtains the uniform resource locator sequence of the user;Second acquisition unit 503 is for obtaining destination Uniform Resource to be counted Finger URL arrangement set, wherein each destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence is by transmission The time of page request by after arriving first sequence sort, the page corresponding to posterior destination Uniform Resource finger URL that sorts be by Page jump corresponding to and the preceding destination Uniform Resource finger URL of sequence adjacent with the destination Uniform Resource finger URL;? It is used to the uniform resource locator sequence of each user and destination Uniform Resource finger URL arrangement set carrying out mould with unit 504 Formula matching obtains time that each destination Uniform Resource finger URL sequence occurs in the uniform resource locator sequence of each user Number;Output unit 505 is used for the page according to corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Relationship is jumped between face, each destination Uniform Resource finger URL sequence is gone out in the uniform resource locator sequence of each user Existing number is converted in each destination Uniform Resource finger URL sequence between the page corresponding to destination Uniform Resource finger URL Conversion ratio is exported.
In the present embodiment, the first acquisition unit 501 of information output apparatus 500, sequencing unit 502, second obtain single The specific processing of member 503, matching unit 504 and output unit 505 can be with reference to step 201, the step in Fig. 2 corresponding embodiment 202, step 203, step 204 and step 205.
In some optional implementations of the present embodiment, device 500 further includes map unit (not shown), is used for: Before the uniform resource locator sequence of each user and destination Uniform Resource finger URL arrangement set are carried out pattern match, Each uniform resource locator is mapped as single character according to preset character map, the unified resource after being mapped is fixed Position symbol sequence, wherein character map is used to characterize the corresponding relationship of uniform resource locator Yu single character;It is reflected according to character Each destination Uniform Resource finger URL sequence in destination Uniform Resource finger URL arrangement set is mapped as being reflected by character by firing table The destination Uniform Resource finger URL sequence of character composition on firing table.
In some optional implementations of the present embodiment, matching unit 504 is further used for: will by KMP algorithm The uniform resource locator sequence after character map maps of each user is united with the target after character map maps One Resource Locator arrangement set carries out pattern match.
In some optional implementations of the present embodiment, first acquisition unit 501 is further used for: obtaining includes extremely The original log of the access information of a few user, wherein access information includes at least one of the following: page request, page member Element request and style sheet request;Page elements request and style sheet request are filtered out from original log, and by same use The page request at family is combined into the access log of the user.
In some optional implementations of the present embodiment, single character and at least one unified money in character map Source finger URL is corresponding.
In some optional implementations of the present embodiment, device 500 further include: generation unit (not shown) is used for It is generated according to the conversion ratio between the page corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Crater blasting.
Below with reference to Fig. 6, it illustrates the computer systems 600 for the server for being suitable for being used to realize the embodiment of the present application Structural schematic diagram.Server shown in Fig. 6 is only an example, should not function and use scope band to the embodiment of the present application Carry out any restrictions.
As shown in fig. 6, computer system 600 includes central processing unit (CPU) 601, it can be read-only according to being stored in Program in memory (ROM) 602 or be loaded into the program in random access storage device (RAM) 603 from storage section 608 and Execute various movements appropriate and processing.In RAM 603, also it is stored with system 600 and operates required various programs and data. CPU 601, ROM 602 and RAM 603 are connected with each other by bus 604.Input/output (I/O) interface 605 is also connected to always Line 604.
I/O interface 605 is connected to lower component: the importation 606 including keyboard, mouse etc.;It is penetrated including such as cathode The output par, c 607 of spool (CRT), liquid crystal display (LCD) etc. and loudspeaker etc.;Storage section 608 including hard disk etc.; And the communications portion 609 of the network interface card including LAN card, modem etc..Communications portion 609 via such as because The network of spy's net executes communication process.Driver 610 is also connected to I/O interface 605 as needed.Detachable media 611, such as Disk, CD, magneto-optic disk, semiconductor memory etc. are mounted on as needed on driver 610, in order to read from thereon Computer program be mounted into storage section 608 as needed.
Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be carried on computer-readable medium On computer program, which includes the program code for method shown in execution flow chart.In such reality It applies in example, which can be downloaded and installed from network by communications portion 609, and/or from detachable media 611 are mounted.When the computer program is executed by central processing unit (CPU) 601, limited in execution the present processes Above-mentioned function.It should be noted that computer-readable medium described herein can be computer-readable signal media or Computer readable storage medium either the two any combination.Computer readable storage medium for example can be --- but Be not limited to --- electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor system, device or device, or any above combination. The more specific example of computer readable storage medium can include but is not limited to: have one or more conducting wires electrical connection, Portable computer diskette, hard disk, random access storage device (RAM), read-only memory (ROM), erasable type may be programmed read-only deposit Reservoir (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD-ROM), light storage device, magnetic memory Part or above-mentioned any appropriate combination.In this application, computer readable storage medium, which can be, any include or stores The tangible medium of program, the program can be commanded execution system, device or device use or in connection.And In the application, computer-readable signal media may include in a base band or the data as the propagation of carrier wave a part are believed Number, wherein carrying computer-readable program code.The data-signal of this propagation can take various forms, including but not It is limited to electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be computer Any computer-readable medium other than readable storage medium storing program for executing, the computer-readable medium can send, propagate or transmit use In by the use of instruction execution system, device or device or program in connection.Include on computer-readable medium Program code can transmit with any suitable medium, including but not limited to: wireless, electric wire, optical cable, RF etc., Huo Zheshang Any appropriate combination stated.
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the application, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, program segment or code of table, a part of the module, program segment or code include one or more use The executable instruction of the logic function as defined in realizing.It should also be noted that in some implementations as replacements, being marked in box The function of note can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are actually It can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it to infuse Meaning, the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart can be with holding The dedicated hardware based system of functions or operations as defined in row is realized, or can use specialized hardware and computer instruction Combination realize.
Being described in unit involved in the embodiment of the present application can be realized by way of software, can also be by hard The mode of part is realized.Described unit also can be set in the processor, for example, can be described as: a kind of processor packet Include first acquisition unit, sequencing unit, second acquisition unit, matching unit and output unit.Wherein, the title of these units exists The restriction to the unit itself is not constituted in the case of certain, for example, acquiring unit is also described as " obtaining at least one User, access log for recording the page request sent unit ".
As on the other hand, present invention also provides a kind of computer-readable medium, which be can be Included in device described in above-described embodiment;It is also possible to individualism, and without in the supplying device.Above-mentioned calculating Machine readable medium carries one or more program, when said one or multiple programs are executed by the device, so that should Device: at least one user, access log for recording the page request sent are obtained, wherein page request includes Uniform resource locator, access log include the time for sending page request;For the access log of each user, by the access The uniform resource locator in page request sent in log is according to the time of transmission page request by the sequence after arriving first Sequence obtains the uniform resource locator sequence of the user;Destination Uniform Resource finger URL arrangement set to be counted is obtained, In, each destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence is by the time of transmission page request by elder generation Sequence sequence after arriving, the page corresponding to posterior destination Uniform Resource finger URL that sorts is by determining with the destination Uniform Resource Position accords with page jump corresponding to the adjacent and preceding destination Uniform Resource finger URL of sequence;By the unified resource of each user Finger URL sequence and destination Uniform Resource finger URL arrangement set carry out pattern match, obtain each destination Uniform Resource finger URL The number that sequence occurs in the uniform resource locator sequence of each user;According to mesh in each destination Uniform Resource finger URL sequence Relationship is jumped between the page corresponding to mark uniform resource locator, by each destination Uniform Resource finger URL sequence in each use The number occurred in the uniform resource locator sequence at family is converted to the unified money of target in each destination Uniform Resource finger URL sequence Conversion ratio between the page corresponding to the finger URL of source is exported.
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.Those skilled in the art Member is it should be appreciated that invention scope involved in the application, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from the inventive concept, it is carried out by above-mentioned technical characteristic or its equivalent feature Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed herein Can technical characteristic replaced mutually and the technical solution that is formed.

Claims (14)

1. a kind of information output method, which is characterized in that the described method includes:
Obtain at least one user, access log for recording the page request sent, wherein page request includes system One Resource Locator, access log include the time for sending page request;
For the access log of each user, the uniform resource locator in the page request sent in the access log is pressed It is approved for distribution that the time of page request is sent to be sorted to obtain the uniform resource locator sequence of the user by the sequence after arriving first;
Obtain destination Uniform Resource finger URL arrangement set to be counted, wherein in each destination Uniform Resource finger URL sequence Each destination Uniform Resource finger URL sorted by the time for sending page request by the sequence after arriving first, the posterior target of sorting system The page corresponding to one Resource Locator is provided by adjacent with the destination Uniform Resource finger URL and the preceding target of sequence is unified Page jump corresponding to the finger URL of source;
The uniform resource locator sequence of each user and the destination Uniform Resource finger URL arrangement set are subjected to mode Match, obtains the number that each destination Uniform Resource finger URL sequence occurs in the uniform resource locator sequence of each user;
Pass is jumped according between the page corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence System, the number that each destination Uniform Resource finger URL sequence occurs in the uniform resource locator sequence of each user is converted to Conversion ratio in each destination Uniform Resource finger URL sequence between the page corresponding to destination Uniform Resource finger URL is exported.
2. the method according to claim 1, wherein by the uniform resource locator sequence of each user and institute Before stating destination Uniform Resource finger URL arrangement set progress pattern match, the method also includes:
Each uniform resource locator is mapped as single character according to preset character map, the unified money after being mapped Source finger URL sequence, wherein the character map is used to characterize the corresponding relationship of uniform resource locator Yu single character;
Each destination Uniform Resource in the destination Uniform Resource finger URL arrangement set is determined according to the character map Position symbol sequence is mapped as the destination Uniform Resource finger URL sequence being made of the character on the character map.
3. according to the method described in claim 2, it is characterized in that, the uniform resource locator sequence by each user with The destination Uniform Resource finger URL arrangement set carries out pattern match, comprising:
The uniform resource locator sequence after character map maps of each user is mapped with through character by KMP algorithm Destination Uniform Resource finger URL arrangement set after table mapping carries out pattern match.
4. the method according to claim 1, wherein it is described obtain at least one user, for record sent out The access log for the page request sent, comprising:
Obtain the original log of the access information including at least one user, wherein access information includes at least one of the following: page Request in person ask, page elements request and style sheet request;
Page elements request and style sheet request are filtered out from the original log, and by the page request group of same user Synthesize the access log of the user.
5. according to the method described in claim 2, single character and at least one unified resource position in the character map It accords with corresponding.
6. method described in one of -5 according to claim 1, which is characterized in that the method also includes:
According to the conversion ratio between the page corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Generate crater blasting.
7. a kind of information output apparatus, which is characterized in that described device includes:
First acquisition unit, for obtaining at least one user, access log for recording the page request sent, In, page request includes uniform resource locator, and access log includes the time for sending page request;
Sequencing unit, for the access log for each user, by the system in the page request sent in the access log One Resource Locator is positioned according to the time for sending page request by the unified resource that the sequence after arriving first sorts to obtain the user Accord with sequence;
Second acquisition unit, for obtaining destination Uniform Resource finger URL arrangement set to be counted, wherein each target is unified Each destination Uniform Resource finger URL in Resource Locator sequence is sorted by the time for sending page request by the sequence after arriving first, The page corresponding to posterior destination Uniform Resource finger URL that sorts is by and sequence adjacent with the destination Uniform Resource finger URL Page jump corresponding to preceding destination Uniform Resource finger URL;
Matching unit, for by the uniform resource locator sequence of each user and the destination Uniform Resource finger URL sequence sets It closes and carries out pattern match, obtain each destination Uniform Resource finger URL sequence and go out in the uniform resource locator sequence of each user Existing number;
Output unit, for the page according to corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Between jump relationship, each destination Uniform Resource finger URL sequence is occurred in the uniform resource locator sequence of each user Number be converted in each destination Uniform Resource finger URL sequence and to turn between the page corresponding to destination Uniform Resource finger URL The rate of changing is exported.
8. device according to claim 7, which is characterized in that described device further includes map unit, is used for:
The uniform resource locator sequence of each user and the destination Uniform Resource finger URL arrangement set are being subjected to mode Before matching, each uniform resource locator is mapped as by single character according to preset character map, after being mapped Uniform resource locator sequence, wherein the character map is corresponding with single character for characterizing uniform resource locator Relationship;
Each destination Uniform Resource in the destination Uniform Resource finger URL arrangement set is determined according to the character map Position symbol sequence is mapped as the destination Uniform Resource finger URL sequence being made of the character on the character map.
9. device according to claim 8, which is characterized in that the matching unit is further used for:
The uniform resource locator sequence after character map maps of each user is mapped with through character by KMP algorithm Destination Uniform Resource finger URL arrangement set after table mapping carries out pattern match.
10. device according to claim 7, which is characterized in that the first acquisition unit is further used for:
Obtain the original log of the access information including at least one user, wherein access information includes at least one of the following: page Request in person ask, page elements request and style sheet request;
Page elements request and style sheet request are filtered out from the original log, and by the page request group of same user Synthesize the access log of the user.
11. device according to claim 8, single character and at least one unified resource are positioned in the character map It accords with corresponding.
12. the device according to one of claim 7-11, which is characterized in that described device further include:
Generation unit, for the page according to corresponding to destination Uniform Resource finger URL in each destination Uniform Resource finger URL sequence Between conversion ratio generate crater blasting.
13. a kind of server, comprising:
One or more processors;
Storage device, for storing one or more programs,
When one or more of programs are executed by one or more of processors, so that one or more of processors are real Now such as method as claimed in any one of claims 1 to 6.
14. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the program is by processor Such as method as claimed in any one of claims 1 to 6 is realized when execution.
CN201710454694.XA 2017-06-15 2017-06-15 Information output method and device Pending CN109145230A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710454694.XA CN109145230A (en) 2017-06-15 2017-06-15 Information output method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710454694.XA CN109145230A (en) 2017-06-15 2017-06-15 Information output method and device

Publications (1)

Publication Number Publication Date
CN109145230A true CN109145230A (en) 2019-01-04

Family

ID=64830305

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710454694.XA Pending CN109145230A (en) 2017-06-15 2017-06-15 Information output method and device

Country Status (1)

Country Link
CN (1) CN109145230A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111310044A (en) * 2020-02-14 2020-06-19 北京百度网讯科技有限公司 Method, device, equipment and storage medium for extracting page element information
CN114429360A (en) * 2021-12-29 2022-05-03 神策网络科技(北京)有限公司 Conversion rate determination method, device, electronic equipment and computer readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103810184A (en) * 2012-11-07 2014-05-21 阿里巴巴集团控股有限公司 Method for determining web page address velocity, optimization method and device of methods
CN106055572A (en) * 2016-05-20 2016-10-26 百度在线网络技术(北京)有限公司 Method and device for processing page transformation parameter
CN106095979A (en) * 2016-06-20 2016-11-09 百度在线网络技术(北京)有限公司 URL merging treatment method and apparatus
CN106294559A (en) * 2016-07-26 2017-01-04 北京三快在线科技有限公司 A kind of application traffic analysis method and device
CN106528569A (en) * 2015-09-11 2017-03-22 北京国双科技有限公司 Method and device for calculating validity of site search

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103810184A (en) * 2012-11-07 2014-05-21 阿里巴巴集团控股有限公司 Method for determining web page address velocity, optimization method and device of methods
CN106528569A (en) * 2015-09-11 2017-03-22 北京国双科技有限公司 Method and device for calculating validity of site search
CN106055572A (en) * 2016-05-20 2016-10-26 百度在线网络技术(北京)有限公司 Method and device for processing page transformation parameter
CN106095979A (en) * 2016-06-20 2016-11-09 百度在线网络技术(北京)有限公司 URL merging treatment method and apparatus
CN106294559A (en) * 2016-07-26 2017-01-04 北京三快在线科技有限公司 A kind of application traffic analysis method and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111310044A (en) * 2020-02-14 2020-06-19 北京百度网讯科技有限公司 Method, device, equipment and storage medium for extracting page element information
CN111310044B (en) * 2020-02-14 2023-09-26 北京百度网讯科技有限公司 Page element information extraction method, device, equipment and storage medium
CN114429360A (en) * 2021-12-29 2022-05-03 神策网络科技(北京)有限公司 Conversion rate determination method, device, electronic equipment and computer readable storage medium

Similar Documents

Publication Publication Date Title
CN107679211A (en) Method and apparatus for pushed information
CN109460513A (en) Method and apparatus for generating clicking rate prediction model
CN107105031A (en) Information-pushing method and device
CN107609890A (en) A kind of method and apparatus of order tracking
CN105718559B (en) Search forms pages and the method and apparatus of target pages transforming relationship
CN108572990A (en) Information-pushing method and device
CN107885873A (en) Method and apparatus for output information
CN109981322A (en) The method and apparatus of cloud resource management based on label
CN108170843B (en) Method and apparatus for obtaining data
CN109857971A (en) Page rendering method and apparatus
CN109002440A (en) Method, apparatus and system for big data multidimensional analysis
CN108776692A (en) Method and apparatus for handling information
CN107169077A (en) Method and apparatus for pushed information
CN109408754A (en) Processing method, device, electronic equipment and the storage medium of web page operation data
CN107517251A (en) Information-pushing method and device
CN110515968A (en) Method and apparatus for output information
CN109101309A (en) For updating user interface method and device
CN109284367A (en) Method and apparatus for handling text
CN109062560A (en) Method and apparatus for generating information
CN108052290A (en) For storing the method and apparatus of data
CN108062423B (en) Information-pushing method and device
CN109145230A (en) Information output method and device
CN108932640A (en) Method and apparatus for handling order
CN108984070A (en) Method, apparatus, electronic equipment and readable medium for thermodynamic chart imaging
CN108182180B (en) Method and apparatus for generating information

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination