CN106557473A - The method and apparatus for generating path - Google Patents

The method and apparatus for generating path Download PDF

Info

Publication number
CN106557473A
CN106557473A CN201510617172.8A CN201510617172A CN106557473A CN 106557473 A CN106557473 A CN 106557473A CN 201510617172 A CN201510617172 A CN 201510617172A CN 106557473 A CN106557473 A CN 106557473A
Authority
CN
China
Prior art keywords
page
unit
path
subchain
key word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201510617172.8A
Other languages
Chinese (zh)
Other versions
CN106557473B (en
Inventor
王江伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201510617172.8A priority Critical patent/CN106557473B/en
Publication of CN106557473A publication Critical patent/CN106557473A/en
Application granted granted Critical
Publication of CN106557473B publication Critical patent/CN106557473B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9558Details of hyperlinks; Management of linked annotations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0255Targeted advertisements based on user history
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0277Online advertisement

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Databases & Information Systems (AREA)
  • Finance (AREA)
  • Strategic Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Accounting & Taxation (AREA)
  • Development Economics (AREA)
  • General Engineering & Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Game Theory and Decision Science (AREA)
  • Data Mining & Analysis (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a kind of method and apparatus for generating path, is related to Internet technical field, the path and the unmatched problem of user's real demand of artificial subjective setting in prior art is can solve the problem that.The method of the present invention includes:Obtain the access information of each page;Determine page rank of the access information in each page;According to the page rank, the source key word of corresponding each page of the access information is extracted;Find out all popularization units comprising the source key word;Targeted promotion unit is obtained from all popularization units for finding, the targeted promotion unit is comprising the most popularization unit of source key word;The uniform resource position mark URL of at least one page of correspondence described targeted promotion unit is defined as into path subchain.The present invention is suitable for the scene that path is generated using user access activity.

Description

The method and apparatus for generating path
Technical field
The present invention relates to Internet technical field, more particularly to a kind of method and apparatus for generating path.
Background technology
Path is a kind of advertisement promotion pattern of search engine, and many strips are incorporated in common promotional content Chain, makes extension service possess more information expressive function, and which represents subchain and is referred to as " path subchain ".
, with website main link in same page presentation, user can be by clicking on main link for path subchain Into website homepage, then by repeatedly click, the page (target pages) wanted is found, also may be used By clicking on path subchain, directly to reach target pages.Therefore, path subchain can be greatly shortened User clicks on process, improves conversion ratio.If it follows that the path subchain for arranging is paid close attention to for user Content, then the clicking rate of website can be improved.
In prior art, path mainly by search marketing personnel rule of thumb or business demand arrange, It is subjective, tend not to meet the real demand of user, therefore there is the footpath of artificial subjective setting Footpath and the unmatched problem of user's real demand.
The content of the invention
In view of this, the present invention provides a kind of method and apparatus for generating path, can solve the problem that existing skill The path and the unmatched problem of user's real demand of artificial subjective setting in art.
According to one aspect of the invention, there is provided a kind of method of generation path, methods described include:
Obtain the access information of each page;
Determine page rank of the access information in each page;
According to the page rank, the source key word of corresponding each page of the access information is extracted;
Find out all popularization units comprising the source key word;
Targeted promotion unit is obtained from all popularization units for finding, the targeted promotion unit is Comprising the most popularization unit of source key word;
Will be the uniform resource position mark URL of at least one page of correspondence described targeted promotion unit true It is set to path subchain.
According to another aspect of the invention, there is provided a kind of device of generation path, described device include:
Acquiring unit, for obtaining the access information of each page;
Sequencing unit, for determining page rank of the access information in each page;
Extraction unit, for according to the page rank, extracting corresponding each page of the access information The source key word in face;
Searching unit, for finding out the institute of the source key word extracted comprising the extraction unit There is popularization unit;
The acquiring unit, obtains in being additionally operable to from the searching unit all popularization units for finding Targeted promotion unit, the targeted promotion unit are comprising the most popularization unit of source key word;
Determining unit, for the targeted promotion unit by the acquiring unit acquisition is corresponded at least The uniform resource position mark URL of one page is defined as path subchain.
By above-mentioned technical proposal, the method and apparatus of the generation path that the present invention is provided can obtained After taking the access information that family accesses website, each page is ranked up, the N name pages before obtaining, The all popularization units comprising the source key word of each page in the front N names page are then looked up, and Targeted promotion unit is obtained therefrom, finally by the URL of at least one page of correspondence targeted promotion unit It is defined as path subchain.Compared with prior art by artificial subjective determination path subchain, the present invention It is analyzed by the access information that website is accessed to user, first obtains the high page of user's attention rate (i.e. The front N names page), reentry the most popularization unit of the source key word comprising the front N names page (i.e. Targeted promotion unit), finally the URL of at least one page of correspondence targeted promotion unit is defined as Path subchain, so that path subchain of the path subchain for generating for user's real demand, Jin Erti High user enters the efficiency of target pages.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the present invention's Technological means, and being practiced according to the content of description, and in order to allow the above-mentioned of the present invention and Other objects, features and advantages can become apparent, below especially exemplified by the specific embodiment of the present invention.
Description of the drawings
By the detailed description for reading hereafter preferred implementation, various other advantages and benefit for Those of ordinary skill in the art will be clear from understanding.Accompanying drawing is only used for the mesh for illustrating preferred implementation , and it is not considered as limitation of the present invention.And in whole accompanying drawing, with identical with reference to symbol Number represent identical part.In the accompanying drawings:
The flow chart that Fig. 1 shows a kind of method for generating path provided in an embodiment of the present invention;
Fig. 2 shows a kind of composition frame chart of device for generating path provided in an embodiment of the present invention;
Fig. 3 shows the composition frame chart of another kind of device for generating path provided in an embodiment of the present invention.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing in accompanying drawing The exemplary embodiment of the disclosure is shown, it being understood, however, that may be realized in various forms the disclosure And should not be limited by embodiments set forth here.On the contrary, there is provided these embodiments are able to more Thoroughly understand the disclosure, and can be by the scope of the present disclosure complete technology for conveying to this area Personnel.
A kind of method for generating path is embodiments provided, as shown in figure 1, the method includes:
101st, obtain the access information of each page.
In actual applications, advertisement master terminal can collect user's visit by various data acquisition technologys The access information of website is asked, then these access informations are stored in data warehouse, be easy to follow-up point Analysis and management.Wherein, the access information of user's access website includes the operating system class used by user Key word (i.e. the source key word of the page) that type, browser type, searched page are used, user Browse the time of each page and the essential information (such as account) of user etc..Obtaining above-mentioned basic letter After breath, by all access informations are counted and analyzed, corresponding each page can also be obtained The pageview of other information, such as page, conversion ratio and jump out rate etc..
It should be noted that during data of the terminal on Website server is gathered, some can be collected dirty Data, the data for for example repeating, other data unrelated with user access information, therefore when terminal is obtained After initial data, first the initial data can be carried out cleaning, the optimization operation such as format conversion, so as to Obtain valid data, then these valid data are saved in data warehouse carry out follow-up management with analysis.
Further, since user can be different in different phase content of interest, it is possible to gather User accesses the access information of website in the recent period, future access information is analyzed with will pass through, and obtains The higher information of recent user's attention rate.
102nd, determine page rank of the access information in each page.
After acquisition user accesses the access information of each page, can be according to certain index to each page Face is ranked up (be for example ranked up according to the pageview of the page), so as to obtain the front N names page. Wherein, N is positive integer.
In actual applications, ranking, such as OLAP (Online can be carried out to the page using various ways Analytical Processing, on-line analytical processing) technology, data mining technology etc..
103rd, according to the page rank, the source for extracting corresponding each page of the access information is closed Keyword.
Due to the key word that used comprising searched page in access information, (i.e. the source of the page is crucial Word), so in the N name pages, the source of each page is closed before terminal can extract correspondence from access information Keyword, so as to obtain N number of source keyword set.
It should be noted that each source key word originated in keyword set that terminal is obtained is The key word used when scanning for each page by different user, and corresponding to the same page Source key word be it is same or like, therefore terminal obtained each source keyword set in Source key word be repeated.User accessed into the page included by behavior each time why Face source key word is all recorded in the keyword set of source, and does not carry out duplicate removal process, is because using The behavior that accesses each time at family is all the once concern to corresponding page, the identical source of the same page Key word is more, illustrates that user is more to the attention rate of the page, if carrying out duplicate removal process, cannot Actual concern situation of the user to the page is obtained accurately.
104th, all popularization units comprising source key word are found out.
Wherein, it can be search engine marketing (Search Engine Marketing, abbreviation to promote unit For SEM) in be used for managing key word, search creative content etc..For example, certain promotes the pass in unit Keyword is the key word of tourist attractions class, and another key word promoted in unit is Expert English language training by qualified teachers The key word of class's class.
After the source key word of each page in the N name pages before acquisition correspondence, needs are locally being searched The popularization unit of corresponding each source key word, that is, search during which promotes unit and include front N names page At least one source key word in face.
105th, the acquisition targeted promotion unit from all popularization units for finding.
Wherein, targeted promotion unit is comprising the most popularization unit of source key word.When finding bag After all popularization units of containing the front N names page at least one source key word, terminal can count each The quantity of the source key word included in unit is promoted, to obtain comprising source key word quantity most Many popularization units, so that it is determined which kind of content is current user most paying close attention to.
106th, the URL of at least one page of correspondence targeted promotion unit is defined as into path subchain.
After obtaining comprising source key word most targeted promotion unit, terminal can select at least one The URL of the individual page is used as path subchain.Wherein, at least one page is originated to have at least one Key word is included in the page in targeted promotion unit.In actual applications, can there will be at least one Individual source key word is included in URL (the Uniform Resource of all pages in targeted promotion unit Locator, URL) it is defined as path subchain, also therefrom select the URL of partial page It is defined as path subchain.
The method for generating path provided in an embodiment of the present invention, can be in the visit for obtaining user's access website After asking information, each page is ranked up, the N name pages before obtaining are then looked up comprising front N All popularization units of the source key word of each page in the name page, and therefrom obtain targeted promotion list The URL of at least one page of correspondence targeted promotion unit is finally defined as path subchain by unit.With Compared by artificial subjective determination path subchain in prior art, the present invention is by accessing website to user Access information be analyzed, first obtain the high page (the i.e. front N names page) of user's attention rate, then The most popularization unit (i.e. targeted promotion unit) of the source key word comprising the front N names page is obtained, The URL of at least one page of correspondence targeted promotion unit is defined as into path subchain finally, so that Path subchain of the path subchain that must be generated for user's real demand, and then improve user and enter target The efficiency of the page.
Further, after the access information for obtaining user's access each page of website, it is thus necessary to determine that institute State page rank of the access information in each page.Wherein it is determined that the side of implementing of page rank Formula is:First, each page is ranked up according to ordering rule;Then, using the sequence As a result, determine page rank of the access information in each page;Wherein, the row of the page Name, chooses in the sequence according to pre-conditioned.
In actual applications, ordering rule can be with certain indication information be according to being ranked up, It is according to being ranked up with the comprehensive condition of some indication informations that can be.For example, terminal can basis The size of pageview is ranked up to each page.And for example, terminal can according to pageview, jump out rate With the comprehensive condition (as 50% pageview+30% jumps out+20% conversion ratio of rate) of conversion ratio to each page It is ranked up.For another example, terminal first can be ranked up according to pageview, when there is the clear of some pages During the amount of looking at identical situation, can be ranked up further according to the rate of jumping out, when jumping out for some pages of appearance During rate identical situation, can be ranked up further according to conversion ratio.
Further, refer in the above-described embodiments, adopted technical approach is ranked up to the page Can have various, one way in which is:Under OLAP technologies, according to ordering rule to each page Face is ranked up.Wherein, OLAP can extract a subset of detailed data from data warehouse, and Read and analysis for frontal chromatography instrument in OLAP memorizeies through necessary aggregating storing.
Further, when it is determined that after targeted promotion unit, terminal can correspondence targeted promotion unit extremely The URL of few page is defined as path subchain.But the URL of at least one page for randomly selecting It is not necessarily what user needed most, therefore in order to further such that the path subchain for arranging is needed with user Path subchain match, following scheme can be adopted:Calculating is included in every in targeted promotion unit The number of the source key word of the individual page, and the URL of at least one number most pages is defined as Path subchain.
Specifically, terminal calculates the source key of each page being included in targeted promotion unit respectively Then number is ranked up by the number of word from big to small, and number ranking is located at front M names finally The URL of the page is defined as path subchain.Wherein, M is positive integer, and M≤N.
Exemplary, if N is 10, and there is at least one source key word positioned at targeted promotion unit The page be the page 1, the page 3, the page 4, the page 5, the page 7 and the page 10, then terminal difference Statistics is included in the number of the source key word of each page in targeted promotion unit, and statistical result is The source key word number of the page 1 is 100, the page 3 is 200, the page 4 is 160, The page 5 is 240, the page 7 is 150, the page 10 is 300.Now, if the footpath for arranging The number of footpath subchain is 4, then the URL of the page that source key word number is first 4 is defined as footpath Footpath subchain, will the URL of the page 10, the page 5, the page 3 and the page 4 be defined as path subchain.
Additionally, generally, one group of path subchain is from left to right illustrated on the page successively, and is used Family custom is from left to right clicked on successively, so entering target pages to further simplify user Operating procedure, can show corresponding path subchain successively according to the sequencing of number ranking, that is, go up State in example successively by the path subchain of corresponding page 10, the page 5, the page 3 and the page 4 from left-hand The right side is illustrated on the page successively.During implementing, typically only need to the URL of the page 10 First path subchain is set to, the URL of the page 5 second path subchain is set to into, by the page 3 URL is set to the 3rd path subchain, and the URL of the page 4 is set to the 4th path subchain .
Further, when it is determined that after the URL of path subchain, in addition it is also necessary to arrange the title of path subchain. As page title can summarize the subject content of the page, so page title can be set to by terminal The title of the path subchain of corresponding page.For example, page title is Beijing hotel reservation, then can be by To should the title of path subchain of the page be set to Beijing hotel reservation.
Further, in actual applications, partial page title may be long, in order that path is sub The title of chain is more succinct, can extract at least one keyword from page title, and by this at least One keyword is set to the title of the path subchain of corresponding page.For example, page title is Beijing wine Shop reservation-Beijing hotel price-Beijing inquiry about the hotels, then can extract Beijing hotel as to should the page Path subchain title.
Additionally, the source key word of a page has multiple, these source key words may correspond to difference Popularization unit, the multiple source key words comprising the same page are likely in same popularization unit. When the multiple source key words comprising certain page in targeted promotion unit, these sources can be extracted The same section of key word as to should the page path subchain title.For example, targeted promotion list In unit containing three of a certain page source key word, i.e. Beijing hotel reservations, Beijing hotel price, Beijing inquiry about the hotels, then can extract Beijing hotel as to should the page path subchain title.
Further, according to said method embodiment, an alternative embodiment of the invention additionally provides life Into the device of path, as shown in Fig. 2 the device includes:Acquiring unit 21, sequencing unit 22, carry Take unit 23, searching unit 24 and determining unit 25.Wherein,
Acquiring unit 21, for obtaining the access information of each page;
Sequencing unit 22, the access information for being obtained according to acquiring unit 21 determine the access information Page rank in each page;
Extraction unit 23, for according to the page rank, extract the access information it is corresponding each The source key word of the page;
Searching unit 24, pushes away comprising source all of key word that extraction unit 23 is extracted for finding out Wide unit;
Acquiring unit 21, obtains target in being additionally operable to from searching unit 24 all popularization units for finding Unit is promoted, targeted promotion unit is comprising the most popularization unit of source key word;
Determining unit 25, at least one of the targeted promotion unit that acquiring unit 21 is obtained will be corresponded to The uniform resource position mark URL of the page is defined as path subchain.
The device for generating path provided in an embodiment of the present invention, can be in the visit for obtaining user's access website After asking information, each page is ranked up, the N name pages before obtaining are then looked up comprising front N All popularization units of the source key word of each page in the name page, and therefrom obtain targeted promotion list The URL of at least one page of correspondence targeted promotion unit is finally defined as path subchain by unit.With Compared by artificial subjective determination path subchain in prior art, the present invention is by accessing website to user Access information be analyzed, first obtain the high page (the i.e. front N names page) of user's attention rate, then The most popularization unit (i.e. targeted promotion unit) of the source key word comprising the front N names page is obtained, The URL of at least one page of correspondence targeted promotion unit is defined as into path subchain finally, so that Path subchain of the path subchain that must be generated for user's real demand, and then improve user and enter target The efficiency of the page.
Further, sequencing unit 22, for being ranked up to each page according to ordering rule;Profit With the result of the sequence, page rank of the access information in each page is determined;Wherein, The ranking of the page, chooses in the sequence according to pre-conditioned.
Further, sequencing unit 22, under On Line Analysis Process technology, according to row Sequence rule is ranked up to each page.
Further, as shown in figure 3, determining unit 25, including:
Computing module 251, the source for calculating each page being included in targeted promotion unit are crucial The number of word;
Determining module 252, at least one most page of the number for computing module 251 is calculated URL is defined as path subchain.
Further, as shown in figure 3, the device also includes:
Setting unit 26, for page title to be set to the title of the path subchain of corresponding page.
Further, as shown in figure 3, setting unit 26, including:
Extraction module 261, at least one keyword is extracted from page title;
Setup module 262, at least one keyword for extraction module 261 is extracted are set to correspondence The title of the path subchain of the page.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have in certain embodiment The part being described in detail, may refer to the associated description of other embodiment.
It is understood that said method and the correlated characteristic in device mutually can be referred to.In addition, " first ", " second " in above-described embodiment etc. is, for distinguishing each embodiment, and not represent each enforcement The quality of example.
Those skilled in the art can be understood that, for convenience and simplicity of description, above-mentioned The specific work process of the system, apparatus, and unit of description, may be referred in preceding method embodiment Corresponding process, will not be described here.
Provided herein algorithm and show not with any certain computer, virtual system or miscellaneous equipment It is intrinsic related.Various general-purpose systems can also be used together based on teaching in this.According to above Description, the structure constructed required by this kind of system is obvious.Additionally, the present invention is also not for Any certain programmed language.It is understood that, it is possible to use various programming languages realize described here The content of invention, and the description done to language-specific above is for the optimal reality for disclosing the present invention Apply mode.
In description mentioned herein, a large amount of details are illustrated.It is to be appreciated, however, that Embodiments of the invention can be put into practice in the case where not having these details.In some instances, Known method, structure and technology are not been shown in detail, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help understand in each inventive aspect It is individual or multiple, in above to the description of the exemplary embodiment of the present invention, each feature of the invention Sometimes it is grouped together in single embodiment, figure or descriptions thereof.However, should be by The method of the disclosure is construed to reflect following intention:I.e. the present invention for required protection requires ratio at each The more features of feature being expressly recited in claim.More precisely, as following right will As asking book reflected, inventive aspect is less than all spies of single embodiment disclosed above Levy.Therefore, it then follows thus claims of specific embodiment are expressly incorporated in the specific embodiment party Separate embodiments of the formula, wherein each claim as the present invention itself.
Those skilled in the art are appreciated that can be carried out to the module in the equipment in embodiment Adaptively change and they are arranged in one or more different from embodiment equipment. Module or unit or component in embodiment can be combined into a module or unit or component, and In addition multiple submodule or subelement or sub-component can be divided into.Except such feature and/or Outside at least some in process or unit is excluded each other, can be using any combinations to this explanation All features disclosed in book (including adjoint claim, summary and accompanying drawing) and such as the displosure Any method or all processes or unit of equipment be combined.Unless expressly stated otherwise, originally Each feature disclosed in description (including adjoint claim, summary and accompanying drawing) can be by carrying For identical, equivalent or similar purpose alternative features replacing.
Although additionally, it will be appreciated by those of skill in the art that some embodiments described herein include Some included features rather than further feature in other embodiments, but the feature of different embodiments Combination mean to be within the scope of the present invention and formed different embodiments.For example, under In the claims in face, embodiment required for protection one of arbitrarily can be in any combination Mode is using.
The all parts embodiment of the present invention can be realized with hardware, or with one or more The software module run on reason device is realized, or is realized with combinations thereof.Those skilled in the art It should be appreciated that can be realized using microprocessor or digital signal processor (DSP) in practice The condition detection method of accompanied electronic anti-theft device according to embodiments of the present invention, equipment, server and The some or all functions of some or all parts in system equipment.The present invention can also be realized It is some or all equipment or program of device (example for performing method as described herein Such as, computer program and computer program).Such program for realizing the present invention can be stored in On computer-readable medium, or there can be the form of one or more signal.Such signal Can download from internet website and obtain, or provide on carrier signal, or with any other Form is provided.
It should be noted that above-described embodiment the present invention will be described rather than the present invention is limited Make, and those skilled in the art can design without departing from the scope of the appended claims Alternative embodiment.In the claims, any reference markss between bracket should not be configured to Limitations on claims.Word "comprising" does not exclude the presence of element not listed in the claims or step Suddenly.Word "a" or "an" before element does not exclude the presence of multiple such elements.The present invention Can come real by means of the hardware for including some different elements and by means of properly programmed computer It is existing.If in the unit claim for listing equipment for drying, several in these devices can be logical Cross same hardware branch to embody.The use of word first, second, and third is not indicated that Any order.These words can be construed to title.

Claims (10)

1. it is a kind of generate path method, it is characterised in that methods described includes:
Obtain the access information of each page;
Determine page rank of the access information in each page;
According to the page rank, the source key word of corresponding each page of the access information is extracted;
Find out all popularization units comprising the source key word;
Targeted promotion unit is obtained from all popularization units for finding, the targeted promotion unit is Comprising the most popularization unit of source key word;
Will be the uniform resource position mark URL of at least one page of correspondence described targeted promotion unit true It is set to path subchain.
2. method according to claim 1, it is characterised in that the determination access information Page rank in each page, including:
Each page is ranked up according to ordering rule;
Using the result of the sequence, page rank of the access information in each page is determined; Wherein, the ranking of the page, chooses in the sequence according to pre-conditioned.
3. method according to claim 2, it is characterised in that it is described according to ordering rule to each The individual page is ranked up, including:
Under On Line Analysis Process technology, each page is arranged according to the ordering rule Sequence.
4. method according to claim 1, it is characterised in that described that correspondence described target be pushed away The URL of at least one page of wide unit is defined as path subchain, including:
Calculating is included in the number of the source key word of each page in the targeted promotion unit;
The URL of at least one number most pages is defined as into path subchain.
5. method according to any one of claim 1 to 4, it is characterised in that methods described Also include:
Page title is set to the title of the path subchain of corresponding page.
6. method according to claim 5, it is characterised in that described that page title is set to The title of the path subchain of corresponding page, including:
At least one keyword is extracted from page title, and at least one keyword is set to The title of the path subchain of corresponding page.
7. it is a kind of generate path device, it is characterised in that described device includes:
Acquiring unit, for obtaining the access information of each page;
Sequencing unit, for determining page rank of the access information in each page;
Extraction unit, for according to the page rank, extracting corresponding each page of the access information The source key word in face;
Searching unit, for finding out the institute of the source key word extracted comprising the extraction unit There is popularization unit;
The acquiring unit, obtains in being additionally operable to from the searching unit all popularization units for finding Targeted promotion unit, the targeted promotion unit are comprising the most popularization unit of source key word;
Determining unit, for the targeted promotion unit by the acquiring unit acquisition is corresponded at least The uniform resource position mark URL of one page is defined as path subchain.
8. device according to claim 7, it is characterised in that the sequencing unit, for root Each page is ranked up according to ordering rule;Using the result of the sequence, determine that described access is believed Page rank of the breath in each page;Wherein, the ranking of the page, be according to it is pre-conditioned Choose in the sequence.
9. device according to claim 7, it is characterised in that the determining unit, including:
Computing module, the source for calculating each page being included in the targeted promotion unit are closed The number of keyword;
Determining module, at least one most page of the number for the computing module is calculated URL be defined as path subchain.
10. the device according to any one of claim 7 to 9, it is characterised in that the dress Putting also includes:
Setting unit, for page title to be set to the title of the path subchain of corresponding page.
CN201510617172.8A 2015-09-24 2015-09-24 Method and device for generating new channel Active CN106557473B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510617172.8A CN106557473B (en) 2015-09-24 2015-09-24 Method and device for generating new channel

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510617172.8A CN106557473B (en) 2015-09-24 2015-09-24 Method and device for generating new channel

Publications (2)

Publication Number Publication Date
CN106557473A true CN106557473A (en) 2017-04-05
CN106557473B CN106557473B (en) 2020-01-07

Family

ID=58414204

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510617172.8A Active CN106557473B (en) 2015-09-24 2015-09-24 Method and device for generating new channel

Country Status (1)

Country Link
CN (1) CN106557473B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108596433A (en) * 2018-03-22 2018-09-28 安徽建筑大学 One kind being applied to coal mine safety management risk factors evaluation method

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050149395A1 (en) * 2003-10-29 2005-07-07 Kontera Technologies, Inc. System and method for real-time web page context analysis for the real-time insertion of textual markup objects and dynamic content
CN102073960A (en) * 2010-09-15 2011-05-25 江苏仕德伟网络科技股份有限公司 Method for assessing operation effect in website marketing process
CN102142033A (en) * 2010-05-20 2011-08-03 百度在线网络技术(北京)有限公司 Method and device for providing relative sub-link information in search result
CN102411589A (en) * 2010-09-26 2012-04-11 百度在线网络技术(北京)有限公司 Method and equipment for monitoring and managing keywords
CN103164521A (en) * 2013-03-11 2013-06-19 亿赞普(北京)科技有限公司 Keyword calculation method and device based on user browse and search actions
US20130226690A1 (en) * 2005-11-30 2013-08-29 John Nicholas Gross System & Method of Presenting Content Based Advertising
CN103514193A (en) * 2012-06-21 2014-01-15 百度在线网络技术(北京)有限公司 Method and device used for determining popularization result information of popularization keyword
CN104021209A (en) * 2014-06-19 2014-09-03 北京博雅立方科技有限公司 Statistical method for keyword advertising effect and browsing client
CN104077290A (en) * 2013-03-26 2014-10-01 腾讯科技(深圳)有限公司 Method and device for generating promoted accounts
CN104462397A (en) * 2014-12-10 2015-03-25 北京国双科技有限公司 Promotion information processing method and promotion information processing device
CN104504135A (en) * 2014-12-31 2015-04-08 北京国双科技有限公司 Promotion account structure generation method and device
CN104572960A (en) * 2014-12-29 2015-04-29 北京奇虎科技有限公司 Searching method and searching device
CN104778606A (en) * 2015-04-10 2015-07-15 北京京东尚科信息技术有限公司 Account structure data processing method and device

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050149395A1 (en) * 2003-10-29 2005-07-07 Kontera Technologies, Inc. System and method for real-time web page context analysis for the real-time insertion of textual markup objects and dynamic content
US20130226690A1 (en) * 2005-11-30 2013-08-29 John Nicholas Gross System & Method of Presenting Content Based Advertising
CN102142033A (en) * 2010-05-20 2011-08-03 百度在线网络技术(北京)有限公司 Method and device for providing relative sub-link information in search result
CN102073960A (en) * 2010-09-15 2011-05-25 江苏仕德伟网络科技股份有限公司 Method for assessing operation effect in website marketing process
CN102411589A (en) * 2010-09-26 2012-04-11 百度在线网络技术(北京)有限公司 Method and equipment for monitoring and managing keywords
CN103514193A (en) * 2012-06-21 2014-01-15 百度在线网络技术(北京)有限公司 Method and device used for determining popularization result information of popularization keyword
CN103164521A (en) * 2013-03-11 2013-06-19 亿赞普(北京)科技有限公司 Keyword calculation method and device based on user browse and search actions
CN104077290A (en) * 2013-03-26 2014-10-01 腾讯科技(深圳)有限公司 Method and device for generating promoted accounts
CN104021209A (en) * 2014-06-19 2014-09-03 北京博雅立方科技有限公司 Statistical method for keyword advertising effect and browsing client
CN104462397A (en) * 2014-12-10 2015-03-25 北京国双科技有限公司 Promotion information processing method and promotion information processing device
CN104572960A (en) * 2014-12-29 2015-04-29 北京奇虎科技有限公司 Searching method and searching device
CN104504135A (en) * 2014-12-31 2015-04-08 北京国双科技有限公司 Promotion account structure generation method and device
CN104778606A (en) * 2015-04-10 2015-07-15 北京京东尚科信息技术有限公司 Account structure data processing method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108596433A (en) * 2018-03-22 2018-09-28 安徽建筑大学 One kind being applied to coal mine safety management risk factors evaluation method

Also Published As

Publication number Publication date
CN106557473B (en) 2020-01-07

Similar Documents

Publication Publication Date Title
US8799310B2 (en) Method and system for processing a uniform resource locator
US9015176B2 (en) Automatic identification of related search keywords
US9384289B2 (en) Method and system to identify geographical locations associated with queries received at a search engine
CN106919625B (en) Internet user attribute identification method and device
CN108090104B (en) Method and device for acquiring webpage information
CN104217031B (en) A kind of method and apparatus that user's classification is carried out according to server search daily record data
US20090299964A1 (en) Presenting search queries related to navigational search queries
CN102037464A (en) Search results with most clicked next objects
WO2011008848A2 (en) Activity based users' interests modeling for determining content relevance
US20090150345A1 (en) Web Domain Data Replication System
KR101566616B1 (en) Advertisement decision supporting system using big data-processing and method thereof
EP2628097A1 (en) Systems and methods for using a behavior history of a user to augment content of a webpage
CN104572863A (en) Product recommending method and system
CN106709073A (en) Browser notification pushing method and browser terminal
CN103412881A (en) Method and system for providing search result
US9886711B2 (en) Product recommendations over multiple stores
EP2933734A1 (en) Method and system for the structural analysis of websites
KR100987058B1 (en) Method and system for providing advertising service using the keywords of internet contents and program recording medium
Jiang et al. A clickstream data analysis of Chinese academic library OPAC users' information behavior
CN111414410A (en) Data processing method, device, equipment and storage medium
WO2020051416A1 (en) Entity-based search system using user engagement
WO2007011129A1 (en) Information search method and information search apparatus on which information value is reflected
CN102819384A (en) Method and device for prompting display at input field
CN105468627A (en) Method and system for shielding and filtering web page contents
US11341141B2 (en) Search system using multiple search streams

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant