CN106557473A - The method and apparatus for generating path - Google Patents
The method and apparatus for generating path Download PDFInfo
- Publication number
- CN106557473A CN106557473A CN201510617172.8A CN201510617172A CN106557473A CN 106557473 A CN106557473 A CN 106557473A CN 201510617172 A CN201510617172 A CN 201510617172A CN 106557473 A CN106557473 A CN 106557473A
- Authority
- CN
- China
- Prior art keywords
- page
- unit
- path
- subchain
- key word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9558—Details of hyperlinks; Management of linked annotations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0251—Targeted advertisements
- G06Q30/0255—Targeted advertisements based on user history
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0277—Online advertisement
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Databases & Information Systems (AREA)
- Finance (AREA)
- Strategic Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- General Engineering & Computer Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Game Theory and Decision Science (AREA)
- Data Mining & Analysis (AREA)
- Economics (AREA)
- Marketing (AREA)
- General Business, Economics & Management (AREA)
- Information Transfer Between Computers (AREA)
Abstract
The invention discloses a kind of method and apparatus for generating path, is related to Internet technical field, the path and the unmatched problem of user's real demand of artificial subjective setting in prior art is can solve the problem that.The method of the present invention includes:Obtain the access information of each page;Determine page rank of the access information in each page;According to the page rank, the source key word of corresponding each page of the access information is extracted;Find out all popularization units comprising the source key word;Targeted promotion unit is obtained from all popularization units for finding, the targeted promotion unit is comprising the most popularization unit of source key word;The uniform resource position mark URL of at least one page of correspondence described targeted promotion unit is defined as into path subchain.The present invention is suitable for the scene that path is generated using user access activity.
Description
Technical field
The present invention relates to Internet technical field, more particularly to a kind of method and apparatus for generating path.
Background technology
Path is a kind of advertisement promotion pattern of search engine, and many strips are incorporated in common promotional content
Chain, makes extension service possess more information expressive function, and which represents subchain and is referred to as " path subchain ".
, with website main link in same page presentation, user can be by clicking on main link for path subchain
Into website homepage, then by repeatedly click, the page (target pages) wanted is found, also may be used
By clicking on path subchain, directly to reach target pages.Therefore, path subchain can be greatly shortened
User clicks on process, improves conversion ratio.If it follows that the path subchain for arranging is paid close attention to for user
Content, then the clicking rate of website can be improved.
In prior art, path mainly by search marketing personnel rule of thumb or business demand arrange,
It is subjective, tend not to meet the real demand of user, therefore there is the footpath of artificial subjective setting
Footpath and the unmatched problem of user's real demand.
The content of the invention
In view of this, the present invention provides a kind of method and apparatus for generating path, can solve the problem that existing skill
The path and the unmatched problem of user's real demand of artificial subjective setting in art.
According to one aspect of the invention, there is provided a kind of method of generation path, methods described include:
Obtain the access information of each page;
Determine page rank of the access information in each page;
According to the page rank, the source key word of corresponding each page of the access information is extracted;
Find out all popularization units comprising the source key word;
Targeted promotion unit is obtained from all popularization units for finding, the targeted promotion unit is
Comprising the most popularization unit of source key word;
Will be the uniform resource position mark URL of at least one page of correspondence described targeted promotion unit true
It is set to path subchain.
According to another aspect of the invention, there is provided a kind of device of generation path, described device include:
Acquiring unit, for obtaining the access information of each page;
Sequencing unit, for determining page rank of the access information in each page;
Extraction unit, for according to the page rank, extracting corresponding each page of the access information
The source key word in face;
Searching unit, for finding out the institute of the source key word extracted comprising the extraction unit
There is popularization unit;
The acquiring unit, obtains in being additionally operable to from the searching unit all popularization units for finding
Targeted promotion unit, the targeted promotion unit are comprising the most popularization unit of source key word;
Determining unit, for the targeted promotion unit by the acquiring unit acquisition is corresponded at least
The uniform resource position mark URL of one page is defined as path subchain.
By above-mentioned technical proposal, the method and apparatus of the generation path that the present invention is provided can obtained
After taking the access information that family accesses website, each page is ranked up, the N name pages before obtaining,
The all popularization units comprising the source key word of each page in the front N names page are then looked up, and
Targeted promotion unit is obtained therefrom, finally by the URL of at least one page of correspondence targeted promotion unit
It is defined as path subchain.Compared with prior art by artificial subjective determination path subchain, the present invention
It is analyzed by the access information that website is accessed to user, first obtains the high page of user's attention rate (i.e.
The front N names page), reentry the most popularization unit of the source key word comprising the front N names page (i.e.
Targeted promotion unit), finally the URL of at least one page of correspondence targeted promotion unit is defined as
Path subchain, so that path subchain of the path subchain for generating for user's real demand, Jin Erti
High user enters the efficiency of target pages.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the present invention's
Technological means, and being practiced according to the content of description, and in order to allow the above-mentioned of the present invention and
Other objects, features and advantages can become apparent, below especially exemplified by the specific embodiment of the present invention.
Description of the drawings
By the detailed description for reading hereafter preferred implementation, various other advantages and benefit for
Those of ordinary skill in the art will be clear from understanding.Accompanying drawing is only used for the mesh for illustrating preferred implementation
, and it is not considered as limitation of the present invention.And in whole accompanying drawing, with identical with reference to symbol
Number represent identical part.In the accompanying drawings:
The flow chart that Fig. 1 shows a kind of method for generating path provided in an embodiment of the present invention;
Fig. 2 shows a kind of composition frame chart of device for generating path provided in an embodiment of the present invention;
Fig. 3 shows the composition frame chart of another kind of device for generating path provided in an embodiment of the present invention.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing in accompanying drawing
The exemplary embodiment of the disclosure is shown, it being understood, however, that may be realized in various forms the disclosure
And should not be limited by embodiments set forth here.On the contrary, there is provided these embodiments are able to more
Thoroughly understand the disclosure, and can be by the scope of the present disclosure complete technology for conveying to this area
Personnel.
A kind of method for generating path is embodiments provided, as shown in figure 1, the method includes:
101st, obtain the access information of each page.
In actual applications, advertisement master terminal can collect user's visit by various data acquisition technologys
The access information of website is asked, then these access informations are stored in data warehouse, be easy to follow-up point
Analysis and management.Wherein, the access information of user's access website includes the operating system class used by user
Key word (i.e. the source key word of the page) that type, browser type, searched page are used, user
Browse the time of each page and the essential information (such as account) of user etc..Obtaining above-mentioned basic letter
After breath, by all access informations are counted and analyzed, corresponding each page can also be obtained
The pageview of other information, such as page, conversion ratio and jump out rate etc..
It should be noted that during data of the terminal on Website server is gathered, some can be collected dirty
Data, the data for for example repeating, other data unrelated with user access information, therefore when terminal is obtained
After initial data, first the initial data can be carried out cleaning, the optimization operation such as format conversion, so as to
Obtain valid data, then these valid data are saved in data warehouse carry out follow-up management with analysis.
Further, since user can be different in different phase content of interest, it is possible to gather
User accesses the access information of website in the recent period, future access information is analyzed with will pass through, and obtains
The higher information of recent user's attention rate.
102nd, determine page rank of the access information in each page.
After acquisition user accesses the access information of each page, can be according to certain index to each page
Face is ranked up (be for example ranked up according to the pageview of the page), so as to obtain the front N names page.
Wherein, N is positive integer.
In actual applications, ranking, such as OLAP (Online can be carried out to the page using various ways
Analytical Processing, on-line analytical processing) technology, data mining technology etc..
103rd, according to the page rank, the source for extracting corresponding each page of the access information is closed
Keyword.
Due to the key word that used comprising searched page in access information, (i.e. the source of the page is crucial
Word), so in the N name pages, the source of each page is closed before terminal can extract correspondence from access information
Keyword, so as to obtain N number of source keyword set.
It should be noted that each source key word originated in keyword set that terminal is obtained is
The key word used when scanning for each page by different user, and corresponding to the same page
Source key word be it is same or like, therefore terminal obtained each source keyword set in
Source key word be repeated.User accessed into the page included by behavior each time why
Face source key word is all recorded in the keyword set of source, and does not carry out duplicate removal process, is because using
The behavior that accesses each time at family is all the once concern to corresponding page, the identical source of the same page
Key word is more, illustrates that user is more to the attention rate of the page, if carrying out duplicate removal process, cannot
Actual concern situation of the user to the page is obtained accurately.
104th, all popularization units comprising source key word are found out.
Wherein, it can be search engine marketing (Search Engine Marketing, abbreviation to promote unit
For SEM) in be used for managing key word, search creative content etc..For example, certain promotes the pass in unit
Keyword is the key word of tourist attractions class, and another key word promoted in unit is Expert English language training by qualified teachers
The key word of class's class.
After the source key word of each page in the N name pages before acquisition correspondence, needs are locally being searched
The popularization unit of corresponding each source key word, that is, search during which promotes unit and include front N names page
At least one source key word in face.
105th, the acquisition targeted promotion unit from all popularization units for finding.
Wherein, targeted promotion unit is comprising the most popularization unit of source key word.When finding bag
After all popularization units of containing the front N names page at least one source key word, terminal can count each
The quantity of the source key word included in unit is promoted, to obtain comprising source key word quantity most
Many popularization units, so that it is determined which kind of content is current user most paying close attention to.
106th, the URL of at least one page of correspondence targeted promotion unit is defined as into path subchain.
After obtaining comprising source key word most targeted promotion unit, terminal can select at least one
The URL of the individual page is used as path subchain.Wherein, at least one page is originated to have at least one
Key word is included in the page in targeted promotion unit.In actual applications, can there will be at least one
Individual source key word is included in URL (the Uniform Resource of all pages in targeted promotion unit
Locator, URL) it is defined as path subchain, also therefrom select the URL of partial page
It is defined as path subchain.
The method for generating path provided in an embodiment of the present invention, can be in the visit for obtaining user's access website
After asking information, each page is ranked up, the N name pages before obtaining are then looked up comprising front N
All popularization units of the source key word of each page in the name page, and therefrom obtain targeted promotion list
The URL of at least one page of correspondence targeted promotion unit is finally defined as path subchain by unit.With
Compared by artificial subjective determination path subchain in prior art, the present invention is by accessing website to user
Access information be analyzed, first obtain the high page (the i.e. front N names page) of user's attention rate, then
The most popularization unit (i.e. targeted promotion unit) of the source key word comprising the front N names page is obtained,
The URL of at least one page of correspondence targeted promotion unit is defined as into path subchain finally, so that
Path subchain of the path subchain that must be generated for user's real demand, and then improve user and enter target
The efficiency of the page.
Further, after the access information for obtaining user's access each page of website, it is thus necessary to determine that institute
State page rank of the access information in each page.Wherein it is determined that the side of implementing of page rank
Formula is:First, each page is ranked up according to ordering rule;Then, using the sequence
As a result, determine page rank of the access information in each page;Wherein, the row of the page
Name, chooses in the sequence according to pre-conditioned.
In actual applications, ordering rule can be with certain indication information be according to being ranked up,
It is according to being ranked up with the comprehensive condition of some indication informations that can be.For example, terminal can basis
The size of pageview is ranked up to each page.And for example, terminal can according to pageview, jump out rate
With the comprehensive condition (as 50% pageview+30% jumps out+20% conversion ratio of rate) of conversion ratio to each page
It is ranked up.For another example, terminal first can be ranked up according to pageview, when there is the clear of some pages
During the amount of looking at identical situation, can be ranked up further according to the rate of jumping out, when jumping out for some pages of appearance
During rate identical situation, can be ranked up further according to conversion ratio.
Further, refer in the above-described embodiments, adopted technical approach is ranked up to the page
Can have various, one way in which is:Under OLAP technologies, according to ordering rule to each page
Face is ranked up.Wherein, OLAP can extract a subset of detailed data from data warehouse, and
Read and analysis for frontal chromatography instrument in OLAP memorizeies through necessary aggregating storing.
Further, when it is determined that after targeted promotion unit, terminal can correspondence targeted promotion unit extremely
The URL of few page is defined as path subchain.But the URL of at least one page for randomly selecting
It is not necessarily what user needed most, therefore in order to further such that the path subchain for arranging is needed with user
Path subchain match, following scheme can be adopted:Calculating is included in every in targeted promotion unit
The number of the source key word of the individual page, and the URL of at least one number most pages is defined as
Path subchain.
Specifically, terminal calculates the source key of each page being included in targeted promotion unit respectively
Then number is ranked up by the number of word from big to small, and number ranking is located at front M names finally
The URL of the page is defined as path subchain.Wherein, M is positive integer, and M≤N.
Exemplary, if N is 10, and there is at least one source key word positioned at targeted promotion unit
The page be the page 1, the page 3, the page 4, the page 5, the page 7 and the page 10, then terminal difference
Statistics is included in the number of the source key word of each page in targeted promotion unit, and statistical result is
The source key word number of the page 1 is 100, the page 3 is 200, the page 4 is 160,
The page 5 is 240, the page 7 is 150, the page 10 is 300.Now, if the footpath for arranging
The number of footpath subchain is 4, then the URL of the page that source key word number is first 4 is defined as footpath
Footpath subchain, will the URL of the page 10, the page 5, the page 3 and the page 4 be defined as path subchain.
Additionally, generally, one group of path subchain is from left to right illustrated on the page successively, and is used
Family custom is from left to right clicked on successively, so entering target pages to further simplify user
Operating procedure, can show corresponding path subchain successively according to the sequencing of number ranking, that is, go up
State in example successively by the path subchain of corresponding page 10, the page 5, the page 3 and the page 4 from left-hand
The right side is illustrated on the page successively.During implementing, typically only need to the URL of the page 10
First path subchain is set to, the URL of the page 5 second path subchain is set to into, by the page
3 URL is set to the 3rd path subchain, and the URL of the page 4 is set to the 4th path subchain
.
Further, when it is determined that after the URL of path subchain, in addition it is also necessary to arrange the title of path subchain.
As page title can summarize the subject content of the page, so page title can be set to by terminal
The title of the path subchain of corresponding page.For example, page title is Beijing hotel reservation, then can be by
To should the title of path subchain of the page be set to Beijing hotel reservation.
Further, in actual applications, partial page title may be long, in order that path is sub
The title of chain is more succinct, can extract at least one keyword from page title, and by this at least
One keyword is set to the title of the path subchain of corresponding page.For example, page title is Beijing wine
Shop reservation-Beijing hotel price-Beijing inquiry about the hotels, then can extract Beijing hotel as to should the page
Path subchain title.
Additionally, the source key word of a page has multiple, these source key words may correspond to difference
Popularization unit, the multiple source key words comprising the same page are likely in same popularization unit.
When the multiple source key words comprising certain page in targeted promotion unit, these sources can be extracted
The same section of key word as to should the page path subchain title.For example, targeted promotion list
In unit containing three of a certain page source key word, i.e. Beijing hotel reservations, Beijing hotel price,
Beijing inquiry about the hotels, then can extract Beijing hotel as to should the page path subchain title.
Further, according to said method embodiment, an alternative embodiment of the invention additionally provides life
Into the device of path, as shown in Fig. 2 the device includes:Acquiring unit 21, sequencing unit 22, carry
Take unit 23, searching unit 24 and determining unit 25.Wherein,
Acquiring unit 21, for obtaining the access information of each page;
Sequencing unit 22, the access information for being obtained according to acquiring unit 21 determine the access information
Page rank in each page;
Extraction unit 23, for according to the page rank, extract the access information it is corresponding each
The source key word of the page;
Searching unit 24, pushes away comprising source all of key word that extraction unit 23 is extracted for finding out
Wide unit;
Acquiring unit 21, obtains target in being additionally operable to from searching unit 24 all popularization units for finding
Unit is promoted, targeted promotion unit is comprising the most popularization unit of source key word;
Determining unit 25, at least one of the targeted promotion unit that acquiring unit 21 is obtained will be corresponded to
The uniform resource position mark URL of the page is defined as path subchain.
The device for generating path provided in an embodiment of the present invention, can be in the visit for obtaining user's access website
After asking information, each page is ranked up, the N name pages before obtaining are then looked up comprising front N
All popularization units of the source key word of each page in the name page, and therefrom obtain targeted promotion list
The URL of at least one page of correspondence targeted promotion unit is finally defined as path subchain by unit.With
Compared by artificial subjective determination path subchain in prior art, the present invention is by accessing website to user
Access information be analyzed, first obtain the high page (the i.e. front N names page) of user's attention rate, then
The most popularization unit (i.e. targeted promotion unit) of the source key word comprising the front N names page is obtained,
The URL of at least one page of correspondence targeted promotion unit is defined as into path subchain finally, so that
Path subchain of the path subchain that must be generated for user's real demand, and then improve user and enter target
The efficiency of the page.
Further, sequencing unit 22, for being ranked up to each page according to ordering rule;Profit
With the result of the sequence, page rank of the access information in each page is determined;Wherein,
The ranking of the page, chooses in the sequence according to pre-conditioned.
Further, sequencing unit 22, under On Line Analysis Process technology, according to row
Sequence rule is ranked up to each page.
Further, as shown in figure 3, determining unit 25, including:
Computing module 251, the source for calculating each page being included in targeted promotion unit are crucial
The number of word;
Determining module 252, at least one most page of the number for computing module 251 is calculated
URL is defined as path subchain.
Further, as shown in figure 3, the device also includes:
Setting unit 26, for page title to be set to the title of the path subchain of corresponding page.
Further, as shown in figure 3, setting unit 26, including:
Extraction module 261, at least one keyword is extracted from page title;
Setup module 262, at least one keyword for extraction module 261 is extracted are set to correspondence
The title of the path subchain of the page.
In the above-described embodiments, the description to each embodiment all emphasizes particularly on different fields, and does not have in certain embodiment
The part being described in detail, may refer to the associated description of other embodiment.
It is understood that said method and the correlated characteristic in device mutually can be referred to.In addition,
" first ", " second " in above-described embodiment etc. is, for distinguishing each embodiment, and not represent each enforcement
The quality of example.
Those skilled in the art can be understood that, for convenience and simplicity of description, above-mentioned
The specific work process of the system, apparatus, and unit of description, may be referred in preceding method embodiment
Corresponding process, will not be described here.
Provided herein algorithm and show not with any certain computer, virtual system or miscellaneous equipment
It is intrinsic related.Various general-purpose systems can also be used together based on teaching in this.According to above
Description, the structure constructed required by this kind of system is obvious.Additionally, the present invention is also not for
Any certain programmed language.It is understood that, it is possible to use various programming languages realize described here
The content of invention, and the description done to language-specific above is for the optimal reality for disclosing the present invention
Apply mode.
In description mentioned herein, a large amount of details are illustrated.It is to be appreciated, however, that
Embodiments of the invention can be put into practice in the case where not having these details.In some instances,
Known method, structure and technology are not been shown in detail, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help understand in each inventive aspect
It is individual or multiple, in above to the description of the exemplary embodiment of the present invention, each feature of the invention
Sometimes it is grouped together in single embodiment, figure or descriptions thereof.However, should be by
The method of the disclosure is construed to reflect following intention:I.e. the present invention for required protection requires ratio at each
The more features of feature being expressly recited in claim.More precisely, as following right will
As asking book reflected, inventive aspect is less than all spies of single embodiment disclosed above
Levy.Therefore, it then follows thus claims of specific embodiment are expressly incorporated in the specific embodiment party
Separate embodiments of the formula, wherein each claim as the present invention itself.
Those skilled in the art are appreciated that can be carried out to the module in the equipment in embodiment
Adaptively change and they are arranged in one or more different from embodiment equipment.
Module or unit or component in embodiment can be combined into a module or unit or component, and
In addition multiple submodule or subelement or sub-component can be divided into.Except such feature and/or
Outside at least some in process or unit is excluded each other, can be using any combinations to this explanation
All features disclosed in book (including adjoint claim, summary and accompanying drawing) and such as the displosure
Any method or all processes or unit of equipment be combined.Unless expressly stated otherwise, originally
Each feature disclosed in description (including adjoint claim, summary and accompanying drawing) can be by carrying
For identical, equivalent or similar purpose alternative features replacing.
Although additionally, it will be appreciated by those of skill in the art that some embodiments described herein include
Some included features rather than further feature in other embodiments, but the feature of different embodiments
Combination mean to be within the scope of the present invention and formed different embodiments.For example, under
In the claims in face, embodiment required for protection one of arbitrarily can be in any combination
Mode is using.
The all parts embodiment of the present invention can be realized with hardware, or with one or more
The software module run on reason device is realized, or is realized with combinations thereof.Those skilled in the art
It should be appreciated that can be realized using microprocessor or digital signal processor (DSP) in practice
The condition detection method of accompanied electronic anti-theft device according to embodiments of the present invention, equipment, server and
The some or all functions of some or all parts in system equipment.The present invention can also be realized
It is some or all equipment or program of device (example for performing method as described herein
Such as, computer program and computer program).Such program for realizing the present invention can be stored in
On computer-readable medium, or there can be the form of one or more signal.Such signal
Can download from internet website and obtain, or provide on carrier signal, or with any other
Form is provided.
It should be noted that above-described embodiment the present invention will be described rather than the present invention is limited
Make, and those skilled in the art can design without departing from the scope of the appended claims
Alternative embodiment.In the claims, any reference markss between bracket should not be configured to
Limitations on claims.Word "comprising" does not exclude the presence of element not listed in the claims or step
Suddenly.Word "a" or "an" before element does not exclude the presence of multiple such elements.The present invention
Can come real by means of the hardware for including some different elements and by means of properly programmed computer
It is existing.If in the unit claim for listing equipment for drying, several in these devices can be logical
Cross same hardware branch to embody.The use of word first, second, and third is not indicated that
Any order.These words can be construed to title.
Claims (10)
1. it is a kind of generate path method, it is characterised in that methods described includes:
Obtain the access information of each page;
Determine page rank of the access information in each page;
According to the page rank, the source key word of corresponding each page of the access information is extracted;
Find out all popularization units comprising the source key word;
Targeted promotion unit is obtained from all popularization units for finding, the targeted promotion unit is
Comprising the most popularization unit of source key word;
Will be the uniform resource position mark URL of at least one page of correspondence described targeted promotion unit true
It is set to path subchain.
2. method according to claim 1, it is characterised in that the determination access information
Page rank in each page, including:
Each page is ranked up according to ordering rule;
Using the result of the sequence, page rank of the access information in each page is determined;
Wherein, the ranking of the page, chooses in the sequence according to pre-conditioned.
3. method according to claim 2, it is characterised in that it is described according to ordering rule to each
The individual page is ranked up, including:
Under On Line Analysis Process technology, each page is arranged according to the ordering rule
Sequence.
4. method according to claim 1, it is characterised in that described that correspondence described target be pushed away
The URL of at least one page of wide unit is defined as path subchain, including:
Calculating is included in the number of the source key word of each page in the targeted promotion unit;
The URL of at least one number most pages is defined as into path subchain.
5. method according to any one of claim 1 to 4, it is characterised in that methods described
Also include:
Page title is set to the title of the path subchain of corresponding page.
6. method according to claim 5, it is characterised in that described that page title is set to
The title of the path subchain of corresponding page, including:
At least one keyword is extracted from page title, and at least one keyword is set to
The title of the path subchain of corresponding page.
7. it is a kind of generate path device, it is characterised in that described device includes:
Acquiring unit, for obtaining the access information of each page;
Sequencing unit, for determining page rank of the access information in each page;
Extraction unit, for according to the page rank, extracting corresponding each page of the access information
The source key word in face;
Searching unit, for finding out the institute of the source key word extracted comprising the extraction unit
There is popularization unit;
The acquiring unit, obtains in being additionally operable to from the searching unit all popularization units for finding
Targeted promotion unit, the targeted promotion unit are comprising the most popularization unit of source key word;
Determining unit, for the targeted promotion unit by the acquiring unit acquisition is corresponded at least
The uniform resource position mark URL of one page is defined as path subchain.
8. device according to claim 7, it is characterised in that the sequencing unit, for root
Each page is ranked up according to ordering rule;Using the result of the sequence, determine that described access is believed
Page rank of the breath in each page;Wherein, the ranking of the page, be according to it is pre-conditioned
Choose in the sequence.
9. device according to claim 7, it is characterised in that the determining unit, including:
Computing module, the source for calculating each page being included in the targeted promotion unit are closed
The number of keyword;
Determining module, at least one most page of the number for the computing module is calculated
URL be defined as path subchain.
10. the device according to any one of claim 7 to 9, it is characterised in that the dress
Putting also includes:
Setting unit, for page title to be set to the title of the path subchain of corresponding page.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510617172.8A CN106557473B (en) | 2015-09-24 | 2015-09-24 | Method and device for generating new channel |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510617172.8A CN106557473B (en) | 2015-09-24 | 2015-09-24 | Method and device for generating new channel |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106557473A true CN106557473A (en) | 2017-04-05 |
CN106557473B CN106557473B (en) | 2020-01-07 |
Family
ID=58414204
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510617172.8A Active CN106557473B (en) | 2015-09-24 | 2015-09-24 | Method and device for generating new channel |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106557473B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108596433A (en) * | 2018-03-22 | 2018-09-28 | 安徽建筑大学 | One kind being applied to coal mine safety management risk factors evaluation method |
Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050149395A1 (en) * | 2003-10-29 | 2005-07-07 | Kontera Technologies, Inc. | System and method for real-time web page context analysis for the real-time insertion of textual markup objects and dynamic content |
CN102073960A (en) * | 2010-09-15 | 2011-05-25 | 江苏仕德伟网络科技股份有限公司 | Method for assessing operation effect in website marketing process |
CN102142033A (en) * | 2010-05-20 | 2011-08-03 | 百度在线网络技术(北京)有限公司 | Method and device for providing relative sub-link information in search result |
CN102411589A (en) * | 2010-09-26 | 2012-04-11 | 百度在线网络技术(北京)有限公司 | Method and equipment for monitoring and managing keywords |
CN103164521A (en) * | 2013-03-11 | 2013-06-19 | 亿赞普(北京)科技有限公司 | Keyword calculation method and device based on user browse and search actions |
US20130226690A1 (en) * | 2005-11-30 | 2013-08-29 | John Nicholas Gross | System & Method of Presenting Content Based Advertising |
CN103514193A (en) * | 2012-06-21 | 2014-01-15 | 百度在线网络技术(北京)有限公司 | Method and device used for determining popularization result information of popularization keyword |
CN104021209A (en) * | 2014-06-19 | 2014-09-03 | 北京博雅立方科技有限公司 | Statistical method for keyword advertising effect and browsing client |
CN104077290A (en) * | 2013-03-26 | 2014-10-01 | 腾讯科技(深圳)有限公司 | Method and device for generating promoted accounts |
CN104462397A (en) * | 2014-12-10 | 2015-03-25 | 北京国双科技有限公司 | Promotion information processing method and promotion information processing device |
CN104504135A (en) * | 2014-12-31 | 2015-04-08 | 北京国双科技有限公司 | Promotion account structure generation method and device |
CN104572960A (en) * | 2014-12-29 | 2015-04-29 | 北京奇虎科技有限公司 | Searching method and searching device |
CN104778606A (en) * | 2015-04-10 | 2015-07-15 | 北京京东尚科信息技术有限公司 | Account structure data processing method and device |
-
2015
- 2015-09-24 CN CN201510617172.8A patent/CN106557473B/en active Active
Patent Citations (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050149395A1 (en) * | 2003-10-29 | 2005-07-07 | Kontera Technologies, Inc. | System and method for real-time web page context analysis for the real-time insertion of textual markup objects and dynamic content |
US20130226690A1 (en) * | 2005-11-30 | 2013-08-29 | John Nicholas Gross | System & Method of Presenting Content Based Advertising |
CN102142033A (en) * | 2010-05-20 | 2011-08-03 | 百度在线网络技术(北京)有限公司 | Method and device for providing relative sub-link information in search result |
CN102073960A (en) * | 2010-09-15 | 2011-05-25 | 江苏仕德伟网络科技股份有限公司 | Method for assessing operation effect in website marketing process |
CN102411589A (en) * | 2010-09-26 | 2012-04-11 | 百度在线网络技术(北京)有限公司 | Method and equipment for monitoring and managing keywords |
CN103514193A (en) * | 2012-06-21 | 2014-01-15 | 百度在线网络技术(北京)有限公司 | Method and device used for determining popularization result information of popularization keyword |
CN103164521A (en) * | 2013-03-11 | 2013-06-19 | 亿赞普(北京)科技有限公司 | Keyword calculation method and device based on user browse and search actions |
CN104077290A (en) * | 2013-03-26 | 2014-10-01 | 腾讯科技(深圳)有限公司 | Method and device for generating promoted accounts |
CN104021209A (en) * | 2014-06-19 | 2014-09-03 | 北京博雅立方科技有限公司 | Statistical method for keyword advertising effect and browsing client |
CN104462397A (en) * | 2014-12-10 | 2015-03-25 | 北京国双科技有限公司 | Promotion information processing method and promotion information processing device |
CN104572960A (en) * | 2014-12-29 | 2015-04-29 | 北京奇虎科技有限公司 | Searching method and searching device |
CN104504135A (en) * | 2014-12-31 | 2015-04-08 | 北京国双科技有限公司 | Promotion account structure generation method and device |
CN104778606A (en) * | 2015-04-10 | 2015-07-15 | 北京京东尚科信息技术有限公司 | Account structure data processing method and device |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108596433A (en) * | 2018-03-22 | 2018-09-28 | 安徽建筑大学 | One kind being applied to coal mine safety management risk factors evaluation method |
Also Published As
Publication number | Publication date |
---|---|
CN106557473B (en) | 2020-01-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8799310B2 (en) | Method and system for processing a uniform resource locator | |
US9015176B2 (en) | Automatic identification of related search keywords | |
US9384289B2 (en) | Method and system to identify geographical locations associated with queries received at a search engine | |
CN106919625B (en) | Internet user attribute identification method and device | |
CN108090104B (en) | Method and device for acquiring webpage information | |
CN104217031B (en) | A kind of method and apparatus that user's classification is carried out according to server search daily record data | |
US20090299964A1 (en) | Presenting search queries related to navigational search queries | |
CN102037464A (en) | Search results with most clicked next objects | |
WO2011008848A2 (en) | Activity based users' interests modeling for determining content relevance | |
US20090150345A1 (en) | Web Domain Data Replication System | |
KR101566616B1 (en) | Advertisement decision supporting system using big data-processing and method thereof | |
EP2628097A1 (en) | Systems and methods for using a behavior history of a user to augment content of a webpage | |
CN104572863A (en) | Product recommending method and system | |
CN106709073A (en) | Browser notification pushing method and browser terminal | |
CN103412881A (en) | Method and system for providing search result | |
US9886711B2 (en) | Product recommendations over multiple stores | |
EP2933734A1 (en) | Method and system for the structural analysis of websites | |
KR100987058B1 (en) | Method and system for providing advertising service using the keywords of internet contents and program recording medium | |
Jiang et al. | A clickstream data analysis of Chinese academic library OPAC users' information behavior | |
CN111414410A (en) | Data processing method, device, equipment and storage medium | |
WO2020051416A1 (en) | Entity-based search system using user engagement | |
WO2007011129A1 (en) | Information search method and information search apparatus on which information value is reflected | |
CN102819384A (en) | Method and device for prompting display at input field | |
CN105468627A (en) | Method and system for shielding and filtering web page contents | |
US11341141B2 (en) | Search system using multiple search streams |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: Beijing Guoshuang Technology Co.,Ltd. Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing Applicant before: Beijing Guoshuang Technology Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |