CN103530364A - Method and system for providing download link - Google Patents

Method and system for providing download link Download PDF

Info

Publication number
CN103530364A
CN103530364A CN201310476117.2A CN201310476117A CN103530364A CN 103530364 A CN103530364 A CN 103530364A CN 201310476117 A CN201310476117 A CN 201310476117A CN 103530364 A CN103530364 A CN 103530364A
Authority
CN
China
Prior art keywords
download link
search result
web page
user
redirect
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310476117.2A
Other languages
Chinese (zh)
Other versions
CN103530364B (en
Inventor
田乐逍
胡又欢
肖镜辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Sogou Technology Development Co Ltd
Beijing Sogou Information Service Co Ltd
Original Assignee
Beijing Sogou Technology Development Co Ltd
Beijing Sogou Information Service Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sogou Technology Development Co Ltd, Beijing Sogou Information Service Co Ltd filed Critical Beijing Sogou Technology Development Co Ltd
Priority to CN201310476117.2A priority Critical patent/CN103530364B/en
Publication of CN103530364A publication Critical patent/CN103530364A/en
Application granted granted Critical
Publication of CN103530364B publication Critical patent/CN103530364B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/954Navigation, e.g. using categorised browsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention discloses a method and system for providing a download link. The method for providing the download link comprises the steps that a website of a search result webpage is obtained in the process that a user skips to a target webpage from the search result webpage, a search term and/or a target webpage title contained in the website of the search result webpage are/is obtained to be used as query information, a preset download link base is inquired according to the query information, the download link matched with the query information is obtained, and a matched download link is provided. According to the skip process of user web browsing, the download link can be more efficiently and accurately provided, and the provided download link can more accurately meet the actual requirement of the user, and efficiency for downloading resources of the user is improved.

Description

The method and system of download link are provided
Technical field
The present invention relates to technical field of the computer network, be specifically related to provide the method and system of download link.
Background technology
Along with the fast development of the universal and internet of computer utility, network is downloaded becomes the Main Means that user obtains Internet resources gradually., having there are many web websites that resource downloading service is provided in the download demand improving constantly in order better to meet user, comprises that the web of a lot of portal websites provides the download service of Internet resources to provide convenience for users find Internet resources.
When user finds Internet resources on the internet, need to first navigate to target network resource, then according to the network site of Internet resources, download.Common Internet resources positioning means comprises, uses search engine, with the information of resource keyword or website, retrieves, and the results list providing by search engine arrives the resource downloading page; Input Address access download website or downloading page etc.But in the resource downloading page, existence need to maybe need to login by input validation code the situation of the resource downloading link that just can obtain; What also have designs download link in subordinate's subpage frame of current page; Even, in order to promote the objects such as product, in some downloading page, comprise false resource downloading link and mislead user's click; These obtain authentic and valid resource downloading link all to user and have caused inconvenience from downloading page.In addition, for the Internet resources of different classifications, the download link that user is right has different precision demands, and prior art is recommended for whole internet, does not consider the precision demand of user to different vertical classification.
To sum up, the problem solving in the urgent need to those skilled in the art is just how more efficiently and exactly to provide download link, makes provided download link user's real demand more accurately, the efficiency of raising user downloaded resources.
Summary of the invention
In view of the above problems, the present invention has been proposed to a kind of method that overcomes the problems referred to above or the system that download link is provided addressing the above problem at least in part and download link is provided is accordingly provided.
According to one aspect of the present invention, a kind of method that download link is provided is provided, it is characterized in that, comprising:
Obtain user and jump to the process of target web from search result web page, the network address of described search result web page;
Obtain the title of query word that the network address of described search result web page comprises and/or described target web as Query Information;
With described Query Information, inquire about preset download link storehouse, obtain the download link matching with described Query Information;
The download link matching described in providing.
Optionally, described in obtain user and jump to the process of target web from search result web page, the network address of described search result web page, comprising:
Obtain user with the redirect behavior of the mode accession page of webpage redirect;
From described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, described in obtain user with the redirect behavior of the mode accession page of webpage redirect, comprising:
Obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses time of each webpage;
Described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page, comprising:
According to the described user's who gets identification information, the network address of the webpage of accessing, access the time of each webpage, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, described in obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses time of each webpage, comprising:
By browser or browser plug-in, obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses the time of each webpage, and is recorded as daily record, by described Log Sender to described server end;
Described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page, comprising:
By described server end, according to the information comprising in the described daily record receiving, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page, comprising:
From described redirect behavior, filter out user from search result web page, through the redirect of preset threshold value number of times, arrive the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, comprising:
Utilize preset regular expression, from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web.
Optionally, described in obtain query word that the network address of described search result web page comprises as Query Information, comprising:
Obtain the query word comprising in the network address of described search result web page, and described query word is carried out to participle, go stop words to process; The query word obtaining after processing is as described Query Information;
Describedly with described Query Information, inquire about preset download link storehouse, obtain the download link matching with described Query Information, comprising:
With the query word obtaining after described processing, inquire about preset download link storehouse, obtain the shared word of described target pages and download link;
According to described each shared word ratio of the searching times in search result web page described in each, and the search word accounting of each shared word in this download link, determine the comprehensive weights of download link;
More described comprehensive weights and preset weight threshold, be greater than by comprehensive weights the download link that the download link of described weight threshold is defined as matching.
Optionally, described in obtain described target web title as Query Information, comprising:
Obtain the title of described target web, and the title of described target web is carried out to participle and filtration treatment, using the title keyword obtaining after participle and filtration treatment as described Query Information; Wherein said filtration treatment comprises: described title is carried out to noise reduction, remove the garbage in title;
Describedly with described Query Information, inquire about preset download link storehouse, obtain the download link matching with described Query Information, comprising:
With described title keyword, inquire about preset download link storehouse, obtain the matching degree of the download link in described title keyword and download link storehouse;
More described matching degree and preset matching threshold, the download link that matching degree is greater than to described preset matching threshold is defined as the download link matching with described Query Information.
Optionally, described preset matching threshold is according to the difference of the resource class of download link and difference, and described method also comprises:
Determine the resource class of described download link;
Described matching degree and preset matching threshold, the download link that matching degree is greater than to described preset matching threshold is defined as the download link matching with described Query Information, comprising:
The matching threshold that more described matching degree is corresponding with the resource class of this download link, the download link that matching degree is greater than to described corresponding with the resource class of this download link matching threshold is defined as the download link matching with described Query Information.
Optionally, described in the download link that matches described in providing, comprising:
The mode that the described download link matching is ejected to the drawer type bullet window in subwindow or system tray pop-up window or browser window with operating system pop-up window or browser provides.
According to a further aspect in the invention, provide a kind of system that download link is provided, it is characterized in that, having comprised:
Network address acquiring unit, jumps to the process of target web, the network address of described search result web page for obtaining user from search result web page;
Query Information acquiring unit, for the title that obtains query word that the network address of described search result web page comprises and/or described target web as Query Information;
Download link acquiring unit, for inquire about preset download link storehouse with described Query Information, obtains the download link matching with described Query Information;
Link provides unit, for the download link matching described in providing.
Optionally, described network address acquiring unit, comprising:
Subelement is obtained in redirect behavior, for obtaining user with the redirect behavior of the mode accession page of webpage redirect;
Filter subelement, for from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, subelement is obtained in described redirect behavior, specifically for:
Obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses time of each webpage;
Described filtration subelement, specifically for:
According to the described user's who gets identification information, the network address of the webpage of accessing, access the time of each webpage, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, subelement is obtained in described redirect behavior, specifically for:
By browser or browser plug-in, obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses the time of each webpage, and is recorded as daily record, by described Log Sender to described server end;
Described filtration subelement is positioned at server end, specifically for:
Receive described daily record, and according to the information comprising in the described daily record receiving, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, described filtration subelement, specifically for:
From described redirect behavior, filter out user from search result web page, through the redirect of preset threshold value number of times, arrive the access behavior of target web, and obtain the network address of accessed search result web page.
Optionally, described filtration subelement, specifically for:
Utilize preset regular expression, from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web.
Optionally, described Query Information acquiring unit, comprising:
The first Query Information obtains subelement, the query word comprising for obtaining the network address of described search result web page, and described query word is carried out to participle, go stop words to process; The query word obtaining after processing is as described Query Information;
Described download link acquiring unit, comprising:
Shared word obtains subelement, for inquire about preset download link storehouse with the query word obtaining after described processing, obtains the shared word of described target pages and download link;
Comprehensive weights determining unit, for according to the searching times ratio of described each shared word search result web page described in each, and the search word accounting of each shared word in this download link, determines the comprehensive weights of download link;
Subelement is determined in the first link, for more described comprehensive weights and preset weight threshold, comprehensive weights is greater than to the download link that the download link of described weight threshold is defined as matching.
Optionally, described Query Information acquiring unit, comprising:
The second Query Information obtains subelement, for obtaining the title of described target web, and the title of described target web is carried out to participle and filtration treatment, using the title keyword obtaining after participle and filtration treatment as described Query Information; Wherein said filtration treatment comprises: described title is carried out to noise reduction, remove the garbage in title;
Described download link acquiring unit, comprising:
Matching degree is obtained subelement, for inquire about preset download link storehouse with described title keyword, obtains the matching degree of the download link in described title keyword and download link storehouse;
Subelement is determined in the second link, and for more described matching degree and preset matching threshold, the download link that matching degree is greater than to described preset matching threshold is defined as the download link matching with described Query Information.
Optionally, described preset matching threshold is according to the difference of the resource class of download link and difference, and described system also comprises:
Classification is determined subelement, for determining the resource class of described download link;
Subelement is determined in described the second link, specifically for:
The matching threshold that more described matching degree is corresponding with the resource class of this download link, the download link that matching degree is greater than to described corresponding with the resource class of this download link matching threshold is defined as the download link matching with described Query Information.
Optionally, described link provides unit, specifically for:
The mode that the described download link matching is ejected to the drawer type bullet window in subwindow or system tray pop-up window or browser window with operating system pop-up window or browser provides.
According to the method that download link is provided of the present invention, can obtain user jumps to the process of target web from search result web page, the network address of the search result web page of accessing, and then obtain the query word that the network address of search result web page comprises, the title of target web is as Query Information; With the Query Information getting, inquire about preset download link storehouse, obtain the download link matching with described Query Information; When described target pages is accessed, provide corresponding download link.When user will obtain the download link of a certain resource, usually can use the search engine to obtain the search result web page of this resource dependency, and in the network address of search result web page, conventionally all comprise the keyword of the interested content of user.By Search Results, arrive the process of target pages, can be regarded as a kind of process that visits the page relevant with downloaded resources by search, by the webpage that this process user is accessed, analyze, the query word obtaining, and the title of target pages, objectively reflected to a certain extent user's actual demand, can be for judging accurately user's potential demand, further by using the present chained library of these information inquiries, the download link of relevant user's potential demand resource is provided to user, the download link providing is relevant to the content of user's access process and target pages, more can reflect user's actual demand.Obtained thus more efficiently and exactly download link is provided, provided download link user's real demand has more accurately been provided, improved the beneficial effect of the efficiency of user's downloaded resources.
Further, the present invention is usingd the redirect behavior of the mode accession page of user by webpage redirect as basis, the download link providing can be by user's access process separately, and the target pages that user accesses decides, and the download link providing meets users ' individualized requirement more.Wherein, the title of target web is the information that can more directly react the interested resource of user, and the download link matching that the title of target web of usining inquires as Query Information can obtain more meeting the link of the downloaded resources of user's request.
The present invention can also inquire about download link storehouse with Query Information, while obtaining the download link matching, download demand for different classes of Internet resources, adopts different matching strategies, and the download link that makes to provide can meet user's multiple demand more flexibly.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to better understand technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, to the accompanying drawing of required use in embodiment be briefly described below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skills, do not paying under the prerequisite of creative work, can also obtain according to these accompanying drawings other accompanying drawing.In the accompanying drawings:
Fig. 1 is the method flow diagram that download link is provided according to an embodiment of the invention; And,
Fig. 2 is the system schematic that download link is provided according to an embodiment of the invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the present invention's part embodiment, rather than whole embodiment.Embodiment based in the present invention, the every other embodiment that those of ordinary skills obtain, belongs to the scope of protection of the invention.
Refer to Fig. 1, the method that download link is provided of the embodiment of the present invention can comprise the following steps:
S101: obtain user and jump to the process of target web from search result web page, the network address of described search result web page;
User, in obtaining the process of Internet resources, need to obtain by accessing certain webpage the download address of Internet resources conventionally, such as, the downloading page of download service website; The encyclopaedic knowledge page; Downloading page of the official website that Internet resources are corresponding etc.; The common feature of this class page of these pages is conventionally can comprise the recommended information about Internet resources, as the introduction to the software function of software class Internet resources, system requirements; To introduction of the author of literature resource, chapters and sections information, content summary etc.Can think with respect to other pages, when user accesses an above-mentioned class page, may be more finding corresponding Internet resources, by accessing the above-mentioned page, more may trigger directly or indirectly the download to respective network resource, an above-mentioned class page is referred to as to target web here.Target pages can be by artificially collecting, or network statistics obtains, if the page that in statistics network, directly or indirectly the number of times of trigger network resource downloading surpasses certain threshold value is as target pages.
In addition; user is passing through the mode access destination page of webpage redirect; and then the information such as the function of awareness network resource or content; or while finding the download link of Internet resources; often can arrive rapidly target pages by search; using search to arrive in the process of target pages, also can pass through another webpage---search result web page.The search that user uses can comprise professional search engine, search in Website that download website provides etc.This process is similar to user and uses search to obtain search results pages, then by one or many redirect, arrives target pages by search results pages.Wherein, the network address of search results pages can generate according to the query word of user's input conventionally, and as when using search dog search engine to search for " pinyin " and " ime " two query words, the network address of the search result web page that this search engine returns is simultaneously:
http://www.sogou.com/web?ie=utf8&query=pinyin+ime
And for example, when " search in Website " entrance input " pdf ", " doc " two query words of certain software download site carry out search in Website, the network address of the search result web page that this website returns is:
http://search.….com/search_list.php?searchsid=0&searchname=pdf+doc
Visible, user, use in the network address of the search result web page that when search obtain, usually include the query word that user inputs, and these query words user's keyword required or interested Internet resources just.First, can obtain user and jump to the process of target web from search result web page, the network address of search result web page.
In obtaining the process of Internet resources, usually that mode by webpage redirect realizes, by a series of webpage redirect, the final webpage that arrives the download link that comprises Internet resources, and along with the widespread use of search technique, in this process, by search results pages, pass through redirect and arrive the process that target pages is also the more common access destination page.Specifically when obtaining the network address of search result web page, can first obtain user with the redirect behavior of the mode accession page of webpage redirect, from redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page, collect the redirect behavior of all mode accession pages with webpage redirect, therefrom filtering out by the access behavior of search results pages redirect access destination webpage.
Redirect behavior to the mode accession page with webpage redirect is obtained, can be by thering is the browser program of information function, collect by the redirect behavior of all mode accession pages with webpage redirect, also can be by thering is the browser plug-in of correlation function, or the watchdog routine being arranged in operating system realizes etc., the means that specific implementation is obtained redirect behavior can have multiple, and the embodiment of the present invention is to this not restriction.Concrete, can obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses the time of each webpage etc.; And then according to the user's who gets identification information, the network address of the webpage of accessing, accesses time of each webpage, also original subscriber's redirect behavior.Wherein, user totem information is used for distinguishing different users, in conjunction with the network address of accessed webpage, and the time of accessing each webpage, just can obtain which user and when, access which webpage, obtain the redirect behavior of accessed web page in chronological order.From the redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, obtain the network address of accessed search result web page, can be specifically first from all redirect behaviors, filter out and take the redirect behavior that target pages is object, recycle preset regular expression, from redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, the preset regular expression of usining is searched for as search rule, from redirect behavior, filter out the access behavior that jumps to target web from search result web page.
Obtain user with the redirect behavior of the mode accession page of webpage redirect, and filter and to obtain user and from search result web page, jump to the access behavior of target web, obtain the whole process of the network address of accessed search result web page, can have been coordinated with server end by client-side program (as browser or browser plug-in), concrete, can be to obtain in the process of user with the mode accession page of webpage redirect by browser or browser plug-in, user's identification information, the network address of the webpage of accessing, access the time of each webpage, and be recorded as daily record, by Log Sender to described server end, by server end, according to the information comprising in the daily record receiving, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.Under this mode, can utilize the more powerful performance of server end and processing power, redirect behavior is carried out to filtration more rapidly and efficiently, obtain the access behavior that jumps to target web from search result web page, and the network address of search result web page.
During this external filtration, can only from described redirect behavior, filter out user's redirect through preset threshold value number of times from search result web page, arrive the access behavior of target web, and obtain the network address of accessed search result web page.This is that the number of hops experiencing is more because jump in the process of target web by search result web page, and contacting between search result web page and the target web of arrival is more prone to less; Otherwise the number of hops experiencing is fewer, contacting between search result web page and the target web of access is more prone to tightr; If start to experience considerable number of times redirect from search result web page, just arrive a target pages, such as 50 times, can think between search result web page and the target web of final access close to not contacted.So can only filter out user's redirect through preset threshold value number of times from search result web page, arrive the access behavior of target web, as filter out user from search result web page through being less than 5 redirects, arrive the access behavior of target web.
S102: obtain the title of query word that the network address of described search result web page comprises and/or described target web as Query Information;
As previously mentioned, user jumps to the process of target web from search result web page, the query word that usually includes user's input in the network address of search result web page, therefore, the query word that the network address of search result web page can be comprised extracts, and as Query Information, inquires about preset download link storehouse.In addition, the title of target web also usually contains the key word information of Internet resources, as the title of the downloading page of Internet resources, the information such as title that usually comprise this resource, also usually there is being similar to the form that " so-and-so encyclopaedia _ ' Internet resources name ' " etc. includes the information such as title of Internet resources in the title of the encyclopaedic knowledge page, the content that is target pages is also usually contained in the relevant information of Internet resources, can be used as equally Query Information and inquires about preset download link storehouse.So, can obtain the title of query word that the network address of search result web page comprises and/or target web as Query Information.
S103: inquire about preset download link storehouse with described Query Information, obtain the download link matching with described Query Information;
Get after Query Information, can inquire about preset download link storehouse according to the Query Information getting, obtain the download link matching with Query Information.In actual applications, can, according to the difference that gets Query Information, take different inquiry modes.
If got the title of target web, can directly use the title of target web as Query Information, as the title of the target web title that is literary works, or the title of software resource, just can directly use the preset download link storehouse of these name querys, obtain the download link matching.In actual applications, the title of target web is not often the form appearance with simple keyword, also can be mixed with the garbage that inquiry is caused to interference therebetween.Now in order to extract effective query information wherein, can also be according to the title of webpage, obtain the information characteristic of webpage, as the resource name that web page title comprises, software version number, author information etc., can be by the title of target web be carried out to participle and filtration treatment, after participle and filtration treatment, can obtain the title keyword that comprises in the title of target web, using the title keyword obtaining after participle and filtration treatment as Query Information.When specific implementation obtains title keyword by target web title, can pass through the different regular expressions preset to different web sites, come the mode of the title keyword in extracting objects web page title to obtain.Wherein filtration treatment can comprise: the title of the target web getting is carried out to participle noise reduction, and the garbage in noise reduction process and middle removal title, during specific implementation, can complete by preset noise reduction regular expression.Certainly, in actual applications, target web title is carried out to participle, extract title keyword wherein, and the process of noise reduction, also can be completed by the regular expression template instrument combining, improve the extraction efficiency to the title keyword in target web title.Then with the title keyword obtaining after participle and filtration treatment, inquire about preset download link storehouse, obtain the title keyword that obtains after filtration treatment and the matching degree of the download link in download link storehouse; The matching degree getting is compared with preset matching threshold, and the download link that matching degree is greater than to preset matching threshold is defined as the download link matching with Query Information.
In addition, preset matching threshold can be according to the difference of the resource class of download link and difference, for example, for the download link of literature resource, higher matching threshold can be set and carry out strict screening, because conventionally when user inquires about literature resource, more may expect to obtain such as making the name of an article, the download link that author etc. conform to, if there is one of them not conform to, to the resource of, not probably the desired resource of user, therefore in actual applications, for Query Information, comprise literary works title, the situation of the information such as author, can be by preset higher threshold value, filter out the download link that literary works title and author are strictly consistent with Query Information, as linking of matching with Query Information, reach the expectation that user is right.Again such as for software class resource, can preset relatively low threshold value, when comprised dbase and software version at Query Information, the download link inquiring may only meet title and conform to, version is higher or lower with respect to Query Information, now can be by preset lower preset, make the conform to download link of the close software class resource of version of title out screened, as the download link matching with Query Information.The resource class of download link, can preservation corresponding to download link in preset download link storehouse, get after Query Information, when obtaining according to Query Information inquiry download link storehouse the download link matching, can determine according to the classification information of preserving in download link storehouse the resource class of download link, different matching threshold that different resource class is preset, the matching threshold that comparison match degree is corresponding with the resource class of this download link, matching degree is greater than to the download link of the matching threshold corresponding with the resource class of this download link, be defined as the download link matching with described Query Information.
Under another kind of implementation, when the Query Information obtaining comprises the query word obtaining from the network address of search result web page, can inquire about preset download link storehouse according to the query word getting.In query word, may comprise invalid vocabulary equally, or user input time, the content of input is not carried out to participle, but has inputted continuously multiple queries word, now, can carry out participle to described query word, go stop words to process; The query word obtaining after processing is inquired about preset download link storehouse as Query Information.During specific implementation, available query word is inquired about preset download link storehouse, obtains the shared word of target pages and download link.Searching times ratio according to each shared word in each search result web page, and the search word accounting of each shared word in this download link, determine the comprehensive weights of download link; More described comprehensive weights and preset weight threshold, be greater than by comprehensive weights the download link that the download link of described weight threshold is defined as matching.The shared word here refers to the query word obtaining from the network address of search result web page, the common factor of the search word corresponding with download link.The search word of download link, the process statistics that can retrieve download link by the user of colony out, the search word of the download link coming out, the all search words relevant with this download link have been generally comprised, and each search word has corresponding search word accounting, for example, for download link:
http://xiazai.….com/Soft/A/Absinthe_2.0.4_XiaZaiBa.zip
Its search word is as shown in table 1 with corresponding search word accounting:
Table 1
Search word Search word accounting
Absinthe 0.4
2.0.4 0.4
Escape from prison 0.2
As accessed the relevant target pages of Absinthe2.0.4 user by redirect, in the process of redirect, accessed respectively three search result web page, and from the network address of these three search result web page, obtained respectively following query word:
From the network address of search result web page one, obtained query word " Absinthe " " 2.0.4 ";
From the network address of search result web page two, obtained query word " Absinthe " " 2.0.4 ";
From the network address of search result web page three, obtained query word " Absinthe " " download ";
Now, the shared word obtaining according to the above-mentioned query word search word corresponding with download link in table 2 is:
" Absinthe " and " 2.0.4 ", has occurred 3 times as query word " Absinthe ", and " 2.0.4 " occurred 2 times, and the searching times of two query words ratio is:
“Absinthe”:3/(3+2)=0.6
“2.0.4”:2/(3+2)=0.4
Searching times ratio according to each shared word in each search result web page, and the search word accounting of each shared word in this download link, while determining the comprehensive weights of download link, can first obtain (the sharing the search word accounting of the searching times ratio+shared word of word) of each shared word, do again multiplication,
(searching times ratio+search word accounting) of each shared word of ∏,
Final result is as the comprehensive weights of the similarity of reflection query word and download link, and the comprehensive weights that relatively obtain and preset weight threshold are greater than by comprehensive weights the download link that the download link of described weight threshold is defined as matching.As above the comprehensive weights in example are
(0.6+0.4) * (0.4+0.4)=0.8, if preset weight threshold is 0.6, can determine this link
http://xiazai.….com/Soft/A/Absinthe_2.0.4_XiaZaiBa.zip
Meet similarity requirement, and the download link that is defined as matching.
S104: the download link matching described in providing.
After having obtained the download link matching with Query Information, the download link matching getting can be offered to user.The concrete mode that the download link matching can be ejected to the drawer type bullet window in subwindow or system tray pop-up window or browser window with operating system pop-up window or browser offers user.
More than introduced the method that download link is provided of the embodiment of the present invention, pass through the method, can obtain user jumps to the process of target web from search result web page, the network address of the search result web page of accessing, and then the query word that comprises of the network address of obtaining search result web page, the title of target web is as Query Information; With the Query Information getting, inquire about preset download link storehouse, obtain the download link matching with described Query Information; The download link matching is provided.By the webpage that user in jump procedure is accessed, analyze, the query word obtaining and the title of target pages, objectively reflected to a certain extent user's actual demand, can be for judging accurately user's potential demand, further by using the present chained library of these information inquiries, the download link of relevant user's potential demand resource is provided to user, the download link providing is relevant to the content of user's access process and target pages, more can reflect user's actual demand.Obtained thus more efficiently and exactly download link is provided, provided download link user's real demand has more accurately been provided, improved the beneficial effect of the efficiency of user's downloaded resources
The method that download link is provided providing with the embodiment of the present invention is corresponding, and the embodiment of the present invention also provides a kind of device that download link is provided, and referring to Fig. 2, this device specifically can comprise:
Network address acquiring unit 210, jumps to the process of target web, the network address of search result web page for obtaining user from search result web page;
Query Information acquiring unit 220, for the title that obtains query word that the network address of described search result web page comprises and/or described target web as Query Information;
Download link acquiring unit 230, for inquire about preset download link storehouse with described Query Information, obtains the download link matching with described Query Information;
Link provides unit 240, for the download link matching described in providing.
Wherein, network address acquiring unit 210, can comprise:
Subelement is obtained in redirect behavior, for obtaining user with the redirect behavior of the mode accession page of webpage redirect; And,
Filter subelement, for from redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Wherein, subelement is obtained in redirect behavior, specifically can be for:
Obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses time of each webpage; And,
Filter subelement, specifically can be for:
According to the user's who gets identification information, the network address of the webpage of accessing, access the time of each webpage, also original subscriber's redirect behavior, and from the redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
Under another kind of implementation, subelement is obtained in redirect behavior, specifically can be for:
By browser or browser plug-in, obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses the time of each webpage, and is recorded as daily record, by described Log Sender to described server end;
Now, filter subelement and be positioned at server end, specifically for:
Receive described daily record, and according to the information comprising in the described daily record receiving, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
In order further to improve the relevance of search result web page and target web, filter subelement, specifically can be for:
From described redirect behavior, filter out user from search result web page, through the redirect of preset threshold value number of times, arrive the access behavior of target web, and obtain the network address of accessed search result web page.
During specific implementation, filter subelement, specifically can be for:
Utilize preset regular expression, from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web.
In addition, Query Information acquiring unit 220, can comprise:
The first Query Information obtains subelement, the query word comprising for obtaining the network address of search result web page, and query word is carried out to participle, go stop words to process; The query word obtaining after processing is as Query Information;
Described download link acquiring unit, comprising:
Shared word obtains subelement, for inquiring about preset download link storehouse with the query word obtaining after processing, obtains the shared word of described target pages and download link;
Comprehensive weights determining unit, for the searching times ratio in each search result web page according to each shared word, and the search word accounting of each shared word in this download link, determines the comprehensive weights of download link;
Subelement is determined in the first link, for more comprehensive weights and preset weight threshold, comprehensive weights is greater than to the download link that the download link of weight threshold is defined as matching.
Under another implementation, Query Information acquiring unit 220, comprising:
The second Query Information obtains subelement, for obtaining the title of target web, and the title of target web is carried out to participle and filtration treatment, using the title keyword after participle and filtration treatment as described Query Information; Wherein filtration treatment comprises: the title of target web is carried out to noise reduction, remove the garbage in target web title;
Now, download link acquiring unit 230 can comprise:
Matching degree is obtained subelement, for inquiring about preset download link storehouse with the title keyword obtaining after filtration treatment, obtain the title keyword that obtains after filtration treatment and the matching degree of the download link in download link storehouse;
Subelement is determined in the second link, and for comparison match degree and preset matching threshold, the download link that matching degree is greater than to described preset matching threshold is defined as the download link matching with described Query Information.
In addition, preset matching threshold can be according to the difference of the resource class of download link and difference, and now this system can also comprise:
Classification is determined subelement, for determining the resource class of described download link;
Under this implementation, subelement is determined in described the second link, specifically can be for:
The matching threshold that comparison match degree is corresponding with the resource class of this download link, the download link that matching degree is greater than to the matching threshold corresponding with the resource class of this download link is defined as the download link matching with described Query Information.
In addition, link provides unit 240, specifically can be for:
The mode that the described download link matching is ejected to the drawer type bullet window in subwindow or system tray pop-up window or browser window with operating system pop-up window or browser provides.
As seen through the above description of the embodiments, those skilled in the art can be well understood to the mode that the present invention can add essential general hardware platform by software and realizes.Understanding based on such, the part that technical scheme of the present invention contributes to prior art in essence in other words can embody with the form of software product, this computer software product can be stored in storage medium, as ROM/RAM, magnetic disc, CD etc., comprise that some instructions are with so that a computer equipment (can be personal computer, server, or the network equipment etc.) carry out the method described in some part of each embodiment of the present invention or embodiment.
Each embodiment in this instructions all adopts the mode of going forward one by one to describe, between each embodiment identical similar part mutually referring to, each embodiment stresses is the difference with other embodiment.Especially, for device or system embodiment, because it is substantially similar in appearance to embodiment of the method, so describe fairly simplely, relevant part is referring to the part explanation of embodiment of the method.Apparatus and system embodiment described above is only schematic, the wherein said unit as separating component explanation can or can not be also physically to separate, the parts that show as unit can be or can not be also physical locations, can be positioned at a place, or also can be distributed in a plurality of network element.Can select according to the actual needs some or all of module wherein to realize the object of the present embodiment scheme.Those of ordinary skills, in the situation that not paying creative work, are appreciated that and implement.
Above to the method and system that download link is provided provided by the present invention, be described in detail, applied specific case herein principle of the present invention and embodiment are set forth, the explanation of above embodiment is just for helping to understand method of the present invention and core concept thereof; Meanwhile, for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications.In sum, this description should not be construed as limitation of the present invention.

Claims (20)

1. the method that download link is provided, is characterized in that, comprising:
Obtain user and jump to the process of target web from search result web page, the network address of described search result web page;
Obtain the title of query word that the network address of described search result web page comprises and/or described target web as Query Information;
With described Query Information, inquire about preset download link storehouse, obtain the download link matching with described Query Information;
The download link matching described in providing.
2. method according to claim 1, is characterized in that, described in obtain user and jump to the process of target web from search result web page, the network address of described search result web page, comprising:
Obtain user with the redirect behavior of the mode accession page of webpage redirect;
From described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
3. method according to claim 2, is characterized in that, described in obtain user with the redirect behavior of the mode accession page of webpage redirect, comprising:
Obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses time of each webpage;
Described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page, comprising:
According to the described user's who gets identification information, the network address of the webpage of accessing, access the time of each webpage, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
4. method according to claim 3, is characterized in that, described in obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses time of each webpage, comprising:
By browser or browser plug-in, obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses the time of each webpage, and is recorded as daily record, by described Log Sender to described server end;
Described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page, comprising:
By described server end, according to the information comprising in the described daily record receiving, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
5. according to the method described in claim 2 to 4 any one, it is characterized in that, described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page, comprising:
From described redirect behavior, filter out user from search result web page, through the redirect of preset threshold value number of times, arrive the access behavior of target web, and obtain the network address of accessed search result web page.
6. according to the method described in claim 2 to 4 any one, it is characterized in that, described from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, comprising:
Utilize preset regular expression, from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web.
7. according to the method described in any one in claim 1 to 4, it is characterized in that, described in obtain query word that the network address of described search result web page comprises as Query Information, comprising:
Obtain the query word comprising in the network address of described search result web page, and described query word is carried out to participle, go stop words to process; The query word obtaining after processing is as described Query Information;
Describedly with described Query Information, inquire about preset download link storehouse, obtain the download link matching with described Query Information, comprising:
With the query word obtaining after described processing, inquire about preset download link storehouse, obtain the shared word of described target pages and download link;
According to described each shared word ratio of the searching times in search result web page described in each, and the search word accounting of each shared word in this download link, determine the comprehensive weights of download link;
More described comprehensive weights and preset weight threshold, be greater than by comprehensive weights the download link that the download link of described weight threshold is defined as matching.
8. according to the method described in claim 1 to 4 any one, it is characterized in that, described in obtain described target web title as Query Information, comprising:
Obtain the title of described target web, and the title of described target web is carried out to participle and filtration treatment, using the title keyword obtaining after participle and filtration treatment as described Query Information; Wherein said filtration treatment comprises: described title is carried out to noise reduction, remove the garbage in title;
Describedly with described Query Information, inquire about preset download link storehouse, obtain the download link matching with described Query Information, comprising:
With described title keyword, inquire about preset download link storehouse, obtain the matching degree of the download link in described title keyword and download link storehouse;
More described matching degree and preset matching threshold, the download link that matching degree is greater than to described preset matching threshold is defined as the download link matching with described Query Information.
9. method according to claim 8, is characterized in that, described preset matching threshold is according to the difference of the resource class of download link and difference, and described method also comprises:
Determine the resource class of described download link;
Described matching degree and preset matching threshold, the download link that matching degree is greater than to described preset matching threshold is defined as the download link matching with described Query Information, comprising:
The matching threshold that more described matching degree is corresponding with the resource class of this download link, the download link that matching degree is greater than to described corresponding with the resource class of this download link matching threshold is defined as the download link matching with described Query Information.
10. according to the method described in claim 1 to 9 any one, it is characterized in that, described in the download link that matches described in providing, comprising:
The mode that the described download link matching is ejected to the drawer type bullet window in subwindow or system tray pop-up window or browser window with operating system pop-up window or browser provides.
11. 1 kinds of systems that download link is provided, is characterized in that, comprising:
Network address acquiring unit, jumps to the process of target web, the network address of described search result web page for obtaining user from search result web page;
Query Information acquiring unit, for the title that obtains query word that the network address of described search result web page comprises and/or described target web as Query Information;
Download link acquiring unit, for inquire about preset download link storehouse with described Query Information, obtains the download link matching with described Query Information;
Link provides unit, for the download link matching described in providing.
12. systems according to claim 11, is characterized in that, described network address acquiring unit, comprising:
Subelement is obtained in redirect behavior, for obtaining user with the redirect behavior of the mode accession page of webpage redirect;
Filter subelement, for from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
13. systems according to claim 12, is characterized in that, subelement is obtained in described redirect behavior, specifically for:
Obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses time of each webpage;
Described filtration subelement, specifically for:
According to the described user's who gets identification information, the network address of the webpage of accessing, access the time of each webpage, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
14. systems according to claim 13, is characterized in that, subelement is obtained in described redirect behavior, specifically for:
By browser or browser plug-in, obtain in the process of user with the mode accession page of webpage redirect, user's identification information, the network address of the webpage of accessing, accesses the time of each webpage, and is recorded as daily record, by described Log Sender to described server end;
Described filtration subelement is positioned at server end, specifically for:
Receive described daily record, and according to the information comprising in the described daily record receiving, also original subscriber's redirect behavior, and from the described redirect behavior restoring, filter out user and from search result web page, jump to the access behavior of target web, and obtain the network address of accessed search result web page.
15. according to claim 12 to the system described in 14 any one, it is characterized in that, and described filtration subelement, specifically for:
From described redirect behavior, filter out user from search result web page, through the redirect of preset threshold value number of times, arrive the access behavior of target web, and obtain the network address of accessed search result web page.
16. according to claim 12 to the system described in 14 any one, it is characterized in that, and described filtration subelement, specifically for:
Utilize preset regular expression, from described redirect behavior, filter out user and from search result web page, jump to the access behavior of target web.
17. according to claim 11 to the system described in any one in 14, it is characterized in that, described Query Information acquiring unit, comprising:
The first Query Information obtains subelement, the query word comprising for obtaining the network address of described search result web page, and described query word is carried out to participle, go stop words to process; The query word obtaining after processing is as described Query Information;
Described download link acquiring unit, comprising:
Shared word obtains subelement, for inquire about preset download link storehouse with the query word obtaining after described processing, obtains the shared word of described target pages and download link;
Comprehensive weights determining unit, for according to the searching times ratio of described each shared word search result web page described in each, and the search word accounting of each shared word in this download link, determines the comprehensive weights of download link;
Subelement is determined in the first link, for more described comprehensive weights and preset weight threshold, comprehensive weights is greater than to the download link that the download link of described weight threshold is defined as matching.
18. according to claim 11 to the system described in 14 any one, it is characterized in that, described Query Information acquiring unit, comprising:
The second Query Information obtains subelement, for obtaining the title of described target web, and the title of described target web is carried out to participle and filtration treatment, using the title keyword obtaining after participle and filtration treatment as described Query Information; Wherein said filtration treatment comprises: described title is carried out to noise reduction, remove the garbage in title;
Described download link acquiring unit, comprising:
Matching degree is obtained subelement, for inquire about preset download link storehouse with described title keyword, obtains the matching degree of the download link in described title keyword and download link storehouse;
Subelement is determined in the second link, and for more described matching degree and preset matching threshold, the download link that matching degree is greater than to described preset matching threshold is defined as the download link matching with described Query Information.
19. systems according to claim 18, is characterized in that, described preset matching threshold is according to the difference of the resource class of download link and difference, and described system also comprises:
Classification is determined subelement, for determining the resource class of described download link;
Subelement is determined in described the second link, specifically for:
The matching threshold that more described matching degree is corresponding with the resource class of this download link, the download link that matching degree is greater than to described corresponding with the resource class of this download link matching threshold is defined as the download link matching with described Query Information.
20. according to claim 11 to the system described in 19 any one, it is characterized in that, described link provides unit, specifically for:
The mode that the described download link matching is ejected to the drawer type bullet window in subwindow or system tray pop-up window or browser window with operating system pop-up window or browser provides.
CN201310476117.2A 2013-10-12 2013-10-12 The method and system of download link are provided Active CN103530364B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310476117.2A CN103530364B (en) 2013-10-12 2013-10-12 The method and system of download link are provided

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310476117.2A CN103530364B (en) 2013-10-12 2013-10-12 The method and system of download link are provided

Publications (2)

Publication Number Publication Date
CN103530364A true CN103530364A (en) 2014-01-22
CN103530364B CN103530364B (en) 2018-01-02

Family

ID=49932373

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310476117.2A Active CN103530364B (en) 2013-10-12 2013-10-12 The method and system of download link are provided

Country Status (1)

Country Link
CN (1) CN103530364B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838865A (en) * 2014-03-20 2014-06-04 北京奇虎科技有限公司 Method and device for mining timeliness seed page
CN104182485A (en) * 2014-08-08 2014-12-03 北京奇虎科技有限公司 Recording method and system for restarting sites
CN105095527A (en) * 2015-09-29 2015-11-25 北京奇虎科技有限公司 Search method and device based on link address
CN105183896A (en) * 2015-09-29 2015-12-23 北京奇虎科技有限公司 Searching recommending method and device based on link address
CN105701231A (en) * 2016-01-20 2016-06-22 深圳市迅雷网络技术有限公司 Network resource search system and method
CN107943893A (en) * 2017-11-16 2018-04-20 北京奇安信科技有限公司 A kind of search processing method and device based on internet
CN111177566A (en) * 2020-01-02 2020-05-19 北京字节跳动网络技术有限公司 Information processing method and device, electronic equipment and storage medium
CN111680482A (en) * 2020-05-07 2020-09-18 车智互联(北京)科技有限公司 Title image-text generation method and computing device
CN112818197A (en) * 2021-01-22 2021-05-18 北京百度网讯科技有限公司 Search method, search device, electronic equipment and storage medium
CN115374066A (en) * 2022-10-26 2022-11-22 北京芯可鉴科技有限公司 Remote visualization system and remote visualization method

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101789018A (en) * 2010-02-09 2010-07-28 清华大学 Method and device for constructing webpage click describing files based on mutual information
CN102487375A (en) * 2010-12-01 2012-06-06 腾讯科技(深圳)有限公司 Method, device and system for downloading videos online
CN102624967A (en) * 2011-01-28 2012-08-01 腾讯科技(深圳)有限公司 Method and system for realizing document downloading in mobile terminal
US20120239693A1 (en) * 2006-06-16 2012-09-20 Microsoft Corporation Online service for program lookup
US20130085987A1 (en) * 2011-09-29 2013-04-04 Hon Hai Precision Industry Co., Ltd. Downloading method and device
CN103294507A (en) * 2013-05-09 2013-09-11 优视科技有限公司 Method and device for providing information of downloading resources

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120239693A1 (en) * 2006-06-16 2012-09-20 Microsoft Corporation Online service for program lookup
CN101789018A (en) * 2010-02-09 2010-07-28 清华大学 Method and device for constructing webpage click describing files based on mutual information
CN102487375A (en) * 2010-12-01 2012-06-06 腾讯科技(深圳)有限公司 Method, device and system for downloading videos online
CN102624967A (en) * 2011-01-28 2012-08-01 腾讯科技(深圳)有限公司 Method and system for realizing document downloading in mobile terminal
US20130085987A1 (en) * 2011-09-29 2013-04-04 Hon Hai Precision Industry Co., Ltd. Downloading method and device
CN103294507A (en) * 2013-05-09 2013-09-11 优视科技有限公司 Method and device for providing information of downloading resources

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
刘宇波: "面向可下载资源的WEB搜索引擎的设计与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
姜芳: "基于BT客户端协议实现种子文件搜索", 《万方数据知识服务平台》 *
远渡重洋: "文件素材就要批量下载", 《电脑迷》 *

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103838865A (en) * 2014-03-20 2014-06-04 北京奇虎科技有限公司 Method and device for mining timeliness seed page
CN103838865B (en) * 2014-03-20 2017-04-05 北京奇虎科技有限公司 For excavating the method and device of ageing kind of subpage
CN104182485B (en) * 2014-08-08 2018-01-12 北京奇虎科技有限公司 Restart the recording method and system with website
CN104182485A (en) * 2014-08-08 2014-12-03 北京奇虎科技有限公司 Recording method and system for restarting sites
CN105095527A (en) * 2015-09-29 2015-11-25 北京奇虎科技有限公司 Search method and device based on link address
CN105183896A (en) * 2015-09-29 2015-12-23 北京奇虎科技有限公司 Searching recommending method and device based on link address
CN105701231B (en) * 2016-01-20 2018-04-20 深圳市迅雷网络技术有限公司 Internet resources search system and method
CN105701231A (en) * 2016-01-20 2016-06-22 深圳市迅雷网络技术有限公司 Network resource search system and method
CN107943893A (en) * 2017-11-16 2018-04-20 北京奇安信科技有限公司 A kind of search processing method and device based on internet
CN111177566A (en) * 2020-01-02 2020-05-19 北京字节跳动网络技术有限公司 Information processing method and device, electronic equipment and storage medium
CN111177566B (en) * 2020-01-02 2023-06-23 北京字节跳动网络技术有限公司 Information processing method, device, electronic equipment and storage medium
CN111680482A (en) * 2020-05-07 2020-09-18 车智互联(北京)科技有限公司 Title image-text generation method and computing device
CN111680482B (en) * 2020-05-07 2024-04-12 车智互联(北京)科技有限公司 Title image-text generation method and computing device
CN112818197A (en) * 2021-01-22 2021-05-18 北京百度网讯科技有限公司 Search method, search device, electronic equipment and storage medium
CN112818197B (en) * 2021-01-22 2024-02-23 北京百度网讯科技有限公司 Search method, search device, electronic equipment and storage medium
CN115374066A (en) * 2022-10-26 2022-11-22 北京芯可鉴科技有限公司 Remote visualization system and remote visualization method

Also Published As

Publication number Publication date
CN103530364B (en) 2018-01-02

Similar Documents

Publication Publication Date Title
US11847612B2 (en) Social media profiling for one or more authors using one or more social media platforms
CN103530364A (en) Method and system for providing download link
CN102693271B (en) A kind of network information recommending method and system
US20160034514A1 (en) Providing search results based on an identified user interest and relevance matching
CN101963965B (en) Document indexing method, data query method and server based on search engine
CN103744856A (en) Method, device and system for linkage extended search
CN103617266A (en) Personalized extension search method, device and system
US7962523B2 (en) System and method for detecting templates of a website using hyperlink analysis
CN104217031A (en) Method and device for classifying users according to search log data of server
CN104715064A (en) Method and server for marking keywords on webpage
CN107977678B (en) Method and apparatus for outputting information
CN105893622A (en) Polymerization search method and polymerization search system
Desai et al. Web Crawler: Review of Different Types of Web Crawler, Its Issues, Applications and Research Opportunities.
CN104391978A (en) Method and device for storing and processing web pages of browsers
CN103605848A (en) Method and device for analyzing paths
Suwaileh et al. ArabicWeb16: A new crawl for today's Arabic Web
KR101638535B1 (en) Method of detecting issue patten associated with user search word, server performing the same and storage medium storing the same
US8949254B1 (en) Enhancing the content and structure of a corpus of content
US20210109945A1 (en) Self-orchestrated system for extraction, analysis, and presentation of entity data
CN103226601A (en) Method and device for image search
CN103699590A (en) Method and server for providing graphic tutorial problem solution
CN103678601A (en) Model essay retrieval request processing method and device
CN104462241A (en) Population property classification method and device based on anchor texts and peripheral texts in URLs
KR101568800B1 (en) Real-time issue search word sorting method and system
US9081858B2 (en) Method and system for processing search queries

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant