CN106649740A - Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search - Google Patents

Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search Download PDF

Info

Publication number
CN106649740A
CN106649740A CN201611213542.2A CN201611213542A CN106649740A CN 106649740 A CN106649740 A CN 106649740A CN 201611213542 A CN201611213542 A CN 201611213542A CN 106649740 A CN106649740 A CN 106649740A
Authority
CN
China
Prior art keywords
ugc
classes
data
vocabulary
websites
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611213542.2A
Other languages
Chinese (zh)
Inventor
王艳丽
陈营营
马华蓉
佟思颖
高苏丹
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201611213542.2A priority Critical patent/CN106649740A/en
Publication of CN106649740A publication Critical patent/CN106649740A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a method and a device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search. The method comprises the following steps: aiming at one or more hot words of the computers, communication and consumer electronics, and grasping respective UGC data corresponding to one or more hot words from a plurality of UGC websites related to the computers, communication and consumer electronics; when a target search word which is related to the computers, communication and consumer electronics is received, matching the target search word with one or more hot words to obtain UGC data corresponding to the matched hot words; aggregating the obtained UGC data to a recommending item of a search result page corresponding to the target search word. According to the technical scheme provided by the embodiment of the invention, high-quality content on the UGC websites can be directly transmitted to the recommending item of the search result page corresponding to the target search word when the target search word which is related to the computers, communication and consumer electronics is received, so that the utilization rate of a search engine is improved by utilizing the advantages of the UGC websites.

Description

Recommendation method and device based on the 3C class UGC data of search
Technical field
The present invention relates to technical field of internet application, the recommendation side of particularly a kind of 3C class UGC data based on search Method and device.
Background technology
Modern network has substantial amounts of UGC (User Gernerated Content, user-generated content), and it is also referred to as UCC (User Created Content, user creates content), such as forum's note, wechat public number, top news number, interest clan note Son etc., wherein video, audio frequency that user records can be included, word content that the picture and user that user shoots is created etc., It is no lack of high-quality information in these contents, but is not fully excavated in each search engine products, and is added to correlation As a result in.
The content of the invention
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome the problems referred to above or at least in part solve on State the recommendation method and corresponding device of the 3C class UGC data based on search of problem.
According to an aspect of of the present present invention, there is provided a kind of recommendation method of the 3C class UGC data based on search, including:
Based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes described one is captured The each self-corresponding UGC data of individual or multiple popular vocabulary;
When the target search word related to 3C classes is received, by the target search word and one or more of hot topics Vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
By the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word.
Alternatively, based on one or more the popular vocabulary for 3C classes, grab from the multiple UGC websites with regard to 3C classes Before taking each self-corresponding UGC data of one or more of popular vocabulary, methods described also includes:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website.
Alternatively, one or more the popular vocabulary for 3C classes recommended in appointed website are captured, including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
Alternatively, based on one or more the popular vocabulary for 3C classes, capture from the multiple UGC websites with regard to 3C classes The each self-corresponding UGC data of one or more of popular vocabulary, including:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second The each self-corresponding UGC data of popular vocabulary in the vocabulary.
Alternatively, the multiple UGC websites with regard to 3C classes include:Filter out in multiple UGC websites from network With regard at least one high-quality UGC website of 3C classes.
Alternatively, at least one high-quality UGC website with regard to 3C classes is filtered out in the multiple UGC websites from network, is wrapped Include:
The multiple UGC websites with regard to 3C classes in collection network;
The quality condition of the plurality of UGC websites is weighed out according to one or more measurement factors, and therefrom screens pledge Amount meets at least one UGC websites of specified quality requirements as high-quality UGC website.
Alternatively, when the measurement factor includes multiple, according to multiple measurement factors the matter of the plurality of UGC websites is weighed out Amount situation, including:
The respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
Alternatively, each self-corresponding UGC data of one or more of popular vocabulary are then being based on for 3C including a plurality of One or more of class are popular vocabulary, captures one or more of popular vocabulary each from the multiple UGC websites with regard to 3C classes After self-corresponding UGC data, methods described also includes:
Obtain the attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary Data.
Alternatively, the attribute information includes at least one following:
Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
Alternatively, the recommendation items are located at the right side area of the search results pages.
Alternatively, if the right side area of the search results pages includes other recommending datas, by the UGC data aggregates for obtaining To the corresponding search results pages of the target search word recommendation items, including:
Duplicate removal process is carried out to the UGC data for obtaining according to described other recommending datas, the UGC data after duplicate removal is processed It is polymerized to the recommendation items of the corresponding search results pages of the target search word.
Alternatively, by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word, bag Include:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word The recommendation items of search results pages.
Alternatively, in the recommendation items of the UGC data aggregates that will be obtained to the corresponding search results pages of the target search word Afterwards, methods described also includes:
The trigger action of UGC data of the counting user for representing in the search results pages, obtains statistics;
Determine whether represent the UGC data in the corresponding page of subsequent search request according to the statistics.
Alternatively, determine whether represent the UGC in the corresponding page of subsequent search request according to the statistics Data, including:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair No longer represent the UGC data in the page answered.
According to another aspect of the present invention, a kind of recommendation apparatus of the 3C class UGC data based on search, bag are additionally provided Include:
UGC data capture modules, are suitable to based on one or more the popular vocabulary for 3C classes, from regard to the multiple of 3C classes The each self-corresponding UGC data of one or more of popular vocabulary are captured in UGC websites;
Matching module, is suitable to when the target search word related to 3C classes is received, by the target search word with it is described One or more are popular, and vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
Recommending module, is suitable to UGC data aggregates the pushing away to the corresponding search results pages of target search word that will be obtained Recommend item.
Alternatively, described device also includes:
Vocabulary handling module, is suitable to capture one or more the popular vocabulary for 3C classes recommended in appointed website.
Alternatively, the vocabulary handling module is further adapted for:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
Alternatively, the UGC data capture modules are further adapted for:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second The each self-corresponding UGC data of popular vocabulary in the vocabulary.
Alternatively, the multiple UGC websites with regard to 3C classes include:Filter out in multiple UGC websites from network With regard at least one high-quality UGC website of 3C classes.
Alternatively, described device also includes:
Screening module, the multiple UGC websites with regard to 3C classes being suitable in collection network;According to one or more measurement factors The quality condition of the plurality of UGC websites is weighed out, and therefrom screens mass and meet at least one UGC for specifying quality requirements Website is used as high-quality UGC website.
Alternatively, the screening module is further adapted for:
When the measurement factor includes multiple, the respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
Alternatively, described device also includes:
Order module, is suitable to be based on one or more the popular vocabulary for 3C classes in the UGC data capture modules, from After with regard to capturing each self-corresponding UGC data of one or more of popular vocabulary in multiple UGC websites of 3C classes, obtain The attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary Data.
Alternatively, the attribute information includes at least one following:Issuing time, user are read number, user comment number, are used Reprint number, whether there is picture in family.
Alternatively, the recommendation items are located at the right side area of the search results pages.
Alternatively, the recommending module is further adapted for:
If the right side area of the search results pages includes other recommending datas, according to described other recommending datas to obtaining To UGC data carry out duplicate removal process, the UGC data aggregates after duplicate removal is processed to the target search word it is corresponding search knot The recommendation items of fruit page.
Alternatively, the recommending module is further adapted for:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word The recommendation items of search results pages.
Alternatively, described device also includes:
Statistical module, is suitable in the recommending module that the UGC data aggregates for obtaining is corresponding to the target search word After the recommendation items of search results pages, the trigger action of UGC data of the counting user for representing in the search results pages is obtained To statistics;
Determining module, is suitable to determine whether represent institute in the corresponding page of subsequent search request according to the statistics State UGC data.
Alternatively, the determining module is further adapted for:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair No longer represent the UGC data in the page answered.
The embodiment of the present invention is based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes The each self-corresponding UGC data of the one or more of popular vocabulary of middle crawl, so as to search receiving the target related to 3C classes During rope word, premium content on these UGC websites is directly transparent to into the recommendation items of the corresponding search results pages of target search word, from And the advantage of UGC websites is utilized, improve the utilization rate of search engine.Further, UGC data, will be each from each UGC website Data in individual UGC websites are preposition to be represented in search results pages, goes to website to search phase by multi-pass operation without the need for user Information is closed, the retrieval cost of user is reduced, the retrieval experience of user is lifted.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention, And can be practiced according to the content of specification, and in order to allow the above and other objects of the present invention, feature and advantage can Become apparent, below especially exemplified by the specific embodiment of the present invention.
According to the detailed description below in conjunction with accompanying drawing to the specific embodiment of the invention, those skilled in the art will be brighter Above-mentioned and other purposes, the advantages and features of the present invention.
Description of the drawings
By the detailed description for reading hereafter preferred embodiment, various other advantages and benefit is common for this area Technical staff will be clear from understanding.Accompanying drawing is only used for illustrating the purpose of preferred embodiment, and is not considered as to the present invention Restriction.And in whole accompanying drawing, it is denoted by the same reference numerals identical part.In the accompanying drawings:
Fig. 1 shows the flow chart of the recommendation method of the 3C class UGC data based on search according to an embodiment of the invention;
Fig. 2 shows the flow process of the recommendation method of the 3C class UGC data based on search according to another embodiment of the present invention Figure;
Fig. 3 shows the schematic diagram of the search results pages of the UGC data for including recommendation according to an embodiment of the invention;
Fig. 4 shows that the structure of the recommendation apparatus of the 3C class UGC data based on search according to an embodiment of the invention is shown It is intended to;And
Fig. 5 shows the structure of the recommendation apparatus of the 3C class UGC data based on search according to another embodiment of the present invention Schematic diagram.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure and should not be by embodiments set forth here Limited.On the contrary, there is provided these embodiments are able to be best understood from the disclosure, and can be by the scope of the present disclosure Complete conveys to those skilled in the art.
To solve above-mentioned technical problem, a kind of recommendation of the 3C class UGC data based on search is embodiments provided Method, the method can be applied on the terminal devices such as PC, smart mobile phone, panel computer.Fig. 1 is shown according to this The flow chart of the recommendation method of the 3C class UGC data based on search of a bright embodiment.As shown in figure 1, the method at least can be with S102 is comprised the following steps to step S106.
Step S102, based on one or more the popular vocabulary for 3C classes, grabs from the multiple UGC websites with regard to 3C classes Take each self-corresponding UGC data of one or more popular vocabulary.
Step S104, when the target search word related to 3C classes is received, by target search word and one or more heat Door vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching.
Step S106, by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of target search word.
The embodiment of the present invention is based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes The each self-corresponding UGC data of the one or more of popular vocabulary of middle crawl, so as to search receiving the target related to 3C classes During rope word, premium content on these UGC websites is directly transparent to into the recommendation items of the corresponding search results pages of target search word, from And the advantage of UGC websites is utilized, improve the utilization rate of search engine.Further, UGC data, will be each from each UGC website Data in individual UGC websites are preposition to be represented in search results pages, goes to website to search phase by multi-pass operation without the need for user Information is closed, the retrieval cost of user is reduced, the retrieval experience of user is lifted.
The 3C referred in above step S102 is computer (Computer), communication (Communication) and consumption electricity The abbreviation of the sub- electronic product of product (ConsumerElectronics) three.One or many for 3C classes in step S102 Individual popular vocabulary, the embodiment of the present invention can capture one or more the popular vocabulary for 3C classes recommended in appointed website, Here appointed website can such as 360 hot lists, Baidu's roll of the hour website, embodiment of the present invention not limited to this.
In the alternative embodiment of the present invention, one or more hot topics for 3C classes recommended in crawl appointed website During vocabulary, one or more the popular vocabulary for 3C classes recommended in appointed website can be captured, and generate comprising one or The vocabulary of multiple popular vocabulary;Further work as and specify crawl frequency (such as 1 or 2 hour) to grab specified net next time according to first Recommend in standing for 3C classes popular vocabulary when, it is possible to use the more newly-generated vocabulary of the popular vocabulary that grabs next time, So as on the one hand enrich the quantity of vocabulary in vocabulary, on the other hand can reduce dittograph in vocabulary and converge.
Based on the vocabulary of above-mentioned generation, step S102 captures one or more heat from the multiple UGC websites with regard to 3C classes During the door each self-corresponding UGC data of vocabulary, the vocabulary for generating can be based on, crawl frequency is specified (such as one day or three according to second It etc.) from the multiple UGC websites with regard to 3C classes capture vocabulary in each self-corresponding UGC data of popular vocabulary.
Further, UCC is also known as introduced UGC above, the word content of user's creation can be included, user shoots Picture and user video, the audio frequency etc. recorded.Additionally, PGC (Professional Generated Content, specially Industry produces content), it is the derivative concept of UGC, and the benefit of UGC is that user can freely upload content, enriches web site contents, but Be disadvantageously content quality it is very different.Compared with UGC, PGC classification is more professional, and content quality is also more guaranteed, Its curriculum offering and product edition are very professional.In fact, both UGC and PGC not contradiction, is not only mutually exclusive, Er Qiexu Complement each other.The internet content of one maturation is to product, no matter website or community, video platform, audio platform, even Media under neomorph, are required for depth and two aspects of range parallel.With reference to the characteristics of itself, UGC is responsible for content range, main Contribute flow and participation, and PGC maintains content depth, main Branding, the creation of value, both are indispensable.Due to PGC is the derivative concept of UGC, in embodiments of the present invention might as well using PGC as UGC a part.
The quality of the content provided due to UGC is very different, and the embodiment of the present invention is in order to increase the credible of 3C class UGC data Degree, the multiple UGC websites with regard to 3C classes in step S102 can be filter out in multiple UGC websites from network with regard to At least one high-quality UGC website of 3C classes.Further, filter out with regard to 3C classes in the multiple UGC websites from network During at least one high-quality UGC website, a kind of optional scheme is embodiments provided, in this scenario, can be with collecting net The multiple UGC websites with regard to 3C classes in network, and then weigh out the quality of multiple UGC websites according to one or more measurement factors Situation, and at least one UGC websites of the specified quality requirements of mass satisfaction are therefrom screened as high-quality UGC website.Here Weigh the factor can the such as confidence level of website, number of users, the visit capacity of website registered on website, the embodiment of the present invention is not It is limited to this.
When the measurement factor includes multiple, when according to multiple measurement factors come the quality condition for weighing multiple UGC websites, A kind of optional scheme is embodiments provided, in this scenario, multiple measurement factors can be determined based on Weight Algorithm Respective weight, obtains the respective numerical value of multiple measurement factors of multiple UGC websites;Subsequently by multiple weighing apparatuses of multiple UGC websites The respective numerical value of the amount factor is weighted summation with weight, obtains comprehensive numerical value, and then according to the respective synthesis in multiple UGC websites Numerical value weighs out the quality condition of multiple UGC websites.
For example, multiple UGC websites are website 1, website 2, website 3, website 4 and website 5, and multiple factors of weighing are for website Number of users, the visit capacity of website registered in confidence level, website, the respective numerical value of multiple measurement factors of website 1 is respectively P11, p12, p13, the respective numerical value of multiple measurement factors of website 2 is respectively p21, p22, p23, multiple measurements of website 3 because The respective numerical value of son is respectively p31, p32, p33, and the respective numerical value of multiple measurement factors of website 4 is respectively p41, p42, p43, The respective numerical value of multiple measurement factors of website 5 is respectively p51, p52, p53.Determine that the respective weight of multiple measurement factors is W1, w2, w3, by the respective numerical value of multiple measurement factors of multiple UGC websites and weight summation is weighted, and obtains multiple UGC The comprehensive numerical value of website.Might as well be by taking website 1 and website 2 as an example, the comprehensive numerical value of website 1 is p11 × w1+p12 after weighted sum × w2+p13 × w3, the comprehensive numerical value of website 2 is p21 × w1+p22 × w2+p23 × w3, and website 3, website 4 and website 5 are with this Analogize, no longer repeat one by one herein.
In the alternative embodiment of the present invention, the popular vocabulary of one or more for obtaining is captured in step S102 and is each corresponded to UGC data include it is a plurality of, then step S102 based on for 3C classes one or more popular vocabulary, from regard to many of 3C classes After each self-corresponding UGC data of one or more popular vocabulary are captured in individual UGC websites, the embodiment of the present invention can also be to this A little UGC data are ranked up, so as to realize optimizing the purpose of UGC data.Specifically, the embodiment of the present invention can obtain UGC numbers According to attribute information, and then based on the attribute information for obtaining to each self-corresponding a plurality of UGC data of one or more popular vocabulary It is ranked up, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the sequence of this is popular vocabulary UGC data afterwards.Here attribute information can include issuing time, user read number, user comment number, user reprint number, Whether there is picture etc., embodiment of the present invention not limited to this.For example, the issuing time of UGC data can from front to back be arranged Sequence, by issuing time UGC data sortings rearward front, by the forward UGC data sortings of issuing time rear.Again for example, can It is ranked up with descending to the user comment number of UGC data, by the UGC data sortings more than user comment number front, will be used Comment number few UGC data sortings in family are rear.Again for example, when being ranked up by multiple attribute informations, it may be determined that multiple The respective weight of attribute information, obtains each self-corresponding numerical value of multiple attribute informations of UGC data, by multiple category of UGC data Property each self-corresponding numerical value of information and weight be weighted summation, obtain comprehensive numerical value;And then according to the comprehensive numerical value pair for obtaining UGC data are ranked up.
In the alternative embodiment of the present invention, the recommendation items referred in step S106 may be located at the right side of search results pages Region, so as to when the target search word related to 3C classes is received, can be by the direct transparent transmission of premium content on these UGC websites Position is recommended on right side to the corresponding search results pages of target search word, so as to using the advantage of UGC websites, improve search engine Utilization rate.
In step s 106 by the right side area of the UGC data aggregates for obtaining to the corresponding search results pages of target search word Recommendation items when, if the right side area of search results pages also include other recommending datas, then in order to the repetition for reducing data is pushed away Recommend, then duplicate removal process can be carried out to the UGC data for obtaining according to other recommending datas, the UGC data after duplicate removal is processed are gathered It is bonded to the recommendation items of the corresponding search results pages of target search word.
Further, in step s 106 by the UGC data aggregates for obtaining to the corresponding search results pages of target search word Recommendation items when, the UGC data for obtaining can also be polymerized to target search word pair in the form of carousel figure and/or Text Link The recommendation items of the search results pages answered, so that the displaying of recommending data becomes apparent from and intuitively.
Step S106 by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of target search word it Afterwards, the embodiment of the present invention can be judging CTR (the Click To of UGC data according to (the such as 1 hour) cycle specified time Rate, clicking rate), and processed accordingly according to judged result.Specifically, the embodiment of the present invention can be directed to counting user The trigger action of the UGC data represented in search results pages, obtains statistics, and then is subsequently being searched according to statistics determination Rope asks whether represent UGC data in the corresponding page.For example, if statistics is less than for the quantity of trigger action specifies threshold Value, it is determined that no longer represent UGC data in the corresponding page of subsequent search request, new UGC data can be waited to update After re-start and represent;If statistics is more than or equal to specified threshold for the quantity of trigger action, it is determined that in subsequent searches Ask to represent UGC data in the corresponding page.On implementing, UGC data can be set or adjusted represents weight, if system Meter result is less than specified threshold for the quantity of trigger action, then reduce UGC data represents weight so that in subsequent search request No longer represent UGC data in the corresponding page;If statistics is more than or equal to specified threshold for the quantity of trigger action, increase Big UGC data represent weight so that represent UGC data in the corresponding page of subsequent search request.It should be noted that this The implementation that place is enumerated is only illustrative, and can also be realized by arranging the modes such as label in actual applications, belongs to In protection scope of the present invention.
Various implementations of the links of the embodiment being described above shown in Fig. 1, are embodied as below by one Example realizes process come the recommendation method that the 3C class UGC data based on search of the present invention are discussed in detail.
Fig. 2 shows the flow process of the recommendation method of the 3C class UGC data based on search according to another embodiment of the present invention Figure.As shown in Fig. 2 the method at least may comprise steps of S202 to step S212.
Step S202, at least one high-quality UGC net with regard to 3C classes filtered out in the multiple UGC websites from network Stand.
In this step, can with collection network in the multiple UGC websites with regard to 3C classes, and then according to one or more weighing apparatus The amount factor weighs out the quality condition of multiple UGC websites, and therefrom screening mass meets at least one of specified quality requirements UGC websites are used as high-quality UGC website.Here the measurement factor can be such as number of users, the net registered on the confidence level of website, website Visit capacity stood etc., embodiment of the present invention not limited to this.When the measurement factor includes multiple, the side introduced above is may refer to Weighing the quality condition of multiple UGC websites, here is omitted for case.Here, at least one high-quality UGC website for filtering out can With websites such as such as interest clan, top news number, bubble net, mobile phone China, the online, Pacific Ocean computer nets in Zhong Guan-cun.
Step S204, according to first specify crawl frequency crawl appointed website in recommend for 3C classes one or more Popular vocabulary, and generate the vocabulary comprising one or more popular vocabulary.
In this step, when according to first specify crawl frequency (such as 1 or 2 hour) grab next time in appointed website Recommend for 3C classes popular vocabulary when, it is possible to use the more newly-generated vocabulary of the popular vocabulary that grabs next time, so as to On the one hand the quantity of vocabulary in vocabulary is enriched, dittograph in vocabulary on the other hand can be reduced and be converged.Here appointed website Can such as 360 hot lists, Baidu's roll of the hour website, embodiment of the present invention not limited to this.
Step S206, based on the vocabulary for generating, crawl frequency is specified from least one high-quality with regard to 3C classes according to second The each self-corresponding UGC data of popular vocabulary in vocabulary are captured in UGC websites.
Step S208, obtains the attribute information of UGC data, and then popular to one or more based on the attribute information for obtaining The each self-corresponding a plurality of UGC data of vocabulary are ranked up, the UGC data after being sorted, so as to the popular word on subsequent match During remittance, there is provided the UGC data after the sequence of this is popular vocabulary.
In this step, attribute information can include issuing time, user read number, user comment number, user reprint number, Whether there is picture etc., embodiment of the present invention not limited to this.For example, the issuing time of UGC data can from front to back be arranged Sequence, by issuing time UGC data sortings rearward front, by the forward UGC data sortings of issuing time rear.Again for example, can It is ranked up with descending to the user comment number of UGC data, by the UGC data sortings more than user comment number front, will be used Comment number few UGC data sortings in family are rear.Again for example, when being ranked up by multiple attribute informations, it may be determined that multiple The respective weight of attribute information, obtains each self-corresponding numerical value of multiple attribute informations of UGC data, by multiple category of UGC data Property each self-corresponding numerical value of information and weight be weighted summation, obtain comprehensive numerical value;And then according to the comprehensive numerical value pair for obtaining UGC data are ranked up.
Step S210, when the target search word related to 3C classes is received, by target search word and one or more heat Door vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching.
Step S212, by the right side area of the UGC data aggregates for obtaining to the corresponding search results pages of target search word Recommendation items.
In this step, in the right side region of the UGC data aggregates that will be obtained to the corresponding search results pages of target search word During the recommendation items in domain, if the right side area of search results pages also includes other recommending datas, then in order to reduce the repetition of data Recommend, then duplicate removal process can be carried out to the UGC data for obtaining according to other recommending datas, the UGC data after duplicate removal is processed It is polymerized to the recommendation items of the corresponding search results pages of target search word.
Further, in the right side area of the UGC data aggregates that will be obtained to the corresponding search results pages of target search word Recommendation items when, the UGC data for obtaining can also be polymerized to target search word pair in the form of carousel figure and/or Text Link The recommendation items of the search results pages answered, so that the displaying of recommending data becomes apparent from and intuitively.
When user is input into " Samsung mobile phone " in search box, the Search Results obtained using the scheme of the embodiment of the present invention As shown in figure 3, in figure 3, the right side area of search results pages shows the UGC data for having Samsung mobile phone to page, specifically includes carousel Picture and word chain.
In the alternative embodiment of the present invention, after step s 212, can be with according to week specified time (such as 1 hour) Phase judges the CTR of UGC data, and is processed accordingly according to judged result.Specifically, the embodiment of the present invention can count use The trigger action of UGC data of the family for representing in search results pages, obtains statistics, and then is determined according to statistics Whether represent UGC data in the corresponding page of subsequent search request.For example, refer to if statistics is less than for the quantity of trigger action Determine threshold value, it is determined that no longer represent UGC data in the corresponding page of subsequent search request, new UGC data can be waited Re-start after renewal and represent;If statistics is more than or equal to specified threshold for the quantity of trigger action, it is determined that follow-up Represent UGC data in the corresponding page of searching request.On implementing, UGC data can be set or adjusted represents weight, If statistics is less than specified threshold for the quantity of trigger action, reduce UGC data represents weight so that in subsequent searches Ask no longer to represent UGC data in the corresponding page;If statistics is more than or equal to specified threshold for the quantity of trigger action, Then increase UGC data represents weight so that represent UGC data in the corresponding page of subsequent search request.Need explanation It is that implementation listed herewith is only illustrative, can also be realized by arranging the modes such as label in actual applications, Belong to protection scope of the present invention.
It should be noted that in practical application, above-mentioned all optional embodiments can be with any group by the way of combining Close, form the alternative embodiment of the present invention, this is no longer going to repeat them.
The recommendation method of the 3C class UGC data based on search provided based on each embodiment above, based on same invention Design, the embodiment of the present invention additionally provides a kind of recommendation apparatus of the 3C class UGC data based on search.
Fig. 4 shows that the structure of the recommendation apparatus of the 3C class UGC data based on search according to an embodiment of the invention is shown It is intended to.As shown in figure 4, the device can at least include UGC data capture modules 410, matching module 420 and recommending module 430。
Now introduce each composition or the work(of device of the recommendation apparatus of the 3C class UGC data based on search of the embodiment of the present invention Annexation between energy and each several part:
UGC data capture modules 410, are suitable to based on one or more the popular vocabulary for 3C classes, from regard to 3C classes The each self-corresponding UGC data of one or more of popular vocabulary are captured in multiple UGC websites;
Matching module 420, is coupled with UGC data capture modules 410, is suitable to search when receiving the target related to 3C classes During rope word, the target search word is matched with one or more of popular vocabulary, obtained the popular vocabulary pair for matching The UGC data answered;
Recommending module 430, is coupled with matching module 420, is suitable to the UGC data aggregates that will be obtained to the target search The recommendation items of the corresponding search results pages of word.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Vocabulary handling module 510, is coupled with UGC data capture modules 410, is suitable to capture the pin recommended in appointed website One or more popular vocabulary to 3C classes.
In an embodiment of the present invention, the vocabulary handling module 510 is further adapted for:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
In an embodiment of the present invention, the UGC data capture modules 410 are further adapted for:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second The each self-corresponding UGC data of popular vocabulary in the vocabulary.
In an embodiment of the present invention, the multiple UGC websites with regard to 3C classes include:Multiple UGC nets from network At least one high-quality UGC website with regard to 3C classes filtered out in standing.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Screening module 520, is coupled with UGC data capture modules 410, be suitable in collection network with regard to the multiple of 3C classes UGC websites;The quality condition of the plurality of UGC websites is weighed out according to one or more measurement factors, and therefrom screens pledge Amount meets at least one UGC websites of specified quality requirements as high-quality UGC website.
In an embodiment of the present invention, the screening module 520 is further adapted for:
When the measurement factor includes multiple, the respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Order module 530, is coupled with UGC data capture modules 410, matching module 420, is suitable in the UGC data Handling module 410 captures described based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes After each self-corresponding UGC data of vocabulary that one or more are popular, the attribute information of UGC data is obtained;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary Data.
In an embodiment of the present invention, the attribute information includes at least one following:Issuing time, user read number, User comment number, user reprint number, whether there is picture.
In an embodiment of the present invention, the recommendation items are located at the right side area of the search results pages.
In an embodiment of the present invention, the recommending module 430 is further adapted for:
If the right side area of the search results pages includes other recommending datas, according to described other recommending datas to obtaining To UGC data carry out duplicate removal process, the UGC data aggregates after duplicate removal is processed to the target search word it is corresponding search knot The recommendation items of fruit page.
In an embodiment of the present invention, the recommending module 430 is further adapted for:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word The recommendation items of search results pages.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Statistical module 540, is coupled with recommending module 430, is suitable to the UGC data for obtaining in the recommending module 430 It is polymerized to the recommendation items of the corresponding search results pages of the target search word, counting user is in the search results pages The trigger action of the UGC data for representing, obtains statistics;
Determining module 550, is coupled with statistical module 540, is suitable to be determined according to the statistics and asks in subsequent searches Ask and whether represent in the corresponding page UGC data.
In an embodiment of the present invention, the determining module 550 is further adapted for:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair No longer represent the UGC data in the page answered.
According to the combination of above-mentioned any one alternative embodiment or multiple alternative embodiments, the embodiment of the present invention can reach Following beneficial effect:
The embodiment of the present invention is based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes The each self-corresponding UGC data of the one or more of popular vocabulary of middle crawl, so as to search receiving the target related to 3C classes During rope word, premium content on these UGC websites is directly transparent to into the recommendation items of the corresponding search results pages of target search word, from And the advantage of UGC websites is utilized, improve the utilization rate of search engine.Further, UGC data, will be each from each UGC website Data in individual UGC websites are preposition to be represented in search results pages, goes to website to search phase by multi-pass operation without the need for user Information is closed, the retrieval cost of user is reduced, the retrieval experience of user is lifted.
In specification mentioned herein, a large amount of details are illustrated.It is to be appreciated, however, that the enforcement of the present invention Example can be put into practice in the case of without these details.In some instances, known method, structure is not been shown in detail And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help understand one or more in each inventive aspect, exist Above in the description of the exemplary embodiment of the present invention, each feature of the present invention is grouped together into single enforcement sometimes In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor The more features of feature that the application claims ratio of shield is expressly recited in each claim.More precisely, such as following Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore, Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim itself All as the separate embodiments of the present invention.
Those skilled in the art are appreciated that can be carried out adaptively to the module in the equipment in embodiment Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment Unit or component are combined into a module or unit or component, and can be divided in addition multiple submodule or subelement or Sub-component.In addition at least some in such feature and/or process or unit is excluded each other, can adopt any Combine to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so disclosed Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification is (including adjoint power Profit is required, summary and accompanying drawing) disclosed in each feature can it is identical by offers, be equal to or the alternative features of similar purpose carry out generation Replace.
Although additionally, it will be appreciated by those of skill in the art that some embodiments described herein include other embodiments In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention Within the scope of and form different embodiments.For example, in detail in the claims, embodiment required for protection one of arbitrarily Can in any combination mode using.
The present invention all parts embodiment can be realized with hardware, or with one or more processor operation Software module realize, or with combinations thereof realization.It will be understood by those of skill in the art that can use in practice Microprocessor or digital signal processor (DSP) are realizing the 3C class UGC data based on search according to embodiments of the present invention Recommendation apparatus in some or all parts some or all functions.The present invention is also implemented as performing this In described method some or all equipment or program of device (for example, computer program and computer program Product).Such program for realizing the present invention can be stored on a computer-readable medium, either can be with one or many The form of individual signal.Such signal can be downloaded from internet website and obtained, or be provided on carrier signal, or with Any other form is provided.
It should be noted that above-described embodiment the present invention will be described rather than limits the invention, and ability Field technique personnel can design without departing from the scope of the appended claims alternative embodiment.In the claims, Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" is not excluded the presence of not Element listed in the claims or step.Word "a" or "an" before element does not exclude the presence of multiple such Element.The present invention can come real by means of the hardware for including some different elements and by means of properly programmed computer It is existing.If in the unit claim for listing equipment for drying, several in these devices can be by same hardware branch To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and be run after fame Claim.
So far, although those skilled in the art will appreciate that detailed herein illustrate and describe multiple showing for the present invention Example property embodiment, but, without departing from the spirit and scope of the present invention, still can be direct according to present disclosure It is determined that or deriving many other variations or modifications for meeting the principle of the invention.Therefore, the scope of the present invention is understood that and recognizes It is set to and covers all these other variations or modifications.
The one side of the embodiment of the present invention, there is provided A1, a kind of recommendation method of the 3C class UGC data based on search, bag Include:
Based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes described one is captured The each self-corresponding UGC data of individual or multiple popular vocabulary;
When the target search word related to 3C classes is received, by the target search word and one or more of hot topics Vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
By the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word.
A2, the method according to A1, wherein, one or more the popular vocabulary for 3C classes are being based on, from regard to 3C Before each self-corresponding UGC data of one or more of popular vocabulary are captured in multiple UGC websites of class, methods described is also wrapped Include:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website.
A3, the method according to A1 or A2, wherein, capture appointed website in recommend for 3C classes one or more Popular vocabulary, including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
A4, the method according to any one of A1-A3, wherein, based on one or more the popular vocabulary for 3C classes, The each self-corresponding UGC data of one or more of popular vocabulary are captured from the multiple UGC websites with regard to 3C classes, including:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second The each self-corresponding UGC data of popular vocabulary in the vocabulary.
A5, the method according to any one of A1-A4, wherein, the multiple UGC websites with regard to 3C classes include:From At least one high-quality UGC website with regard to 3C classes filtered out in multiple UGC websites in network.
A6, the method according to any one of A1-A5, wherein, filter out in the multiple UGC websites from network with regard to At least one high-quality UGC website of 3C classes, including:
The multiple UGC websites with regard to 3C classes in collection network;
The quality condition of the plurality of UGC websites is weighed out according to one or more measurement factors, and therefrom screens pledge Amount meets at least one UGC websites of specified quality requirements as high-quality UGC website.
A7, the method according to any one of A1-A6, wherein, when the measurement factor includes multiple, according to multiple measurements The factor weighs out the quality condition of the plurality of UGC websites, including:
The respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
A8, the method according to any one of A1-A7, wherein, one or more of popular vocabulary are each self-corresponding UGC data are then being based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes including a plurality of Middle to capture after each self-corresponding UGC data of one or more of popular vocabulary, methods described also includes:
Obtain the attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary Data.
A9, the method according to any one of A1-A8, wherein, the attribute information includes at least one following:
Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
A10, the method according to any one of A1-A9, wherein, the recommendation items are located at the right side of the search results pages Side region.
A11, the method according to any one of A1-A10, wherein, if the right side area of the search results pages is included Other recommending datas, by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word, bag Include:
Duplicate removal process is carried out to the UGC data for obtaining according to described other recommending datas, the UGC data after duplicate removal is processed It is polymerized to the recommendation items of the corresponding search results pages of the target search word.
A12, the method according to any one of A1-A11, wherein, the UGC data aggregates for obtaining are searched to the target The recommendation items of the corresponding search results pages of rope word, including:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word The recommendation items of search results pages.
A13, the method according to any one of A1-A12, wherein, in the UGC data aggregates that will be obtained to the target After the recommendation items of the corresponding search results pages of search word, methods described also includes:
The trigger action of UGC data of the counting user for representing in the search results pages, obtains statistics;
Determine whether represent the UGC data in the corresponding page of subsequent search request according to the statistics.
A14, the method according to any one of A1-A13, wherein, determined in subsequent searches according to the statistics Ask whether to represent the UGC data in the corresponding page, including:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair No longer represent the UGC data in the page answered.
The another aspect of the embodiment of the present invention, additionally provides the recommendation dress of B15, a kind of 3C class UGC data based on search Put, including:
UGC data capture modules, are suitable to based on one or more the popular vocabulary for 3C classes, from regard to the multiple of 3C classes The each self-corresponding UGC data of one or more of popular vocabulary are captured in UGC websites;
Matching module, is suitable to when the target search word related to 3C classes is received, by the target search word with it is described One or more are popular, and vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
Recommending module, is suitable to UGC data aggregates the pushing away to the corresponding search results pages of target search word that will be obtained Recommend item.
B16, the device according to B15, wherein, also include:
Vocabulary handling module, is suitable to capture one or more the popular vocabulary for 3C classes recommended in appointed website.
B17, the device according to B15 or B16, wherein, the vocabulary handling module is further adapted for:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
B18, the device according to any one of B15-B17, wherein, the UGC data capture modules are further adapted for:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second The each self-corresponding UGC data of popular vocabulary in the vocabulary.
B19, the device according to any one of B15-B18, wherein, the multiple UGC websites with regard to 3C classes include: At least one high-quality UGC website with regard to 3C classes filtered out in multiple UGC websites from network.
B20, the device according to any one of B15-B19, wherein, also include:
Screening module, the multiple UGC websites with regard to 3C classes being suitable in collection network;According to one or more measurement factors The quality condition of the plurality of UGC websites is weighed out, and therefrom screens mass and meet at least one UGC for specifying quality requirements Website is used as high-quality UGC website.
B21, the device according to any one of B15-B20, wherein, the screening module is further adapted for:
When the measurement factor includes multiple, the respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
B22, the device according to any one of B15-B21, wherein, also include:
Order module, is suitable to be based on one or more the popular vocabulary for 3C classes in the UGC data capture modules, from After with regard to capturing each self-corresponding UGC data of one or more of popular vocabulary in multiple UGC websites of 3C classes, obtain The attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary Data.
B23, the device according to any one of B15-B22, wherein, the attribute information includes at least one following: Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
B24, the device according to any one of B15-B23, wherein, the recommendation items are located at the search results pages Right side area.
B25, the device according to any one of B15-B24, wherein, the recommending module is further adapted for:
If the right side area of the search results pages includes other recommending datas, according to described other recommending datas to obtaining To UGC data carry out duplicate removal process, the UGC data aggregates after duplicate removal is processed to the target search word it is corresponding search knot The recommendation items of fruit page.
B26, the device according to any one of B15-B25, wherein, the recommending module is further adapted for:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word The recommendation items of search results pages.
B27, the device according to any one of B15-B26, wherein, also include:
Statistical module, is suitable in the recommending module that the UGC data aggregates for obtaining is corresponding to the target search word After the recommendation items of search results pages, the trigger action of UGC data of the counting user for representing in the search results pages is obtained To statistics;
Determining module, is suitable to determine whether represent institute in the corresponding page of subsequent search request according to the statistics State UGC data.
B28, the device according to any one of B15-B27, wherein, the determining module is further adapted for:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair No longer represent the UGC data in the page answered.

Claims (10)

1. it is a kind of based on search 3C class UGC data recommendation method, including:
Based on for 3C classes one or more popular vocabulary, capture from the multiple UGC websites with regard to 3C classes it is one or The each self-corresponding UGC data of multiple popular vocabulary;
When the target search word related to 3C classes is received, by the target search word and one or more of popular vocabulary Matched, obtained the corresponding UGC data of popular vocabulary for matching;
By the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word.
2. method according to claim 1, wherein, based on one or more the popular vocabulary for 3C classes, from regard to Before each self-corresponding UGC data of one or more of popular vocabulary are captured in multiple UGC websites of 3C classes, methods described is also Including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website.
3. method according to claim 1 and 2, wherein, capture or many for 3C classes recommended in appointed website Individual popular vocabulary, including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or more of The vocabulary of popular vocabulary;
When according to first specify crawl frequency grab next time recommend in appointed website for 3C classes popular vocabulary when, profit With the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
4. the method according to any one of claim 1-3, wherein, based on one or more the popular words for 3C classes Converge, each self-corresponding UGC data of one or more of popular vocabulary are captured from the multiple UGC websites with regard to 3C classes, including:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second described The each self-corresponding UGC data of popular vocabulary in vocabulary.
5. the method according to any one of claim 1-4, wherein, the multiple UGC websites with regard to 3C classes include:From At least one high-quality UGC website with regard to 3C classes filtered out in multiple UGC websites in network.
6. the method according to any one of claim 1-5, wherein, filter out pass in the multiple UGC websites from network In at least one high-quality UGC website of 3C classes, including:
The multiple UGC websites with regard to 3C classes in collection network;
Weighing out the quality condition of the plurality of UGC websites according to one or more measurement factors, and therefrom screen mass expires Toe determines at least one UGC websites of quality requirements as high-quality UGC website.
7. the method according to any one of claim 1-6, wherein, when weighing the factor and including multiple, according to multiple weighing apparatuses The amount factor weighs out the quality condition of the plurality of UGC websites, including:
The respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, synthesis is obtained Numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
8. the method according to any one of claim 1-7, wherein, one or more of popular vocabulary are each self-corresponding UGC data are then being based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes including a plurality of Middle to capture after each self-corresponding UGC data of one or more of popular vocabulary, methods described also includes:
Obtain the attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are ranked up based on the attribute information for obtaining, UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC numbers after the sequence of this is popular vocabulary According to.
9. the method according to any one of claim 1-8, wherein, the attribute information includes at least one following:
Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
10. it is a kind of based on search 3C class UGC data recommendation apparatus, including:
UGC data capture modules, are suitable to based on one or more the popular vocabulary for 3C classes, from the multiple UGC with regard to 3C classes The each self-corresponding UGC data of one or more of popular vocabulary are captured in website;
Matching module, is suitable to when the target search word related to 3C classes is received, by the target search word with it is one Or multiple popular vocabulary are matched, the corresponding UGC data of popular vocabulary for matching are obtained;
Recommending module, is suitable to the recommendation items of UGC data aggregates to the corresponding search results pages of the target search word that will be obtained.
CN201611213542.2A 2016-12-23 2016-12-23 Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search Pending CN106649740A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611213542.2A CN106649740A (en) 2016-12-23 2016-12-23 Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611213542.2A CN106649740A (en) 2016-12-23 2016-12-23 Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search

Publications (1)

Publication Number Publication Date
CN106649740A true CN106649740A (en) 2017-05-10

Family

ID=58827797

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611213542.2A Pending CN106649740A (en) 2016-12-23 2016-12-23 Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search

Country Status (1)

Country Link
CN (1) CN106649740A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111310018A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Determining method of timeliness search vocabulary and search engine
CN111310017A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Method and device for generating timeliness scene content
CN111309999A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Method and device for generating interactive scene content

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605808A (en) * 2013-12-10 2014-02-26 合一网络技术(北京)有限公司 Search-based UGC (user generated content) recommendation method and search-based UGC recommendation system
CN104008139A (en) * 2014-05-08 2014-08-27 北京奇艺世纪科技有限公司 Method and device for creating video index table and method and device for recommending video
CN104239495A (en) * 2014-09-09 2014-12-24 百度在线网络技术(北京)有限公司 Search method and search device
CN105354227A (en) * 2015-09-30 2016-02-24 北京奇虎科技有限公司 Search-based method and apparatus for providing high-quality comment information
CN105740473A (en) * 2016-03-14 2016-07-06 腾讯科技(深圳)有限公司 User-generated content display method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605808A (en) * 2013-12-10 2014-02-26 合一网络技术(北京)有限公司 Search-based UGC (user generated content) recommendation method and search-based UGC recommendation system
CN104008139A (en) * 2014-05-08 2014-08-27 北京奇艺世纪科技有限公司 Method and device for creating video index table and method and device for recommending video
CN104239495A (en) * 2014-09-09 2014-12-24 百度在线网络技术(北京)有限公司 Search method and search device
CN105354227A (en) * 2015-09-30 2016-02-24 北京奇虎科技有限公司 Search-based method and apparatus for providing high-quality comment information
CN105740473A (en) * 2016-03-14 2016-07-06 腾讯科技(深圳)有限公司 User-generated content display method and device

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111310018A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Determining method of timeliness search vocabulary and search engine
CN111310017A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Method and device for generating timeliness scene content
CN111309999A (en) * 2018-12-11 2020-06-19 阿里巴巴集团控股有限公司 Method and device for generating interactive scene content
CN111310017B (en) * 2018-12-11 2023-05-12 阿里巴巴集团控股有限公司 Method and device for generating time-efficient scene content
CN111309999B (en) * 2018-12-11 2023-05-16 阿里巴巴集团控股有限公司 Method and device for generating interactive scene content
CN111310018B (en) * 2018-12-11 2024-03-01 阿里巴巴集团控股有限公司 Method for determining timeliness search vocabulary and search engine

Similar Documents

Publication Publication Date Title
CN105701216B (en) A kind of information-pushing method and device
CN103778548B (en) Merchandise news and key word matching method, merchandise news put-on method and device
CN105956161B (en) A kind of information recommendation method and device
CN105989074B (en) A kind of method and apparatus recommend by mobile device information cold start-up
CN104965905B (en) A kind of method and apparatus of Web page classifying
CN106708821A (en) User personalized shopping behavior-based commodity recommendation method
CN107679211A (en) Method and apparatus for pushed information
CN107229730A (en) Data query method and device
CN106709777A (en) Order clustering method and apparatus thereof, and anti-malicious information method and apparatus thereof
CN106777206A (en) Movie and television play class keywords search for exhibiting method and device
CN102663064B (en) A kind of disposal route of favorites data and device
CN104766224B (en) A kind of shopping evaluation display method and system
CN106649740A (en) Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search
CN107783993A (en) The storage method and device of data
CN106709073A (en) Browser notification pushing method and browser terminal
CN109189990A (en) A kind of generation method of search term, device and electronic equipment
CN106649738A (en) Method and device for aggregating personage information message in search engine result page
CN109727047A (en) A kind of method and apparatus, data recommendation method and the device of determining data correlation degree
CN108090807A (en) Information recommendation method and device
CN109949172A (en) Social account influence power evaluation method, device and storage medium
CN106919582A (en) The association of network articles and related information statistical method and device
CN109446431A (en) For the method, apparatus of information recommendation, medium and calculate equipment
CN112100221A (en) Information recommendation method and device, recommendation server and storage medium
CN106844488A (en) With reference to the stock class UGC data recommendation methods and device of search
CN108268357A (en) real-time data processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170510