CN106649740A - Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search - Google Patents
Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search Download PDFInfo
- Publication number
- CN106649740A CN106649740A CN201611213542.2A CN201611213542A CN106649740A CN 106649740 A CN106649740 A CN 106649740A CN 201611213542 A CN201611213542 A CN 201611213542A CN 106649740 A CN106649740 A CN 106649740A
- Authority
- CN
- China
- Prior art keywords
- ugc
- classes
- data
- vocabulary
- websites
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides a method and a device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search. The method comprises the following steps: aiming at one or more hot words of the computers, communication and consumer electronics, and grasping respective UGC data corresponding to one or more hot words from a plurality of UGC websites related to the computers, communication and consumer electronics; when a target search word which is related to the computers, communication and consumer electronics is received, matching the target search word with one or more hot words to obtain UGC data corresponding to the matched hot words; aggregating the obtained UGC data to a recommending item of a search result page corresponding to the target search word. According to the technical scheme provided by the embodiment of the invention, high-quality content on the UGC websites can be directly transmitted to the recommending item of the search result page corresponding to the target search word when the target search word which is related to the computers, communication and consumer electronics is received, so that the utilization rate of a search engine is improved by utilizing the advantages of the UGC websites.
Description
Technical field
The present invention relates to technical field of internet application, the recommendation side of particularly a kind of 3C class UGC data based on search
Method and device.
Background technology
Modern network has substantial amounts of UGC (User Gernerated Content, user-generated content), and it is also referred to as
UCC (User Created Content, user creates content), such as forum's note, wechat public number, top news number, interest clan note
Son etc., wherein video, audio frequency that user records can be included, word content that the picture and user that user shoots is created etc.,
It is no lack of high-quality information in these contents, but is not fully excavated in each search engine products, and is added to correlation
As a result in.
The content of the invention
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome the problems referred to above or at least in part solve on
State the recommendation method and corresponding device of the 3C class UGC data based on search of problem.
According to an aspect of of the present present invention, there is provided a kind of recommendation method of the 3C class UGC data based on search, including:
Based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes described one is captured
The each self-corresponding UGC data of individual or multiple popular vocabulary;
When the target search word related to 3C classes is received, by the target search word and one or more of hot topics
Vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
By the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word.
Alternatively, based on one or more the popular vocabulary for 3C classes, grab from the multiple UGC websites with regard to 3C classes
Before taking each self-corresponding UGC data of one or more of popular vocabulary, methods described also includes:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website.
Alternatively, one or more the popular vocabulary for 3C classes recommended in appointed website are captured, including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or
The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first
When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
Alternatively, based on one or more the popular vocabulary for 3C classes, capture from the multiple UGC websites with regard to 3C classes
The each self-corresponding UGC data of one or more of popular vocabulary, including:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second
The each self-corresponding UGC data of popular vocabulary in the vocabulary.
Alternatively, the multiple UGC websites with regard to 3C classes include:Filter out in multiple UGC websites from network
With regard at least one high-quality UGC website of 3C classes.
Alternatively, at least one high-quality UGC website with regard to 3C classes is filtered out in the multiple UGC websites from network, is wrapped
Include:
The multiple UGC websites with regard to 3C classes in collection network;
The quality condition of the plurality of UGC websites is weighed out according to one or more measurement factors, and therefrom screens pledge
Amount meets at least one UGC websites of specified quality requirements as high-quality UGC website.
Alternatively, when the measurement factor includes multiple, according to multiple measurement factors the matter of the plurality of UGC websites is weighed out
Amount situation, including:
The respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained
Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
Alternatively, each self-corresponding UGC data of one or more of popular vocabulary are then being based on for 3C including a plurality of
One or more of class are popular vocabulary, captures one or more of popular vocabulary each from the multiple UGC websites with regard to 3C classes
After self-corresponding UGC data, methods described also includes:
Obtain the attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining
Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary
Data.
Alternatively, the attribute information includes at least one following:
Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
Alternatively, the recommendation items are located at the right side area of the search results pages.
Alternatively, if the right side area of the search results pages includes other recommending datas, by the UGC data aggregates for obtaining
To the corresponding search results pages of the target search word recommendation items, including:
Duplicate removal process is carried out to the UGC data for obtaining according to described other recommending datas, the UGC data after duplicate removal is processed
It is polymerized to the recommendation items of the corresponding search results pages of the target search word.
Alternatively, by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word, bag
Include:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word
The recommendation items of search results pages.
Alternatively, in the recommendation items of the UGC data aggregates that will be obtained to the corresponding search results pages of the target search word
Afterwards, methods described also includes:
The trigger action of UGC data of the counting user for representing in the search results pages, obtains statistics;
Determine whether represent the UGC data in the corresponding page of subsequent search request according to the statistics.
Alternatively, determine whether represent the UGC in the corresponding page of subsequent search request according to the statistics
Data, including:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair
No longer represent the UGC data in the page answered.
According to another aspect of the present invention, a kind of recommendation apparatus of the 3C class UGC data based on search, bag are additionally provided
Include:
UGC data capture modules, are suitable to based on one or more the popular vocabulary for 3C classes, from regard to the multiple of 3C classes
The each self-corresponding UGC data of one or more of popular vocabulary are captured in UGC websites;
Matching module, is suitable to when the target search word related to 3C classes is received, by the target search word with it is described
One or more are popular, and vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
Recommending module, is suitable to UGC data aggregates the pushing away to the corresponding search results pages of target search word that will be obtained
Recommend item.
Alternatively, described device also includes:
Vocabulary handling module, is suitable to capture one or more the popular vocabulary for 3C classes recommended in appointed website.
Alternatively, the vocabulary handling module is further adapted for:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or
The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first
When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
Alternatively, the UGC data capture modules are further adapted for:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second
The each self-corresponding UGC data of popular vocabulary in the vocabulary.
Alternatively, the multiple UGC websites with regard to 3C classes include:Filter out in multiple UGC websites from network
With regard at least one high-quality UGC website of 3C classes.
Alternatively, described device also includes:
Screening module, the multiple UGC websites with regard to 3C classes being suitable in collection network;According to one or more measurement factors
The quality condition of the plurality of UGC websites is weighed out, and therefrom screens mass and meet at least one UGC for specifying quality requirements
Website is used as high-quality UGC website.
Alternatively, the screening module is further adapted for:
When the measurement factor includes multiple, the respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained
Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
Alternatively, described device also includes:
Order module, is suitable to be based on one or more the popular vocabulary for 3C classes in the UGC data capture modules, from
After with regard to capturing each self-corresponding UGC data of one or more of popular vocabulary in multiple UGC websites of 3C classes, obtain
The attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining
Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary
Data.
Alternatively, the attribute information includes at least one following:Issuing time, user are read number, user comment number, are used
Reprint number, whether there is picture in family.
Alternatively, the recommendation items are located at the right side area of the search results pages.
Alternatively, the recommending module is further adapted for:
If the right side area of the search results pages includes other recommending datas, according to described other recommending datas to obtaining
To UGC data carry out duplicate removal process, the UGC data aggregates after duplicate removal is processed to the target search word it is corresponding search knot
The recommendation items of fruit page.
Alternatively, the recommending module is further adapted for:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word
The recommendation items of search results pages.
Alternatively, described device also includes:
Statistical module, is suitable in the recommending module that the UGC data aggregates for obtaining is corresponding to the target search word
After the recommendation items of search results pages, the trigger action of UGC data of the counting user for representing in the search results pages is obtained
To statistics;
Determining module, is suitable to determine whether represent institute in the corresponding page of subsequent search request according to the statistics
State UGC data.
Alternatively, the determining module is further adapted for:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair
No longer represent the UGC data in the page answered.
The embodiment of the present invention is based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes
The each self-corresponding UGC data of the one or more of popular vocabulary of middle crawl, so as to search receiving the target related to 3C classes
During rope word, premium content on these UGC websites is directly transparent to into the recommendation items of the corresponding search results pages of target search word, from
And the advantage of UGC websites is utilized, improve the utilization rate of search engine.Further, UGC data, will be each from each UGC website
Data in individual UGC websites are preposition to be represented in search results pages, goes to website to search phase by multi-pass operation without the need for user
Information is closed, the retrieval cost of user is reduced, the retrieval experience of user is lifted.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention,
And can be practiced according to the content of specification, and in order to allow the above and other objects of the present invention, feature and advantage can
Become apparent, below especially exemplified by the specific embodiment of the present invention.
According to the detailed description below in conjunction with accompanying drawing to the specific embodiment of the invention, those skilled in the art will be brighter
Above-mentioned and other purposes, the advantages and features of the present invention.
Description of the drawings
By the detailed description for reading hereafter preferred embodiment, various other advantages and benefit is common for this area
Technical staff will be clear from understanding.Accompanying drawing is only used for illustrating the purpose of preferred embodiment, and is not considered as to the present invention
Restriction.And in whole accompanying drawing, it is denoted by the same reference numerals identical part.In the accompanying drawings:
Fig. 1 shows the flow chart of the recommendation method of the 3C class UGC data based on search according to an embodiment of the invention;
Fig. 2 shows the flow process of the recommendation method of the 3C class UGC data based on search according to another embodiment of the present invention
Figure;
Fig. 3 shows the schematic diagram of the search results pages of the UGC data for including recommendation according to an embodiment of the invention;
Fig. 4 shows that the structure of the recommendation apparatus of the 3C class UGC data based on search according to an embodiment of the invention is shown
It is intended to;And
Fig. 5 shows the structure of the recommendation apparatus of the 3C class UGC data based on search according to another embodiment of the present invention
Schematic diagram.
Specific embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure and should not be by embodiments set forth here
Limited.On the contrary, there is provided these embodiments are able to be best understood from the disclosure, and can be by the scope of the present disclosure
Complete conveys to those skilled in the art.
To solve above-mentioned technical problem, a kind of recommendation of the 3C class UGC data based on search is embodiments provided
Method, the method can be applied on the terminal devices such as PC, smart mobile phone, panel computer.Fig. 1 is shown according to this
The flow chart of the recommendation method of the 3C class UGC data based on search of a bright embodiment.As shown in figure 1, the method at least can be with
S102 is comprised the following steps to step S106.
Step S102, based on one or more the popular vocabulary for 3C classes, grabs from the multiple UGC websites with regard to 3C classes
Take each self-corresponding UGC data of one or more popular vocabulary.
Step S104, when the target search word related to 3C classes is received, by target search word and one or more heat
Door vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching.
Step S106, by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of target search word.
The embodiment of the present invention is based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes
The each self-corresponding UGC data of the one or more of popular vocabulary of middle crawl, so as to search receiving the target related to 3C classes
During rope word, premium content on these UGC websites is directly transparent to into the recommendation items of the corresponding search results pages of target search word, from
And the advantage of UGC websites is utilized, improve the utilization rate of search engine.Further, UGC data, will be each from each UGC website
Data in individual UGC websites are preposition to be represented in search results pages, goes to website to search phase by multi-pass operation without the need for user
Information is closed, the retrieval cost of user is reduced, the retrieval experience of user is lifted.
The 3C referred in above step S102 is computer (Computer), communication (Communication) and consumption electricity
The abbreviation of the sub- electronic product of product (ConsumerElectronics) three.One or many for 3C classes in step S102
Individual popular vocabulary, the embodiment of the present invention can capture one or more the popular vocabulary for 3C classes recommended in appointed website,
Here appointed website can such as 360 hot lists, Baidu's roll of the hour website, embodiment of the present invention not limited to this.
In the alternative embodiment of the present invention, one or more hot topics for 3C classes recommended in crawl appointed website
During vocabulary, one or more the popular vocabulary for 3C classes recommended in appointed website can be captured, and generate comprising one or
The vocabulary of multiple popular vocabulary;Further work as and specify crawl frequency (such as 1 or 2 hour) to grab specified net next time according to first
Recommend in standing for 3C classes popular vocabulary when, it is possible to use the more newly-generated vocabulary of the popular vocabulary that grabs next time,
So as on the one hand enrich the quantity of vocabulary in vocabulary, on the other hand can reduce dittograph in vocabulary and converge.
Based on the vocabulary of above-mentioned generation, step S102 captures one or more heat from the multiple UGC websites with regard to 3C classes
During the door each self-corresponding UGC data of vocabulary, the vocabulary for generating can be based on, crawl frequency is specified (such as one day or three according to second
It etc.) from the multiple UGC websites with regard to 3C classes capture vocabulary in each self-corresponding UGC data of popular vocabulary.
Further, UCC is also known as introduced UGC above, the word content of user's creation can be included, user shoots
Picture and user video, the audio frequency etc. recorded.Additionally, PGC (Professional Generated Content, specially
Industry produces content), it is the derivative concept of UGC, and the benefit of UGC is that user can freely upload content, enriches web site contents, but
Be disadvantageously content quality it is very different.Compared with UGC, PGC classification is more professional, and content quality is also more guaranteed,
Its curriculum offering and product edition are very professional.In fact, both UGC and PGC not contradiction, is not only mutually exclusive, Er Qiexu
Complement each other.The internet content of one maturation is to product, no matter website or community, video platform, audio platform, even
Media under neomorph, are required for depth and two aspects of range parallel.With reference to the characteristics of itself, UGC is responsible for content range, main
Contribute flow and participation, and PGC maintains content depth, main Branding, the creation of value, both are indispensable.Due to
PGC is the derivative concept of UGC, in embodiments of the present invention might as well using PGC as UGC a part.
The quality of the content provided due to UGC is very different, and the embodiment of the present invention is in order to increase the credible of 3C class UGC data
Degree, the multiple UGC websites with regard to 3C classes in step S102 can be filter out in multiple UGC websites from network with regard to
At least one high-quality UGC website of 3C classes.Further, filter out with regard to 3C classes in the multiple UGC websites from network
During at least one high-quality UGC website, a kind of optional scheme is embodiments provided, in this scenario, can be with collecting net
The multiple UGC websites with regard to 3C classes in network, and then weigh out the quality of multiple UGC websites according to one or more measurement factors
Situation, and at least one UGC websites of the specified quality requirements of mass satisfaction are therefrom screened as high-quality UGC website.Here
Weigh the factor can the such as confidence level of website, number of users, the visit capacity of website registered on website, the embodiment of the present invention is not
It is limited to this.
When the measurement factor includes multiple, when according to multiple measurement factors come the quality condition for weighing multiple UGC websites,
A kind of optional scheme is embodiments provided, in this scenario, multiple measurement factors can be determined based on Weight Algorithm
Respective weight, obtains the respective numerical value of multiple measurement factors of multiple UGC websites;Subsequently by multiple weighing apparatuses of multiple UGC websites
The respective numerical value of the amount factor is weighted summation with weight, obtains comprehensive numerical value, and then according to the respective synthesis in multiple UGC websites
Numerical value weighs out the quality condition of multiple UGC websites.
For example, multiple UGC websites are website 1, website 2, website 3, website 4 and website 5, and multiple factors of weighing are for website
Number of users, the visit capacity of website registered in confidence level, website, the respective numerical value of multiple measurement factors of website 1 is respectively
P11, p12, p13, the respective numerical value of multiple measurement factors of website 2 is respectively p21, p22, p23, multiple measurements of website 3 because
The respective numerical value of son is respectively p31, p32, p33, and the respective numerical value of multiple measurement factors of website 4 is respectively p41, p42, p43,
The respective numerical value of multiple measurement factors of website 5 is respectively p51, p52, p53.Determine that the respective weight of multiple measurement factors is
W1, w2, w3, by the respective numerical value of multiple measurement factors of multiple UGC websites and weight summation is weighted, and obtains multiple UGC
The comprehensive numerical value of website.Might as well be by taking website 1 and website 2 as an example, the comprehensive numerical value of website 1 is p11 × w1+p12 after weighted sum
× w2+p13 × w3, the comprehensive numerical value of website 2 is p21 × w1+p22 × w2+p23 × w3, and website 3, website 4 and website 5 are with this
Analogize, no longer repeat one by one herein.
In the alternative embodiment of the present invention, the popular vocabulary of one or more for obtaining is captured in step S102 and is each corresponded to
UGC data include it is a plurality of, then step S102 based on for 3C classes one or more popular vocabulary, from regard to many of 3C classes
After each self-corresponding UGC data of one or more popular vocabulary are captured in individual UGC websites, the embodiment of the present invention can also be to this
A little UGC data are ranked up, so as to realize optimizing the purpose of UGC data.Specifically, the embodiment of the present invention can obtain UGC numbers
According to attribute information, and then based on the attribute information for obtaining to each self-corresponding a plurality of UGC data of one or more popular vocabulary
It is ranked up, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the sequence of this is popular vocabulary
UGC data afterwards.Here attribute information can include issuing time, user read number, user comment number, user reprint number,
Whether there is picture etc., embodiment of the present invention not limited to this.For example, the issuing time of UGC data can from front to back be arranged
Sequence, by issuing time UGC data sortings rearward front, by the forward UGC data sortings of issuing time rear.Again for example, can
It is ranked up with descending to the user comment number of UGC data, by the UGC data sortings more than user comment number front, will be used
Comment number few UGC data sortings in family are rear.Again for example, when being ranked up by multiple attribute informations, it may be determined that multiple
The respective weight of attribute information, obtains each self-corresponding numerical value of multiple attribute informations of UGC data, by multiple category of UGC data
Property each self-corresponding numerical value of information and weight be weighted summation, obtain comprehensive numerical value;And then according to the comprehensive numerical value pair for obtaining
UGC data are ranked up.
In the alternative embodiment of the present invention, the recommendation items referred in step S106 may be located at the right side of search results pages
Region, so as to when the target search word related to 3C classes is received, can be by the direct transparent transmission of premium content on these UGC websites
Position is recommended on right side to the corresponding search results pages of target search word, so as to using the advantage of UGC websites, improve search engine
Utilization rate.
In step s 106 by the right side area of the UGC data aggregates for obtaining to the corresponding search results pages of target search word
Recommendation items when, if the right side area of search results pages also include other recommending datas, then in order to the repetition for reducing data is pushed away
Recommend, then duplicate removal process can be carried out to the UGC data for obtaining according to other recommending datas, the UGC data after duplicate removal is processed are gathered
It is bonded to the recommendation items of the corresponding search results pages of target search word.
Further, in step s 106 by the UGC data aggregates for obtaining to the corresponding search results pages of target search word
Recommendation items when, the UGC data for obtaining can also be polymerized to target search word pair in the form of carousel figure and/or Text Link
The recommendation items of the search results pages answered, so that the displaying of recommending data becomes apparent from and intuitively.
Step S106 by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of target search word it
Afterwards, the embodiment of the present invention can be judging CTR (the Click To of UGC data according to (the such as 1 hour) cycle specified time
Rate, clicking rate), and processed accordingly according to judged result.Specifically, the embodiment of the present invention can be directed to counting user
The trigger action of the UGC data represented in search results pages, obtains statistics, and then is subsequently being searched according to statistics determination
Rope asks whether represent UGC data in the corresponding page.For example, if statistics is less than for the quantity of trigger action specifies threshold
Value, it is determined that no longer represent UGC data in the corresponding page of subsequent search request, new UGC data can be waited to update
After re-start and represent;If statistics is more than or equal to specified threshold for the quantity of trigger action, it is determined that in subsequent searches
Ask to represent UGC data in the corresponding page.On implementing, UGC data can be set or adjusted represents weight, if system
Meter result is less than specified threshold for the quantity of trigger action, then reduce UGC data represents weight so that in subsequent search request
No longer represent UGC data in the corresponding page;If statistics is more than or equal to specified threshold for the quantity of trigger action, increase
Big UGC data represent weight so that represent UGC data in the corresponding page of subsequent search request.It should be noted that this
The implementation that place is enumerated is only illustrative, and can also be realized by arranging the modes such as label in actual applications, belongs to
In protection scope of the present invention.
Various implementations of the links of the embodiment being described above shown in Fig. 1, are embodied as below by one
Example realizes process come the recommendation method that the 3C class UGC data based on search of the present invention are discussed in detail.
Fig. 2 shows the flow process of the recommendation method of the 3C class UGC data based on search according to another embodiment of the present invention
Figure.As shown in Fig. 2 the method at least may comprise steps of S202 to step S212.
Step S202, at least one high-quality UGC net with regard to 3C classes filtered out in the multiple UGC websites from network
Stand.
In this step, can with collection network in the multiple UGC websites with regard to 3C classes, and then according to one or more weighing apparatus
The amount factor weighs out the quality condition of multiple UGC websites, and therefrom screening mass meets at least one of specified quality requirements
UGC websites are used as high-quality UGC website.Here the measurement factor can be such as number of users, the net registered on the confidence level of website, website
Visit capacity stood etc., embodiment of the present invention not limited to this.When the measurement factor includes multiple, the side introduced above is may refer to
Weighing the quality condition of multiple UGC websites, here is omitted for case.Here, at least one high-quality UGC website for filtering out can
With websites such as such as interest clan, top news number, bubble net, mobile phone China, the online, Pacific Ocean computer nets in Zhong Guan-cun.
Step S204, according to first specify crawl frequency crawl appointed website in recommend for 3C classes one or more
Popular vocabulary, and generate the vocabulary comprising one or more popular vocabulary.
In this step, when according to first specify crawl frequency (such as 1 or 2 hour) grab next time in appointed website
Recommend for 3C classes popular vocabulary when, it is possible to use the more newly-generated vocabulary of the popular vocabulary that grabs next time, so as to
On the one hand the quantity of vocabulary in vocabulary is enriched, dittograph in vocabulary on the other hand can be reduced and be converged.Here appointed website
Can such as 360 hot lists, Baidu's roll of the hour website, embodiment of the present invention not limited to this.
Step S206, based on the vocabulary for generating, crawl frequency is specified from least one high-quality with regard to 3C classes according to second
The each self-corresponding UGC data of popular vocabulary in vocabulary are captured in UGC websites.
Step S208, obtains the attribute information of UGC data, and then popular to one or more based on the attribute information for obtaining
The each self-corresponding a plurality of UGC data of vocabulary are ranked up, the UGC data after being sorted, so as to the popular word on subsequent match
During remittance, there is provided the UGC data after the sequence of this is popular vocabulary.
In this step, attribute information can include issuing time, user read number, user comment number, user reprint number,
Whether there is picture etc., embodiment of the present invention not limited to this.For example, the issuing time of UGC data can from front to back be arranged
Sequence, by issuing time UGC data sortings rearward front, by the forward UGC data sortings of issuing time rear.Again for example, can
It is ranked up with descending to the user comment number of UGC data, by the UGC data sortings more than user comment number front, will be used
Comment number few UGC data sortings in family are rear.Again for example, when being ranked up by multiple attribute informations, it may be determined that multiple
The respective weight of attribute information, obtains each self-corresponding numerical value of multiple attribute informations of UGC data, by multiple category of UGC data
Property each self-corresponding numerical value of information and weight be weighted summation, obtain comprehensive numerical value;And then according to the comprehensive numerical value pair for obtaining
UGC data are ranked up.
Step S210, when the target search word related to 3C classes is received, by target search word and one or more heat
Door vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching.
Step S212, by the right side area of the UGC data aggregates for obtaining to the corresponding search results pages of target search word
Recommendation items.
In this step, in the right side region of the UGC data aggregates that will be obtained to the corresponding search results pages of target search word
During the recommendation items in domain, if the right side area of search results pages also includes other recommending datas, then in order to reduce the repetition of data
Recommend, then duplicate removal process can be carried out to the UGC data for obtaining according to other recommending datas, the UGC data after duplicate removal is processed
It is polymerized to the recommendation items of the corresponding search results pages of target search word.
Further, in the right side area of the UGC data aggregates that will be obtained to the corresponding search results pages of target search word
Recommendation items when, the UGC data for obtaining can also be polymerized to target search word pair in the form of carousel figure and/or Text Link
The recommendation items of the search results pages answered, so that the displaying of recommending data becomes apparent from and intuitively.
When user is input into " Samsung mobile phone " in search box, the Search Results obtained using the scheme of the embodiment of the present invention
As shown in figure 3, in figure 3, the right side area of search results pages shows the UGC data for having Samsung mobile phone to page, specifically includes carousel
Picture and word chain.
In the alternative embodiment of the present invention, after step s 212, can be with according to week specified time (such as 1 hour)
Phase judges the CTR of UGC data, and is processed accordingly according to judged result.Specifically, the embodiment of the present invention can count use
The trigger action of UGC data of the family for representing in search results pages, obtains statistics, and then is determined according to statistics
Whether represent UGC data in the corresponding page of subsequent search request.For example, refer to if statistics is less than for the quantity of trigger action
Determine threshold value, it is determined that no longer represent UGC data in the corresponding page of subsequent search request, new UGC data can be waited
Re-start after renewal and represent;If statistics is more than or equal to specified threshold for the quantity of trigger action, it is determined that follow-up
Represent UGC data in the corresponding page of searching request.On implementing, UGC data can be set or adjusted represents weight,
If statistics is less than specified threshold for the quantity of trigger action, reduce UGC data represents weight so that in subsequent searches
Ask no longer to represent UGC data in the corresponding page;If statistics is more than or equal to specified threshold for the quantity of trigger action,
Then increase UGC data represents weight so that represent UGC data in the corresponding page of subsequent search request.Need explanation
It is that implementation listed herewith is only illustrative, can also be realized by arranging the modes such as label in actual applications,
Belong to protection scope of the present invention.
It should be noted that in practical application, above-mentioned all optional embodiments can be with any group by the way of combining
Close, form the alternative embodiment of the present invention, this is no longer going to repeat them.
The recommendation method of the 3C class UGC data based on search provided based on each embodiment above, based on same invention
Design, the embodiment of the present invention additionally provides a kind of recommendation apparatus of the 3C class UGC data based on search.
Fig. 4 shows that the structure of the recommendation apparatus of the 3C class UGC data based on search according to an embodiment of the invention is shown
It is intended to.As shown in figure 4, the device can at least include UGC data capture modules 410, matching module 420 and recommending module
430。
Now introduce each composition or the work(of device of the recommendation apparatus of the 3C class UGC data based on search of the embodiment of the present invention
Annexation between energy and each several part:
UGC data capture modules 410, are suitable to based on one or more the popular vocabulary for 3C classes, from regard to 3C classes
The each self-corresponding UGC data of one or more of popular vocabulary are captured in multiple UGC websites;
Matching module 420, is coupled with UGC data capture modules 410, is suitable to search when receiving the target related to 3C classes
During rope word, the target search word is matched with one or more of popular vocabulary, obtained the popular vocabulary pair for matching
The UGC data answered;
Recommending module 430, is coupled with matching module 420, is suitable to the UGC data aggregates that will be obtained to the target search
The recommendation items of the corresponding search results pages of word.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Vocabulary handling module 510, is coupled with UGC data capture modules 410, is suitable to capture the pin recommended in appointed website
One or more popular vocabulary to 3C classes.
In an embodiment of the present invention, the vocabulary handling module 510 is further adapted for:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or
The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first
When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
In an embodiment of the present invention, the UGC data capture modules 410 are further adapted for:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second
The each self-corresponding UGC data of popular vocabulary in the vocabulary.
In an embodiment of the present invention, the multiple UGC websites with regard to 3C classes include:Multiple UGC nets from network
At least one high-quality UGC website with regard to 3C classes filtered out in standing.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Screening module 520, is coupled with UGC data capture modules 410, be suitable in collection network with regard to the multiple of 3C classes
UGC websites;The quality condition of the plurality of UGC websites is weighed out according to one or more measurement factors, and therefrom screens pledge
Amount meets at least one UGC websites of specified quality requirements as high-quality UGC website.
In an embodiment of the present invention, the screening module 520 is further adapted for:
When the measurement factor includes multiple, the respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained
Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Order module 530, is coupled with UGC data capture modules 410, matching module 420, is suitable in the UGC data
Handling module 410 captures described based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes
After each self-corresponding UGC data of vocabulary that one or more are popular, the attribute information of UGC data is obtained;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining
Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary
Data.
In an embodiment of the present invention, the attribute information includes at least one following:Issuing time, user read number,
User comment number, user reprint number, whether there is picture.
In an embodiment of the present invention, the recommendation items are located at the right side area of the search results pages.
In an embodiment of the present invention, the recommending module 430 is further adapted for:
If the right side area of the search results pages includes other recommending datas, according to described other recommending datas to obtaining
To UGC data carry out duplicate removal process, the UGC data aggregates after duplicate removal is processed to the target search word it is corresponding search knot
The recommendation items of fruit page.
In an embodiment of the present invention, the recommending module 430 is further adapted for:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word
The recommendation items of search results pages.
In an embodiment of the present invention, as shown in figure 5, the device that figure 4 above shows can also include:
Statistical module 540, is coupled with recommending module 430, is suitable to the UGC data for obtaining in the recommending module 430
It is polymerized to the recommendation items of the corresponding search results pages of the target search word, counting user is in the search results pages
The trigger action of the UGC data for representing, obtains statistics;
Determining module 550, is coupled with statistical module 540, is suitable to be determined according to the statistics and asks in subsequent searches
Ask and whether represent in the corresponding page UGC data.
In an embodiment of the present invention, the determining module 550 is further adapted for:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair
No longer represent the UGC data in the page answered.
According to the combination of above-mentioned any one alternative embodiment or multiple alternative embodiments, the embodiment of the present invention can reach
Following beneficial effect:
The embodiment of the present invention is based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes
The each self-corresponding UGC data of the one or more of popular vocabulary of middle crawl, so as to search receiving the target related to 3C classes
During rope word, premium content on these UGC websites is directly transparent to into the recommendation items of the corresponding search results pages of target search word, from
And the advantage of UGC websites is utilized, improve the utilization rate of search engine.Further, UGC data, will be each from each UGC website
Data in individual UGC websites are preposition to be represented in search results pages, goes to website to search phase by multi-pass operation without the need for user
Information is closed, the retrieval cost of user is reduced, the retrieval experience of user is lifted.
In specification mentioned herein, a large amount of details are illustrated.It is to be appreciated, however, that the enforcement of the present invention
Example can be put into practice in the case of without these details.In some instances, known method, structure is not been shown in detail
And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help understand one or more in each inventive aspect, exist
Above in the description of the exemplary embodiment of the present invention, each feature of the present invention is grouped together into single enforcement sometimes
In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:I.e. required guarantor
The more features of feature that the application claims ratio of shield is expressly recited in each claim.More precisely, such as following
Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore,
Thus the claims for following specific embodiment are expressly incorporated in the specific embodiment, wherein each claim itself
All as the separate embodiments of the present invention.
Those skilled in the art are appreciated that can be carried out adaptively to the module in the equipment in embodiment
Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment
Unit or component are combined into a module or unit or component, and can be divided in addition multiple submodule or subelement or
Sub-component.In addition at least some in such feature and/or process or unit is excluded each other, can adopt any
Combine to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so disclosed
Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification is (including adjoint power
Profit is required, summary and accompanying drawing) disclosed in each feature can it is identical by offers, be equal to or the alternative features of similar purpose carry out generation
Replace.
Although additionally, it will be appreciated by those of skill in the art that some embodiments described herein include other embodiments
In included some features rather than further feature, but the combination of the feature of different embodiments means in of the invention
Within the scope of and form different embodiments.For example, in detail in the claims, embodiment required for protection one of arbitrarily
Can in any combination mode using.
The present invention all parts embodiment can be realized with hardware, or with one or more processor operation
Software module realize, or with combinations thereof realization.It will be understood by those of skill in the art that can use in practice
Microprocessor or digital signal processor (DSP) are realizing the 3C class UGC data based on search according to embodiments of the present invention
Recommendation apparatus in some or all parts some or all functions.The present invention is also implemented as performing this
In described method some or all equipment or program of device (for example, computer program and computer program
Product).Such program for realizing the present invention can be stored on a computer-readable medium, either can be with one or many
The form of individual signal.Such signal can be downloaded from internet website and obtained, or be provided on carrier signal, or with
Any other form is provided.
It should be noted that above-described embodiment the present invention will be described rather than limits the invention, and ability
Field technique personnel can design without departing from the scope of the appended claims alternative embodiment.In the claims,
Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" is not excluded the presence of not
Element listed in the claims or step.Word "a" or "an" before element does not exclude the presence of multiple such
Element.The present invention can come real by means of the hardware for including some different elements and by means of properly programmed computer
It is existing.If in the unit claim for listing equipment for drying, several in these devices can be by same hardware branch
To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and be run after fame
Claim.
So far, although those skilled in the art will appreciate that detailed herein illustrate and describe multiple showing for the present invention
Example property embodiment, but, without departing from the spirit and scope of the present invention, still can be direct according to present disclosure
It is determined that or deriving many other variations or modifications for meeting the principle of the invention.Therefore, the scope of the present invention is understood that and recognizes
It is set to and covers all these other variations or modifications.
The one side of the embodiment of the present invention, there is provided A1, a kind of recommendation method of the 3C class UGC data based on search, bag
Include:
Based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes described one is captured
The each self-corresponding UGC data of individual or multiple popular vocabulary;
When the target search word related to 3C classes is received, by the target search word and one or more of hot topics
Vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
By the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word.
A2, the method according to A1, wherein, one or more the popular vocabulary for 3C classes are being based on, from regard to 3C
Before each self-corresponding UGC data of one or more of popular vocabulary are captured in multiple UGC websites of class, methods described is also wrapped
Include:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website.
A3, the method according to A1 or A2, wherein, capture appointed website in recommend for 3C classes one or more
Popular vocabulary, including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or
The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first
When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
A4, the method according to any one of A1-A3, wherein, based on one or more the popular vocabulary for 3C classes,
The each self-corresponding UGC data of one or more of popular vocabulary are captured from the multiple UGC websites with regard to 3C classes, including:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second
The each self-corresponding UGC data of popular vocabulary in the vocabulary.
A5, the method according to any one of A1-A4, wherein, the multiple UGC websites with regard to 3C classes include:From
At least one high-quality UGC website with regard to 3C classes filtered out in multiple UGC websites in network.
A6, the method according to any one of A1-A5, wherein, filter out in the multiple UGC websites from network with regard to
At least one high-quality UGC website of 3C classes, including:
The multiple UGC websites with regard to 3C classes in collection network;
The quality condition of the plurality of UGC websites is weighed out according to one or more measurement factors, and therefrom screens pledge
Amount meets at least one UGC websites of specified quality requirements as high-quality UGC website.
A7, the method according to any one of A1-A6, wherein, when the measurement factor includes multiple, according to multiple measurements
The factor weighs out the quality condition of the plurality of UGC websites, including:
The respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained
Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
A8, the method according to any one of A1-A7, wherein, one or more of popular vocabulary are each self-corresponding
UGC data are then being based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes including a plurality of
Middle to capture after each self-corresponding UGC data of one or more of popular vocabulary, methods described also includes:
Obtain the attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining
Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary
Data.
A9, the method according to any one of A1-A8, wherein, the attribute information includes at least one following:
Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
A10, the method according to any one of A1-A9, wherein, the recommendation items are located at the right side of the search results pages
Side region.
A11, the method according to any one of A1-A10, wherein, if the right side area of the search results pages is included
Other recommending datas, by the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word, bag
Include:
Duplicate removal process is carried out to the UGC data for obtaining according to described other recommending datas, the UGC data after duplicate removal is processed
It is polymerized to the recommendation items of the corresponding search results pages of the target search word.
A12, the method according to any one of A1-A11, wherein, the UGC data aggregates for obtaining are searched to the target
The recommendation items of the corresponding search results pages of rope word, including:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word
The recommendation items of search results pages.
A13, the method according to any one of A1-A12, wherein, in the UGC data aggregates that will be obtained to the target
After the recommendation items of the corresponding search results pages of search word, methods described also includes:
The trigger action of UGC data of the counting user for representing in the search results pages, obtains statistics;
Determine whether represent the UGC data in the corresponding page of subsequent search request according to the statistics.
A14, the method according to any one of A1-A13, wherein, determined in subsequent searches according to the statistics
Ask whether to represent the UGC data in the corresponding page, including:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair
No longer represent the UGC data in the page answered.
The another aspect of the embodiment of the present invention, additionally provides the recommendation dress of B15, a kind of 3C class UGC data based on search
Put, including:
UGC data capture modules, are suitable to based on one or more the popular vocabulary for 3C classes, from regard to the multiple of 3C classes
The each self-corresponding UGC data of one or more of popular vocabulary are captured in UGC websites;
Matching module, is suitable to when the target search word related to 3C classes is received, by the target search word with it is described
One or more are popular, and vocabulary is matched, and obtains the corresponding UGC data of popular vocabulary for matching;
Recommending module, is suitable to UGC data aggregates the pushing away to the corresponding search results pages of target search word that will be obtained
Recommend item.
B16, the device according to B15, wherein, also include:
Vocabulary handling module, is suitable to capture one or more the popular vocabulary for 3C classes recommended in appointed website.
B17, the device according to B15 or B16, wherein, the vocabulary handling module is further adapted for:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or
The vocabulary of multiple popular vocabulary;
The popular vocabulary for 3C classes that frequency grabs recommendation in appointed website next time is captured when specifying according to first
When, using the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
B18, the device according to any one of B15-B17, wherein, the UGC data capture modules are further adapted for:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second
The each self-corresponding UGC data of popular vocabulary in the vocabulary.
B19, the device according to any one of B15-B18, wherein, the multiple UGC websites with regard to 3C classes include:
At least one high-quality UGC website with regard to 3C classes filtered out in multiple UGC websites from network.
B20, the device according to any one of B15-B19, wherein, also include:
Screening module, the multiple UGC websites with regard to 3C classes being suitable in collection network;According to one or more measurement factors
The quality condition of the plurality of UGC websites is weighed out, and therefrom screens mass and meet at least one UGC for specifying quality requirements
Website is used as high-quality UGC website.
B21, the device according to any one of B15-B20, wherein, the screening module is further adapted for:
When the measurement factor includes multiple, the respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, are obtained
Comprehensive numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
B22, the device according to any one of B15-B21, wherein, also include:
Order module, is suitable to be based on one or more the popular vocabulary for 3C classes in the UGC data capture modules, from
After with regard to capturing each self-corresponding UGC data of one or more of popular vocabulary in multiple UGC websites of 3C classes, obtain
The attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are arranged based on the attribute information for obtaining
Sequence, the UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC after the sequence of this is popular vocabulary
Data.
B23, the device according to any one of B15-B22, wherein, the attribute information includes at least one following:
Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
B24, the device according to any one of B15-B23, wherein, the recommendation items are located at the search results pages
Right side area.
B25, the device according to any one of B15-B24, wherein, the recommending module is further adapted for:
If the right side area of the search results pages includes other recommending datas, according to described other recommending datas to obtaining
To UGC data carry out duplicate removal process, the UGC data aggregates after duplicate removal is processed to the target search word it is corresponding search knot
The recommendation items of fruit page.
B26, the device according to any one of B15-B25, wherein, the recommending module is further adapted for:
The UGC data for obtaining are polymerized in the form of carousel figure and/or Text Link corresponding to the target search word
The recommendation items of search results pages.
B27, the device according to any one of B15-B26, wherein, also include:
Statistical module, is suitable in the recommending module that the UGC data aggregates for obtaining is corresponding to the target search word
After the recommendation items of search results pages, the trigger action of UGC data of the counting user for representing in the search results pages is obtained
To statistics;
Determining module, is suitable to determine whether represent institute in the corresponding page of subsequent search request according to the statistics
State UGC data.
B28, the device according to any one of B15-B27, wherein, the determining module is further adapted for:
If the quantity that the statistics is the trigger action is less than specified threshold, it is determined that in subsequent search request pair
No longer represent the UGC data in the page answered.
Claims (10)
1. it is a kind of based on search 3C class UGC data recommendation method, including:
Based on for 3C classes one or more popular vocabulary, capture from the multiple UGC websites with regard to 3C classes it is one or
The each self-corresponding UGC data of multiple popular vocabulary;
When the target search word related to 3C classes is received, by the target search word and one or more of popular vocabulary
Matched, obtained the corresponding UGC data of popular vocabulary for matching;
By the recommendation items of the UGC data aggregates for obtaining to the corresponding search results pages of the target search word.
2. method according to claim 1, wherein, based on one or more the popular vocabulary for 3C classes, from regard to
Before each self-corresponding UGC data of one or more of popular vocabulary are captured in multiple UGC websites of 3C classes, methods described is also
Including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website.
3. method according to claim 1 and 2, wherein, capture or many for 3C classes recommended in appointed website
Individual popular vocabulary, including:
One or more the popular vocabulary for 3C classes recommended in crawl appointed website, and generate comprising one or more of
The vocabulary of popular vocabulary;
When according to first specify crawl frequency grab next time recommend in appointed website for 3C classes popular vocabulary when, profit
With the vocabulary that the popular vocabulary for grabbing is more newly-generated next time.
4. the method according to any one of claim 1-3, wherein, based on one or more the popular words for 3C classes
Converge, each self-corresponding UGC data of one or more of popular vocabulary are captured from the multiple UGC websites with regard to 3C classes, including:
Based on the vocabulary for generating, crawl frequency is specified to capture from the multiple UGC websites with regard to 3C classes according to second described
The each self-corresponding UGC data of popular vocabulary in vocabulary.
5. the method according to any one of claim 1-4, wherein, the multiple UGC websites with regard to 3C classes include:From
At least one high-quality UGC website with regard to 3C classes filtered out in multiple UGC websites in network.
6. the method according to any one of claim 1-5, wherein, filter out pass in the multiple UGC websites from network
In at least one high-quality UGC website of 3C classes, including:
The multiple UGC websites with regard to 3C classes in collection network;
Weighing out the quality condition of the plurality of UGC websites according to one or more measurement factors, and therefrom screen mass expires
Toe determines at least one UGC websites of quality requirements as high-quality UGC website.
7. the method according to any one of claim 1-6, wherein, when weighing the factor and including multiple, according to multiple weighing apparatuses
The amount factor weighs out the quality condition of the plurality of UGC websites, including:
The respective weight of the plurality of measurement factor is determined based on Weight Algorithm;
Obtain the respective numerical value of the plurality of measurement factor of the plurality of UGC websites;
The respective numerical value of the plurality of measurement factor of the plurality of UGC websites and weight are weighted into summation, synthesis is obtained
Numerical value;
The quality condition of the plurality of UGC websites is weighed out according to the respective comprehensive numerical value in the plurality of UGC websites.
8. the method according to any one of claim 1-7, wherein, one or more of popular vocabulary are each self-corresponding
UGC data are then being based on one or more the popular vocabulary for 3C classes, from the multiple UGC websites with regard to 3C classes including a plurality of
Middle to capture after each self-corresponding UGC data of one or more of popular vocabulary, methods described also includes:
Obtain the attribute information of UGC data;
The each self-corresponding a plurality of UGC data of one or more of popular vocabulary are ranked up based on the attribute information for obtaining,
UGC data after being sorted, so as on subsequent match during popular vocabulary, there is provided the UGC numbers after the sequence of this is popular vocabulary
According to.
9. the method according to any one of claim 1-8, wherein, the attribute information includes at least one following:
Issuing time, user's reading number, user comment number, user reprint number, whether there is picture.
10. it is a kind of based on search 3C class UGC data recommendation apparatus, including:
UGC data capture modules, are suitable to based on one or more the popular vocabulary for 3C classes, from the multiple UGC with regard to 3C classes
The each self-corresponding UGC data of one or more of popular vocabulary are captured in website;
Matching module, is suitable to when the target search word related to 3C classes is received, by the target search word with it is one
Or multiple popular vocabulary are matched, the corresponding UGC data of popular vocabulary for matching are obtained;
Recommending module, is suitable to the recommendation items of UGC data aggregates to the corresponding search results pages of the target search word that will be obtained.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611213542.2A CN106649740A (en) | 2016-12-23 | 2016-12-23 | Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611213542.2A CN106649740A (en) | 2016-12-23 | 2016-12-23 | Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106649740A true CN106649740A (en) | 2017-05-10 |
Family
ID=58827797
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611213542.2A Pending CN106649740A (en) | 2016-12-23 | 2016-12-23 | Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106649740A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111310018A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Determining method of timeliness search vocabulary and search engine |
CN111310017A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Method and device for generating timeliness scene content |
CN111309999A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Method and device for generating interactive scene content |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103605808A (en) * | 2013-12-10 | 2014-02-26 | 合一网络技术(北京)有限公司 | Search-based UGC (user generated content) recommendation method and search-based UGC recommendation system |
CN104008139A (en) * | 2014-05-08 | 2014-08-27 | 北京奇艺世纪科技有限公司 | Method and device for creating video index table and method and device for recommending video |
CN104239495A (en) * | 2014-09-09 | 2014-12-24 | 百度在线网络技术(北京)有限公司 | Search method and search device |
CN105354227A (en) * | 2015-09-30 | 2016-02-24 | 北京奇虎科技有限公司 | Search-based method and apparatus for providing high-quality comment information |
CN105740473A (en) * | 2016-03-14 | 2016-07-06 | 腾讯科技(深圳)有限公司 | User-generated content display method and device |
-
2016
- 2016-12-23 CN CN201611213542.2A patent/CN106649740A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103605808A (en) * | 2013-12-10 | 2014-02-26 | 合一网络技术(北京)有限公司 | Search-based UGC (user generated content) recommendation method and search-based UGC recommendation system |
CN104008139A (en) * | 2014-05-08 | 2014-08-27 | 北京奇艺世纪科技有限公司 | Method and device for creating video index table and method and device for recommending video |
CN104239495A (en) * | 2014-09-09 | 2014-12-24 | 百度在线网络技术(北京)有限公司 | Search method and search device |
CN105354227A (en) * | 2015-09-30 | 2016-02-24 | 北京奇虎科技有限公司 | Search-based method and apparatus for providing high-quality comment information |
CN105740473A (en) * | 2016-03-14 | 2016-07-06 | 腾讯科技(深圳)有限公司 | User-generated content display method and device |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111310018A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Determining method of timeliness search vocabulary and search engine |
CN111310017A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Method and device for generating timeliness scene content |
CN111309999A (en) * | 2018-12-11 | 2020-06-19 | 阿里巴巴集团控股有限公司 | Method and device for generating interactive scene content |
CN111310017B (en) * | 2018-12-11 | 2023-05-12 | 阿里巴巴集团控股有限公司 | Method and device for generating time-efficient scene content |
CN111309999B (en) * | 2018-12-11 | 2023-05-16 | 阿里巴巴集团控股有限公司 | Method and device for generating interactive scene content |
CN111310018B (en) * | 2018-12-11 | 2024-03-01 | 阿里巴巴集团控股有限公司 | Method for determining timeliness search vocabulary and search engine |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105701216B (en) | A kind of information-pushing method and device | |
CN103778548B (en) | Merchandise news and key word matching method, merchandise news put-on method and device | |
CN105956161B (en) | A kind of information recommendation method and device | |
CN105989074B (en) | A kind of method and apparatus recommend by mobile device information cold start-up | |
CN104965905B (en) | A kind of method and apparatus of Web page classifying | |
CN106708821A (en) | User personalized shopping behavior-based commodity recommendation method | |
CN107679211A (en) | Method and apparatus for pushed information | |
CN107229730A (en) | Data query method and device | |
CN106709777A (en) | Order clustering method and apparatus thereof, and anti-malicious information method and apparatus thereof | |
CN106777206A (en) | Movie and television play class keywords search for exhibiting method and device | |
CN102663064B (en) | A kind of disposal route of favorites data and device | |
CN104766224B (en) | A kind of shopping evaluation display method and system | |
CN106649740A (en) | Method and device for recommending UGC (User Generated Content) data of computers, communication and consumer electronics based on search | |
CN107783993A (en) | The storage method and device of data | |
CN106709073A (en) | Browser notification pushing method and browser terminal | |
CN109189990A (en) | A kind of generation method of search term, device and electronic equipment | |
CN106649738A (en) | Method and device for aggregating personage information message in search engine result page | |
CN109727047A (en) | A kind of method and apparatus, data recommendation method and the device of determining data correlation degree | |
CN108090807A (en) | Information recommendation method and device | |
CN109949172A (en) | Social account influence power evaluation method, device and storage medium | |
CN106919582A (en) | The association of network articles and related information statistical method and device | |
CN109446431A (en) | For the method, apparatus of information recommendation, medium and calculate equipment | |
CN112100221A (en) | Information recommendation method and device, recommendation server and storage medium | |
CN106844488A (en) | With reference to the stock class UGC data recommendation methods and device of search | |
CN108268357A (en) | real-time data processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170510 |