WO2018113673A1 - 针对综艺类query的搜索结果的推送方法及装置 - Google Patents

针对综艺类query的搜索结果的推送方法及装置 Download PDF

Info

Publication number
WO2018113673A1
WO2018113673A1 PCT/CN2017/117220 CN2017117220W WO2018113673A1 WO 2018113673 A1 WO2018113673 A1 WO 2018113673A1 CN 2017117220 W CN2017117220 W CN 2017117220W WO 2018113673 A1 WO2018113673 A1 WO 2018113673A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
ugc
video
search result
search
Prior art date
Application number
PCT/CN2017/117220
Other languages
English (en)
French (fr)
Inventor
王艳丽
陈营营
马华蓉
佟思颖
高苏丹
Original Assignee
北京奇虎科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN201611209249.9A external-priority patent/CN106777206A/zh
Priority claimed from CN201611209280.2A external-priority patent/CN106649737B/zh
Priority claimed from CN201611209248.4A external-priority patent/CN106777205A/zh
Application filed by 北京奇虎科技有限公司 filed Critical 北京奇虎科技有限公司
Publication of WO2018113673A1 publication Critical patent/WO2018113673A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to the field of Internet technologies, and in particular, to a method and device for pushing search results of a variety of magazines, a method and device for searching and displaying keywords of a film and television drama, and a search method for search words in games. Device.
  • High-quality UGC websites have many advantages in content, such as: (1) data comes from individuals and has independence; (2) websites that paste bars can resonate with users because they can be commented by many people; (3) for the same Search Q&A, high-quality UGC website can supplement high-quality search results, so it extends reading to a certain extent; and so on.
  • the current search engine product is to combine all the search results for the variety show query on the left side of the search results page, and the user on the right side of the search results page in the form of list and graphic recommendation.
  • Recommend related programs or stars are often not related to the variety program query searched by the user, so the click rate is low, and the recommended area on the right side cannot perform its recommendation function well.
  • users need to find the content they are interested in from the large number of search result items displayed on the left side, which is a waste of time.
  • an object of the embodiments of the present invention is to provide a push method and device for a search result of a variety magazine that can overcome the above problems or at least partially solve the above problems.
  • a method for pushing search results for a variety magazine query comprising: fetching a UGC search result item from a designated UGC website, wherein the specified UGC website includes a big data level UGC. Searching for the result item; selecting a plurality of variety keywords from the UGC content corresponding to the variety UGC search result item, and collecting the plurality of variety keywords to create a variety class for the variety magazine query a keyword vocabulary; when receiving a search request for any of the variety keywords in the variety keyword vocabulary, obtaining a match with the variety keyword from the UGC search result item of the variety class a high-quality UGC search result item; the high-quality UGC search result item matching the variety keyword is pushed to the search result page according to its own display form, and the self-display form of the high-quality UGC search result item includes the variety video
  • the screenshot is at least one of a picture display form of the cover and a text display form in which the title of the UGC content of the variety is linked.
  • a push device for a search result of a variety magazine query comprising: a capture module adapted to retrieve a UGC search result item from a designated UGC website, the designated UGC website Including big data level UGC a search result item; a creation module, configured to filter out a plurality of variety keywords from the UGC content corresponding to the variety UGC search result item, and collect the plurality of variety keywords to create a variety show a category keyword vocabulary of the class query; an obtaining module, adapted to receive a search request for any of the variety keywords in the variety keyword list, from the UGC search result item of the variety class Obtaining a high-quality UGC search result item matching the variety keyword; the first pushing module is adapted to push the high-quality UGC search result item matching the variety keyword according to its own display form to the search result page.
  • the self-display form of the high-quality UGC search result item includes at least one of a picture display form in which the variety art video screenshot is a cover, and
  • the UGC search result item by extracting the UGC search result item from the designated UGC website including a large number of UGC search result items, the UGC content corresponding to the UGC search result item corresponding to the variety UGC search result item is filtered out.
  • a variety of keywords a collection of multiple variety keywords to create a variety of keyword vocabulary for the variety magazine query.
  • the UGC search result item matching the variety keyword is obtained from the UGC search result item, and the quality UGC is obtained.
  • the search result item is pushed to the search result page according to its own display form, so that the search result page can display the high-quality UGC search result item for the variety magazine query, such as the video edited by the user himself, the spoiler guest, the spoiler result, the star Gossip and other data, and these high-quality UGC search results items are the most interesting to users, so the technical solution can directly display the high-quality UGC search results items for the variety magazine query, without the user to find a large number of search results items.
  • Content that is of interest to you, thereby increasing user click-through rates on search result items and improving user experience with search engines.
  • a video game keyword search and display method including: determining N video drama keywords, wherein N is an integer and N is greater than 1; generated from a predetermined one or more users Obtaining information information and/or video related to each of the N video drama keywords in the content UGC website; and obtaining the information information and/or video related to the respective movie drama keywords.
  • a video game keyword search and display device including: a determining module, configured to determine N video drama keywords, wherein N is an integer and N is greater than 1; Obtaining information information and/or video related to each of the N movie drama keywords from the predetermined one or more user generated content UGC websites; and a storage module for acquiring the acquired The information information and/or video related to each of the movie drama keywords is stored in the video content database; the search module is configured to search for the target from the Internet in response to the target search word input by the user on the search engine.
  • the search result page corresponding to the word is presented to the user.
  • information information and/or video related to the keyword of the film and television drama is captured from the UGC website, and the captured information and/or video is stored in the information content database of the film and television drama, and received.
  • searching for a target search term related to a film and television drama type entered by a user on a search engine searching for a target search term from the Internet, and searching for information information matching the target search term from the video drama content content database and/or
  • the video is displayed to the user by aggregating the information information and/or video found in the video content information database from the database to the search result page corresponding to the target search word.
  • the information about the film and television drama of the UGC website can be aggregated in the search result page, thereby providing the user with more comprehensive information information and broadening the content coverage.
  • the film and television drama information content database comes from various UGC websites and will be in various UGC websites. The data is forwarded to the search result page for presentation, and the user does not need to go to the website to find relevant information information through multiple operations, thereby reducing the user's search cost.
  • a search method for a game search term including: determining whether there are information items related to N preset game identifiers in a predetermined plurality of user generated content UGC websites, wherein N is An integer, and N is greater than 1; according to the judgment result, data related to the N preset game identifiers is captured from one or more UGC websites having information items related to the N preset game identifiers; Processing the data related to the N preset game identifiers to obtain a UGC game information database, where each piece of data in the UGC game information database includes at least: keywords, information content, and attributes; a target search term input by the user on the search engine, determining whether the target search term is a search term of the game class; and in the case of determining that the target search term is a game class search term, searching for the target from the Internet Searching for the word, searching for data matching the target search word in the UGC game information database; and aggregating the information content of the data matching the target search word
  • a search device for a game search term including: a first determining module, configured to determine whether a predetermined plurality of user generated content UGC websites are associated with N preset game identifiers a information item; a capture module, configured to: according to the judgment result, grab the N preset game identifiers from one or more UGC websites having information items related to the N preset game identifiers
  • the data module is configured to process the captured data related to the N preset game identifiers to obtain a UGC game information database, where each piece of data in the UGC game information database includes at least: keywords
  • a response module configured to respond to the target search term input by the user on the search engine, determine whether the target search term is a search term of the game class, and a search module, configured to determine the target search term In the case of a search term of a game class, while searching for the target search term from the Internet, searching for the target search term in the UGC game information database
  • a presentation module configured to
  • the information information related to the preset game identifier is captured from the UGC website, and the captured information information is stored in the UGC game information database, and the game input by the user on the search engine is received.
  • searching for target search words related to the class searching for the target search words from the Internet, searching for information information matching the target search words from the UGC game information database, and aggregating the information information found in the UGC game information database.
  • the search result page corresponding to the target search word is presented to the user. It can be seen that, in the technical solution provided by the embodiment of the present invention, the game information information of the UGC website can be aggregated in the search result page, thereby providing users with more comprehensive information information and broadening the content coverage.
  • the UGC game information database comes from various UGC websites, and the data in each UGC website is forwarded to the search result page for presentation, and the user does not need to go to the website to find relevant information information through multiple operations, thereby reducing the user's search cost.
  • a computer program comprising computer readable code, when said computer readable code is run on a computing device, causing said computing device to perform a variety of arts as described above.
  • the push method of the search result of the class query, the keyword search and display method of the film and television drama, or the search method of the game type search word is also provided.
  • FIG. 1 is a schematic flow chart of a method for pushing search results for a variety magazine query according to an embodiment of the present invention
  • FIG. 2 is a schematic interface diagram of a search result page in a push method for a search result of a variety magazine query according to an embodiment of the present invention
  • FIG. 3 is a schematic block diagram of a push device for a search result of a variety magazine query according to an embodiment of the present invention
  • FIG. 4 is a schematic block diagram of a push device for a search result of a variety magazine query according to another embodiment of the present invention.
  • FIG. 5 is a flow chart showing a method for displaying a keyword search and play of a movie drama according to an embodiment of the present invention
  • FIG. 6 is a schematic diagram showing a search result page in which information information and/or video of a movie and television drama is aggregated according to another embodiment of the present invention
  • FIG. 7 is a schematic structural diagram of a video drama keyword search and presentation device according to an embodiment of the invention.
  • FIG. 8 is a schematic structural diagram of a video drama keyword search and presentation device according to another embodiment of the present invention.
  • FIG. 9 is a flowchart showing a search method of a game class search word according to an embodiment of the present invention.
  • FIG. 10 is a schematic diagram showing a search result page in which game information information is aggregated according to another embodiment of the present invention.
  • FIG. 11 is a block diagram showing the structure of a search device for a game class search word according to an embodiment of the present invention.
  • FIG. 12 is a block diagram showing the structure of a search device for a game class search word according to another embodiment of the present invention.
  • FIG. 13 is a block diagram schematically showing a computing device for performing a search method for a variety result query according to the present invention, a movie drama keyword search presentation method, or a search method for a game class search word;
  • FIG. 14 is a view schematically showing a program method for holding or carrying a search method for realizing a search result for a variety magazine according to the present invention, a movie drama keyword search presentation method, or a search method for a game type search word. Storage unit.
  • FIG. 1 is a schematic flow chart of a method for pushing search results for a variety magazine query according to an embodiment of the present invention. As shown in FIG. 1, the method can generally include the following steps S101-S104:
  • Step S101 Grab the UGC search result item from the designated UGC website, and specify the UGC search result item including the big data level in the UGC website.
  • Step S102 screening a plurality of variety keywords from the UGC content corresponding to the UGC search result item of the variety class, and collecting a plurality of variety keywords to create a variety word list for the variety magazine query.
  • Step S103 when receiving a search request for any variety keyword in the variety keyword vocabulary, obtaining a high-quality UGC search result item matching the variety keyword from the variety UGC search result item.
  • step S104 the high-quality UGC search result item matching the variety keyword is pushed to the search result page according to its own display form.
  • the self-display form of the high-quality UGC search result item includes at least one of a picture display form in which the variety art video screenshot is a cover, and a text display form in which the title of the UGC content of the variety art is linked.
  • step S101 is executed, that is, the UGC search result item is captured from the designated UGC website, and the UGC search result item including the big data level is specified in the UGC website.
  • the designated UGC website can be some of the better quality websites of various UGC websites, such as headlines, tribes, and other websites.
  • the UGC search result item of the variety class can be captured by at least one of the following ways:
  • Method 1 Obtain the information channel path of the variety class, and grab the UGC search result item from the designated UGC website according to the information channel path.
  • Method 2 extract keywords in the title corresponding to each UGC search result item in the specified UGC website; determine the type of each UGC search result item according to the keyword; and grab the UGC search result item according to the type of each UGC search result item Variety UGC search results item.
  • the UGC search result item is a variety UGC search result item; if the title corresponding to the UGC search result item includes a key The words "sports”, “soccer”, “swim competition”, etc., can determine that the UGC search result item is a sports UGC search result item.
  • Method 3 Extracting the UGC search result item corresponding to at least one of the following UGC contents is a UGC search result item of the variety art: entertainment gossip, variety information, variety show comment.
  • the third method to capture the UGC search result item of the variety class it is mainly determined according to the UGC content whether the UGC search result item is a UGC search result item of the variety class.
  • step S102 a plurality of variety keywords are selected from the UGC content corresponding to the UGC search result item, and a plurality of variety categories are collected.
  • variety keywords such as: entertainment, games, Happy Camp.
  • step S103 After creating the variety keyword vocabulary, proceeding to step S103, that is, when receiving a search request for any variety keyword in the variety keyword vocabulary, obtaining the variety class from the UGC search result item of the variety class
  • the keywords match the high quality UGC search result items.
  • the UGC search result item matching the variety keyword and matching the preset condition is captured from the UGC search result item of the variety class as a high-quality UGC search result item matching the variety keyword.
  • the preset condition includes the UGC video clip, the UGC content includes the variety program content that has not yet been broadcasted, and the UGC content includes at least one of the character information related to the variety keyword.
  • a UGC video clip such as a video clipped and uploaded by any user himself; the UGC content contains the variety show content that has not yet been broadcasted, such as a spoiler for a variety show; the UGC content contains character information related to the variety keyword, such as variety Guest information in the class program, etc.
  • the high-quality UGC search result item matching the variety keyword is pushed to the search result page according to its own display form, wherein the quality is high.
  • the self-display form of the UGC search result item includes at least one of a picture display form in which the variety video clip is a cover, and a text display form in which the title of the UGC content of the variety is linked.
  • the high quality UGC search result items that match the variety keyword are pushed to the relevant recommendation area of the search results page in their own display form.
  • the plurality of high-quality UGC search result items matching the variety keyword may be first sorted according to the preset sorting element, and the preset sorting elements include each At least one of the release time, the number of comments, and the most recent comment time of the high-quality UGC search result item; then, the sorted plurality of high-quality UGC search result items are pushed to the search result page according to their own display form. Since the user is usually more interested in the latest or the latest issue of the variety show, and has less interest in the previous variety show, in the sorting, it can also follow the variety show corresponding to the high-quality UGC search results. The broadcast time is sorted, and the closer the broadcast time is, the higher the ranking of the corresponding high-quality UGC search result items is.
  • the self-display form of the high-quality UGC search result item includes at least one of a picture display form in which the variety video clip is a cover, and a text display form in which the title of the UGC content of the variety is linked. Therefore, when the high-quality UGC search result item is displayed, if the high-quality UGC search result item is a video item, the variety picture screenshot is displayed as the cover image display form; if the high-quality UGC search result item is a text link item, the text is displayed. The display form is displayed. The two display forms are described below.
  • the premium UGC search result item includes a video item.
  • the embodiment may display the high quality UGC search result item as follows: first, the cover picture corresponding to the video item is pushed to the first position of the search result page; secondly, the corresponding position on the cover picture is provided for playing the video item corresponding The first identifier of the video; again, when the triggering operation on the first identifier is received, the video page corresponding to the video item is entered, and the video is played.
  • the cover picture corresponding to the plurality of video items may also be displayed in a carousel manner at the first position of the search result page.
  • the first location may be any specified location in the relevant recommendation area on the search result page, for example, the top display location in the relevant recommendation area.
  • the cover picture corresponding to the plurality of video items is rotated, the title of the video item corresponding to each cover picture may be displayed on the cover picture or below the cover picture.
  • a "More" button can be added at a specified location within the relevant recommendation area. When the user triggers the "More” button, the current page jumps to include more video items that match the variety keyword. page.
  • the premium UGC search result item includes a text link item.
  • the embodiment may display the high quality UGC search result item as follows: first, the text link item is pushed to the second position of the search result page; secondly, when the trigger operation on the text link item is received, the text link is entered and displayed.
  • the second location may be any specified location in the relevant recommended area on the search result page, for example, the location below the first location, and the second location is set below the first location, and the user may be preferentially presented in the form of a picture.
  • the video items matched by the class keywords are more likely to trigger the user's interest in the recommended high quality UGC search result items in the relevant search area.
  • the plurality of high-quality UGC search result items may be sorted according to the preset sorting element, wherein the preset sorting element includes the publishing time, the number of comments, and the latest comment of each high-quality UGC search result item. At least one of the time.
  • the above method further includes the following steps: counting the click rate of the high-quality UGC search result item; when the quality UGC search result item is clicked When the rate is lower than the preset click rate, the display operation of the high quality UGC search result item on the search result page is canceled, and the quality UGC search result item that is cancelled is continuously monitored; if the quality UGC is displayed, the quality is canceled.
  • the search result item is updated, the updated high quality UGC search result item is displayed again on the search result page.
  • the default click-through rate is set to 70%
  • the hit rate of the high-quality UGC search result item matching the variety keyword "Happy Camp” is less than 70%
  • the display operation of the high-quality UGC search result item matching the variety keyword "Happy Camp” is cancelled in the relevant recommended area, and the original recommended content is displayed for the user in the relevant recommended area, for example, recommending some entertainment in graphic form. Circle celebrities.
  • continuously monitor whether the high-quality UGC search result items matching the variety keyword "Happy Camp” are updated, and when monitoring the high-quality UGC search result items matching the variety keyword "Happy Camp” is monitored. Update (for example, when a user uploads a new clip video clip to "Happy Camp"), the high-quality UGC search result item matching the variety keyword "Happy Camp” is displayed in the relevant recommended area.
  • Fig. 2 is a view showing an interface of a search result page when searching for a variety keyword "Happy Camp” in an embodiment of the present invention.
  • the natural search result display area on the left side of the search result page displays the natural search result related to "Happy Camp”
  • the relevant recommended area on the right side displays the high quality UGC search matched with "Happy Camp”.
  • the result items include spoilers for the variety show "Happy Camp", guest information, video clips of user clips, and so on.
  • a plurality of cover pictures of the video items matching the "Happy Camp” are displayed in the form of a carousel picture, and the corresponding high quality UGC search results are displayed below the cover picture.
  • the user can click on the cover picture or the title to trigger the playback operation of the corresponding video item.
  • a "More” button is displayed at the top right of the video item, and the user clicks on the "More” button, and the current page can jump to a page that includes more video items.
  • the relevant recommended area on the right side of the search result page is no longer only recommended by the user for some variety of celebrities who are not interested in the user, but To show users better quality UGC search results items that match the variety magazine query, such as spoilers for variety shows, guest information, user clip video clips, etc., so that the relevant recommended areas of the search results page can be recommended and user needs Relevant and interesting content not only enhances the user experience of the search engine, but also greatly improves the click-through rate of the relevant recommended areas, and maximizes the recommendation role of the relevant recommended areas.
  • the apparatus includes: a capture module 310, configured to: retrieve a UGC search result item from a specified UGC website, where the specified UGC website includes a UGC search result item of a big data level; and the creation module 320
  • the matching module 310 is configured to select a plurality of variety keywords from the UGC content corresponding to the variety UGC search result item, and collect the plurality of a variety keyword to create a variety keyword vocabulary for the variety query;
  • the obtaining module 330 is coupled with the creating module 320, and is adapted to receive any variety in the vocabulary for the variety keyword
  • the first push module 340 is coupled with the acquisition module 330, and is suitable for The high-quality UGC search result item matching the variety keyword is pushed to the search
  • the crawling module 310 is further configured to: acquire at least one of: obtaining an information channel path of the variety class, and fetching a variety UGC search result item from the designated UGC website according to the information channel path; extracting the Specifying a keyword in a title corresponding to each UGC search result item in the UGC website; determining a type of each UGC search result item according to the keyword; and capturing a variety class from the UGC search result item according to the type
  • the UGC search result item is extracted; the UGC search result item corresponding to the at least one UGC content is extracted as the UGC search result item of the variety class: entertainment gossip, variety information, variety show comment.
  • the obtaining module 330 is further configured to: grab the UGC search result item that matches the variety keyword and meets the preset condition from the UGC search result item of the variety class, as A high quality UGC search result item that matches the variety keyword.
  • the preset condition includes at least one of the following: a UGC video clip; the UGC content includes the variety program content that has not been broadcasted; and the UGC content includes the character information related to the variety keyword.
  • the first pushing module 340 is further configured to: when the high quality UGC search result item includes multiple, sort the plurality of high quality UGC search result items according to a preset sorting element, the preset The sorting element includes at least one of a publishing time, a number of comments, and a last comment time of each high-quality UGC search result item; and the sorted plurality of high-quality UGC search result items are pushed to the search result page according to their own display form. .
  • the high-quality UGC search result item includes a video item; the first pushing module 340 is further configured to: push a cover picture corresponding to the video item to a first position of the search result page; Corresponding position on the cover picture provides a first identifier for playing a video corresponding to the video item; when receiving a trigger operation on the first identifier, entering a video page corresponding to the video item, and playing the video.
  • the first pushing module 340 is further configured to: when the video item includes multiple, display, in a first position of the search result page, a corresponding manner of the multiple video items in a carousel manner cover image.
  • the high-quality UGC search result item includes a text link item; the first push module 340 is further adapted to: push the text link item to a second position of the search result page;
  • the UGC content corresponding to the text link item is entered and displayed.
  • the foregoing apparatus further includes: a statistics module 350 coupled to the first push module 340, configured to push the high quality UGC search result item matching the variety keyword to the search result. After the page, the click rate of the high-quality UGC search result item is counted; the canceling module 360 is coupled with the statistic module 350, and is adapted to cancel when the click rate of the high-quality UGC search result item is lower than the preset click rate.
  • a statistics module 350 coupled to the first push module 340, configured to push the high quality UGC search result item matching the variety keyword to the search result.
  • the canceling module 360 is coupled with the statistic module 350, and is adapted to cancel when the click rate of the high-quality UGC search result item is lower than the preset click rate.
  • the first pushing module 340 is further adapted to: push the high quality UGC search result item matching the variety keyword to the relevant recommended area of the search result page according to its own display form.
  • the push device for the search results of the variety magazine in FIG. 3 and FIG. 4 can be used to implement the push scheme for the search results of the variety magazine query described above, wherein the detailed description should be Similar to the description in the previous method section, in order to avoid cumbersome, it will not be described here.
  • the embodiment of the present invention further provides a method for aggregating information information of a film and television drama in a search result page, and the method can be applied to a terminal device such as a personal computer, a smart phone, or a tablet computer.
  • FIG. 5 is a flow chart showing a method of aggregating television drama information information in a search result page according to an embodiment of the present invention. As shown in FIG. 5, the method may at least include the following steps S502 to S510.
  • Step S502 determining N movie drama keywords, where N is an integer and N is greater than 1.
  • the N movie drama keywords may be determined according to the click rate and/or the search rate of each keyword in the predetermined database.
  • the N movie drama keywords may be composed for the top of the 360 hot list and the movie station in the ranking or the click rate and/or the search rate, wherein the value of N may be determined according to the specific application. It is not limited in this embodiment.
  • Step S504 acquiring information information and/or video related to each of the N movie drama keywords from the predetermined one or more user generated content UGC websites;
  • UGC User Gernerated Content
  • UCC User Created Content
  • PGC Professional Generated Content
  • UGC has the advantage that users can freely upload content and enrich the content of the website, but the downside is that the quality of the content is mixed.
  • PGC classification is more professional, content quality is more guaranteed, and its content setting and product editing are very professional.
  • UGC and PGC are not contradictory, not only in parallel, but also need to complement each other.
  • UGC is responsible for the breadth of content, mainly contributing to traffic and participation, while PGC maintains content depth, mainly establishing brands and creating value, both of which are indispensable. Since PGC is a derivative concept of UGC, PGC may be included as part of UGC in the embodiment of the present invention.
  • the embodiment of the present invention can capture the information of the film and television drama from multiple UGC websites in this step. Screen at least one high-quality UGC website from multiple UGC websites, and then grab video information from at least one high-quality UGC website.
  • the quality UGC website when screening at least one quality UGC website from multiple UGC websites, it can be screened by some measurement factors. Specifically, one or more measurement factors are determined, and the quality of the plurality of UGC websites is measured according to the determined one or more measurement factors, and at least one UGC website whose quality meets the specified quality conditions is selected as a quality UGC website.
  • the measurement factors here can be such as the credibility of the website, the number of users registered on the website, the number of visits to the website, and so on.
  • the embodiment of the present invention provides an optional solution, in which multiple times may be determined based on the weight policy. Measure the respective weights of the factors, obtain the respective values of multiple metrics of multiple UGC websites; then weight and sum the values and weights of multiple metrics of multiple UGC websites to obtain comprehensive values, and then according to multiple UGCs The comprehensive values of the websites measure the quality of multiple UGC websites.
  • multiple UGC websites are Website 1, Website 2, Website 3, Website 4, and Website 5.
  • Multiple measures are the credibility of the website, the number of users registered on the website, the number of visits to the website, and multiple websites 1
  • the respective values of the measurement factors are p11, p12, and p13, and the respective values of the plurality of measurement factors of the website 2 are p21, p22, and p23, respectively, and the respective values of the plurality of measurement factors of the website 3 are p31, p32, and p33, respectively.
  • the respective values of the four measurement factors of 4 are p41, p42, and p43, respectively, and the respective values of the plurality of measurement factors of the website 5 are p51, p52, and p53, respectively.
  • the weights of the plurality of measurement factors are determined as w1, w2, and w3, and the values and weights of the plurality of measurement factors of the plurality of UGC websites are weighted and summed to obtain comprehensive values of the plurality of UGC websites.
  • the comprehensive value of the website 1 after weighted summation is p11 ⁇ w1+p12 ⁇ w2+p13 ⁇ w3
  • the comprehensive value of the website 2 is p21 ⁇ w1+p22 ⁇ w2+p23 ⁇ w3.
  • Website 3, Website 4, and Website 5 are deduced by analogy, and are not repeated here.
  • a video website such as a headline number, iQiyi, Youku, etc.
  • Information and/or video for example, you can search for each movie drama preset word in the movie title class in the search box of the headline number or iQiyi or Youku, and crawl each time according to the release time.
  • the information and video of the film and television drama category are marked, and the information information related to the N video drama keywords is extracted from the marked video drama information and video respectively. And / or video. For example, you can manually mark the headline number of the film and television drama on the headline number, perform data capture in these headline numbers, and then classify according to the information of the captured information and/or the name of the person included in the title of the video.
  • a UGC website of a network topic community class for example, an interest tribe or a watercress
  • Information information and/or video related to each of the N video drama keywords including: determining, in the UGC website of the network theme community class, the keywords of the N movie dramas respectively a theme community related to each of the movie drama keywords, selecting a largest one or more theme communities from the related topic communities, searching in the title or body of the information published by the one or more topic communities.
  • the N movie drama keywords extract information information and/or video related to the N movie drama keywords from the one or more theme communities according to the search result.
  • the presupposition words of each film and television drama in the presupposition vocabulary of the film and television drama for example, the return of the great saint, first locate the number of tribes in the target film and television drama, for example, returning to the community, and then selecting the largest tribe.
  • the title or article body contains information about the keyword (for example, the return of the Holy Spirit).
  • the UGC website of the network question and answer community class is respectively obtained from the predetermined one or more user generated content UGC websites.
  • Information information and/or video related to each of the N video drama keywords including: obtaining the information related to the category of the film and television drama in the UGC website of the network question and answer community class;
  • the category of the published question is the information related to the film and television drama category, respectively, the name and/or the text includes information of one or more of the N movie drama keywords; the N video dramas extracted from the search results Keywords related to information and/or video.
  • the preset words of the film and television drama in the table for example, the return of the great saint
  • the question and the answer are captured as the information related to the preset words of the film and television drama.
  • Step S506 storing the acquired information information and/or video related to the respective movie drama keywords into the movie content information content database.
  • each of the obtained information information and/or video related video drama keywords may be classified and stored in the video drama content content database, and according to Each piece of information and/or video content attributes sorts each piece of information and/or video stored in the category. For example, information information and/or video related to "Grand St. Return" is put together.
  • the information content database of the film and television drama is sorted, and the content in the database can be sorted according to the keywords of the film and television drama, which is convenient for subsequent search.
  • the captured information information and/or video is classified according to each of the information and/or video related movie drama preset words.
  • a structured video and television content content database is generated that has the content of the film and television drama and the content of the information and/or video. That is, the movie content information database may include three attribute columns: a movie drama preset word, a news information, and/or a video content attribute and information content.
  • the content attribute of the information information and/or the video may include a plurality of items, for example, a publishing time of the information, a number of comments of the information, and the like, and the information content may include a title of the information (tittle) and a link address of the information.
  • Table 1 is an example of the structure of the video content information database in the present embodiment.
  • the information may be optimized according to the information information of each piece of information and/or the content attribute of the video.
  • the content attribute of the information information and/or the video may include: a type of the content (for example, information information or video), a publishing time, a number of views, and/or a number of comments, that is, the information content database of the film and television drama may follow the information. Timeliness and/or heat are sorted to improve subsequent search efficiency.
  • the embodiment of the present invention provides an optional solution, in which a movie drama keyword for processing the captured movie and television information information can be determined, and then based on the determined movie drama keyword The corresponding attribute content is extracted from the captured information of the film and television drama.
  • the video drama keyword may be a movie drama name, a director of a film and television drama, or a screenwriter of a film and television drama, and the like, and the embodiment of the present invention is not limited thereto.
  • Step S508 searching for a result matching the target search term from the Internet in response to the target search term input by the user on the search engine, and searching for information matching the target search term in the movie drama content content database.
  • Information and / or video are examples of the target search term from the Internet in response to the target search term input by the user on the search engine.
  • the target search term input by the user when receiving the target search term input by the user, it may first determine whether the target search term hits one or more of the N movie drama keywords, and if so, from the Internet Searching for the result of matching the target search term, and simultaneously searching for information information and/or video matching the target search term in the movie drama content content database; otherwise, searching according to a normal search mode, Search only the target search term from the Internet.
  • Step S510 in the case that the information information and/or video matching the target search term is found from the movie drama content content database, the found information information and/or video is aggregated to the The search result page corresponding to the target search word is presented to the user.
  • step S510 in the case that a video matching the target search term is found from the movie drama content content database, in step S510, the found location is presented to the user.
  • the found video may be played on the search result page, and the found information information and/or video may be displayed on the search result page. Text link.
  • the search result page corresponding to the target search word displays the result searched from the Internet.
  • step S510 The steps can be included:
  • Step 1 displaying a result of searching for the target search term from the Internet on the left side of the search result page;
  • Step 2 determining whether the found information information and/or video has a phase opposite to the result displayed on the left side of the search result page. The same information information and/or video, if any, removes the same information information and/or video from the information information and/or video found;
  • Step 3 The searched information information and/or video after the same information information and/or video is removed is aggregated to the right area of the search result page corresponding to the target search word and presented to the user.
  • the search result page includes two areas: a left area and a right area.
  • the left area is used to display the result obtained by the search engine in the Internet search target search word.
  • the content displayed on the left side of the search result page of the search engine such as baidu, google, etc.
  • the right area is used to display the result searched in the information content database of the film and television drama, so that the content on the right side of the search result page can be expanded. , to provide users with more complete search results.
  • the contents displayed on the left side and the right side are not coincident, so that the uniqueness of the search result can be guaranteed.
  • the matched information information and/or video is aggregated to the After the search result page corresponding to the target search word is presented to the user, the method further includes: counting the triggering operation of the matched information information and/or video displayed by the user on the search result page, and obtaining a statistical result; The statistical result determines whether the matched information information and/or video is presented in the page corresponding to the subsequent search request.
  • the triggering operation of the matched information information and/or video displayed by the user on the search result page may be the CTR (Click To Rate) of the matched information information and/or video.
  • the information information in the information content database of the film and television drama and/or the click rate after the video is displayed it is determined whether the information information and the information information are displayed on the right side of the search page when searching for the information information and/or video. / or video.
  • the statistical result when determining, according to the statistical result, whether the matched information information and/or video is displayed in a page corresponding to the subsequent search request, whether the statistical result is the The number of triggering operations is less than a specified threshold, and if so, it is determined that the matching information information and/or video is no longer displayed in the page corresponding to the subsequent search request.
  • the CTR of the information information and/or video may be judged according to a specified time (such as 1 or 2 hours, etc.), and corresponding processing is performed according to the judgment result.
  • a specified time such as 1 or 2 hours, etc.
  • the search engine when receiving the target search term related to the film and television drama type input by the user on the search engine, first determining whether the target search term hits the preset vocabulary of the movie drama category, and if so, from the UGC Searching for information and/or video matching the target search term in a structured video and television content content database composed of data captured in the website, and searching for information information from the structured video drama content content database And/or the video search result page corresponding to the target search word is presented to the user.
  • the information and/or video information of the UGC website can be aggregated in the search result page, so that the user can provide more comprehensive information information and/or video. Broaden content coverage.
  • the film and television drama information content database has the structured features of the movie drama preset words and the information attributes of the information and/or video, it is readable and can help the user quickly find the required information.
  • the video content database of the film and television dramas comes from various UGC websites, and the data in each UGC website is forwarded to the search result page for presentation, and the user does not need to go through multiple operations to find related information information and/or video, which is reduced. User's search cost.
  • the embodiment of the present invention further provides an apparatus for aggregating video information of a movie and television drama in a search result page based on the method for aggregating the information of the film and television drama in the search result page.
  • FIG. 7 is a schematic structural diagram of a video drama keyword search and presentation device according to an embodiment of the present invention.
  • the apparatus may include at least a determining module 710, an obtaining module 720, a storage module 730, a search module 740, and a presentation module 750.
  • the determining module 710 is configured to determine N movie drama keywords, where N is an integer and N is greater than 1; the obtaining module 720 is configured to obtain the N and the N from the predetermined one or more user generated content UGC websites respectively. a piece of video drama keyword related information information and/or video; a storage module 730, configured to store the acquired information information and/or video related to each of the movie drama keywords into a film and television drama
  • the search module 740 is configured to search for a result matching the target search term from the Internet in response to the target search term input by the user on the search engine, and search in the movie content information database.
  • a presentation module 750 configured to search for information information and/or video matching the target search term from the movie drama content content database And uploading the found information information and/or video to the search result page corresponding to the target search word to be presented to the user.
  • the obtaining module 720 is specifically configured to store the acquired information information and/or video related to the respective movie drama keywords into the movie and television drama information content in the following manner.
  • the classification is stored in the movie content information content database, and according to the content attribute of each piece of information information and/or video
  • Each piece of information and/or video stored in the category is sorted.
  • the content attribute of each piece of information information and/or video may include the type of the content, the time of publication of the content, the number of comments of the content, and the number of views of the content, that is, the timeliness of the information in the video content database of the film and television drama and/or Or sort by heat to improve search efficiency.
  • the presentation module 750 is specifically configured to use the following manner And uploading the searched information information and/or video to the search result page corresponding to the target search word to be displayed to the user: playing the found video on the search result page, and displaying the search result page The text link of the information information and/or video found.
  • the obtaining module 720 is specifically configured to obtain information information and/or video related to each of the N movie drama keywords in the following manner:
  • the information and video of the film and television drama category are marked, and the information information related to the N video drama keywords is extracted from the marked video drama information and video respectively. And / or video.
  • the obtaining module 720 is specifically configured to acquire information information and/or video related to each of the N movie drama keywords in the following manner: in the network Identifying a theme community related to each of the N movie drama keywords in the UGC website of the theme community class, and selecting the largest one or more theme communities from the related theme communities Searching for the N movie drama keywords in the title or body of the information published by the one or more topic communities, and extracting the N video files from the one or more theme communities according to the search result. Information and/or video related to the keyword.
  • the presupposition words of each film and television drama in the presupposition vocabulary of the film and television drama for example, the return of the great saint, first locate the number of tribes in the target film and television drama (for example, the Dasheng community), and then select the largest tribe.
  • Grab The fetch for example, depending on the degree of attention
  • the title or the body of the article contains information about the keyword (for example, the return of the Holy Trinity).
  • the obtaining module 720 is specifically configured to acquire, in the following manner, each of the N video drama keywords Relevant information information and/or video:
  • the category of the questions published in the UGC website of the network question and answer community class is related to the film and television drama category; and the name of the published question category is the information related to the film and television drama category respectively.
  • the text includes information of one or more of the N movie drama keywords; information information and/or video related to the N movie drama keywords extracted from the search result. For example, it may be first determined whether the category in which the question is published is related to entertainment. If relevant, it is further determined whether the question and the answer include the preset words of the film and television drama in the preset vocabulary of the film and television drama, and if so, Take the question and answer as the information related to the preset words of the film and television drama.
  • the presentation module 750 is specifically configured to: send the found information information and/or video to the search result page corresponding to the target search word to be presented to the following manner: User: displaying, on the left side of the search result page, a result of searching for the target search word from the Internet; determining whether the searched information information and/or video has a result displayed on the left side of the search result page The same information information and/or video, if any, removes the same information information and/or video in the found information information and/or video; the same information information and/or will be removed The searched information information and/or video after the video is aggregated to the right area of the search result page corresponding to the target search word to be presented to the user.
  • the apparatus may further include: a statistics module 760, configured to aggregate the found information information and/or video to the target search After the search result page corresponding to the word is presented to the user, the statistical user performs a triggering operation on each of the searched information information and/or video displayed on the search result page to obtain a statistical result; the determining module 770 is configured to use the The statistical result determines whether each of the found information information and/or video is displayed in a page corresponding to the subsequent search request.
  • a statistics module 760 configured to aggregate the found information information and/or video to the target search After the search result page corresponding to the word is presented to the user, the statistical user performs a triggering operation on each of the searched information information and/or video displayed on the search result page to obtain a statistical result
  • the determining module 770 is configured to use the The statistical result determines whether each of the found information information and/or video is displayed in a page corresponding to the subsequent search request.
  • the determining module 770 is specifically configured to determine whether to display each of the found information information and/or video in a page corresponding to the subsequent search request according to the following manner: determining the subsequent search The information corresponding to the requested information and/or video is no longer displayed in the searched page, and the number of the triggering operations is less than the specified threshold information information and/or video.
  • the embodiment of the present invention further provides a method for aggregating game information information in a search result page, and the method can be applied to a terminal device such as a personal computer, a smart phone, or a tablet computer.
  • FIG. 9 illustrates a flow chart of a method of aggregating game class information information in a search result page, in accordance with an embodiment of the present invention. As shown in FIG. 9, the method may at least include the following steps S902 to S912.
  • Step S902 determining whether there are information items related to the N preset game identifiers in the predetermined plurality of user generated content UGC websites, where N is an integer and N is greater than 1.
  • the N preset game identifiers may be the names of the currently popular N games.
  • the N preset game identifiers may be determined based on the click rate and/or the search rate of each keyword in the predetermined database.
  • the N preset game identifiers may be formed for the top N game names in the Baidu Billboard.
  • the value of N may be determined according to a specific application, which is not limited in this embodiment.
  • UGC User Gernerated Content
  • UCC User Created Content
  • PGC Professional Generated Content
  • UGC has the advantage that users can freely upload content and enrich the content of the website, but the downside is that the quality of the content is mixed.
  • PGC classification is more professional, content quality is more guaranteed, and its content setting and product editing are very professional.
  • UGC and PGC are not contradictory, not only in parallel, but also need to complement each other.
  • UGC is responsible for the breadth of content, mainly contributing to traffic and participation, while PGC maintains content depth, mainly establishing brands and creating value, both of which are indispensable. Since PGC is a derivative concept of UGC, PGC may be included as part of UGC in the embodiment of the present invention.
  • the embodiment of the present invention can increase the credibility of the information content of the game, and when the game information information is captured from multiple UGC websites in this step, At least one high-quality UGC website is selected from the UGC website to retrieve game information from at least one high-quality UGC website.
  • the quality UGC website when screening at least one quality UGC website from multiple UGC websites, it can be screened by some measurement factors. Specifically, one or more measurement factors are determined, and the quality of the plurality of UGC websites is measured according to the determined one or more measurement factors, and at least one UGC website whose quality meets the specified quality conditions is selected as a quality UGC website.
  • the measurement factors here can be such as the credibility of the website, the number of users registered on the website, the number of visits to the website, and so on.
  • the embodiment of the present invention provides an optional solution, in which multiple times may be determined based on the weight policy. Measure the respective weights of the factors, obtain the respective values of multiple metrics of multiple UGC websites; then weight and sum the values and weights of multiple metrics of multiple UGC websites to obtain comprehensive values, and then according to multiple UGCs The comprehensive values of the websites measure the quality of multiple UGC websites.
  • multiple UGC websites are Website 1, Website 2, Website 3, Website 4, and Website 5.
  • Multiple measures are the credibility of the website, the number of users registered on the website, the number of visits to the website, and multiple websites 1
  • the respective values of the measurement factors are p11, p12, and p13, and the respective values of the plurality of measurement factors of the website 2 are p21, p22, and p23, respectively, and the respective values of the plurality of measurement factors of the website 3 are p31, p32, and p33, respectively.
  • the respective values of the four measurement factors of 4 are p41, p42, and p43, respectively, and the respective values of the plurality of measurement factors of the website 5 are p51, p52, and p53, respectively.
  • the weights of the plurality of measurement factors are determined as w1, w2, and w3, and the values and weights of the plurality of measurement factors of the plurality of UGC websites are weighted and summed to obtain comprehensive values of the plurality of UGC websites.
  • the comprehensive value of the website 1 after weighted summation is p11 ⁇ w1+p12 ⁇ w2+p13 ⁇ w3
  • the comprehensive value of the website 2 is p21 ⁇ w1+p22 ⁇ w2+p23 ⁇ w3.
  • Website 3, Website 4, and Website 5 are deduced by analogy, and are not repeated here.
  • the N preset games may be The logo is used as a keyword to search for each high-quality UGC website. If the hit rate reaches a predetermined value, for example, 80%, the corresponding UGC website is used as a UGC website having information items related to the N preset game identifiers, that is, when When a UGC website contains 80% of the keywords of the N preset game identifiers, the UGC website is used as a UGC website having information items related to the N preset game identifiers.
  • the predetermined value may be set according to actual needs, which is not limited in this embodiment.
  • Step S904 according to the determination result, grab the data related to the N preset game identifiers from one or more UGC websites having information items related to the N preset game identifiers;
  • a UGC website of a professional information publishing platform for example, a video website such as a headline number, iQiyi, Youku, etc., from the existence of the N preset game identifiers
  • a video website such as a headline number, iQiyi, Youku, etc.
  • data related to the N preset game identifiers is captured, including:
  • each of the preset game identifiers may be searched in the search box of the headline number or iQiyi or Youku, and the information related to each game identifier may be captured according to the release time; or
  • the game information is marked in the information information published by the UGC website of the professional information publishing platform, and the information related to the N preset game identifiers is captured from the marked game information. For example, you can manually mark the headline number of the game class on the headline number. Data capture is performed in these headline numbers, and then classified according to the names of the people included in the title of the captured information.
  • a UGC website of a network topic community class for example, an interest tribe or a watercress, etc.
  • a network topic community class for example, an interest tribe or a watercress, etc.
  • data related to the N preset game identifiers including: for each of the N preset game identifiers, in the network theme community class
  • the UGC website determines a theme community related to the preset game identifier, selects M theme communities from the theme community related to the preset game identifier, and grabs the name title or body from the M theme communities to include the Preset the information information of the game logo.
  • title or article body contains information about keywords (for example, World of Warcraft).
  • the website obtains the information of the category of the game as the game information; and determines whether the publication question is one or more of the N preset game identifiers in the information information of the game category, and if so, the information information is captured as Data related to the N preset game identifications. For example, you can first determine whether the category that knows the problem is related to the game (for example, the question is: what kind of story background is in World of Warcraft), and if so, whether to determine whether the question and answer contain game-based preset words. The game-based preset words in the table (for example, World of Warcraft), if included, grab the question and answer as information related to the game-based preset words.
  • Step S906 processing the data related to the captured N preset game identifiers to obtain a UGC game information database, where each piece of data in the UGC game information database includes at least: keywords, information content, and Attributes.
  • each piece of data that is captured may be stored, and sorted according to one or more information attributes of each piece of data that is captured, to obtain the UGC game information database.
  • the captured information information may be classified according to the game-based preset words related to each piece of information information to generate a game-based pre-preparation.
  • a structured UGC game information database that sets the information attributes of information and information. That is, the UGC game information database may include three attribute columns: a preset game identifier, an information attribute of the information information, and an information content.
  • the information attribute of the information information may include a plurality of items, for example, the time when the information is published, the number of comments of the information, and the like, and the information content may include a title of the information (tittle) and a link address of the information.
  • Table 2 is an example of the structure of the UGC game information database in the present embodiment.
  • the information may be optimally sorted according to the information attribute of the information information of each piece of information information.
  • the information attribute of the information information may include: The type of content (for example, information information or video), posting time, number of views, and/or number of comments, etc., that is, the UGC game information database can be sorted according to the timeliness and/or popularity of the information to improve subsequent search efficiency.
  • An embodiment of the present invention provides an optional solution, in which a preset game identifier for processing the captured game information information may be determined, and then the captured game keyword is captured based on the determined game keyword. The corresponding attribute content is extracted from the game information information.
  • the preset game identifier may be a game name, a game copy name, or the like, and the embodiment of the present invention is not limited thereto.
  • the information content contained in the UGC game information database may have live video. Since the live video has certain timeliness, in order to avoid the user's retrieval does not exist.
  • the live video in an optional implementation of this embodiment, after the UGC game information database is obtained, the method further includes: for the information content of the UGC game information database, the data item of the live video is periodically If the live video is detected to be finished, the corresponding data item is deleted from the UGC game information database when the live video end is detected.
  • the detection period can be set according to actual conditions, for example, 1 hour or 2 hours.
  • Step S908 in response to the target search term input by the user on the search engine, determining whether the target search term is a search term of the game class, and if yes, executing step S910; otherwise, performing the search according to the normal search mode, only from the Internet Search for the target search term.
  • Step S910 while searching for the target search term from the Internet, searching for data matching the target search term in the UGC game information database.
  • Step S912 the information content of the data matching the target search word is aggregated to the search result page corresponding to the target search word.
  • step S910 in the case that a live video matching the target search term is found from the UGC game information database, in step S912, the searched representation is presented to the user.
  • the found live video may be played on the search result page, and the text link of the found information information is displayed on the search result page, wherein the live video is displayed.
  • the picture can be synchronized with the live picture at a certain frequency.
  • the search corresponding to the target search term is performed.
  • the results page shows the results of the search from the Internet.
  • step S912 may include The following steps:
  • Step 1 displaying a result of searching for the target search term from the Internet on the left side of the search result page;
  • Step 2 determining whether the data matching the target search term in the UGC game information database has the same data as the result displayed on the left side of the search result page, and if yes, removing the same data. ;
  • Step 3 The information content of the data matching the target search word after the same data is removed is aggregated to the right area of the search result page corresponding to the target search word.
  • the search result page includes two areas: a left area and a right area.
  • the left area is used to display the result obtained by the search engine in the Internet search target search word.
  • the content displayed on the left side of the search result page of the search engine such as baidu, google, etc.
  • the right area is used to display the result searched in the UGC game information database, so that the content on the right side of the search result page can be expanded. Users provide more complete search results.
  • the contents displayed on the left side and the right side are not coincident, so that the uniqueness of the search result can be guaranteed.
  • the information content of the data matching the target search term is aggregated to After the search result page corresponding to the target search word is displayed, the method further includes: counting a trigger operation of the information content of the data that is displayed by the user on the search result page that matches the target search word, and obtaining a statistical result; The statistical result determines whether information content of data matching the target search term is displayed in a page corresponding to the subsequent search request.
  • the triggering operation of the information content of the data may be a CTR (Click To Rate) of the information content of the matched data, that is, the click rate determined according to the information content of the data in the UGC game information database is determined.
  • CTR Click To Rate
  • whether the statistical result is the trigger operation may be determined.
  • the number of the information is less than the specified threshold, and if so, it is determined that the information content of the matched data is no longer displayed in the page corresponding to the subsequent search request.
  • the CTR of the information content of the data may be determined according to a specified time (eg, 1 or 2 hours, etc.), and corresponding processing is performed according to the judgment result.
  • the information content of the data related to each game type preset word in the game-based preset vocabulary in the captured UGC website is updated, and if so, grab the new one.
  • the information content of the data is updated to the UGC game information database.
  • the CTR of each piece of information in the game information content data is cleared, that is, after updating, the information content of the data in the UGC game information database is hit.
  • the information content of the piece of data is displayed on the search result page, and the CTR of the information content of each piece of data is again counted and arrived at the specified time period. Then, it is determined whether the CTR of the information content of the data is greater than a threshold, and further whether the information content of the data is presented in the subsequent detection result.
  • the search engine when receiving the target search term related to the game class input by the user on the search engine, first determining whether the target search term hits the preset game identifier, and if so, crawling from the UGC website.
  • the data of the structured UGC game information database is used to find the information content of the data matching the target search word, and the information content of the data found in the structured UGC game information database is aggregated to the search corresponding to the target search word.
  • the results page is presented to the user. It can be seen that, in the technical solution provided by the embodiment of the present invention, the information content of the game data of the UGC website can be aggregated in the search result page, so that the user can provide more comprehensive information content of the data and widen the content coverage.
  • the UGC game information database has the structured features of the game-based preset words and the information attributes of the data content of the data, it is readable and can help the user quickly find the required information. Further, the UGC game information database comes from various UGC websites, and the data in each UGC website is forwarded to the search result page for presentation, and the user does not need to go to the website to find the information content of the related data through multiple operations, thereby reducing the user's search cost. .
  • the embodiment of the present invention further provides an apparatus for aggregating game information information in a search result page based on the method for aggregating game information information in a search result page.
  • FIG. 11 is a block diagram showing the structure of a search device for a game search word according to an embodiment of the present invention.
  • the apparatus may include at least a first determining module 1110 , a grabbing module 1120 , a storage module 1130 , a response module 1140 , a search module 1150 , and a presentation module 1160 .
  • the first judging module 1110 is configured to determine whether there is a information item related to the N preset game identifiers in the predetermined plurality of user-generated content UGC websites; the crawling module 1120 is configured to: from the presence and the N according to the judgment result One or more UGC websites of the preset game identification related information items, the data related to the N preset game identifiers are captured; the storage module 1130 is configured to capture the N presets
  • the game identification related data is processed to obtain a UGC game information database, wherein each piece of data in the UGC game information database includes at least: keywords, information content, and attributes; and the response module 1140 is configured to respond to the user on the search engine.
  • a target search term Entering a target search term, determining whether the target search term is a search term of a game class; and searching module 1150, configured to search for the search term from the Internet if the target search term is a search term of a game class While searching for the target word, searching for data matching the target search term in the UGC game information database; a presentation module 1160, configured to search with the target The information content of the word-matched data is aggregated to the search result page corresponding to the target search term.
  • the storage module 1130 is specifically configured to process the captured data related to the N preset game identifiers in the following manner to obtain a UGC game information database: storage and capture.
  • Each piece of data is sorted according to one or more information attributes of each piece of data captured, to obtain the UGC game information database.
  • the information attribute of each piece of information information may include the content publishing time, the number of comments of the content, etc., that is, the UGC game information database may be sorted according to the timeliness and/or the heat of the information to improve the search efficiency.
  • the apparatus may further include: an updating module 1170, configured to: for the information content in the UGC game information database, the data item of the live video, periodically If the live video is detected to be finished, the corresponding data item is deleted from the UGC game information database when the live video end is detected.
  • an updating module 1170 configured to: for the information content in the UGC game information database, the data item of the live video, periodically If the live video is detected to be finished, the corresponding data item is deleted from the UGC game information database when the live video end is detected.
  • the crawling module 1120 is specifically configured to: from the presence of one or more information items related to the N preset game identifiers In a plurality of UGC websites, data related to the N preset game identifiers is captured:
  • the game information is marked in the information information published by the UGC website of the professional information publishing platform, and the information related to the N preset game identifiers is captured from the marked game information.
  • the crawling module 1120 is specifically configured to: from the presence of one or more information items related to the N preset game identifiers In a plurality of UGC websites, data related to the N preset game identifiers is captured:
  • determining a theme community related to the game identifier in the UGC website of the network theme community class from a theme community related to the preset game identifier Selecting M theme communities, and extracting the title or text containing the preset game identifier from the M theme communities.
  • determining a theme community related to the game identifier in the UGC website of the network theme community class from a theme community related to the preset game identifier Selecting M theme communities, and extracting the title or text containing the preset game identifier from the M theme communities.
  • the interest tribe for each game-based preset word in the game-preset vocabulary, for example, World of Warcraft, first locate how many tribes the target game has (for example, the Warcraft community), and then select the largest tribe to crawl (for example, According to the degree of attention), the title or the body of the article contains information about the keyword (for example, Warcraft).
  • the crawling module 1120 is specifically configured to: from the presence of the information items related to the N preset game identifiers in the following manner Obtaining data related to the N preset game identifiers in one or more UGC websites: obtaining, from the UGC website of the network question and answer community class, the information category of the published problem as game information; determining the publishing problem as a game Whether the information information of the class includes one or more of the N preset game identifiers, and if so, the information information is captured as data related to the N preset game identifiers.
  • the category of the question to be published is related to entertainment, and if so, whether the question and the answer contain the game-preset word in the game-preset vocabulary, and if so, the content is captured.
  • the question and answer are used as information related to the preset words of the game.
  • the presentation module 1160 is specifically configured to aggregate the information content of the data that matches the target search term into the search result page corresponding to the target search term. Displaying, on the left side of the search result page, a result of searching for the target search term from the Internet; determining whether the data matching the target search term in the UGC game information database is related to the search result page left The same data in the result of the side presentation, if any, the same data is removed; the information content of the data matching the target search word after the same data is removed is aggregated to the search corresponding to the target search word The right area of the results page is displayed.
  • the search result page includes two regions: a left region and a right region.
  • the left area is used to display the results obtained by the search engine in the Internet search target search words, for example, the content displayed on the left side of the search result page of the search engine such as baidu, google, etc.
  • the right area is used to display the search in the UGC game information database. The result is so that the content in the right area of the search results page can be expanded to provide users with more complete search results.
  • the apparatus may further include: a statistics module 1180, configured to aggregate information content of data matching the target search term to the target After the search result page corresponding to the search word is presented to the user, the user performs a triggering operation on the information content of the data that matches the target search word displayed on the search result page, and obtains a statistical result; the second determining module 1190 uses And determining, according to the statistical result, whether the information content of the data matching the target search term is displayed in the page corresponding to the subsequent search request.
  • a statistics module 1180 configured to aggregate information content of data matching the target search term to the target After the search result page corresponding to the search word is presented to the user, the user performs a triggering operation on the information content of the data that matches the target search word displayed on the search result page, and obtains a statistical result
  • the second determining module 1190 uses And determining, according to the statistical result, whether the information content of the data matching the target search term is displayed in the page corresponding to the subsequent search request.
  • the triggering operation of the matched information information displayed by the user on the search result page may be a CTR (Click To Rate) of the matched information information, that is, according to the UGC game information database.
  • the click rate after the information information is displayed determines whether the information information is displayed on the right side of the search page when searching for the information information.
  • the second determining module 1190 is specifically configured to determine, according to the following manner, whether information content of data matching the target search term is displayed in a page corresponding to the subsequent search request: If the statistical result is that the number of the triggering operations is less than the specified threshold, it is determined that the information content of the data matching the target search term is no longer displayed in the page corresponding to the subsequent search request.
  • modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment.
  • the modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components.
  • any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined.
  • Each feature disclosed in this specification (including the accompanying claims, the abstract and the drawings) may be replaced by alternative features that provide the same, equivalent or similar purpose.
  • the various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. It should be understood by those skilled in the art that a microprocessor or a digital signal processor (DSP) can be used in practice to implement a push device for a search result of a variety magazine according to an embodiment of the present invention. Some or all of the functionality of some or all of the means in the device, or game search term search device.
  • the invention may also be embodied as a device or device program (e.g., a computer program) for performing some or all of the methods described herein. And computer program products). Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
  • FIG. 13 shows a computing device that can implement a push method for a search result of a variety magazine query according to the present invention, a movie drama keyword search presentation method, or a search method for a game class search word.
  • the computing device conventionally includes a processor 1310 and a computer program product or computer readable medium in the form of a memory 1320.
  • the memory 1320 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM.
  • Memory 1320 has a storage space 1330 that stores program code 1331 for performing any of the method steps described above.
  • the storage space 1330 storing program code may store respective program codes 1331 for implementing various steps in the above methods, respectively.
  • the program code can be read from or written to one or more computer program products.
  • These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
  • Such a computer program product is typically a portable or fixed storage unit such as that shown in FIG.
  • the storage unit may have a storage segment, a storage space, and the like that are similarly arranged to the storage 1320 in the computing device of FIG.
  • the program code can be compressed, for example, in an appropriate form.
  • the storage unit stores computer readable code 1331' for performing the method steps of the present invention, ie, code that can be read by a processor, such as 1310, when the code is run by a computing device, causing the computing device Perform the various steps in the method described above.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

一种针对综艺类query的搜索结果的推送方法及装置,一种影视剧类关键词搜索展现方法及装置,以及一种在游戏类搜索词的搜索方法及装置。其中,针对综艺类query的搜索结果的推送方法包括:从指定UGC网站中抓取综艺类UGC搜索结果项,指定UGC网站中包括大数据级别的UGC搜索结果项(S101);从综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表(S102);当接收到针对综艺类关键词词表中的任一综艺类关键词的搜索请求时,从综艺类UGC搜索结果项中获取与综艺类关键词相匹配的优质UGC搜索结果项(S103);将与综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页(S104)。其能够为用户推荐针对综艺类query的优质UGC搜索结果项,提高用户对搜索结果项的点击率,且提高用户对搜索引擎的使用体验度。

Description

针对综艺类query的搜索结果的推送方法及装置 技术领域
本发明涉及互联网技术领域,特别是涉及一种针对综艺类query的搜索结果的推送方法及装置,一种影视剧类关键词搜索展现方法及装置,以及一种在游戏类搜索词的搜索方法及装置。
背景技术
现代网络中包含有大量的用户贡献内容,如论坛帖、头条号、兴趣部落帖子等,可以将这些内容统称为优质UGC网站。优质UGC网站在内容上具有很多优点,例如:(1)数据来源于个人,具有独立性;(2)贴吧类的网站由于可供多人评论,因此能够引起用户的共鸣;(3)针对同一搜索问答,优质UGC网站能够补充优质搜索结果,因此在一定程度上延展了阅读性;等等。
由此可知,在优质UGC网站里不乏许多优质资讯,然而,在现有搜索引擎产品中并不能将这些优质资讯充分挖掘出来,并带入到相关搜索结果内。也就是说,用户想要搜索某些优质资讯类的内容时,仍需从海量搜索结果项中进行查找,不仅浪费用户时间,且导致用户对搜索引擎的使用体验度降低。
尤其是在搜索综艺节目类query时,大多数用户都会查看网络上一些用户自己剪辑的视频、剧透嘉宾、剧透结果、明星八卦等数据。而目前的搜索引擎产品则是将所有针对综艺节目类query的搜索结果都糅合在一起战线在搜索结果页的左侧,同时在搜索结果页的右侧以榜单和图文推荐的形式为用户推荐相关节目或明星。然而,右侧推荐区域内推荐的相关节目或明星往往与用户搜索的综艺节目类query关系不大,因此点击率较低,导致右侧推荐区域无法很好地发挥其推荐作用。此外,用户需要从左侧展现的大量搜索结果项中查找自己感兴趣的内容,十分浪费时间。
并且,针对如何为用户提供包括UGC网站上的影视剧类资讯信息的搜索结果的问题,还没有有效的解决方案。
此外,针对如何为用户提供包括UGC网站上的游戏类资讯信息的搜索结果的问题,还没有有效的解决方案。
发明内容
针对现有技术中存在的缺陷,本发明实施例的目的在于提供一种能够克服上述问题或者至少能够部分地解决上述问题的针对综艺类query的搜索结果的推送方法及装置,影视剧类关键词搜索展现方法及相应的装置,以及游戏类搜索词的搜索方法及相应的装置。
依据本发明的一方面,提供了一种针对综艺类query的搜索结果的推送方法,包括:从指定UGC网站中抓取综艺类UGC搜索结果项,所述指定UGC网站中包括大数据级别的UGC搜索结果项;从所述综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合所述多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表;当接收到针对所述综艺类关键词词表中的任一综艺类关键词的搜索请求时,从所述综艺类UGC搜索结果项中获取与所述综艺类关键词相匹配的优质UGC搜索结果项;将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,所述优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。
依据本发明的另一方面,提供了一种针对综艺类query的搜索结果的推送装置,包括:抓取模块,适于从指定UGC网站中抓取综艺类UGC搜索结果项,所述指定UGC网站中包括大数据级别的UGC 搜索结果项;创建模块,适于从所述综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合所述多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表;获取模块,适于当接收到针对所述综艺类关键词词表中的任一综艺类关键词的搜索请求时,从所述综艺类UGC搜索结果项中获取与所述综艺类关键词相匹配的优质UGC搜索结果项;第一推送模块,适于将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,所述优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。
采用本发明实施例提供的技术方案,通过从包括大量UGC搜索结果项的指定UGC网站中抓取综艺类UGC搜索结果项,进而从综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,集合多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表。当接收到针对综艺类关键词词表中的任一综艺类关键词的搜索请求时,从综艺类UGC搜索结果项中获取与综艺类关键词相匹配的优质UGC搜索结果项,并将优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,使得搜索结果页上能够为用户展现针对综艺类query的优质UGC搜索结果项,例如用户自己剪辑的视频、剧透嘉宾、剧透结果、明星八卦等数据,而这些优质UGC搜索结果项恰恰最能引起用户的兴趣,因此该技术方案能够直接将针对综艺类query的优质UGC搜索结果项展示给用户,而无需用户从大量搜索结果项中查找自己感兴趣的内容,从而提高用户对搜索结果项的点击率,且提高用户对搜索引擎的使用体验度。
依据本发明的一方面,提供了一种影视剧类关键词搜索展现方法,包括:确定N个影视剧关键词,其中,N为整数,且N大于1;从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频;将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中;响应用户在搜索引擎上输入的目标搜索词,从互联网中搜索与所述目标搜索词匹配的结果,并在所述影视剧类资讯内容数据库中查找与所述目标搜索词匹配的资讯信息和/或视频;在从所述影视剧类资讯内容数据库中查找到与所述目标搜索词匹配的资讯信息和/或视频的情况下,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户。
依据本发明另一方面,提供了一种影视剧类关键词搜索展现装置,包括:确定模块,用于确定N个影视剧关键词,其中,N为整数,且N大于1;获取模块,用于从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频;存储模块,用于将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中;搜索模块,用于响应用户在搜索引擎上输入的目标搜索词,从互联网中搜索与所述目标搜索词匹配的结果,并在所述影视剧类资讯内容数据库中查找与所述目标搜索词匹配的资讯信息和/或视频;展现模块,用于在从所述影视剧类资讯内容数据库中查找到与所述目标搜索词匹配的资讯信息和/或视频的情况下,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户。
在本发明实施例中,首先从UGC网站中抓取影视剧类关键词相关的资讯信息和/或视频,将抓取的资讯信息和/或视频存储到影视剧类资讯内容数据库中,在接收到用户在搜索引擎上输入的与影视剧类相关的目标搜索词时,在从互联网中搜索目标搜索词的同时,从影视剧类资讯内容数据库中查找与目标搜索词匹配的资讯信息和/或视频,并将从的影视剧类资讯内容数据库中查找到的资讯信息和/或视频聚合至目标搜索词对应的搜索结果页展现给用户。由此可见,在本发明实施例提供的技术方案中,能够在搜索结果页中聚合UGC网站的影视剧类资讯信息,从而可以为用户提供更全面的资讯信息,扩宽内容覆盖面。进一步地,影视剧类资讯内容数据库来自各个UGC网站,将各个UGC网站中 的数据前置到搜索结果页中进行展现,无需用户通过多次操作去网站查找相关资讯信息,降低了用户的检索成本。
依据本发明的一方面,提供了一种游戏类搜索词的搜索方法,包括:判断预定的多个用户生成内容UGC网站中是否存在与N个预设游戏标识相关的资讯项,其中,N为整数,且N大于1;根据判断结果,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据;对抓取的与所述N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库,其中,所述UGC游戏资讯数据库中每条数据至少包括:关键词、资讯内容、以及属性;响应用户在搜索引擎上输入的目标搜索词,判断所述目标搜索词是否为游戏类的搜索词;在判断所述目标搜索词为游戏类的搜索词的情况下,在从互联网中搜索所述目标搜索词的同时,在UGC游戏资讯数据库查找与所述目标搜索词匹配的数据;将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现。
依据本发明的另一方面,提供了一种游戏类搜索词的搜索装置,包括:第一判断模块,用于判断预定的多个用户生成内容UGC网站中是否存在与N个预设游戏标识相关的资讯项;抓取模块,用于根据判断结果,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据;存储模块,用于对抓取的与所述N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库,其中,所述UGC游戏资讯数据库中每条数据至少包括:关键词、资讯内容、以及属性;响应模块,用于响应用户在搜索引擎上输入的目标搜索词,判断所述目标搜索词是否为游戏类的搜索词;搜索模块,用于在判断所述目标搜索词为游戏类的搜索词的情况下,在从互联网中搜索所述目标搜索词的同时,在UGC游戏资讯数据库查找与所述目标搜索词匹配的数据;展现模块,用于将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现。
在本发明实施例中,首先从UGC网站中抓取与预设游戏标识相关的资讯信息,将抓取的资讯信息存储到UGC游戏资讯数据库中,在接收到用户在搜索引擎上输入的与游戏类相关的目标搜索词时,在从互联网中搜索目标搜索词的同时,从UGC游戏资讯数据库中查找与目标搜索词匹配的资讯信息,并将从的UGC游戏资讯数据库中查找到的资讯信息聚合至目标搜索词对应的搜索结果页展现给用户。由此可见,在本发明实施例提供的技术方案中,能够在搜索结果页中聚合UGC网站的游戏类资讯信息,从而可以为用户提供更全面的资讯信息,扩宽内容覆盖面。进一步地,UGC游戏资讯数据库来自各个UGC网站,将各个UGC网站中的数据前置到搜索结果页中进行展现,无需用户通过多次操作去网站查找相关资讯信息,降低了用户的检索成本。
依据本发明的又一方面,还提供了一种计算机程序,其包括计算机可读代码,当所述计算机可读代码在计算设备上运行时,导致所述计算设备执行如上文所述的针对综艺类query的搜索结果的推送方法,影视剧类关键词搜索展现方法,或者游戏类搜索词的搜索方法。
依据本发明的再一方面,还提供了一种计算机可读介质,其中存储了如上文所述的计算机程序。
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。
附图说明
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:
图1是根据本发明一个实施例的一种针对综艺类query的搜索结果的推送方法的示意性流程图;
图2根据本发明一个具体实施例的一种针对综艺类query的搜索结果的推送方法中搜索结果页的示意性界面图;
图3是根据本发明一个实施例的一种针对综艺类query的搜索结果的推送装置的示意性框图;
图4是根据本发明另一个实施例的一种针对综艺类query的搜索结果的推送装置的示意性框图;
图5示出了根据本发明一实施例的影视剧类关键词搜索展现方法的流程图;
图6示出了根据本发明另一实施例的聚合有影视剧类资讯信息和/或视频的搜索结果页的示意图;
图7示出了根据本发明一实施例的影视剧类关键词搜索展现装置的结构示意图;
图8示出了根据本发明另一实施例的影视剧类关键词搜索展现装置的结构示意图;
图9示出了根据本发明一实施例的游戏类搜索词的搜索方法的流程图;
图10示出了根据本发明另一实施例的聚合有游戏类资讯信息的搜索结果页的示意图;
图11示出了根据本发明一实施例的游戏类搜索词的搜索装置的结构示意图;
图12示出了根据本发明另一实施例的游戏类搜索词的搜索装置的结构示意图;
图13示意性地示出了用于执行根据本发明的针对综艺类query的搜索结果的推送方法,影视剧类关键词搜索展现方法,或者游戏类搜索词的搜索方法的计算设备的框图;以及
图14示意性地示出了用于保持或者携带实现根据本发明的针对综艺类query的搜索结果的推送方法,影视剧类关键词搜索展现方法,或者游戏类搜索词的搜索方法的程序代码的存储单元。
具体实施方式
下面结合附图和具体的实施方式对本发明作进一步的描述。
图1是根据本发明一个实施例的一种针对综艺类query的搜索结果的推送方法的示意性流程图。如图1所示,该方法一般性地可包括以下步骤S101-S104:
步骤S101,从指定UGC网站中抓取综艺类UGC搜索结果项,指定UGC网站中包括大数据级别的UGC搜索结果项。
步骤S102,从综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表。
步骤S103,当接收到针对综艺类关键词词表中的任一综艺类关键词的搜索请求时,从综艺类UGC搜索结果项中获取与综艺类关键词相匹配的优质UGC搜索结果项。
步骤S104,将与综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页。其中,优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。
以下针对上述步骤S101-S104进行详细说明。
首先执行步骤S101,即从指定UGC网站中抓取综艺类UGC搜索结果项,指定UGC网站中包括大数据级别的UGC搜索结果项。其中,指定UGC网站可以是各类UGC网站中较为优质的一些网站,例如头条、兴趣部落、知乎等网站。
具体地,可通过以下至少一种方式来抓取综艺类UGC搜索结果项:
方式一、获取综艺类的资讯频道路径,并根据资讯频道路径从指定UGC网站中抓取综艺类UGC搜索结果项。
方式二、提取指定UGC网站中的各UGC搜索结果项对应的标题中的关键词;根据关键词确定各UGC搜索结果项的类型;根据各UGC搜索结果项的类型从UGC搜索结果项中抓取综艺类UGC搜索结果项。
例如,若UGC搜索结果项对应的标题中包含关键词“娱乐”、“游戏”等,则可确定该UGC搜索结果项为综艺类UGC搜索结果项;若UGC搜索结果项对应的标题中包含关键词“体育”、“足球”、“游泳比赛”等,则可确定该UGC搜索结果项为体育类UGC搜索结果项。
方式三、提取包含以下至少一项UGC内容对应的UGC搜索结果项为综艺类UGC搜索结果项:娱乐八卦、综艺资讯、综艺节目点评。采用方式三抓取综艺类UGC搜索结果项时,主要根据UGC内容来确定UGC搜索结果项是否为综艺类UGC搜索结果项。
从指定UGC网站中抓取综艺类UGC搜索结果项后,继续执行步骤S102,即从综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表。其中,综艺类关键词例如:娱乐、游戏、快乐大本营等。
创建综艺类关键词词表之后,继续执行步骤S103,即当接收到针对综艺类关键词词表中的任一综艺类关键词的搜索请求时,从综艺类UGC搜索结果项中获取与综艺类关键词相匹配的优质UGC搜索结果项。具体地,从综艺类UGC搜索结果项中抓取与综艺类关键词相匹配的、且符合预设条件的综艺类UGC搜索结果项,作为与综艺类关键词相匹配的优质UGC搜索结果项。
其中,预设条件包括UGC视频片段、UGC内容中包含尚未播出的综艺节目内容、UGC内容中包含与综艺类关键词相关的人物资讯中的至少一项。UGC视频片段例如任意用户自己剪辑并上传的一段视频;UGC内容中包含尚未播出的综艺节目内容例如对某一综艺节目的剧透;UGC内容中包含与综艺类关键词相关的人物资讯例如综艺类节目中的嘉宾信息等。
获取到与综艺类关键词相匹配的优质UGC搜索结果项后,继续执行步骤S104,即将与综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,其中,优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。在一个实施例中,将与综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页的相关推荐区域。
在一个实施例中,当与综艺类关键词相匹配的优质UGC搜索结果项包括多个时,可首先按照预设排序元素对多个优质UGC搜索结果项进行排序,该预设排序元素包括各优质UGC搜索结果项的发布时间、评论数、最近一次的评论时间中的至少一项;然后将排序后的多个优质UGC搜索结果项按照其自身显示形式推送至搜索结果页。由于用户通常对最新一期或最新几期的综艺类节目更加感兴趣,而对往期的综艺类节目兴趣较小,因此在排序时,还可按照各优质UGC搜索结果项对应的综艺节目的播出时间来排序,播出时间越近,则对应的优质UGC搜索结果项排序越靠前。
如上所说,优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。因此在显示优质UGC搜索结果项时,若优质UGC搜索结果项为视频项,则以综艺类视频截图为封面的图片显示形式进行显示;若优质UGC搜索结果项为文字链接项时,则以文字显示形式进行显示。以下分别对这两种显示形式进行说明。
在一个实施例中,优质UGC搜索结果项包括视频项。该实施例可按照如下方式显示优质UGC搜索结果项:首先,将视频项对应的封面图片推送至搜索结果页的第一位置处;其次,在封面图片上的对应位置提供用于播放视频项对应视频的第一标识;再次,当接收到对第一标识的触发操作时,进入视频项对应的视频页面,并播放视频。
当视频项包括多个时,还可在搜索结果页的第一位置处以轮播的方式显示所述多个视频项分别对应的封面图片。其中,第一位置可以是搜索结果页上相关推荐区域内任一指定位置,例如相关推荐区域内最上面的显示位置。在轮播多个视频项对应的封面图片时,可在封面图片上或者封面图片的下方显示各封面图片对应的视频项的标题。此外,还可在相关推荐区域内的指定位置添加一“更多”按钮,当用户触发该“更多”按钮时,当前页面跳转至包括更多与综艺类关键词相匹配的视频项的页面。
在一个实施例中,优质UGC搜索结果项包括文字链接项。该实施例可按照如下方式显示优质UGC搜索结果项:首先,将文字链接项推送至搜索结果页的第二位置处;其次,当接收到对文字链接项的触发操作时,进入并显示文字链接项对应的UGC内容。其中,第二位置可以是搜索结果页上相关推荐区域内任一指定位置,例如第一位置下方的位置,将第二位置设置于第一位置的下方,能够优先以图片形式为用户展现与综艺类关键词相匹配的视频项,从而更能引发用户对相关搜索区域内推荐的优质UGC搜索结果项的兴趣。当文字链接项包括多个时,可根据预设排序元素对多个优质UGC搜索结果项进行排序,其中,预设排序元素包括各优质UGC搜索结果项的发布时间、评论数、最近一次的评论时间中的至少一项。
在搜索结果页的相关推荐区域内显示与综艺类关键词相匹配的优质UGC搜索结果项之后,上述方法还包括以下步骤:统计优质UGC搜索结果项的点击率;当优质UGC搜索结果项的点击率低于预设点击率时,取消在搜索结果页上对优质UGC搜索结果项的显示操作,并持续监测被取消显示的优质UGC搜索结果项是否被更新;若监测到被取消显示的优质UGC搜索结果项被更新,则重新在搜索结果页上显示更新后的优质UGC搜索结果项。例如,预设点击率设置为70%,针对综艺类关键词“快乐大本营”进行搜索时,如果与综艺类关键词“快乐大本营”相匹配的优质UGC搜索结果项的点击率低于70%,则取消在相关推荐区域内对与综艺类关键词“快乐大本营”相匹配的优质UGC搜索结果项的显示操作,此时相关推荐区域内为用户展示原始推荐内容,例如以图文形式推荐一些娱乐圈的名人。在取消显示操作后,持续监测与综艺类关键词“快乐大本营”相匹配的优质UGC搜索结果项是否被更新,当监测到与综艺类关键词“快乐大本营”相匹配的优质UGC搜索结果项被更新(例如有用户上传新的对“快乐大本营”的剪辑视频片段)时,重新在相关推荐区域内显示与综艺类关键词“快乐大本营”相匹配的优质UGC搜索结果项。
图2示出了本发明一具体实施例中对综艺类关键词“快乐大本营”进行搜索时的搜索结果页的界面图。如图2所示,搜索结果页左侧的自然搜索结果显示区域内显示与“快乐大本营”相关的自然搜索结果,右侧的相关推荐区域内则显示与“快乐大本营”相匹配的优质UGC搜索结果项,包括对综艺节目“快乐大本营”的剧透贴、嘉宾信息、用户剪辑视频片段等。
由图2可看出,在相关推荐区域的上方,以轮播图片的形式显示多个与“快乐大本营”相匹配的视频项的封面图片,并在封面图片下方显示其对应的优质UGC搜索结果项的标题“快乐大本营鬼步舞练习”。用户点击封面图片或者标题,即可触发对对应视频项的播放操作。并且,在视频项的右上方显示有“更多”按钮,用户点击该“更多”按钮,当前页面即可跳转至包括更多视频项的页面。在视频项的下方,还以文字链接形式显示有多个与“快乐大本营”相匹配的优质UGC搜索结果项:“最新:快乐大本营12月20日录制嘉宾现场”、“快乐大本营隔空同台撒狗粮”、“快乐大本营挑战高空爬行塑料膜”等等。
由上述实施例可看出,采用本发明提供的针对综艺类query的搜索结果的推送方法,搜索结果页右侧的相关推荐区域不再仅为用户推荐一些用户兴趣不大的综艺名人,而是为用户展示更优质的与综艺类query相匹配的UGC搜索结果项,例如对综艺节目的剧透贴、嘉宾信息、用户剪辑视频片段等,从而使搜索结果页的相关推荐区域能够推荐与用户需求相关并感兴趣的内容,不仅提升了用户对搜索引擎的使用体验,且很大程度上提高了相关推荐区域的点击率,最大限度地发挥了相关推荐区域的推荐作用。
图3是根据本发明一个实施例的一种针对综艺类query的搜索结果的推送装置的示意性框图。如图3所示,该装置包括:抓取模块310,适于从指定UGC网站中抓取综艺类UGC搜索结果项,所述指定UGC网站中包括大数据级别的UGC搜索结果项;创建模块320,与抓取模块310相耦合,适于从所述综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合所述多个 综艺类关键词,以创建用于综艺类query的综艺类关键词词表;获取模块330,与创建模块320相耦合,适于当接收到针对所述综艺类关键词词表中的任一综艺类关键词的搜索请求时,从所述综艺类UGC搜索结果项中获取与所述综艺类关键词相匹配的优质UGC搜索结果项;第一推送模块340,与获取模块330相耦合,适于将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,所述优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。
可选地,所述抓取模块310还适于以下至少一项:获取综艺类的资讯频道路径,并根据所述资讯频道路径从指定UGC网站中抓取综艺类UGC搜索结果项;提取所述指定UGC网站中的各UGC搜索结果项对应的标题中的关键词;根据所述关键词确定所述各UGC搜索结果项的类型;根据所述类型从所述UGC搜索结果项中抓取综艺类UGC搜索结果项;提取包含以下至少一项UGC内容对应的UGC搜索结果项为所述综艺类UGC搜索结果项:娱乐八卦、综艺资讯、综艺节目点评。
可选地,所述获取模块330还适于:从所述综艺类UGC搜索结果项中抓取与所述综艺类关键词相匹配的、且符合预设条件的综艺类UGC搜索结果项,作为与所述综艺类关键词相匹配的优质UGC搜索结果项。
可选地,所述预设条件包括以下至少一项:UGC视频片段;UGC内容中包含尚未播出的综艺节目内容;UGC内容中包含与所述综艺类关键词相关的人物资讯。
可选地,所述第一推送模块340还适于:当所述优质UGC搜索结果项包括多个时,按照预设排序元素对所述多个优质UGC搜索结果项进行排序,所述预设排序元素包括各优质UGC搜索结果项的发布时间、评论数、最近一次的评论时间中的至少一项;将所述排序后的多个优质UGC搜索结果项按照其自身显示形式推送至搜索结果页。
可选地,所述优质UGC搜索结果项包括视频项;所述第一推送模块340还适于:将所述视频项对应的封面图片推送至所述搜索结果页的第一位置处;在所述封面图片上的对应位置提供用于播放所述视频项对应视频的第一标识;当接收到对所述第一标识的触发操作时,进入所述视频项对应的视频页面,并播放所述视频。
可选地,所述第一推送模块340还适于:当所述视频项包括多个时,在所述搜索结果页的第一位置处以轮播的方式显示所述多个视频项分别对应的封面图片。
可选地,所述优质UGC搜索结果项包括文字链接项;所述第一推送模块340还适于:将所述文字链接项推送至所述搜索结果页的第二位置处;
当接收到对所述文字链接项的触发操作时,进入并显示所述文字链接项对应的UGC内容。
可选地,如图4所示,上述装置还包括:统计模块350,与第一推送模块340相耦合,适于将与所述综艺类关键词相匹配的优质UGC搜索结果项推送至搜索结果页之后,统计所述优质UGC搜索结果项的点击率;取消模块360,与统计模块350相耦合,适于当所述优质UGC搜索结果项的点击率低于预设点击率时,取消在所述搜索结果页上对所述优质UGC搜索结果项的推送操作;监测模块370,与取消模块360相耦合,适于监测所述被取消显示的优质UGC搜索结果项是否被更新;第二推送模块380,与监测模块370相耦合,适于若监测到所述被取消显示的优质UGC搜索结果项被更新,则重新将所述更新后的优质UGC搜索结果项推送至搜索结果页。
可选地,所述第一推送模块340还适于:将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至所述搜索结果页的相关推荐区域。
本领域的技术人员应可理解,图3和图4中的针对综艺类query的搜索结果的推送装置能够用来实现前文所述的针对综艺类query的搜索结果的推送方案,其中的细节描述应与前文方法部分描述类似,为避免繁琐,此处不另赘述。
为解决上述技术问题,本发明实施例还提供了一种在搜索结果页中聚合影视剧类资讯信息的方法,该方法可以应用在个人电脑、智能手机、平板电脑等终端设备上。图5示出了根据本发明一实施例的在搜索结果页中聚合影视剧类资讯信息的方法的流程图。如图5所示,该方法至少可以包括以下步骤S502至步骤S510。
步骤S502,确定N个影视剧关键词,其中,N为整数,且N大于1。
在具体应用中,N个影视剧关键词可以根据预定数据库中各个关键词的点击率和/或搜索率确定。例如,可以为360热榜和影视站中排名或点击率和/或搜索率最靠前的N名影视剧名组成所述N个影视剧关键词,其中,N的取值可以根据具体应用确定,在本实施例中并不作限定。
步骤S504,从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频;
在该步骤中,UGC(User Gernerated Content,用户生产内容),其也被称为UCC(User Created Content,用户创建内容),可以包括用户创作的文字内容,用户拍摄的图片以及用户录制的视频、音频等等。此外,PGC(Professional Generated Content,专业生产内容),它是UGC的衍生概念,UGC的好处是用户可以自由上传内容,丰富网站内容,但不利的方面在于内容的质量良莠不齐。与UGC相比,PGC分类更专业,内容质量也更有保证,其内容设置及产品编辑均非常专业。其实,UGC和PGC两者并不矛盾,不但并行不悖,而且需要相辅相成。一个成熟的互联网内容向产品,不论网站还是社区、视频平台、音频平台、甚至新形态下的媒体,都需要深度和广度两个方面并行。结合自身的特点,UGC负责内容广度,主要贡献流量和参与度,而PGC维持内容深度,主要树立品牌、创造价值,两者缺一不可。由于PGC是UGC的衍生概念,在本发明实施例中不妨将PGC作为UGC的一部分。
在具体应用中,由于UGC提供的内容的质量良莠不齐,本发明实施例为了增加影视剧类资讯内容的可信度,在该步骤中从多个UGC网站中抓取影视剧类资讯信息时,可以从多个UGC网站中筛选出至少一个优质UGC网站,进而从至少一个优质UGC网站中抓取影视剧类资讯信息。
进一步地,在从多个UGC网站中筛选出至少一个优质UGC网站时,可以通过一些衡量因子来筛选。具体地,确定一个或多个衡量因子,根据确定的一个或多个衡量因子衡量出多个UGC网站的质量情况,并从中筛选出质量满足指定质量条件的至少一个UGC网站作为优质UGC网站。这里的衡量因子可以如网站的可信度、网站上注册的用户数、网站的访问量等等。
当衡量因子包括多个时,在根据多个衡量因子来衡量多个UGC网站的质量情况时,本发明实施例提供了一种可选的方案,在该方案中,可以基于权重策略确定多个衡量因子各自的权重,获取多个UGC网站的多个衡量因子各自的数值;随后将多个UGC网站的多个衡量因子各自的数值与权重进行加权求和,得到综合数值,进而根据多个UGC网站各自的综合数值衡量出多个UGC网站的质量情况。
例如,多个UGC网站为网站1、网站2、网站3、网站4和网站5,多个衡量因子为网站的可信度、网站上注册的用户数、网站的访问量,网站1的多个衡量因子各自的数值分别为p11、p12、p13,网站2的多个衡量因子各自的数值分别为p21、p22、p23,网站3的多个衡量因子各自的数值分别为p31、p32、p33,网站4的多个衡量因子各自的数值分别为p41、p42、p43,网站5的多个衡量因子各自的数值分别为p51、p52、p53。确定多个衡量因子各自的权重为w1、w2、w3,将多个UGC网站的多个衡量因子各自的数值与权重进行加权求和,得到多个UGC网站的综合数值。不妨以网站1和网站2为例,加权求和后网站1的综合数值为p11×w1+p12×w2+p13×w3,网站2的综合数值为p21×w1+p22×w2+p23×w3,网站3、网站4和网站5以此类推,此处不再一一赘述。
另外,在本实施例中,可以针对不同类型的UGC网站,采用不同的抓取策略。
例如,在本发明实施例的一个可选实施方案中,对于专业信息发布平台类的UGC网站,例如, 头条号、爱奇艺、优酷等视频网站,从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频,包括:
在所述专业信息发布平台类的UGC网站分别搜索所述N个影视剧关键词,从搜索结果中从所述专业信息发布平台类的UGC网站提取与所述N个影视剧关键词相关的资讯信息和/或视频;例如,可以在头条号或爱奇艺或优酷等网站的搜索框中分别输入影视剧类预设词表中各个影视剧类预设词进行搜索,按发布时间抓取各个影视剧类预设词相关的资讯信息和/或视频;或者,
在所述专业信息发布平台类的UGC网站发布的资讯信息中标注影视剧类的资讯和视频,从标注的影视剧类资讯和视频中分别提取与所述N个影视剧关键词相关的资讯信息和/或视频。例如,可以在头条号上人工标注影视剧类的头条号,在这些头条号里进行数据抓取,然后按照抓取的资讯信息和/或视频的title里包含的人名进行归类。
又例如,在本发明实施例的另一个可选实施方案中,对于网络主题社区类的UGC网站,例如,兴趣部落或豆瓣等,从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频,包括:在所述网络主题社区类的UGC网站中分别确定与所述N个影视剧关键词中的每个所述影视剧关键词相关的主题社区,从所述相关的主题社区中选择最大的一个或多个主题社区,在所述一个或多个主题社区发布的资讯的名称title或正文中搜索所述N个影视剧关键词,根据搜索结果,从所述一个或多个主题社区中提取与所述N个影视剧关键词相关的资讯信息和/或视频。例如,在兴趣部落中,针对影视剧类预设词表中各个影视剧类预设词,例如,大圣归来,先定位目标影视剧有多少部落,例如,大圣归来社区,然后选择最大部落进行抓取(例如,可以依据关注度),title或文章正文包含关键字(例如,大圣归来)的资讯信息。
又例如,在本发明实施例的又一个可选实施方案中,对于网络问答社区类的UGC网站,例如,知乎网,从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频,包括:获取所述网络问答社区类的UGC网站中发表问题的类别为影视剧类相关的资讯;从所述发表问题的类别为影视剧类相关的资讯中分别查找名称和/或正文包含有所述N个影视剧关键词中一个或多个的资讯;从查找结果中提取的与所述N个影视剧关键词相关的资讯信息和/或视频。例如,可以先判断知乎发表问题的类别是否跟影视剧有关(例如,问题为:大圣归来的主演是谁),如果有关,则进一步判断该问题及答案中是否包含影视剧类预设词表中的影视剧类预设词(例如,大圣归来),如果包含,则抓取该问题及答案作为对应影视剧类预设词相关的资讯信息。
步骤S506,将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中。
在具体应用中,在执行步骤S506时,优选地,可以按照获取的每条所述资讯信息和/或视频相关的影视剧关键词进行分类存储到所述影视剧类资讯内容数据库中,并根据每条资讯信息和/或视频的内容属性对分类存储的每条资讯信息和/或视频进行排序。例如,将与“大圣归来”相关的资讯信息和/或视频聚集在一起放置。按照该方式对影视剧类资讯内容数据库进行排序处理,可以使用数据库中的内容按影视剧关键词进行排序,方便后续搜索使用。
即在本实施例中,在抓取到资讯信息和/或视频之后,先对抓取的资讯信息和/或视频按照每条资讯信息和/或视频相关的影视剧类预设词进行分类,生成具有影视剧类预设词和资讯信息和/或视频的内容属性的结构化的影视剧类资讯内容数据库。即,该影视剧类资讯内容数据库可以包括三个属性列:影视剧类预设词、资讯信息和/或视频的内容属性和资讯内容。其中,资讯信息和/或视频的内容属性可以包括多项,例如,资讯的发布时间、资讯的评论数等,而资讯内容中可以包括资讯的标题(tittle)及资讯的链接地址。表1为本实施例中影视剧类资讯内容数据库的结构的一种示例。
表1
Figure PCTCN2017117220-appb-000001
在本发明实施例的一个可选实施方案中,进一步,在步骤S506得到影视剧类资讯内容数据库后,还可以根据每条资讯信息的资讯信息和/或视频的内容属性进行优化排序。其中,资讯信息和/或视频的内容属性可以包括:内容的类型(例如,资讯信息或视频)、发布时间、查看数和/或评论数等,即在影视剧类资讯内容数据库可以按照资讯的时效性和/或热度进行排序,以提高后续的搜索效率。
本发明实施例提供了一种可选的方案,在该方案中,可以确定用于对抓取的影视剧类资讯信息进行处理的影视剧类关键词,进而基于确定的影视剧类关键词从抓取的影视剧类资讯信息中提取相应的属性内容。在本实施例中,影视剧类关键词可以是影视剧名、影视剧的导演或影视剧的编剧等等,本发明实施例不限于此。
步骤S508,响应用户在搜索引擎上输入的目标搜索词,从互联网中搜索与所述目标搜索词匹配的结果,并在所述影视剧类资讯内容数据库中查找与所述目标搜索词匹配的资讯信息和/或视频。
在本实施例中,在接收到用户输入的目标搜索词时,可以先判断所述目标搜索词是否命中所述N个影视剧类关键词中的一个或多个,如果是,则在从互联网中搜索与所述目标搜索词匹配的结果时,同时在所述影视剧类资讯内容数据库中查找与所述目标搜索词匹配的资讯信息和/或视频,否则,按照正常的搜索模式进行搜索,只从互联网中搜索所述目标搜索词。
步骤S510,在从所述影视剧类资讯内容数据库中查找到与所述目标搜索词匹配的资讯信息和/或视频的情况下,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户。
在本实施例中,在步骤S510中,在从所述影视剧类资讯内容数据库中找到到与所述目标搜索词匹配的视频的情况下,在步骤S510中,在向用户展现查找到的所述资讯信息和/或视频时,如图6所示,可以在所述搜索结果页播放查找到的所述视频,并在所述搜索结果页显示查找到的所述资讯信息和/或视频的文字链接。
在本发明实施例的一个可选实施方案中,如果在从所述影视剧类资讯内容数据库中没有查找到与所述目标搜索词匹配的资讯信息和/或视频的情况下,则在所述目标搜索词对应的搜索结果页展现从互联网搜索到的结果。
在本实施例中,从影视剧类资讯内容数据库中查找到的结果可以作为搜索引擎从互联网上进行搜索得到搜索结果的补充,因此,在本发明实施例的一个可选实施方案中,步骤S510可以包括以下步骤:
步骤1,在所述搜索结果页的左侧显示从互联网上搜索所述目标搜索词的结果;
步骤2,判断查找到的所述资讯信息和/或视频中是否存在与所述搜索结果页左侧展现的结果中相 同的资讯信息和/或视频,如果有,则将查找到的所述资讯信息和/或视频中的所述相同的资讯信息和/或视频去除;
步骤3,将去除所述相同的资讯信息和/视频后的所述查找到的资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页的右侧区域展现给用户。即,在上述可选实施方式中,搜索结果页上包括两个区域:左侧区域和右侧区域,在本实施例中,左侧区域用于展现搜索引擎在互联网搜索目标搜索词得到的结果,例如,像现在baidu、google等搜索引擎的搜索结果页左侧展现的内容,右侧区域用于展现在影视剧类资讯内容数据库搜索到的结果,从而可以扩展搜索结果页右侧区域的内容,为用户提供更完整的搜索结果。并且,在该可选实施方式中,左侧和右侧显示的内容没有重合,从而可以保证检索结果的唯一性。
在上述可选实施方式中,为了进一步使得右侧展现的内容能够符合用户的需求,在本发明实施例的一个可选实施方案中,将所述匹配的资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户之后,所述方法还包括:统计用户针对所述搜索结果页上展现的所述匹配的资讯信息和/或视频的触发操作,得到统计结果;根据所述统计结果确定在后续搜索请求对应的页面中是否展现所述匹配的资讯信息和/或视频。其中,用户针对所述搜索结果页上展现的所述匹配的资讯信息和/或视频的触发操作可以是展现的所述匹配的资讯信息和/或视频的CTR(Click To Rate,点击率),即根据影视剧类资讯内容数据库中的资讯信息和/或视频展现后的点击率确定后续在搜索到该条资讯信息和/或视频时,是否还在搜索页的右侧展现该条资讯信息和/或视频。
进一步地,在上述可选实施方式中,在根据所述统计结果确定在后续搜索请求对应的页面中是否展现所述匹配的资讯信息和/或视频时,可以判断所述统计结果是否为所述触发操作的数量小于指定阈值,如果是,则确定在后续搜索请求对应的页面中不再展现所述匹配的资讯信息和/或视频。
在具体应用中,可以按照指定时间(如1或2小时等)周期判断资讯信息和/或视频的CTR,并根据判断结果进行相应的处理。
另外,在本实施例中,还可以定期检测抓取的UGC网站中与影视剧类预设词表中各个影视剧类预设词相关的资讯信息和/或视频是否为更新,如果有,则抓取新的资讯信息和/或视频到影视剧类资讯内容数据库进行更新,在更新之后,将影视剧类资讯内容数据中每条资讯信息和/或视频的点击率CTR进行清空,即更新之后,再命中影视剧类资讯内容数据库中的资讯信息和/或视频之后,无论该条资讯信息和/或视频之前的CTR是高还是低,本次都在搜索结果页展现该条资讯信息和/或视频,并再次统计各条资讯信息和/或视频的CTR,并在指定时间周期到达之后判断该资讯信息和/或视频的CTR是否大于阈值,进而判断在后续的检测结果中是否展现该资讯信息和/或视频。
在本发明实施例中,在接收到用户在搜索引擎上输入的与影视剧类相关的目标搜索词时,先判断目标搜索词是否命中影视剧类预设词表,如果是,则在从UGC网站中抓取的数据组成的结构化的影视剧类资讯内容数据库中查找与目标搜索词匹配的资讯信息和/或视频,并将从结构化的影视剧类资讯内容数据库中查找到的资讯信息和/或视频聚合至目标搜索词对应的搜索结果页展现给用户。由此可见,在本发明实施例提供的技术方案中,能够在搜索结果页中聚合UGC网站的影视剧类资讯信息和/或视频,从而可以为用户提供更全面的资讯信息和/或视频,扩宽内容覆盖面。并且,由于影视剧类资讯内容数据库具有影视剧类预设词和资讯信息和/或视频的内容属性的结构化特点,具有可读性,能够帮助用户快速地找到需要的信息。进一步地,影视剧类资讯内容数据库来自各个UGC网站,将各个UGC网站中的数据前置到搜索结果页中进行展现,无需用户通过多次操作去网站查找相关资讯信息和/或视频,降低了用户的检索成本。
需要说明的是,实际应用中,上述所有可选实施方式可以采用结合的方式任意组合,形成本发明的可选实施例,在此不再一一赘述。
基于上文各个实施例提供的在搜索结果页中聚合影视剧类资讯信息的方法,基于同一发明构思,本发明实施例还提供了一种在搜索结果页中聚合影视剧类资讯信息的装置。
图7示出了根据本发明一实施例的影视剧类关键词搜索展现装置的结构示意图。如图7所示,该装置至少可以包括确定模块710、获取模块720、存储模块730、搜索模块740以及展现模块750。
现介绍本发明实施例的在搜索结果页中聚合影视剧类资讯信息的装置的各组成或器件的功能以及各部分间的连接关系:
确定模块710,用于确定N个影视剧关键词,其中,N为整数,且N大于1;获取模块720,用于从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频;存储模块730,用于将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中;搜索模块740,用于响应用户在搜索引擎上输入的目标搜索词,从互联网中搜索与所述目标搜索词匹配的结果,并在所述影视剧类资讯内容数据库中查找与所述目标搜索词匹配的资讯信息和/或视频;展现模块750,用于在从所述影视剧类资讯内容数据库中查找到与所述目标搜索词匹配的资讯信息和/或视频的情况下,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户。
在本发明实施例的一个可选实施方案中,所述获取模块720具体用于按照以下方式将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中:按照获取的每条所述资讯信息和/或视频相关的影视剧关键词进行分类存储到所述影视剧类资讯内容数据库中,并根据每条资讯信息和/或视频的内容属性对分类存储的每条资讯信息和/或视频进行排序。
其中,每条资讯信息和/或视频的内容属性可以包括内容的类型、内容发布时间、内容的评论数、以及内容的查看数,即在影视剧类资讯内容数据库可以按照资讯的时效性和/或热度进行排序,以提高搜索效率。
在本发明实施例的一个可选实施方案中,在从所述影视剧类资讯内容数据库中找到到与所述目标搜索词匹配的视频的情况下,所述展现模块750具体用于按照以下方式将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户:在所述搜索结果页播放查找到的所述视频,并在所述搜索结果页显示查找到的所述资讯信息和/或视频的文字链接。
另外,在本实施例中,可以针对不同类型的UGC网站,采用不同的抓取策略。
在本发明实施例的一个可选实施方案中,
对于专业信息发布平台类的UGC网站,所述获取模块720具体用于按照以下方式获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频:
在所述专业信息发布平台类的UGC网站分别搜索所述N个影视剧关键词,从搜索结果中从所述专业信息发布平台类的UGC网站提取与所述N个影视剧关键词相关的资讯信息和/或视频;或者,
在所述专业信息发布平台类的UGC网站发布的资讯信息中标注影视剧类的资讯和视频,从标注的影视剧类资讯和视频中分别提取与所述N个影视剧关键词相关的资讯信息和/或视频。
在本发明实施例的一个可选实施方案中,
对于网络主题社区类的UGC网站,所述获取模块720具体用于按照以下方式获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频:在所述网络主题社区类的UGC网站中分别确定与所述N个影视剧关键词中的每个所述影视剧关键词相关的主题社区,从所述相关的主题社区中选择最大的一个或多个主题社区,在所述一个或多个主题社区发布的资讯的名称title或正文中搜索所述N个影视剧关键词,根据搜索结果,从所述一个或多个主题社区中提取与所述N个影视剧关键词相关的资讯信息和/或视频。例如,在兴趣部落中,针对影视剧类预设词表中各个影视剧类预设词,例如,大圣归来,先定位目标影视剧有多少部落(例如,大圣社区),然后选择最大部落进行抓 取(例如,可以依据关注度),title或文章正文包含关键字(例如,大圣归来)的资讯信息。
在本发明实施例的一个可选实施方案中,对于网络问答社区类的UGC网站,所述获取模块720具体用于按照以下方式获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频:获取所述网络问答社区类的UGC网站中发表问题的类别为影视剧类相关的资讯;从所述发表问题的类别为影视剧类相关的资讯中分别查找名称和/或正文包含有所述N个影视剧关键词中一个或多个的资讯;从查找结果中提取的与所述N个影视剧关键词相关的资讯信息和/或视频。例如,可以先判断知乎发表问题的类别是否跟娱乐有关,如果有关,则进一步判断该问题及答案中是否包含影视剧类预设词表中的影视剧类预设词,如果包含,则抓取该问题及答案作为对应影视剧类预设词相关的资讯信息。
在本发明实施例的一个可选实施方案中,所述展现模块750具体用于按照以下方式将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户:在所述搜索结果页的左侧显示从互联网上搜索所述目标搜索词的结果;判断查找到的所述资讯信息和/或视频中是否存在与所述搜索结果页左侧展现的结果中相同的资讯信息和/或视频,如果有,则将查找到的所述资讯信息和/或视频中的所述相同的资讯信息和/或视频去除;将去除所述相同的资讯信息和/视频后的所述查找到的资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页的右侧区域展现给用户。
在本发明实施例的一个可选实施方案中,如图8所示,该装置还可以包括:统计模块760,用于在将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户之后,统计用户针对所述搜索结果页上展现的各个所述查找到的资讯信息和/或视频的触发操作,得到统计结果;判断模块770,用于根据所述统计结果确定在后续搜索请求对应的页面中是否展现各个所述查找到的资讯信息和/或视频。
进一步地,在上述可选实施方式中,所述判断模块770具体用于按照以下方式确定在后续搜索请求对应的页面中是否展现各个所述查找到的资讯信息和/或视频:确定在后续搜索请求对应的页面中不再展现所述查找到的资讯信息和/或视频中,所述触发操作的数量小于指定阈值的资讯信息和/或视频。
为解决上述技术问题,本发明实施例还提供了一种在搜索结果页中聚合游戏类资讯信息的方法,该方法可以应用在个人电脑、智能手机、平板电脑等终端设备上。图9示出了根据本发明一实施例的在搜索结果页中聚合游戏类资讯信息的方法的流程图。如图9所示,该方法至少可以包括以下步骤S902至步骤S912。
步骤S902,判断预定的多个用户生成内容UGC网站中是否存在与N个预设游戏标识相关的资讯项,其中,N为整数,且N大于1。
在本实施例中,N个预设游戏标识可以是当前热门的N个游戏的名称。
在具体应用中,N个预设游戏标识可以根据预定数据库中各个关键词的点击率和/或搜索率确定。例如,可以为百度风云榜中最靠前的N名游戏名组成所述N个预设游戏标识,其中,N的取值可以根据具体应用确定,在本实施例中并不作限定。
在该步骤中,UGC(User Gernerated Content,用户生产内容),其也被称为UCC(User Created Content,用户创建内容),可以包括用户创作的文字内容,用户拍摄的图片以及用户录制的视频、音频等等。此外,PGC(Professional Generated Content,专业生产内容),它是UGC的衍生概念,UGC的好处是用户可以自由上传内容,丰富网站内容,但不利的方面在于内容的质量良莠不齐。与UGC相比,PGC分类更专业,内容质量也更有保证,其内容设置及产品编辑均非常专业。其实,UGC和PGC两者并不矛盾,不但并行不悖,而且需要相辅相成。一个成熟的互联网内容向产品,不论网站还 是社区、视频平台、音频平台、甚至新形态下的媒体,都需要深度和广度两个方面并行。结合自身的特点,UGC负责内容广度,主要贡献流量和参与度,而PGC维持内容深度,主要树立品牌、创造价值,两者缺一不可。由于PGC是UGC的衍生概念,在本发明实施例中不妨将PGC作为UGC的一部分。
在具体应用中,由于UGC提供的内容的质量良莠不齐,本发明实施例为了增加游戏类资讯内容的可信度,在该步骤中从多个UGC网站中抓取游戏类资讯信息时,可以从多个UGC网站中筛选出至少一个优质UGC网站,进而从至少一个优质UGC网站中抓取游戏类资讯信息。
进一步地,在从多个UGC网站中筛选出至少一个优质UGC网站时,可以通过一些衡量因子来筛选。具体地,确定一个或多个衡量因子,根据确定的一个或多个衡量因子衡量出多个UGC网站的质量情况,并从中筛选出质量满足指定质量条件的至少一个UGC网站作为优质UGC网站。这里的衡量因子可以如网站的可信度、网站上注册的用户数、网站的访问量等等。
当衡量因子包括多个时,在根据多个衡量因子来衡量多个UGC网站的质量情况时,本发明实施例提供了一种可选的方案,在该方案中,可以基于权重策略确定多个衡量因子各自的权重,获取多个UGC网站的多个衡量因子各自的数值;随后将多个UGC网站的多个衡量因子各自的数值与权重进行加权求和,得到综合数值,进而根据多个UGC网站各自的综合数值衡量出多个UGC网站的质量情况。
例如,多个UGC网站为网站1、网站2、网站3、网站4和网站5,多个衡量因子为网站的可信度、网站上注册的用户数、网站的访问量,网站1的多个衡量因子各自的数值分别为p11、p12、p13,网站2的多个衡量因子各自的数值分别为p21、p22、p23,网站3的多个衡量因子各自的数值分别为p31、p32、p33,网站4的多个衡量因子各自的数值分别为p41、p42、p43,网站5的多个衡量因子各自的数值分别为p51、p52、p53。确定多个衡量因子各自的权重为w1、w2、w3,将多个UGC网站的多个衡量因子各自的数值与权重进行加权求和,得到多个UGC网站的综合数值。不妨以网站1和网站2为例,加权求和后网站1的综合数值为p11×w1+p12×w2+p13×w3,网站2的综合数值为p21×w1+p22×w2+p23×w3,网站3、网站4和网站5以此类推,此处不再一一赘述。
在本实施例中,在确定优质UGC网站之后,还可以进一步判断多个优质UGC网站中是否存在有与N个预设游戏标识相关的资讯项,在该步骤中,可以将N个预设游戏标识作为关键词,搜索各个优质UGC网站,如果命中率达到预定值,比如,80%,则将对应的UGC网站作为存在与所述N个预设游戏标识相关的资讯项的UGC网站,即当一个UGC网站中包含有N个预设游戏标识中80%的关键词时,将该UGC网站作为存在与所述N个预设游戏标识相关的资讯项的UGC网站。当然,具体应用中,预定值可以根据实际需要进行设置,具体本实施例不作限定。
步骤S904,根据判断结果,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据;
在本实施例中,可以针对不同类型的UGC网站,采用不同的抓取策略。
例如,在本发明实施例的一个可选实施方案中,对于专业信息发布平台类的UGC网站,例如,头条号、爱奇艺、优酷等视频网站,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据,包括:
在所述专业信息发布平台类的UGC网站的搜索框内分别输入所述N个预设游戏标识,从搜索结果中按发布时间抓取所述N个预设游戏标识中各个预设游戏标识相关的资讯信息;例如,可以在头条号或爱奇艺或优酷等网站的搜索框中分别输入各个预设游戏标识进行搜索,按发布时间抓取各个游戏标识相关的资讯信息;或者,
在所述专业信息发布平台类的UGC网站发布的资讯信息中标注游戏类资讯,从标注的游戏类资讯中抓取与所述N个预设游戏标识相关的资讯信息。例如,可以在头条号上人工标注游戏类的头条号, 在这些头条号里进行数据抓取,然后按照抓取的资讯信息的title里包含的人名进行归类。
又例如,在本发明实施例的另一个可选实施方案中,对于网络主题社区类的UGC网站,例如,兴趣部落或豆瓣等,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据,包括:对于所述N个预设游戏标识中的每个预设游戏标识,在所述网络主题社区类的UGC网站中确定与该预设游戏标识相关的主题社区,从与该预设游戏标识相关的主题社区选择M个主题社区中,从所述M个主题社区中抓取名称title或正文包含该预设游戏标识的资讯信息。例如,在兴趣部落中,针对各个各个预设游戏标识,例如,“魔兽世界”,先定位目标游戏有多少部落,例如,魔兽社区,然后选择最大部落进行抓取(例如,可以依据关注度),title或文章正文包含关键字(例如,魔兽世界)的资讯信息。
又例如,在本发明实施例的又一个可选实施方案中,对于网络问答社区类的UGC网站,例如,知乎网,
从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据,包括:从所述网络问答社区类的UGC网站获取发表问题的类别为游戏类的资讯信息;判断发表问题为游戏类的资讯信息中是否包含所述N个预设游戏标识中的一个或多个,如果有,则抓取该资讯信息作为与所述N个预设游戏标识相关的数据。例如,可以先判断知乎发表问题的类别是否跟游戏有关(例如,问题为:魔兽世界中具有什么样的故事背景),如果有关,则进一步判断该问题及答案中是否包含游戏类预设词表中的游戏类预设词(例如,魔兽世界),如果包含,则抓取该问题及答案作为对应游戏类预设词相关的资讯信息。
步骤S906,对抓取的与所述N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库,其中,所述UGC游戏资讯数据库中每条数据至少包括:关键词、资讯内容、以及属性。
在具体应用中,在执行步骤S906时,优选地,可以存储抓取的每条数据,并按照所述抓取的每条数据的一个或多个资讯属性进行排序,得到所述UGC游戏资讯数据库。
在本实施例的一个可选实施方式中中,在抓取到资讯信息之后,还可以先对抓取的资讯信息按照每条资讯信息相关的游戏类预设词进行分类,生成具有游戏类预设词和资讯信息的资讯属性的结构化的UGC游戏资讯数据库。即,该UGC游戏资讯数据库可以包括三个属性列:预设游戏标识、资讯信息的资讯属性和资讯内容。其中,资讯信息的资讯属性可以包括多项,例如,资讯的发布时间、资讯的评论数等,而资讯内容中可以包括资讯的标题(tittle)及资讯的链接地址。表2为本实施例中UGC游戏资讯数据库的结构的一种示例。
表2
Figure PCTCN2017117220-appb-000002
在本发明实施例的一个可选实施方案中,进一步,在步骤S906得到UGC游戏资讯数据库后,还可以根据每条资讯信息的资讯信息的资讯属性进行优化排序。其中,资讯信息的资讯属性可以包括: 内容的类型(例如,资讯信息或视频)、发布时间、查看数和/或评论数等,即在UGC游戏资讯数据库可以按照资讯的时效性和/或热度进行排序,以提高后续的搜索效率。
本发明实施例提供了一种可选的方案,在该方案中,可以确定用于对抓取的游戏类资讯信息进行处理的预设游戏标识,进而基于确定的游戏类关键词从抓取的游戏类资讯信息中提取相应的属性内容。在本实施例中,预设游戏标识可以是游戏名、游戏副本名称等等,本发明实施例不限于此。
在本发明实施例的一个可选实施方案中,由于游戏类资讯的特殊性,UGC游戏资讯数据库中包含的资讯内容可能有直播视频,由于直播视频具有一定时效性,为了避免用户检索到不存在的直播视频,在本实施例的一个可选实施方式中,得到所述UGC游戏资讯数据库之后,所述方法还包括:对于所述UGC游戏资讯数据库中资讯内容包含直播视频的数据项,周期性地检测直播视频是否结束,在检测到直播视频结束的情况下,将对应的数据项从所述UGC游戏资讯库中删除。其中,检测周期可以根据实际情况进行设置,例如,1小时或2小时等。
步骤S908,响应用户在搜索引擎上输入的目标搜索词,判断所述目标搜索词是否为游戏类的搜索词,如果是,则执行步骤S910,否则,按照正常的搜索模式进行搜索,只从互联网中搜索所述目标搜索词。
步骤S910,在从互联网中搜索所述目标搜索词的同时,在UGC游戏资讯数据库查找与所述目标搜索词匹配的数据。
步骤S912,将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现。
在本实施例中,在步骤S910中,在从所述UGC游戏资讯数据库中找到到与所述目标搜索词匹配的直播视频的情况下,在步骤S912中,在向用户展现查找到的所述资讯信息时,如图10所示,可以在所述搜索结果页播放查找到的所述直播视频,并在所述搜索结果页显示查找到的所述资讯信息的文字链接,其中,展现直播视频的画面可以按一定频率与直播画面同步。
在本发明实施例的一个可选实施方案中,如果在从所述UGC游戏资讯数据库中没有查找到与所述目标搜索词匹配的资讯信息的情况下,则在所述目标搜索词对应的搜索结果页展现从互联网搜索到的结果。
在本实施例中,从UGC游戏资讯数据库中查找到的结果可以作为搜索引擎从互联网上进行搜索得到搜索结果的补充,因此,在本发明实施例的一个可选实施方案中,步骤S912可以包括以下步骤:
步骤1,在所述搜索结果页的左侧显示从互联网上搜索所述目标搜索词的结果;
步骤2,判断所述UGC游戏资讯数据库中与所述目标搜索词匹配的数据中是否有与所述搜索结果页左侧展现的结果中相同的数据,如果有,则将所述相同的数据去除;
步骤3,将去除所述相同的数据后的与目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页的右侧区域。即,在上述可选实施方式中,搜索结果页上包括两个区域:左侧区域和右侧区域,在本实施例中,左侧区域用于展现搜索引擎在互联网搜索目标搜索词得到的结果,例如,像现在baidu、google等搜索引擎的搜索结果页左侧展现的内容,右侧区域用于展现在UGC游戏资讯数据库搜索到的结果,从而可以扩展搜索结果页右侧区域的内容,为用户提供更完整的搜索结果。并且,在该可选实施方式中,左侧和右侧显示的内容没有重合,从而可以保证检索结果的唯一性。
在上述可选实施方式中,为了进一步使得右侧展现的内容能够符合用户的需求,在本发明实施例的一个可选实施方案中,将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现之后,所述方法还包括统计用户针对所述搜索结果页上展现的与所述目标搜索词匹配的数据的资讯内容的触发操作,得到统计结果;根据所述统计结果确定在后续搜索请求对应的页面中是否展现与所述目标搜索词匹配的数据的资讯内容。其中,用户针对所述搜索结果页上展现的所述匹配 的数据的资讯内容的触发操作可以是展现的所述匹配的数据的资讯内容的CTR(Click To Rate,点击率),即根据UGC游戏资讯数据库中的数据的资讯内容展现后的点击率确定后续在搜索到该条数据的资讯内容时,是否还在搜索页的右侧展现该条数据的资讯内容。
进一步地,在上述可选实施方式中,在根据所述统计结果确定在后续搜索请求对应的页面中是否展现所述匹配的数据的资讯内容时,可以判断所述统计结果是否为所述触发操作的数量小于指定阈值,如果是,则确定在后续搜索请求对应的页面中不再展现所述匹配的数据的资讯内容。
在具体应用中,可以按照指定时间(如1或2小时等)周期判断数据的资讯内容的CTR,并根据判断结果进行相应的处理。
另外,在本实施例中,还可以定期检测抓取的UGC网站中与游戏类预设词表中各个游戏类预设词相关的数据的资讯内容是否为更新,如果有,则抓取新的数据的资讯内容到UGC游戏资讯数据库进行更新,在更新之后,将游戏类资讯内容数据中每条资讯信息的点击率CTR进行清空,即更新之后,再命中UGC游戏资讯数据库中的数据的资讯内容之后,无论该条数据的资讯内容之前的CTR是高还是低,本次都在搜索结果页展现该条数据的资讯内容,并再次统计各条数据的资讯内容的CTR,并在指定时间周期到达之后判断该数据的资讯内容的CTR是否大于阈值,进而判断在后续的检测结果中是否展现该数据的资讯内容。
在本发明实施例中,在接收到用户在搜索引擎上输入的与游戏类相关的目标搜索词时,先判断目标搜索词是否命中预设游戏标识,如果是,则在从UGC网站中抓取的数据组成的结构化的UGC游戏资讯数据库中查找与目标搜索词匹配的数据的资讯内容,并将从结构化的UGC游戏资讯数据库中查找到的数据的资讯内容聚合至目标搜索词对应的搜索结果页展现给用户。由此可见,在本发明实施例提供的技术方案中,能够在搜索结果页中聚合UGC网站的游戏类数据的资讯内容,从而可以为用户提供更全面的数据的资讯内容,扩宽内容覆盖面。并且,由于UGC游戏资讯数据库具有游戏类预设词和数据的资讯内容的资讯属性的结构化特点,具有可读性,能够帮助用户快速地找到需要的信息。进一步地,UGC游戏资讯数据库来自各个UGC网站,将各个UGC网站中的数据前置到搜索结果页中进行展现,无需用户通过多次操作去网站查找相关数据的资讯内容,降低了用户的检索成本。
需要说明的是,实际应用中,上述所有可选实施方式可以采用结合的方式任意组合,形成本发明的可选实施例,在此不再一一赘述。
基于上文各个实施例提供的在搜索结果页中聚合游戏类资讯信息的方法,基于同一发明构思,本发明实施例还提供了一种在搜索结果页中聚合游戏类资讯信息的装置。
图11示出了根据本发明一实施例的游戏搜索词的搜索装置的结构示意图。如图11所示,该装置至少可以包括第一判断模块1110、抓取模块1120、存储模块1130、响应模块1140、搜索模块1150以及展现模块1160。
现介绍本发明实施例的在搜索结果页中聚合游戏类资讯信息的装置的各组成或器件的功能以及各部分间的连接关系:
第一判断模块1110,用于判断预定的多个用户生成内容UGC网站中是否存在与N个预设游戏标识相关的资讯项;抓取模块1120,用于根据判断结果,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据;存储模块1130,用于对抓取的与所述N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库,其中,所述UGC游戏资讯数据库中每条数据至少包括:关键词、资讯内容、以及属性;响应模块1140,用于响应用户在搜索引擎上输入的目标搜索词,判断所述目标搜索词是否为游戏类的搜索词;搜索模块1150,用于在判断所述目标搜索词为游戏类的搜索词的情况下,在从互联网中搜索所述目标搜索词的同时,在UGC游戏资讯数据库查找与所述目标搜索词匹配的数据;展现模块1160,用于将与所述目标搜索 词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现。
在本发明实施例的一个可选实施方案中,所述存储模块1130具体用于按照以下方式对抓取的与N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库:存储抓取的每条数据,并按照所述抓取的每条数据的一个或多个资讯属性进行排序,得到所述UGC游戏资讯数据库。其中,每条资讯信息的资讯属性可以包括内容发布时间、内容的评论数等,即在UGC游戏资讯数据库可以按照资讯的时效性和/或热度进行排序,以提高搜索效率。
在本发明实施例的一个可选实施方案中,如图12所示,该装置还可以包括:更新模块1170,用于对于所述UGC游戏资讯数据库中资讯内容包含直播视频的数据项,周期性地检测直播视频是否结束,在检测到直播视频结束的情况下,将对应的数据项从所述UGC游戏资讯库中删除。通过该可选实施方式,可以保证UGC游戏资讯数据库中的数据的有效性。
另外,在本实施例中,可以针对不同类型的UGC网站,采用不同的抓取策略。
在本发明实施例的一个可选实施方案中对于专业信息发布平台类的UGC网站,所述抓取模块1120具体用于按照以下方式从存在与N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据:
在所述专业信息发布平台类的UGC网站的搜索框内分别输入所述N个预设游戏标识,从搜索结果中按发布时间抓取所述N个预设游戏标识中各个游戏标识相关的资讯信息;或者,
在所述专业信息发布平台类的UGC网站发布的资讯信息中标注游戏类资讯,从标注的游戏类资讯中抓取与所述N个预设游戏标识相关的资讯信息。
在本发明实施例的一个可选实施方案中,对于网络主题社区类的UGC网站,所述抓取模块1120具体用于按照以下方式从存在与N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据:
对于所述N个预设游戏标识中的每个预设游戏标识,在所述网络主题社区类的UGC网站中确定与该游戏标识相关的主题社区,从与该预设游戏标识相关的主题社区选择M个主题社区中,从所述M个主题社区中抓取名称title或正文包含该预设游戏标识的资讯信息。例如,在兴趣部落中,针对游戏类预设词表中各个游戏类预设词,例如,魔兽世界,先定位目标游戏有多少部落(例如,魔兽社区),然后选择最大部落进行抓取(例如,可以依据关注度),title或文章正文包含关键字(例如,魔兽)的资讯信息。
在本发明实施例的一个可选实施方案中,对于网络问答社区类的UGC网站,所述抓取模块1120具体用于按照以下方式从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据:从所述网络问答社区类的UGC网站获取发表问题的类别为游戏类的资讯信息;判断发表问题为游戏类的资讯信息中是否包含所述N个预设游戏标识中的一个或多个,如果有,则抓取该资讯信息作为与所述N个预设游戏标识相关的数据。例如,可以先判断知乎发表问题的类别是否跟娱乐有关,如果有关,则进一步判断该问题及答案中是否包含游戏类预设词表中的游戏类预设词,如果包含,则抓取该问题及答案作为对应游戏类预设词相关的资讯信息。
在本发明实施例的一个可选实施方案中,所述展现模块1160具体用于按照以下方式将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现:在所述搜索结果页的左侧展现从互联网上搜索所述目标搜索词的结果;判断所述UGC游戏资讯数据库中与所述目标搜索词匹配的数据中是否有与所述搜索结果页左侧展现的结果中相同的数据,如果有,则将所述相同的数据去除;将去除所述相同的数据后的与目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页的右侧区域展现。
即,在上述可选实施方式中,搜索结果页上包括两个区域:左侧区域和右侧区域,在本实施例中, 左侧区域用于展现搜索引擎在互联网搜索目标搜索词得到的结果,例如,像现在baidu、google等搜索引擎的搜索结果页左侧展现的内容,右侧区域用于展现在UGC游戏资讯数据库搜索到的结果,从而可以扩展搜索结果页右侧区域的内容,为用户提供更完整的搜索结果。
在本发明实施例的一个可选实施方案中,如图12所示,该装置还可以包括:统计模块1180,用于在将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现给用户之后,统计用户针对所述搜索结果页上展现的与所述目标搜索词匹配的数据的资讯内容的触发操作,得到统计结果;第二判断模块1190,用于根据所述统计结果确定在后续搜索请求对应的页面中是否展现与所述目标搜索词匹配的数据的资讯内容。
其中,用户针对所述搜索结果页上展现的所述匹配的资讯信息的触发操作可以是展现的所述匹配的资讯信息的CTR(Click To Rate,点击率),即根据UGC游戏资讯数据库中的资讯信息展现后的点击率确定后续在搜索到该条资讯信息时,是否还在搜索页的右侧展现该条资讯信息。
进一步地,在上述可选实施方式中,所述第二判断模块1190具体用于按照以下方式确定在后续搜索请求对应的页面中是否展现与所述目标搜索词匹配的数据的资讯内容:在所述统计结果为所述触发操作的数量小于指定阈值的情况下,确定在后续搜索请求对应的页面中不再展现与所述目标搜索词匹配的数据的资讯内容。
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的针对综艺类query的搜索结果的推送装置,影视剧类关键词搜索展现装置,或者游戏类搜索词的搜索装置中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序 和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。
例如,图13示出了可以实现根据本发明的针对综艺类query的搜索结果的推送方法,影视剧类关键词搜索展现方法,或者游戏类搜索词的搜索方法的计算设备。该计算设备传统上包括处理器1310和以存储器1320形式的计算机程序产品或者计算机可读介质。存储器1320可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。存储器1320具有存储用于执行上述方法中的任何方法步骤的程序代码1331的存储空间1330。例如,存储程序代码的存储空间1330可以存储分别用于实现上面的方法中的各种步骤的各个程序代码1331。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。这些计算机程序产品包括诸如硬盘,紧致盘(CD)、存储卡或者软盘之类的程序代码载体。这样的计算机程序产品通常为例如图14所示的便携式或者固定存储单元。该存储单元可以具有与图13的计算设备中的存储器1320类似布置的存储段、存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储单元存储用于执行本发明的方法步骤的计算机可读代码1331’,即可以由例如1310之类的处理器读取的代码,当这些代码当由计算设备运行时,导致该计算设备执行上面所描述的方法中的各个步骤。
本文中所称的“一个实施例”、“实施例”或者“一个或者多个实施例”意味着,结合实施例描述的特定特征、结构或者特性包括在本发明的至少一个实施例中。此外,请注意,这里“在一个实施例中”的词语例子不一定全指同一个实施例。
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。
此外,还应当注意,本说明书中使用的语言主要是为了可读性和教导的目的而选择的,而不是为了解释或者限定本发明的主题而选择的。因此,在不偏离所附权利要求书的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。对于本发明的范围,对本发明所做的公开是说明性的,而非限制性的,本发明的范围由所附权利要求书限定。

Claims (62)

  1. 一种针对综艺类query的搜索结果的推送方法,包括:
    从指定UGC网站中抓取综艺类UGC搜索结果项,所述指定UGC网站中包括大数据级别的UGC搜索结果项;
    从所述综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合所述多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表;
    当接收到针对所述综艺类关键词词表中的任一综艺类关键词的搜索请求时,从所述综艺类UGC搜索结果项中获取与所述综艺类关键词相匹配的优质UGC搜索结果项;
    将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,所述优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。
  2. 根据权利要求1所述的方法,其中,所述从指定UGC网站中抓取综艺类UGC搜索结果项,包括以下至少一项:
    获取综艺类的资讯频道路径,并根据所述资讯频道路径从指定UGC网站中抓取综艺类UGC搜索结果项;
    提取所述指定UGC网站中的各UGC搜索结果项对应的标题中的关键词;根据所述关键词确定所述各UGC搜索结果项的类型;根据所述类型从所述UGC搜索结果项中抓取综艺类UGC搜索结果项;
    提取包含以下至少一项UGC内容对应的UGC搜索结果项为所述综艺类UGC搜索结果项:娱乐八卦、综艺资讯、综艺节目点评。
  3. 根据权利要求1或2所述的方法,其中,从所述综艺类UGC搜索结果项中获取与所述综艺类关键词相匹配的优质UGC搜索结果项,包括:
    从所述综艺类UGC搜索结果项中抓取与所述综艺类关键词相匹配的、且符合预设条件的综艺类UGC搜索结果项,作为与所述综艺类关键词相匹配的优质UGC搜索结果项。
  4. 根据权利要求1-3中任一项所述的方法,其中,所述预设条件包括以下至少一项:
    UGC视频片段;
    UGC内容中包含尚未播出的综艺节目内容;
    UGC内容中包含与所述综艺类关键词相关的人物资讯。
  5. 根据权利要求1-4中任一项所述的方法,其中,当所述优质UGC搜索结果项包括多个时,将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,包括:
    按照预设排序元素对所述多个优质UGC搜索结果项进行排序,所述预设排序元素包括各优质UGC搜索结果项的发布时间、评论数、最近一次的评论时间中的至少一项;
    将所述排序后的多个优质UGC搜索结果项按照其自身显示形式推送至搜索结果页。
  6. 根据权利要求1-5中任一项所述的方法,其中,所述优质UGC搜索结果项包括视频项;将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,包括:
    将所述视频项对应的封面图片推送至所述搜索结果页的第一位置处;
    在所述封面图片上的对应位置提供用于播放所述视频项对应视频的第一标识;
    当接收到对所述第一标识的触发操作时,进入所述视频项对应的视频页面,并播放所述视频。
  7. 根据权利要求1-6中任一项所述的方法,其中,当所述视频项包括多个时,将所述视频项按照其自身显示形式推送至所述搜索结果页的第一位置处,包括:
    在所述搜索结果页的第一位置处以轮播的方式显示所述多个视频项分别对应的封面图片。
  8. 根据权利要求1-7中任一项所述的方法,其中,所述优质UGC搜索结果项包括文字链接项;将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,包括:
    将所述文字链接项推送至所述搜索结果页的第二位置处;
    当接收到对所述文字链接项的触发操作时,进入并显示所述文字链接项对应的UGC内容。
  9. 根据权利要求1-8中任一项所述的方法,其中,将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页之后,所述方法还包括:
    统计所述优质UGC搜索结果项的点击率;
    当所述优质UGC搜索结果项的点击率低于预设点击率时,取消在所述搜索结果页上对所述优质UGC搜索结果项的推送操作;
    监测所述被取消显示的优质UGC搜索结果项是否被更新;
    若是,则重新将所述更新后的优质UGC搜索结果项推送至搜索结果页。
  10. 根据权利要求1-9中任一项所述的方法,其中,将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,包括:
    将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至所述搜索结果页的相关推荐区域。
  11. 一种针对综艺类query的搜索结果的推送装置,包括:
    抓取模块,适于从指定UGC网站中抓取综艺类UGC搜索结果项,所述指定UGC网站中包括大数据级别的UGC搜索结果项;
    创建模块,适于从所述综艺类UGC搜索结果项对应的综艺类UGC内容中筛选出多个综艺类关键词,并集合所述多个综艺类关键词,以创建用于综艺类query的综艺类关键词词表;
    获取模块,适于当接收到针对所述综艺类关键词词表中的任一综艺类关键词的搜索请求时,从所述综艺类UGC搜索结果项中获取与所述综艺类关键词相匹配的优质UGC搜索结果项;
    第一推送模块,适于将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至搜索结果页,所述优质UGC搜索结果项的自身显示形式包括以综艺类视频截图为封面的图片显示形式、以综艺类UGC内容的标题为链接的文字显示形式中的至少一种。
  12. 根据权利要求11所述的装置,其中,所述抓取模块还适于以下至少一项:
    获取综艺类的资讯频道路径,并根据所述资讯频道路径从指定UGC网站中抓取综艺类UGC搜索结果项;
    提取所述指定UGC网站中的各UGC搜索结果项对应的标题中的关键词;根据所述关键词确定所述各UGC搜索结果项的类型;根据所述类型从所述UGC搜索结果项中抓取综艺类UGC搜索结果项;
    提取包含以下至少一项UGC内容对应的UGC搜索结果项为所述综艺类UGC搜索结果项:娱乐八卦、综艺资讯、综艺节目点评。
  13. 根据权利要求11或12所述的装置,其中,所述获取模块还适于:
    从所述综艺类UGC搜索结果项中抓取与所述综艺类关键词相匹配的、且符合预设条件的综艺类UGC搜索结果项,作为与所述综艺类关键词相匹配的优质UGC搜索结果项。
  14. 根据权利要求11-13中任一项所述的装置,其中,所述预设条件包括以下至少一项:
    UGC视频片段;
    UGC内容中包含尚未播出的综艺节目内容;
    UGC内容中包含与所述综艺类关键词相关的人物资讯。
  15. 根据权利要求11-14中任一项所述的装置,其中,所述第一推送模块还适于:
    当所述优质UGC搜索结果项包括多个时,按照预设排序元素对所述多个优质UGC搜索结果项进 行排序,所述预设排序元素包括各优质UGC搜索结果项的发布时间、评论数、最近一次的评论时间中的至少一项;
    将所述排序后的多个优质UGC搜索结果项按照其自身显示形式推送至搜索结果页。
  16. 根据权利要求11-15中任一项所述的装置,其中,所述优质UGC搜索结果项包括视频项;所述第一推送模块还适于:
    将所述视频项对应的封面图片推送至所述搜索结果页的第一位置处;
    在所述封面图片上的对应位置提供用于播放所述视频项对应视频的第一标识;
    当接收到对所述第一标识的触发操作时,进入所述视频项对应的视频页面,并播放所述视频。
  17. 根据权利要求11-16中任一项所述的装置,其中,所述第一推送模块还适于:
    当所述视频项包括多个时,在所述搜索结果页的第一位置处以轮播的方式显示所述多个视频项分别对应的封面图片。
  18. 根据权利要求11-17中任一项所述的装置,其中,所述优质UGC搜索结果项包括文字链接项;所述第一推送模块还适于:
    将所述文字链接项推送至所述搜索结果页的第二位置处;
    当接收到对所述文字链接项的触发操作时,进入并显示所述文字链接项对应的UGC内容。
  19. 根据权利要求11-18中任一项所述的装置,其中,所述装置还包括:
    统计模块,适于将与所述综艺类关键词相匹配的优质UGC搜索结果项推送至搜索结果页之后,统计所述优质UGC搜索结果项的点击率;
    取消模块,适于当所述优质UGC搜索结果项的点击率低于预设点击率时,取消在所述搜索结果页上对所述优质UGC搜索结果项的推送操作;
    监测模块,适于监测所述被取消显示的优质UGC搜索结果项是否被更新;
    第二推送模块,适于若监测到所述被取消显示的优质UGC搜索结果项被更新,则重新将所述更新后的优质UGC搜索结果项推送至搜索结果页。
  20. 根据权利要求11-19中任一项所述的装置,其中,所述第一推送模块还适于:
    将与所述综艺类关键词相匹配的优质UGC搜索结果项按照其自身显示形式推送至所述搜索结果页的相关推荐区域。
  21. 一种影视剧类关键词搜索展现方法,包括:
    确定N个影视剧关键词,其中,N为整数,且N大于1;
    从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频;
    将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中;
    响应用户在搜索引擎上输入的目标搜索词,从互联网中搜索与所述目标搜索词匹配的结果,并在所述影视剧类资讯内容数据库中查找与所述目标搜索词匹配的资讯信息和/或视频;
    在从所述影视剧类资讯内容数据库中查找到与所述目标搜索词匹配的资讯信息和/或视频的情况下,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户。
  22. 根据权利要求21所述的方法,其中,将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中,包括:
    按照获取的每条所述资讯信息和/或视频相关的影视剧关键词进行分类存储到所述影视剧类资讯内容数据库中,并根据每条资讯信息和/或视频的内容属性对分类存储的每条资讯信息和/或视频进行排序。
  23. 根据权利要求21或22所述的方法,其中,所述内容属性包括以下至少之一:内容的类型、内容发布时间、内容的评论数、以及内容的查看数。
  24. 根据权利要求21至23任一项所述的方法,其中,在从所述影视剧类资讯内容数据库中找到到与所述目标搜索词匹配的视频的情况下,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户,包括:
    在所述搜索结果页播放查找到的所述视频,并在所述搜索结果页显示查找到的所述资讯信息和/或视频的文字链接。
  25. 根据权利要求21至24任一项所述的方法,其中,对于专业信息发布平台类的UGC网站,从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频,包括:
    在所述专业信息发布平台类的UGC网站分别搜索所述N个影视剧关键词,从搜索结果中从所述专业信息发布平台类的UGC网站提取与所述N个影视剧关键词相关的资讯信息和/或视频;或者,
    在所述专业信息发布平台类的UGC网站发布的资讯信息中标注影视剧类的资讯和视频,从标注的影视剧类资讯和视频中分别提取与所述N个影视剧关键词相关的资讯信息和/或视频。
  26. 根据权利要求21至25任一项所述的方法,其中,对于网络主题社区类的UGC网站,从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频,包括:
    在所述网络主题社区类的UGC网站中分别确定与所述N个影视剧关键词中的每个所述影视剧关键词相关的主题社区,从所述相关的主题社区中选择最大的一个或多个主题社区,在所述一个或多个主题社区发布的资讯的名称title或正文中搜索所述N个影视剧关键词,根据搜索结果,从所述一个或多个主题社区中提取与所述N个影视剧关键词相关的资讯信息和/或视频。
  27. 根据权利要求21至26任一项所述的方法,其中,对于网络问答社区类的UGC网站,从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频,包括:
    获取所述网络问答社区类的UGC网站中发表问题的类别为影视剧类相关的资讯;
    从所述发表问题的类别为影视剧类相关的资讯中分别查找名称和/或正文包含有所述N个影视剧关键词中一个或多个的资讯;
    从查找结果中提取的与所述N个影视剧关键词相关的资讯信息和/或视频。
  28. 根据权利要求21至27任一项所述的方法,其中,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户,包括:
    在所述搜索结果页的左侧显示从互联网上搜索所述目标搜索词的结果;
    判断查找到的所述资讯信息和/或视频中是否存在与所述搜索结果页左侧展现的结果中相同的资讯信息和/或视频,如果有,则将查找到的所述资讯信息和/或视频中的所述相同的资讯信息和/或视频去除;
    将去除所述相同的资讯信息和/视频后的所述查找到的资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页的右侧区域展现给用户。
  29. 根据权利要求21至28任一项所述的方法,其中,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户之后,所述方法还包括:
    统计用户针对所述搜索结果页上展现的各个所述查找到的资讯信息和/或视频的触发操作,得到统计结果;
    根据所述统计结果确定在后续搜索请求对应的页面中是否展现各个所述查找到的资讯信息和/或 视频。
  30. 根据权利要求21至29任一项所述的方法,其中,根据所述统计结果确定在后续搜索请求对应的页面中是否展现各个所述查找到的资讯信息和/或视频,包括:
    确定在后续搜索请求对应的页面中不再展现所述查找到的资讯信息和/或视频中,所述触发操作的数量小于指定阈值的资讯信息和/或视频。
  31. 一种影视剧类关键词搜索展现装置,包括:
    确定模块,用于确定N个影视剧关键词,其中,N为整数,且N大于1;
    获取模块,用于从预定的一个或多个用户生成内容UGC网站中分别获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频;
    存储模块,用于将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中;
    搜索模块,用于响应用户在搜索引擎上输入的目标搜索词,从互联网中搜索与所述目标搜索词匹配的结果,并在所述影视剧类资讯内容数据库中查找与所述目标搜索词匹配的资讯信息和/或视频;
    展现模块,用于在从所述影视剧类资讯内容数据库中查找到与所述目标搜索词匹配的资讯信息和/或视频的情况下,将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户。
  32. 根据权利要求31所述的装置,其中,所述获取模块具体用于按照以下方式将获取的与所述各个影视剧关键词相关的资讯信息和/或视频存储到影视剧类资讯内容数据库中:
    按照获取的每条所述资讯信息和/或视频相关的影视剧关键词进行分类存储到所述影视剧类资讯内容数据库中,并根据每条资讯信息和/或视频的内容属性对分类存储的每条资讯信息和/或视频进行排序。
  33. 根据权利要求31或32所述的装置,其中,在从所述影视剧类资讯内容数据库中找到到与所述目标搜索词匹配的视频的情况下,所述展现模块具体用于按照以下方式将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户:
    在所述搜索结果页播放查找到的所述视频,并在所述搜索结果页显示查找到的所述资讯信息和/或视频的文字链接。
  34. 根据权利要求31至33任一项所述的装置,其中,对于专业信息发布平台类的UGC网站,所述获取模块具体用于按照以下方式获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频:
    在所述专业信息发布平台类的UGC网站分别搜索所述N个影视剧关键词,从搜索结果中从所述专业信息发布平台类的UGC网站提取与所述N个影视剧关键词相关的资讯信息和/或视频;或者,
    在所述专业信息发布平台类的UGC网站发布的资讯信息中标注影视剧类的资讯和视频,从标注的影视剧类资讯和视频中分别提取与所述N个影视剧关键词相关的资讯信息和/或视频。
  35. 根据权利要求31至34任一项所述的装置,其中,对于网络主题社区类的UGC网站,所述获取模块具体用于按照以下方式获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频:
    在所述网络主题社区类的UGC网站中分别确定与所述N个影视剧关键词中的每个所述影视剧关键词相关的主题社区,从所述相关的主题社区中选择最大的一个或多个主题社区,在所述一个或多个主题社区发布的资讯的名称title或正文中搜索所述N个影视剧关键词,根据搜索结果,从所述一个或多个主题社区中提取与所述N个影视剧关键词相关的资讯信息和/或视频。
  36. 根据权利要求31至35任一项所述的装置,其中,对于网络问答社区类的UGC网站,所述 获取模块具体用于按照以下方式获取与所述N个影视剧关键词中的各个影视剧关键词相关的资讯信息和/或视频:
    获取所述网络问答社区类的UGC网站中发表问题的类别为影视剧类相关的资讯;
    从所述发表问题的类别为影视剧类相关的资讯中分别查找名称和/或正文包含有所述N个影视剧关键词中一个或多个的资讯;
    从查找结果中提取的与所述N个影视剧关键词相关的资讯信息和/或视频。
  37. 根据权利要求31至36任一项所述的装置,其中,所述展现模块具体用于按照以下方式将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户:
    在所述搜索结果页的左侧显示从互联网上搜索所述目标搜索词的结果;
    判断查找到的所述资讯信息和/或视频中是否存在与所述搜索结果页左侧展现的结果中相同的资讯信息和/或视频,如果有,则将查找到的所述资讯信息和/或视频中的所述相同的资讯信息和/或视频去除;
    将去除所述相同的资讯信息和/视频后的所述查找到的资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页的右侧区域展现给用户。
  38. 根据权利要求31至37任一项所述的装置,其中,还包括:
    统计模块,用于在将查找到的所述资讯信息和/或视频聚合至所述目标搜索词对应的搜索结果页展现给用户之后,统计用户针对所述搜索结果页上展现的各个所述查找到的资讯信息和/或视频的触发操作,得到统计结果;
    判断模块,用于根据所述统计结果确定在后续搜索请求对应的页面中是否展现各个所述查找到的资讯信息和/或视频。
  39. 根据权利要求31至38任一项所述的装置,其中,所述判断模块具体用于按照以下方式确定在后续搜索请求对应的页面中是否展现各个所述查找到的资讯信息和/或视频:
    确定在后续搜索请求对应的页面中不再展现所述查找到的资讯信息和/或视频中,所述触发操作的数量小于指定阈值的资讯信息和/或视频。
  40. 根据权利要求31至39任一项所述的装置,其中,所述内容属性包括以下至少之一:内容的类型、内容发布时间、内容的评论数、以及内容的查看数。
  41. 一种游戏类搜索词的搜索方法,包括:
    判断预定的多个用户生成内容UGC网站中是否存在与N个预设游戏标识相关的资讯项,其中,N为整数,且N大于1;
    根据判断结果,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据;
    对抓取的与所述N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库,其中,所述UGC游戏资讯数据库中每条数据至少包括:关键词、资讯内容、以及属性;
    响应用户在搜索引擎上输入的目标搜索词,判断所述目标搜索词是否为游戏类的搜索词;
    在判断所述目标搜索词为游戏类的搜索词的情况下,在从互联网中搜索所述目标搜索词的同时,在UGC游戏资讯数据库查找与所述目标搜索词匹配的数据;
    将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现。
  42. 根据权利要求41所述的方法,其中,对抓取的与所述N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库,包括:
    存储抓取的每条数据,并按照所述抓取的每条数据的一个或多个资讯属性进行排序,得到所述UGC游戏资讯数据库。
  43. 根据权利要求41或42所述的方法,其中,所述资讯属性包括:发布时间和/或评论数。
  44. 根据权利要求41至43任一项所述的方法,其中,得到所述UGC游戏资讯数据库之后,所述方法还包括:
    对于所述UGC游戏资讯数据库中资讯内容包含直播视频的数据项,周期性地检测直播视频是否结束,在检测到直播视频结束的情况下,将对应的数据项从所述UGC游戏资讯库中删除。
  45. 根据权利要求41至44任一项所述的方法,其中,对于专业信息发布平台类的UGC网站,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据,包括:
    在所述专业信息发布平台类的UGC网站的搜索框内分别输入所述N个预设游戏标识,从搜索结果中按发布时间抓取所述N个预设游戏标识中各个预设游戏标识相关的资讯信息;或者,
    在所述专业信息发布平台类的UGC网站发布的资讯信息中标注游戏类资讯,从标注的游戏类资讯中抓取与所述N个预设游戏标识相关的资讯信息。
  46. 根据权利要求41至45任一项所述的方法,其中,对于网络主题社区类的UGC网站,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据,包括:
    对于所述N个预设游戏标识中的每个预设游戏标识,在所述网络主题社区类的UGC网站中确定与该预设游戏标识相关的主题社区,从与该预设游戏标识相关的主题社区选择M个主题社区中,从所述M个主题社区中抓取名称title或正文包含该预设游戏标识的资讯信息。
  47. 根据权利要求41至46任一项所述的方法,其中,对于网络问答社区类的UGC网站,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据,包括:
    从所述网络问答社区类的UGC网站获取发表问题的类别为游戏类的资讯信息;
    判断发表问题为游戏类的资讯信息中是否包含所述N个预设游戏标识中的一个或多个,如果有,则抓取该资讯信息作为与所述N个预设游戏标识相关的数据。
  48. 根据权利要求41至47任一项所述的方法,其中,将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页,包括:
    在所述搜索结果页的左侧展现从互联网上搜索所述目标搜索词的结果;
    判断所述UGC游戏资讯数据库中与所述目标搜索词匹配的数据中是否有与所述搜索结果页左侧展现的结果中相同的数据,如果有,则将所述相同的数据去除;
    将去除所述相同的数据后的与目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页的右侧区域。
  49. 根据权利要求41至48任一项所述的方法,其中,将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现,所述方法还包括:
    统计用户针对所述搜索结果页上展现的与所述目标搜索词匹配的数据的资讯内容的触发操作,得到统计结果;
    根据所述统计结果确定在后续搜索请求对应的页面中是否展现与所述目标搜索词匹配的数据的资讯内容。
  50. 根据权利要求41至49任一项所述的方法,其中,根据所述统计结果确定在后续搜索请求对应的页面中是否展现与所述目标搜索词匹配的数据的资讯内容,包括:
    在所述统计结果为所述触发操作的数量小于指定阈值的情况下,确定在后续搜索请求对应的页面中不再展现与所述目标搜索词匹配的数据的资讯内容。
  51. 一种游戏类搜索词的搜索装置,包括:
    第一判断模块,用于判断预定的多个用户生成内容UGC网站中是否存在与N个预设游戏标识相关的资讯项;
    抓取模块,用于根据判断结果,从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据;
    存储模块,用于对抓取的与所述N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库,其中,所述UGC游戏资讯数据库中每条数据至少包括:关键词、资讯内容、以及属性;
    响应模块,用于响应用户在搜索引擎上输入的目标搜索词,判断所述目标搜索词是否为游戏类的搜索词
    搜索模块,用于在判断所述目标搜索词为游戏类的搜索词的情况下,在从互联网中搜索所述目标搜索词的同时,在UGC游戏资讯数据库查找与所述目标搜索词匹配的数据;
    展现模块,用于将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现。
  52. 根据权利要求51所述的装置,其中,所述存储模块具体用于按照以下方式对抓取的与N个预设游戏标识相关的数据进行处理,得到UGC游戏资讯数据库:
    存储抓取的每条数据,并按照所述抓取的每条数据的一个或多个资讯属性进行排序,得到所述UGC游戏资讯数据库。
  53. 根据权利要求51或52所述的装置,其中,还包括:
    更新模块,用于对于所述UGC游戏资讯数据库中资讯内容包含直播视频的数据项,周期性地检测直播视频是否结束,在检测到直播视频结束的情况下,将对应的数据项从所述UGC游戏资讯库中删除。
  54. 根据权利要求51至53任一项所述的装置,其中,对于专业信息发布平台类的UGC网站,所述抓取模块具体用于按照以下方式从存在与N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据:
    在所述专业信息发布平台类的UGC网站的搜索框内分别输入所述N个预设游戏标识,从搜索结果中按发布时间抓取所述N个预设游戏标识中各个游戏标识相关的资讯信息;或者,
    在所述专业信息发布平台类的UGC网站发布的资讯信息中标注游戏类资讯,从标注的游戏类资讯中抓取与所述N个预设游戏标识相关的资讯信息。
  55. 根据权利要求51至54任一项所述的装置,其中,对于网络主题社区类的UGC网站,所述抓取模块具体用于按照以下方式从存在与N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据:
    对于所述N个预设游戏标识中的每个预设游戏标识,在所述网络主题社区类的UGC网站中确定与该游戏标识相关的主题社区,从与该预设游戏标识相关的主题社区选择M个主题社区中,从所述M个主题社区中抓取名称title或正文包含该预设游戏标识的资讯信息。
  56. 根据权利要求51至55任一项所述的装置,其中,对于网络问答社区类的UGC网站,所述抓取模块具体用于按照以下方式从存在与所述N个预设游戏标识相关的资讯项的一个或多个UGC网站中,抓取与所述N个预设游戏标识相关的数据:
    从所述网络问答社区类的UGC网站获取发表问题的类别为游戏类的资讯信息;
    判断发表问题为游戏类的资讯信息中是否包含所述N个预设游戏标识中的一个或多个,如果有,则抓取该资讯信息作为与所述N个预设游戏标识相关的数据。
  57. 根据权利要求51至56任一项所述的装置,其中,所述展现模块具体用于按照以下方式将与 所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现:
    在所述搜索结果页的左侧展现从互联网上搜索所述目标搜索词的结果;
    判断所述UGC游戏资讯数据库中与所述目标搜索词匹配的数据中是否有与所述搜索结果页左侧展现的结果中相同的数据,如果有,则将所述相同的数据去除;
    将去除所述相同的数据后的与目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页的右侧区域展现。
  58. 根据权利要求51至57任一项所述的装置,其中,还包括:
    统计模块,用于在将与所述目标搜索词匹配的数据的资讯内容聚合至所述目标搜索词对应的搜索结果页展现给用户之后,统计用户针对所述搜索结果页上展现的与所述目标搜索词匹配的数据的资讯内容的触发操作,得到统计结果;
    第二判断模块,用于根据所述统计结果确定在后续搜索请求对应的页面中是否展现与所述目标搜索词匹配的数据的资讯内容。
  59. 根据权利要求51至58任一项所述的装置,其中,所述第二判断模块具体用于按照以下方式确定在后续搜索请求对应的页面中是否展现与所述目标搜索词匹配的数据的资讯内容:
    在所述统计结果为所述触发操作的数量小于指定阈值的情况下,确定在后续搜索请求对应的页面中不再展现与所述目标搜索词匹配的数据的资讯内容。
  60. 根据权利要求51至59任一项所述的方法,其中,所述资讯属性包括:发布时间和/或评论数。
  61. 一种计算机程序,包括计算机可读代码,当所述计算机可读代码在计算设备上运行时,导致所述计算设备执行下列任意之一:
    根据权利要求1-10中任一项所述的针对综艺类query的搜索结果的推送方法;
    根据权利要求21-30中任一项所述的影视剧类关键词搜索展现方法;
    根据权利要求41-50中任一项所述的游戏类搜索词的搜索方法。
  62. 一种计算机可读介质,其中存储了如权利要求61所述的计算机程序。
PCT/CN2017/117220 2016-12-23 2017-12-19 针对综艺类query的搜索结果的推送方法及装置 WO2018113673A1 (zh)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
CN201611209249.9A CN106777206A (zh) 2016-12-23 2016-12-23 影视剧类关键词搜索展现方法及装置
CN201611209249.9 2016-12-23
CN201611209280.2A CN106649737B (zh) 2016-12-23 2016-12-23 针对综艺类query的搜索结果的推送方法及装置
CN201611209248.4A CN106777205A (zh) 2016-12-23 2016-12-23 游戏类搜索词的搜索方法及装置
CN201611209248.4 2016-12-23
CN201611209280.2 2016-12-23

Publications (1)

Publication Number Publication Date
WO2018113673A1 true WO2018113673A1 (zh) 2018-06-28

Family

ID=62624518

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/117220 WO2018113673A1 (zh) 2016-12-23 2017-12-19 针对综艺类query的搜索结果的推送方法及装置

Country Status (1)

Country Link
WO (1) WO2018113673A1 (zh)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109829091A (zh) * 2018-08-28 2019-05-31 上海雅高文化传播有限公司 电子作品传播程度的测评方法、计算机存储介质、及终端
CN111225055A (zh) * 2020-01-07 2020-06-02 广州搜料信息技术有限公司 一种智能定向资讯推送系统
CN112148977A (zh) * 2020-09-22 2020-12-29 北京字节跳动网络技术有限公司 动态网络资源展示及匹配方法、装置、电子设备和介质
CN112699254A (zh) * 2021-01-05 2021-04-23 北京字节跳动网络技术有限公司 一种媒体内容筛选方法、装置及计算机存储介质
CN113468402A (zh) * 2021-05-25 2021-10-01 北京达佳互联信息技术有限公司 目标对象确定方法、装置及存储介质
CN113468372A (zh) * 2020-07-15 2021-10-01 青岛海信电子产业控股股份有限公司 一种智能镜及视频推荐方法
CN113761374A (zh) * 2021-09-09 2021-12-07 北京搜狗科技发展有限公司 一种数据处理方法及装置
US11923890B2 (en) * 2019-04-15 2024-03-05 Canon Kabushiki Kaisha Wireless communication apparatus, wireless communication system, and communication method

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678694A (zh) * 2013-12-26 2014-03-26 乐视网信息技术(北京)股份有限公司 视频资源的倒排索引文件建立方法及其系统
CN104008139A (zh) * 2014-05-08 2014-08-27 北京奇艺世纪科技有限公司 视频索引表的创建方法和装置,视频的推荐方法和装置
CN104636398A (zh) * 2013-11-15 2015-05-20 腾讯科技(北京)有限公司 搜索用户生成内容的方法、装置、服务器和系统
CN104765885A (zh) * 2015-04-29 2015-07-08 北京奇艺世纪科技有限公司 一种ugc内容库扩充方法及装置
CN104978332A (zh) * 2014-04-04 2015-10-14 腾讯科技(深圳)有限公司 用户生成内容标签数据生成方法、装置及相关方法和装置
CN106649737A (zh) * 2016-12-23 2017-05-10 北京奇虎科技有限公司 针对综艺类query的搜索结果的推送方法及装置
CN106777205A (zh) * 2016-12-23 2017-05-31 北京奇虎科技有限公司 游戏类搜索词的搜索方法及装置
CN106777206A (zh) * 2016-12-23 2017-05-31 北京奇虎科技有限公司 影视剧类关键词搜索展现方法及装置

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104636398A (zh) * 2013-11-15 2015-05-20 腾讯科技(北京)有限公司 搜索用户生成内容的方法、装置、服务器和系统
CN103678694A (zh) * 2013-12-26 2014-03-26 乐视网信息技术(北京)股份有限公司 视频资源的倒排索引文件建立方法及其系统
CN104978332A (zh) * 2014-04-04 2015-10-14 腾讯科技(深圳)有限公司 用户生成内容标签数据生成方法、装置及相关方法和装置
CN104008139A (zh) * 2014-05-08 2014-08-27 北京奇艺世纪科技有限公司 视频索引表的创建方法和装置,视频的推荐方法和装置
CN104765885A (zh) * 2015-04-29 2015-07-08 北京奇艺世纪科技有限公司 一种ugc内容库扩充方法及装置
CN106649737A (zh) * 2016-12-23 2017-05-10 北京奇虎科技有限公司 针对综艺类query的搜索结果的推送方法及装置
CN106777205A (zh) * 2016-12-23 2017-05-31 北京奇虎科技有限公司 游戏类搜索词的搜索方法及装置
CN106777206A (zh) * 2016-12-23 2017-05-31 北京奇虎科技有限公司 影视剧类关键词搜索展现方法及装置

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109829091A (zh) * 2018-08-28 2019-05-31 上海雅高文化传播有限公司 电子作品传播程度的测评方法、计算机存储介质、及终端
CN109829091B (zh) * 2018-08-28 2023-01-03 上海雅高文化传播有限公司 电子作品传播程度的测评方法、计算机存储介质、及终端
US11923890B2 (en) * 2019-04-15 2024-03-05 Canon Kabushiki Kaisha Wireless communication apparatus, wireless communication system, and communication method
CN111225055A (zh) * 2020-01-07 2020-06-02 广州搜料信息技术有限公司 一种智能定向资讯推送系统
CN113468372A (zh) * 2020-07-15 2021-10-01 青岛海信电子产业控股股份有限公司 一种智能镜及视频推荐方法
CN112148977A (zh) * 2020-09-22 2020-12-29 北京字节跳动网络技术有限公司 动态网络资源展示及匹配方法、装置、电子设备和介质
CN112699254A (zh) * 2021-01-05 2021-04-23 北京字节跳动网络技术有限公司 一种媒体内容筛选方法、装置及计算机存储介质
CN113468402A (zh) * 2021-05-25 2021-10-01 北京达佳互联信息技术有限公司 目标对象确定方法、装置及存储介质
CN113468402B (zh) * 2021-05-25 2024-05-17 北京达佳互联信息技术有限公司 目标对象确定方法、装置及存储介质
CN113761374A (zh) * 2021-09-09 2021-12-07 北京搜狗科技发展有限公司 一种数据处理方法及装置

Similar Documents

Publication Publication Date Title
WO2018113673A1 (zh) 针对综艺类query的搜索结果的推送方法及装置
CN108694223B (zh) 一种用户画像库的构建方法及装置
CN105210048B (zh) 基于社交媒体的内容识别方法
US8972392B2 (en) User interaction based related digital content items
US8181197B2 (en) System and method for voting on popular video intervals
CN104899302B (zh) 向用户推荐音乐的方法和装置
US9442933B2 (en) Identification of segments within audio, video, and multimedia items
WO2017096877A1 (zh) 一种推荐方法和装置
JP6273386B2 (ja) 関連するメディアコンテントを識別するための方法及びシステム
US20110099195A1 (en) Method and Apparatus for Video Search and Delivery
CN109684513B (zh) 一种低质量视频识别方法及装置
CN106777206A (zh) 影视剧类关键词搜索展现方法及装置
US8566315B1 (en) Sequenced video segment mix
CN106649737B (zh) 针对综艺类query的搜索结果的推送方法及装置
CN108307239B (zh) 一种视频内容推荐方法和装置
US8706655B1 (en) Machine learned classifiers for rating the content quality in videos using panels of human viewers
CN104469508A (zh) 基于弹幕信息内容进行视频定位的方法、服务器和系统
CN104021140B (zh) 一种网络视频的处理方法及装置
WO2014056369A1 (zh) 一种用于对搜索的网络视频进行排序的方法和系统
JP6280323B2 (ja) キャプチャしたイメージを用いた動画分析装置、方法およびコンピュータ読み取り可能な記録媒体
WO2015070807A1 (zh) 一种智能电视的节目推荐方法及装置
CN106407358B (zh) 一种图像搜索方法、装置及移动终端
CN106649738A (zh) 在搜索结果页中聚合人物类资讯信息的方法及装置
CN110881131B (zh) 一种直播回看视频的分类方法及其相关装置
CN106899879B (zh) 一种多媒体数据的处理方法和装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17883923

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17883923

Country of ref document: EP

Kind code of ref document: A1