WO2015027909A1 - Method and apparatus for obtaining hot-topic information - Google Patents

Method and apparatus for obtaining hot-topic information Download PDF

Info

Publication number
WO2015027909A1
WO2015027909A1 PCT/CN2014/085260 CN2014085260W WO2015027909A1 WO 2015027909 A1 WO2015027909 A1 WO 2015027909A1 CN 2014085260 W CN2014085260 W CN 2014085260W WO 2015027909 A1 WO2015027909 A1 WO 2015027909A1
Authority
WO
WIPO (PCT)
Prior art keywords
hot
topic
information
key phrase
relevancy
Prior art date
Application number
PCT/CN2014/085260
Other languages
French (fr)
Inventor
Bing Cai
Original Assignee
Tencent Technology (Shenzhen) Company Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology (Shenzhen) Company Limited filed Critical Tencent Technology (Shenzhen) Company Limited
Publication of WO2015027909A1 publication Critical patent/WO2015027909A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Definitions

  • the present invention generally relates to the field of computer application technology, in particular to the field of information processing technology, and especially to a hot-topic information obtaining method and apparatus.
  • the main page may contain hot-topic navigation bar, which includes links of the latest hot-topics, such as news, entertainment, automobile, military, reading, blog, etc. The user may jump to the latest hot-topics.
  • the disclosed methods and apparatus are directed to solve one or more problems set forth above and other problems.
  • One aspect of the present invention provides a hot-topic information obtaining method.
  • the method includes obtaining a hot-topic key phrase set, extracting information within a preset time window from an information- containing information collection to be a candidate information collection, calculating respectively a relevancy between a hot-topic key phrase included in the hot-topic key phrase set and an information in the candidate information collection, and screening the candidate information collection based on the calculated relevancy to extract a candidate information which satisfies a preset condition to be a hot-topic information.
  • the apparatus includes a hot-topic key phrase obtaining unit, a candidate information extracting unit, a relevancy calculating unit, a hot-topic information screening unit.
  • the hot-topic key phrase obtaining unit is configured to obtain a hot -topic key phrase set.
  • the candidate information extracting unit is configured to extract information within a preset time window from an information collection to be a candidate information collection.
  • the relevancy calculating unit is configured to calculate respectively a relevancy between a hot-topic key phrase included in the hot-topic key phrase set and an information in the candidate information collection.
  • the hot-topic information screening unit is configured to screen the candidate information collection based on the calculated relevancy to extract a candidate information item which satisfies a preset condition to be a hot-topic information.
  • Figure 1 illustrates a flow chart of a hot-topic information obtaining method according to disclosed embodiments of the present invention
  • Figure 2 illustrates a flow chart of another hot-topic information obtaining method according to disclosed embodiments of the present invention
  • Figure 3 illustrates a block diagram of a hot-topic information obtaining apparatus according to disclosed embodiments of the present invention
  • Figure 4 illustrates a block diagram of another hot-topic information obtaining apparatus according to disclosed embodiments of the present invention
  • Figure 5 illustrates a schematic block diagram of an electronic terminal according to disclosed embodiments of the present invention.
  • the disclosed embodiments of the present invention are mainly applied in the information websites to provide real-time hot-topic information to users.
  • the hot-topic information described in the embodiments of the present invention refer to the web pages with search index and/or occurrence higher than a certain level or among a certain number of top rankings from a time to a current time, where the time interval is less than a preset time length.
  • FIG. 1 illustrates a flow chart of a hot-topic information obtaining method according to disclosed embodiments of the present invention for an electronic device or apparatus with Internet capability, such as personal computer, server, smart phone, tablet computer and laptop computer, etc.
  • the hot-topic information obtaining method includes the following steps.
  • the top topic key phrase set may be obtained from a pre-designated website (such as Sina, Sohu and other websites), or the hot-topic key phrase set may be obtained from statistics of an information-containing information collection (such as a content pool).
  • the hot-topic key phrase set may also be obtained by data mining meaningful and valuable hot-topic key phrases from virtual communities (such as micro-blogs, forums, etc.).
  • the hot- topic key phrase set may be obtained in the following methods.
  • Method one key phrases are searched from news pages of the pre-designated website(s). Every key phrase within a preset time window (e.g. within 24 hours from current time) is analyzed statistically based on its search index. The key phrases are sorted based on occurrences and the phrases at the top are extracted to be the hot-topic key phrases.
  • Method two through large amount of calculations, various hot-topic key phrases in contents of virtual communities are analyzed statistically to obtain occurrence frequencies and other parameters. And according to the occurrence frequencies sorted from high to low, the hot-topic key phrases are extracted from the virtual communities.
  • Method three the hot-topic key phrases are extracted directly from a hot-topic page of the pre-designated website (such as the hot-topic key phrase page of Baidu).
  • SI 02 extracting information within a preset time window from the
  • RSS Really Simple Syndication
  • main stream website such as Sina, Sohu, and other websites
  • RSS feed of the pre-designated main stream website is captured and analyzed in advance to obtain corresponding uniform resource locator URL, title, time, text and hotness index, etc., of an information and to store the obtained information in the information collection.
  • the information may be captured and analyzed periodically. For example, the information is captured every two hours and the repeated information is removed.
  • SI 03 calculating respectively a relevancy between the hot-topic key phrases included in the hot-topic key phrase set and the information in the candidate information collection.
  • the number of hot-topic key phrases in the hot-topic key phrase set is denoted as 'm' and the number of information items in the candidate information collection is denoted as 'n'.
  • the relevancy between each hot-topic key phrase of the 'm' hot-topic key phrases and each information of the 'n' information item is calculated.
  • the relevancy between the i-th hot-topic key phrase and the j-th information item may be calculated by: splitting the i-th hot-topic key phrase into at least one hot-topic keyword combination, calculating respectively a relevancy between the j-th information and each hot-topic keyword combination split from the i-th hot-topic key phrase, and adding up the relevancy between the j-th information and each hot-topic keyword combination split from the i-th hot-topic key phrase to be the relevancy between the i-th hot- topic key phrase and the j-th information.
  • i and j are positive integers no less than 1 with i no greater than m and j no greater than n.
  • splitting the i-th hot-topic key phrase into the at least one hot-topic keyword combination includes: splitting every two adjacent words from the i-th hot-topic key phrase into a hot-topic keyword combination.
  • the hot-topic key phrase " Ji Lin explosion event” can be split into multiple 2-word hot-topic keyword combinations, such as in sequence: "Ji Lin", “Lin explosion”, and "explosion event”.
  • SI 04 based on the calculated relevancy, screening the candidate information collection to extract candidate information which satisfies a preset condition to be the hot- topic information.
  • the hot-topic information which satisfies the requirement can be obtained by various methods, such as the following three methods.
  • the candidate information which has the highest relevancy with each obtained hot-topic key phrase is extracted respectively from the candidate information collection to be the hot-topic information.
  • the candidate information which has the highest relevancy with the hot-topic key phrase is extracted to be the hot-topic information.
  • the candidate information whose relevancy with each obtained hot-topic key phrase is greater than a preset threshold is extracted respectively from the candidate information collection to be the hot-topic information.
  • the candidate information whose relevancy with each obtained hot-topic key-phrase is within a preset top number in a relevancy queue is extracted respectively from the candidate information collection to be the hot-topic information. For example, for each hot-topic key phrase, the candidate information with the top 3 relevancies with the hot-topic key phrase are extracted from the candidate information collection to be the hot-topic information
  • relatively fresh information can be extracted from the information collection to be the candidate information collection, and the relevancy between each hot-topic key phrase included in the obtained hot-topic key phrase set and the relatively fresh information extracted from the information collection can be calculated respectively. Based on the calculated relevancy, the extracted relatively fresh information can be screened to obtain the candidate information which satisfies the preset condition to be the hot-topic information.
  • the hot-topic information may be obtained independently and automatically through a computer, lowering the cost and increasing the hot-topic information obtaining speed.
  • FIG. 2 illustrates a flow chart of another top-topic information obtaining method according to disclosed embodiments of the present invention.
  • the hot-topic information obtaining method may include the following steps.
  • S201 an RSS feed of a pre-designated information website is captured and analyzed periodically, and the captured information is stored in an information collection.
  • S202 it is determined whether a time to capture hot-topic information is reached. If the time is reached, the process proceeds to STEP S204. Otherwise, if the time is not reached, the process proceeds to STEP S203.
  • a capturing frequency may be set in advance.
  • the capturing frequency may be set between 30s and lmin. After each capturing cycle is completed, it is determined whether a next cycle may start.
  • S203 the process waits, and returns to S202 after waiting a predetermined period of time.
  • a hot-topic key phrase set is obtained.
  • the hot-topic key phrase set is captured from content of a dedicated hot-topic page in a web portal and search engine, such as Baidu hot list, Sina home page, etc.
  • S205 information within a preset time window is extracted from the information collection to be a candidate information collection.
  • RSS Really Simple Syndication
  • the RSS feed of the pre-designated main stream website (such as Sina, Sohu, and other websites) is captured and analyzed in advance to obtain the corresponding uniform resource locator URL, title, time, text, hotness index, etc., of the information.
  • the obtained information is stored in the information collection.
  • the information may be captured and analyzed periodically. For example, the information is captured every two hours and repeated information is removed.
  • a hot-topic key phrase is extracted sequentially from the hot-topic key phrase set.
  • the hot-topic key phrase " Ji Lin explosion event" is extracted from the hot-topic key phrase set.
  • the extracted hot-topic key phrase is split into hot-topic keyword combinations with a fixed or variable length or word number. For example, every two adjacent words from the hot-topic key phrase are split as a hot-topic keyword combination.
  • the hot-topic key-phrase "Ji Lin explosion event” can be split into multiple 2-word hot-topic keyword combinations, such as in sequence: "Ji Lin", “Lin explosion”, “explosion event”.
  • the relevancy between an i-th hot-topic key phrase and a j-th information item can be calculated by the followings.
  • the i-th hot-topic key phrase is split into at least one hot-topic keyword combination.
  • the relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is calculated respectively according to a preset algorithm. Further, the relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is added up to be the relevancy between the i-th hot-topic key phrase and the j-th information.
  • the i and j are positive integers no less than 1.
  • splitting the i-th hot-topic phrase into the at least one hot-topic keyword combination includes splitting every two adjacent words from the i-th hot-topic key phrase to be the hot-topic keyword combinations.
  • the relevancy between the hot-topic key phrase and the information can be calculated in the following algorithm.
  • Score(q j ,D) is the relevancy between the hot-topic key phrase q i and the information D ;
  • q i is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
  • TF ⁇ q t is a occurrence frequency of the hot-topic keyword combination q i in the document content of D ;
  • H(D) is a hotness index of the information D ;
  • DF ⁇ q t is a document frequency of the hot-topic keyword combination q i ;
  • L(D) is a content length of the information D ;
  • ⁇ 4 are preset coefficients.
  • relevancy between each obtained hot-topic information and each extracted information item is calculated respectively.
  • the relevancy between the hot-topic information and the extracted information item is a sum of the relevancies between the hot- topic keyword combinations of the corresponding hot-topic key phrase of the hot-topic information and the extracted information item. The formula is given below.
  • Score Q,D ⁇ [ « ) * 1 H ⁇ D)l ⁇ i DF ⁇ q i ) * L(D))]
  • Score(Q,D) is the relevancy between the hot-topic key phrase Q and the information D ;
  • Q is the hot-topic key phrase;
  • i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
  • n is the number of hot-topic keyword combinations included in the hot-topic key-phrase.
  • S210 a candidate information item which has the highest relevancy with each obtained hot-topic key phrase is extracted respectively from the candidate information collection to be the hot-topic information, and the process returns to S202.
  • the hot-topic information which satisfies the requirement may be obtained in various ways.
  • the RSS feed of the pre-designated information website may be captured and analyzed periodically, and the obtained information may be stored in a content pool formed in the information collection.
  • the hot-topic key phrase set may be obtained periodically.
  • a relatively fresh information is extracted from the content pool and the relevancy between the hot-topic key phrase in the hot-topic key phrase set and the extracted relatively fresh information is calculated respectively.
  • the extracted relatively fresh information is screened to extract the candidate information which satisfies a preset condition to be the hot-topic information.
  • the hot-topic information may be independently obtained through a computer in a preset period, which can save the cost and increase the hot- topic information obtaining speed.
  • FIG. 3 illustrates a block diagram of a hot-topic information obtaining apparatus according to disclosed embodiments of the present invention.
  • the hot-topic information obtaining apparatus includes a hot-topic key phrase obtaining unit 301, a candidate information extracting unit 302, a relevancy calculating unit 303 and a hot-topic information screening unit 304.
  • the hot-topic key phrase obtaining unit 301 is configured to obtain a hot-topic key phrase set.
  • the candidate information extracting unit 302 is configured to extract information within a preset time window from an information collection to be a candidate information collection.
  • the relevancy calculating unit 303 is configured to calculate respectively a relevancy between each hot-topic key phrase included in the hot-topic key phrase set and the information in the candidate information collection.
  • the hot-topic information screening unit 304 is configured to screen the candidate information collection based on the calculated relevancy to extract a candidate information which satisfies a preset condition to be a hot- topic information.
  • the hot-topic key phrase obtaining unit 301 may be further configured to obtain the hot-topic key phrase set from a pre-designated website. For the preset time window, a start time of the time window is a time less than a preset time length (e.g. 24 hours) from a current time and an end time of the time window is the current time.
  • the relevancy calculating unit 303 may calculate a relevancy between a i-th hot-topic key phrase and a j-th information in the following steps.
  • the i and j are positive integers no less than 1.
  • the i-th hot-topic key phrase is split into at least one hot-topic keyword combination.
  • the relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is calculated respectively according to a preset algorithm.
  • the relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is added up to be the relevancy between the i-th hot-topic key phrase and the j-th information.
  • Splitting the i-th hot-topic phrase into the at least one hot-topic keyword combination includes splitting every two adjacent words from the i-th hot-topic key phrase to be the hot-topic keyword combinations.
  • the relevancy between each hot-topic key phrase in the hot-topic key phrase set and each information in the candidate information collection is calculated respectively. Specifically, the relevancy between the hot-topic key phrase and the
  • Score Q,D ⁇ [ « ) * ⁇ 2 ⁇ ⁇ D) l ⁇ DF ⁇ q,) * 4 ⁇ ))]
  • Score(Q,D) is the relevancy between the hot-topic key phrase Q and the information D ; i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
  • n is the number of hot-topic keyword combinations included in the hot-topic key-phrase
  • q i is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
  • TF(q j ) is a occurrence frequency of the hot-topic keyword combination q i in the document content of D ;
  • H(D) is a hotness index of the information D ;
  • DF(q j ) is a document frequency of the hot-topic keyword combination q i ;
  • L(D) is a content length of the information D ;
  • ⁇ 4 are preset coefficients.
  • the hot-topic information screening unit 304 may be configured to extract respectively the candidate information which has the highest relevancy with each obtained hot-topic key phrase from the candidate information collection to be the hot-topic information. Further, hot-topic information screening unit 304 may also be configured to extract respectively the candidate information whose relevancy with each obtained hot-topic key phrase is within a preset top number in the relevancy queue from the candidate information collection to be the hot-topic information. Furthermore, the hot-topic
  • information screening unit 304 may further be configured to extract respectively a preset number of the candidate information which has the greatest relevancy with each obtained hot- topic key phrase from the candidate information collection to be the hot-topic information.
  • the disclosed hot-topic information obtaining apparatus obtains the hot- topic key phrase set, extracts relatively fresh information from the information collection to be the candidate information collection, and calculates respectively the relevancy between each hot-topic key phrase in the obtained hot-topic key phrase set and the relatively fresh information extracted from the information collection. Based on the calculated relevancy, the disclosed hot-topic information obtaining apparatus screens the extracted relatively fresh information to obtain the candidate information which satisfies the preset condition to be the hot-topic information.
  • the disclosed hot-topic information obtaining apparatus may independently obtain the hot-topic information through a computer, which can save cost and increase the hot-topic information obtaining speed.
  • FIG. 4 illustrates a block diagram of another hot-topic information obtaining apparatus according to disclosed embodiments of the present invention.
  • the hot-topic information obtaining apparatus includes a hot-topic key phrase obtaining unit 401, an information obtaining unit 402, a candidate information extracting unit 403, a relevancy calculating unit 404, a hot-topic information screening unit 405, and a hot-topic information displaying unit 406.
  • the hot-topic key phrase obtaining unit 401 is configured to obtain a hot-topic key phrase set.
  • the information obtaining unit 402 is configured to capture and analyze a RSS-feed of a pre-designated information website to obtain information and to store the obtained information in an information collection before the candidate information extracting unit 403 extracts information within a preset time window from the information collection to be a candidate information collection.
  • the candidate information extracting unit 403 is configured to extract the information within the preset time window from the information-containing information collection to be the candidate information collection.
  • the relevancy calculating unit 404 is configured to calculate respectively a relevancy between each hot-topic key phrase in the hot-topic key phrase set and the information in the candidate information collection.
  • the hot-topic information screening unit 405 is configured to screen the candidate information collection based the calculated relevancy to extract the candidate information which satisfies a preset condition to be a hot-topic information.
  • the hot-topic information displaying unit 406 is configured to display the screened hot-topic information.
  • the information obtaining unit 402 may further be configured to capture and analyze periodically the RSS-feed of the pre-designated information website.
  • the information obtaining unit is added to capture and analyze the RSS-feed of the pre-designated information website to obtain the information and to store the obtained information in the information collection before the candidate information extracting unit extracts the information within the preset time window from the information-containing information collection to be the candidate information collection.
  • the disclosed hot- topic information obtaining apparatus may improve the extracting efficiency of the candidate information extracting unit and further improve the hot-topic information obtaining efficiency.
  • the present invention may also provide an electronic device or terminal to obtain the hot-topic information.
  • the terminal may include an RF (Radio Frequency) circuit 501, one or more computer-readable storage medium 502, an input unit 503, a display unit 504, a sensor 505, an audio circuit 506, a Wi-Fi (wireless Fidelity) module 507, a processor 508 with one or more processing cores, a power supply 509 and other components.
  • RF Radio Frequency
  • the terminal structure as shown in Figure 5 does not constitute any limitation of the terminal, and the terminal may include more or fewer components than those as shown, or a combination of some of the components, or a different arrangement of components.
  • the RF circuit 501 may be configured to receive or transmit signals during the receiving and sending of a call or message. In particular, after receiving a downlink information from a base station, the RF circuit 501 delivers the downlink information to one or more processor 508. Further, the RF circuit 501 transmits a uplink data to the base station.
  • the RF circuit 501 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a subscriber identity module (SIM) card, a transceiver, a coupler, a low noise amplifier (LNA), a diplexer, etc.
  • SIM subscriber identity module
  • LNA low noise amplifier
  • the RF circuit 501 may communicate with other devices through wireless communication and a network.
  • Wireless communication may use any communication standards or protocols, including but not limited to global system of mobile communication (GSM), general packet radio service (GPRS), code division multiple access (CDMA), sideband code division multiple access (WCDMA), long term evolution (LTE), email, short messaging service (SMS) and etc.
  • GSM global system of mobile communication
  • GPRS general packet radio service
  • CDMA code division multiple access
  • WCDMA sideband code division multiple access
  • LTE long term evolution
  • email short messaging service
  • SMS short messaging service
  • the memory 502 is used to store software programs and modules, and the processor 508 is used to perform various functions and data processing by running the software programs and modules stored in memory 502.
  • the memory 502 may mainly include a program storage segment and a data storage segment, wherein the program storage segment may store an operating system and an application which implements at least one function (such as playing audio, playing video, etc.), the data storage segment may store the data created according to the use of the terminal (such as audio data, phone book, etc.).
  • the memory 502 may include high-speed random access memory.
  • the memory 502 may also include non-volatile memory, such as at least one disk storage device, flash memory device, or other non-volatile solid-state memory device. Accordingly, the memory 502 may also include a memory controller to provide the memory access for the processor 508 and the input unit 503.
  • the input unit 503 may be used to receive input numbers and character information and generates the user settings and function control related signal input for a keyboard, mouse, joystick, trackball or an optical signal input.
  • the input unit 503 may include a touch sensitive surface 5031 and other input devices 5032.
  • Touch sensitive surface 5031 also known as touch screen or touch pad, may collect a user's touching operations on or near it (such as the touching operations of a user's finger, stylus, or any suitable objects on or near the touch- sensitive surface 5031), and drive the corresponding connection device based on a preset program.
  • the touch- sensitive surface 5031 may include a touch-detecting device portion and a touch-controller portion.
  • the touch-detecting device detects a user's touching position, detects the signal caused by the touching operation, and sends the signal to the touch controller.
  • the touch controller after receiving the touching information from the touch-detecting device, converts the touching information into contact coordinates and sends the contact coordinates to the processor 508. Meanwhile the touch controller may also receive and execute the commands sent by the processor 508.
  • the touch sensitive surface 5031 may also be achieved in various forms, such as resistive, capacitive, infrared and surface acoustic wave, etc.
  • the input unit 503 may also include other input devices 5032.
  • other input devices 5032 may include, but are not limited to, the physical keyboard, function keys (such as volume control keys, key switches, etc.), a trackball, a mouse, one or more types of operating lever.
  • the display unit 504 may be used to display information entered by the user or to display the information provided to the user, and to provide various types of graphic user interface of a terminal. These graphical user interfaces may be constituted in the form of graphics, text, icons, video and any combination thereof.
  • the display unit 504 may include a display panel 5041, optionally, the display panel 5041 may be configured in the form of a LCD (Liquid Crystal Display), an organic light emitting diode (OLED), etc.
  • the touch sensitive surface 5031 may cover the display panel 5041. When the touch sensitive surface 5031 detects a touching operation on or near it, the touch sensitive surface 5031 sends a touch event to the processor 508 to determine the type of the touch event.
  • the processor 508 provides the corresponding visual output on the display panel 5041 according to the type of the touch event.
  • the touch sensitive surface 5031 and the display panel 5041 are shown as two separate components to achieve the input and output function, in some embodiments, the touch sensitive surface 5031 may be integrated with the display panel 5041 to realize the input and output function.
  • the terminal may also include at least one sensor 505, such as light sensor, motion sensor and other sensors.
  • the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may be used to adjust the brightness of the display panel 5041 according to the ambient light magnitude and the proximity sensor may be configured to turn off the display panel 5041 and/or the backlight when the terminal is moved close to the ear.
  • the gravity acceleration sensor may detect the acceleration magnitude on all directions (typically in three axes) and may also detect the magnitude and the direction of gravity when in still mode. So the gravity acceleration sensor may be used in phone gesture-identifying applications (such as horizontal and vertical screen switch, relevant games, magnetometer posture calibration), vibration recognition related functions (such as pedometers, percussion), etc.
  • the audio circuit 506, a speaker 5061, a microphone 5062 may provide audio interface between a user and the terminal.
  • the audio circuit 506 after converting the received audio data into electrical signal, transmits the electrical signal to the speaker 5061 and the speaker 5061 converts the electrical signal into sound output.
  • the audio circuit 506 converts the received electrical signal into audio data and transmits audio data to the processor 508 to be processed.
  • the processed audio data is transmitted through the RF circuit 501 to for example another terminal or to be stored in the memory 502 for further processing.
  • the audio circuit 506 may also include an earplug jack to provide communications between a peripheral headset and the terminal.
  • Wi-Fi belongs to a short-range wireless transmission technology.
  • the terminal may help a user to send and receive email, browse web page and access streaming media, etc., through the Wi-Fi module 507. It provides the user with wireless broadband Internet access.
  • the Wi-Fi module 507 is shown in Figure 5, it is understood that it is not necessarily part of the terminal and can be omitted according to the need in the scope of not changing the nature of the invention.
  • the processor 508 is the control center of the terminal. By using a variety of interfaces and circuits to connect various parts of the entire mobile phone, running or executing the software programs and/or modules stored in the memory 502, and by calling the data stored in the memory 502, the processor 508 may perform a variety of functions and data processing of the terminal, so as to monitor the entire mobile phone.
  • the processor 508 may include one or more processing cores.
  • the processor 508 may be integrated with an application processor and a modem processor, wherein the application processor mainly processes operating system, user interface and applications, etc., and the modem processor mainly handles the wireless communication. It is understandable that the above modem processor may also not be integrated into the processor 508.
  • the terminal further includes a power supply 509 (such as battery).
  • the power supply 509 may be logically connected to the processor 508 through a power management system. So the functions such as charge, discharge and power management, etc., may be implemented through the power management system.
  • the power supply 509 may also include one or more of DC or AC power, a recharging system, a power failure detaching circuit, a power converter or inverter, power supply status indictors, etc.
  • the terminal may also include a camera, a Bluetooth module, etc., the list is not repeated hereafter.
  • the display unit is a touch screen display, and the terminal may use memory or other storage to store one or more computer programs.
  • the processor 508 of the terminal may be configured to load the one or more executable computer programs and to execute the one or more computer programs to achieve various functions as followings.
  • the processor may be configured to obtain a hot-topic key phrase set, to extract information within a preset time window from an information collection containing information to be a candidate information collection, to calculate respectively a relevancy between each hot-topic key phrase included in the hot-topic key phrase set and the information in the candidate information collection, and to screen the candidate information collection according to calculated relevancy to extract a candidate information which satisfies a preset condition to be a hot-topic information.
  • the processor 508 may be configured to obtain the hot-topic key phrase set from a pre-designated website.
  • the processor 508 may be configured to capture and analyze a RSS feed of the pre-designated information website to obtain information and to store the obtained information in the information collection.
  • the processor 508 may be configured to capture and analyze periodically the RSS feed of the pre-designated information website.
  • a start time of the preset time window from a current time is less than a preset time length and an end time of the preset time window is the current time.
  • the processor 508 may be configured to calculate a relevancy between an i-th hot-topic key phrase and a j-th information in the following steps.
  • the i and j are positive integers no less than 1.
  • the i-th hot-topic key phrase is split into at least one hot-topic keyword combination.
  • a relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is calculated respectively according to a preset algorithm.
  • the relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is added up to be the relevancy between the i-th hot-topic key phrase and the j-th information.
  • the processor 508 is also configured to split every two adjacent words from the i-th hot-topic key phrase to be the hot-topic keyword combinations.
  • the processor 508 may be configured to calculate the relevancy between the hot-topic key phrase and the information in the following formula.
  • Score Q,D ⁇ [ « ) * ⁇ 2 ⁇ ⁇ D) l ⁇ DF ⁇ q,) * 4 ⁇ ))]
  • Score(Q,D) is the relevancy between the hot-topic key phrase Q and the information D ; i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
  • n is the number of hot-topic keyword combinations included in the hot-topic key-phrase
  • q i is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
  • TF(q j ) is a occurrence frequency of the hot-topic keyword combination q i in the document content of D ;
  • H(D) is a hotness index of the information D ;
  • DF(q j ) is a document frequency of the hot-topic keyword combination q i ;
  • L(D) is a content length of the information D ;
  • ⁇ 4 are preset coefficients.
  • the processor 508 may be configured to screen the candidate information collection according to calculated relevancy to extract the candidate information which satisfies the preset condition to be the hot-topic information. Specifically, the processor 508 may be configured to extract respectively the candidate information which has the highest relevancy with each obtained hot-topic key phrase from the candidate information collection to be the hot-topic information. Further, the processor 508 may also be configured to extract respectively the candidate information whose relevancy with each obtained hot- topic key-phrase is greater than a preset threshold from the candidate information collection to be the hot-topic information. Furthermore, processor 508 may further be configured to extract respectively the candidate information whose relevancy with each obtained hot-topic key phrase is within a preset top number in the relevancy queue from the candidate
  • the processor 508 may be configured to display the obtained hot-topic information.
  • various hot- topic information obtaining methods and apparatus can be implemented.
  • a hot-topic key phrase set can be obtained
  • relatively fresh information can be extracted from an information collection to be a candidate information collection
  • a relevancy between each hot-topic key phrase in the obtained hot-topic key phrase set and the relatively fresh information extracted from the information collection can be calculated respectively.
  • the extracted relatively fresh information can be screened to obtain a candidate information which satisfies the preset condition to be a hot-topic information.
  • the hot-topic information may be independently obtained through a computer and the hot-topic information can be displayed to the user in a timely manner, lowering cost and increasing the hot-topic information obtaining speed.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A hot-topic information obtaining method is provided. The method includes obtaining a hot-topic key phrase set, extracting information within a preset time window from an information-containing information collection to be a candidate information collection, calculating respectively a relevancy between a hot-topic key phrase included in the hot-topic key phrase set and an information in the candidate information collection, and screening the candidate information collection based on the calculated relevancy to extract a candidate information which satisfies a preset condition to be a hot-topic information.

Description

Description
METHOD AND APPARATUS FOR OBTAINING HOT-TOPIC INFORMATION
CRO S S-REFERENCES TO RELATED APPLICATIONS
[0001] This application claims priority of Chinese Patent Application No.
201310386577.6, filed on August 29, 2013, the entire contents of which are incorporated by reference herein.
FIELD OF THE INVENTION
[0002] The present invention generally relates to the field of computer application technology, in particular to the field of information processing technology, and especially to a hot-topic information obtaining method and apparatus.
BACKGROUND
[0003] For the convenience of users to quickly browse the latest information, information websites often contain large amount of contents of the latest hot-topics. For example, when a user browses a main page of a popular web portal, the main page may contain hot-topic navigation bar, which includes links of the latest hot-topics, such as news, entertainment, automobile, military, reading, blog, etc. The user may jump to the
corresponding content page by clicking on the interested link. With the rapid development of the Internet, the Internet information are being updated more frequently, hot-topic characters and hot-topic events are generated at every moment, which makes it very difficult to obtain quickly and accurately the relevant real-time hot-topic information (also known as hot-topic information) and to present those hot-topic information to the user in a timely manner.
[0004] Existing technologies mainly adopt manual configuration methods to present the relevant information to the user by manually picking through massive information with real-time hot-topic key phrases. However, due to the frequent updating of real-time hot-topic information, the cost of manual configuration can be high. Further, because the configuration process is relatively slow, it often cannot satisfy the swiftness required to deliver the contents to the user in a timely manner.
[0005] The disclosed methods and apparatus are directed to solve one or more problems set forth above and other problems.
BRIEF SUMMARY OF THE DISCLO SURE
[0006] One aspect of the present invention provides a hot-topic information obtaining method. The method includes obtaining a hot-topic key phrase set, extracting information within a preset time window from an information- containing information collection to be a candidate information collection, calculating respectively a relevancy between a hot-topic key phrase included in the hot-topic key phrase set and an information in the candidate information collection, and screening the candidate information collection based on the calculated relevancy to extract a candidate information which satisfies a preset condition to be a hot-topic information.
[0007] Another aspect of the present invention provides a hot-topic information obtaining apparatus. The apparatus includes a hot-topic key phrase obtaining unit, a candidate information extracting unit, a relevancy calculating unit, a hot-topic information screening unit. The hot-topic key phrase obtaining unit is configured to obtain a hot -topic key phrase set. The candidate information extracting unit is configured to extract information within a preset time window from an information collection to be a candidate information collection. The relevancy calculating unit is configured to calculate respectively a relevancy between a hot-topic key phrase included in the hot-topic key phrase set and an information in the candidate information collection. The hot-topic information screening unit is configured to screen the candidate information collection based on the calculated relevancy to extract a candidate information item which satisfies a preset condition to be a hot-topic information.
[0008] Other aspects or embodiments of the present disclosure can be understood by those skilled in the art in light of the description, the claims, and the drawings of the present disclosure.
BRIEF DESCRIPTION OF THE DRAWINGS
[0009] In order to make the embodiments and technical solutions of the present invention to be more clearly understood, the followings briefly describes the accompanying drawings. Obviously, the accompanying drawings illustrate certain embodiments, and those ordinary skilled in the art, according to the disclosed embodiments and drawings, may obtain other drawings without creative efforts.
[0010] Figure 1 illustrates a flow chart of a hot-topic information obtaining method according to disclosed embodiments of the present invention;
[0011] Figure 2 illustrates a flow chart of another hot-topic information obtaining method according to disclosed embodiments of the present invention;
[0012] Figure 3 illustrates a block diagram of a hot-topic information obtaining apparatus according to disclosed embodiments of the present invention;
[0013] Figure 4 illustrates a block diagram of another hot-topic information obtaining apparatus according to disclosed embodiments of the present invention; and [0014] Figure 5 illustrates a schematic block diagram of an electronic terminal according to disclosed embodiments of the present invention.
DETAILED DES CRIPTION
[0015] In order to make the purposes, technical solutions and advantages of the present invention more clear, the followings, together with accompanying drawings, describe in detail certain embodiments of the present invention.
[0016] The disclosed embodiments of the present invention are mainly applied in the information websites to provide real-time hot-topic information to users. It should be noted that the hot-topic information described in the embodiments of the present invention refer to the web pages with search index and/or occurrence higher than a certain level or among a certain number of top rankings from a time to a current time, where the time interval is less than a preset time length.
[0017] Figure 1 illustrates a flow chart of a hot-topic information obtaining method according to disclosed embodiments of the present invention for an electronic device or apparatus with Internet capability, such as personal computer, server, smart phone, tablet computer and laptop computer, etc. As shown in Figure 1, the hot-topic information obtaining method includes the following steps.
[0018] S 101 , obtaining a hot-topic key phrase set.
[0019] There are various ways to obtain the hot-topic key phrase set. For example, the top topic key phrase set may be obtained from a pre-designated website (such as Sina, Sohu and other websites), or the hot-topic key phrase set may be obtained from statistics of an information-containing information collection (such as a content pool). The hot-topic key phrase set may also be obtained by data mining meaningful and valuable hot-topic key phrases from virtual communities (such as micro-blogs, forums, etc.). Specifically, the hot- topic key phrase set may be obtained in the following methods.
[0020] Method one: key phrases are searched from news pages of the pre-designated website(s). Every key phrase within a preset time window (e.g. within 24 hours from current time) is analyzed statistically based on its search index. The key phrases are sorted based on occurrences and the phrases at the top are extracted to be the hot-topic key phrases.
[0021] Method two: through large amount of calculations, various hot-topic key phrases in contents of virtual communities are analyzed statistically to obtain occurrence frequencies and other parameters. And according to the occurrence frequencies sorted from high to low, the hot-topic key phrases are extracted from the virtual communities. [0022] Method three: the hot-topic key phrases are extracted directly from a hot-topic page of the pre-designated website (such as the hot-topic key phrase page of Baidu).
[0023] SI 02, extracting information within a preset time window from the
information-containing information collection to be a candidate information collection.
[0024] RSS (Really Simple Syndication) subscription is a simple way for websites to share contents. For example, an RSS feed of the pre-designated main stream website (such as Sina, Sohu, and other websites) is captured and analyzed in advance to obtain corresponding uniform resource locator URL, title, time, text and hotness index, etc., of an information and to store the obtained information in the information collection.
[0025] In order to obtain latest information, the information may be captured and analyzed periodically. For example, the information is captured every two hours and the repeated information is removed.
[0026] SI 03, calculating respectively a relevancy between the hot-topic key phrases included in the hot-topic key phrase set and the information in the candidate information collection.
[0027] The number of hot-topic key phrases in the hot-topic key phrase set is denoted as 'm' and the number of information items in the candidate information collection is denoted as 'n'. The relevancy between each hot-topic key phrase of the 'm' hot-topic key phrases and each information of the 'n' information item is calculated.
[0028] For example, the relevancy between the i-th hot-topic key phrase and the j-th information item may be calculated by: splitting the i-th hot-topic key phrase into at least one hot-topic keyword combination, calculating respectively a relevancy between the j-th information and each hot-topic keyword combination split from the i-th hot-topic key phrase, and adding up the relevancy between the j-th information and each hot-topic keyword combination split from the i-th hot-topic key phrase to be the relevancy between the i-th hot- topic key phrase and the j-th information. Where i and j are positive integers no less than 1 with i no greater than m and j no greater than n.
[0029] Further, splitting the i-th hot-topic key phrase into the at least one hot-topic keyword combination includes: splitting every two adjacent words from the i-th hot-topic key phrase into a hot-topic keyword combination. For example, the hot-topic key phrase " Ji Lin explosion event" can be split into multiple 2-word hot-topic keyword combinations, such as in sequence: "Ji Lin", "Lin explosion", and "explosion event". [0030] SI 04, based on the calculated relevancy, screening the candidate information collection to extract candidate information which satisfies a preset condition to be the hot- topic information.
[0031] Based on a display requirement of the hot-topic information, the hot-topic information which satisfies the requirement can be obtained by various methods, such as the following three methods.
[0032] 1. The candidate information which has the highest relevancy with each obtained hot-topic key phrase is extracted respectively from the candidate information collection to be the hot-topic information. Thus, for each hot-topic key phrase, the candidate information which has the highest relevancy with the hot-topic key phrase is extracted to be the hot-topic information.
[0033] 2. The candidate information whose relevancy with each obtained hot-topic key phrase is greater than a preset threshold is extracted respectively from the candidate information collection to be the hot-topic information.
[0034] 3. The candidate information whose relevancy with each obtained hot-topic key-phrase is within a preset top number in a relevancy queue is extracted respectively from the candidate information collection to be the hot-topic information. For example, for each hot-topic key phrase, the candidate information with the top 3 relevancies with the hot-topic key phrase are extracted from the candidate information collection to be the hot-topic information
[0035] Thus, according to the disclosed embodiments, by obtaining the hot-topic key phrase set, relatively fresh information can be extracted from the information collection to be the candidate information collection, and the relevancy between each hot-topic key phrase included in the obtained hot-topic key phrase set and the relatively fresh information extracted from the information collection can be calculated respectively. Based on the calculated relevancy, the extracted relatively fresh information can be screened to obtain the candidate information which satisfies the preset condition to be the hot-topic information. Thus, the hot-topic information may be obtained independently and automatically through a computer, lowering the cost and increasing the hot-topic information obtaining speed.
[0036] Figure 2 illustrates a flow chart of another top-topic information obtaining method according to disclosed embodiments of the present invention. As shown in Figure 2, the hot-topic information obtaining method may include the following steps.
[0037] S201, an RSS feed of a pre-designated information website is captured and analyzed periodically, and the captured information is stored in an information collection. [0038] S202, it is determined whether a time to capture hot-topic information is reached. If the time is reached, the process proceeds to STEP S204. Otherwise, if the time is not reached, the process proceeds to STEP S203.
[0039] A capturing frequency may be set in advance. For example, the capturing frequency may be set between 30s and lmin. After each capturing cycle is completed, it is determined whether a next cycle may start.
[0040] S203, the process waits, and returns to S202 after waiting a predetermined period of time.
[0041] S204, a hot-topic key phrase set is obtained. For example, the hot-topic key phrase set is captured from content of a dedicated hot-topic page in a web portal and search engine, such as Baidu hot list, Sina home page, etc.
[0042] S205, information within a preset time window is extracted from the information collection to be a candidate information collection.
[0043] RSS (Really Simple Syndication) subscription is a simple way for websites to share contents. For example, the RSS feed of the pre-designated main stream website (such as Sina, Sohu, and other websites) is captured and analyzed in advance to obtain the corresponding uniform resource locator URL, title, time, text, hotness index, etc., of the information. The obtained information is stored in the information collection.
[0044] In order to obtain latest information, the information may be captured and analyzed periodically. For example, the information is captured every two hours and repeated information is removed.
[0045] S206, a hot-topic key phrase is extracted sequentially from the hot-topic key phrase set. For example, the hot-topic key phrase " Ji Lin explosion event" is extracted from the hot-topic key phrase set.
[0046] S207, the extracted hot-topic key phrase is split into hot-topic keyword combinations with a fixed or variable length or word number. For example, every two adjacent words from the hot-topic key phrase are split as a hot-topic keyword combination. The hot-topic key-phrase "Ji Lin explosion event" can be split into multiple 2-word hot-topic keyword combinations, such as in sequence: "Ji Lin", "Lin explosion", "explosion event".
[0047] S208, relevancy between each hot-topic keyword combination and each information in the candidate information collection is calculated to obtain the relevancy between each hot-topic key phrase and each information item.
[0048] The relevancy between an i-th hot-topic key phrase and a j-th information item can be calculated by the followings. [0049] The i-th hot-topic key phrase is split into at least one hot-topic keyword combination. The relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is calculated respectively according to a preset algorithm. Further, the relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is added up to be the relevancy between the i-th hot-topic key phrase and the j-th information. The i and j are positive integers no less than 1. And splitting the i-th hot-topic phrase into the at least one hot-topic keyword combination includes splitting every two adjacent words from the i-th hot-topic key phrase to be the hot-topic keyword combinations.
[0050] Further, the relevancy between the hot-topic key phrase and the information can be calculated in the following algorithm.
Scored, D) = F(¾) * 2H(D)/(A3DF(qi ) * ,L(D))
where:
D is the information;
Score(qj,D) is the relevancy between the hot-topic key phrase qi and the information D ; qi is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
TF{qt) is a occurrence frequency of the hot-topic keyword combination qi in the document content of D ;
H(D) is a hotness index of the information D ;
DF{qt) is a document frequency of the hot-topic keyword combination qi ;
L(D) is a content length of the information D ;
and λ4 are preset coefficients.
[0051] S209, relevancy between each obtained hot-topic information and each extracted information item is calculated respectively. The relevancy between the hot-topic information and the extracted information item is a sum of the relevancies between the hot- topic keyword combinations of the corresponding hot-topic key phrase of the hot-topic information and the extracted information item. The formula is given below.
Score Q,D) = ^[« ) * 1H{D)l{ iDF{qi ) * L(D))]
where:
Score(Q,D) is the relevancy between the hot-topic key phrase Q and the information D ; Q is the hot-topic key phrase; i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
n is the number of hot-topic keyword combinations included in the hot-topic key-phrase.
[0052] S210, a candidate information item which has the highest relevancy with each obtained hot-topic key phrase is extracted respectively from the candidate information collection to be the hot-topic information, and the process returns to S202.
[0053] Thus, according to disclosed embodiments, based on a display requirement of the hot-topic information, the hot-topic information which satisfies the requirement may be obtained in various ways. The RSS feed of the pre-designated information website may be captured and analyzed periodically, and the obtained information may be stored in a content pool formed in the information collection. Further, the hot-topic key phrase set may be obtained periodically. Each time after the hot-topic key phrase set is obtained, a relatively fresh information is extracted from the content pool and the relevancy between the hot-topic key phrase in the hot-topic key phrase set and the extracted relatively fresh information is calculated respectively. Based on the calculated relevancy, the extracted relatively fresh information is screened to extract the candidate information which satisfies a preset condition to be the hot-topic information. Thus, the hot-topic information may be independently obtained through a computer in a preset period, which can save the cost and increase the hot- topic information obtaining speed.
[0054] Figure 3 illustrates a block diagram of a hot-topic information obtaining apparatus according to disclosed embodiments of the present invention. As shown in Figure 3, the hot-topic information obtaining apparatus includes a hot-topic key phrase obtaining unit 301, a candidate information extracting unit 302, a relevancy calculating unit 303 and a hot-topic information screening unit 304.
[0055] The hot-topic key phrase obtaining unit 301 is configured to obtain a hot-topic key phrase set. The candidate information extracting unit 302 is configured to extract information within a preset time window from an information collection to be a candidate information collection.
[0056] The relevancy calculating unit 303 is configured to calculate respectively a relevancy between each hot-topic key phrase included in the hot-topic key phrase set and the information in the candidate information collection. The hot-topic information screening unit 304 is configured to screen the candidate information collection based on the calculated relevancy to extract a candidate information which satisfies a preset condition to be a hot- topic information. [0057] The hot-topic key phrase obtaining unit 301 may be further configured to obtain the hot-topic key phrase set from a pre-designated website. For the preset time window, a start time of the time window is a time less than a preset time length (e.g. 24 hours) from a current time and an end time of the time window is the current time.
[0058] Further, the relevancy calculating unit 303 may calculate a relevancy between a i-th hot-topic key phrase and a j-th information in the following steps. The i and j are positive integers no less than 1.
[0059] The i-th hot-topic key phrase is split into at least one hot-topic keyword combination. The relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is calculated respectively according to a preset algorithm. The relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is added up to be the relevancy between the i-th hot-topic key phrase and the j-th information. Splitting the i-th hot-topic phrase into the at least one hot-topic keyword combination includes splitting every two adjacent words from the i-th hot-topic key phrase to be the hot-topic keyword combinations.
[0060] Further, the relevancy between each hot-topic key phrase in the hot-topic key phrase set and each information in the candidate information collection is calculated respectively. Specifically, the relevancy between the hot-topic key phrase and the
information is calculated in the following formula.
Score Q,D) = ^[« ) * λ2Η {D) l{^DF {q,) * 4Ζ ))]
where:
Q is the hot-topic key phrase;
D is the information;
Score(Q,D) is the relevancy between the hot-topic key phrase Q and the information D ; i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
n is the number of hot-topic keyword combinations included in the hot-topic key-phrase; qi is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
TF(qj) is a occurrence frequency of the hot-topic keyword combination qi in the document content of D ;
H(D) is a hotness index of the information D ;
DF(qj) is a document frequency of the hot-topic keyword combination qi ; L(D) is a content length of the information D ;
and λ4 are preset coefficients.
[0061] Specifically, the hot-topic information screening unit 304 may be configured to extract respectively the candidate information which has the highest relevancy with each obtained hot-topic key phrase from the candidate information collection to be the hot-topic information. Further, hot-topic information screening unit 304 may also be configured to extract respectively the candidate information whose relevancy with each obtained hot-topic key phrase is within a preset top number in the relevancy queue from the candidate information collection to be the hot-topic information. Furthermore, the hot-topic
information screening unit 304 may further be configured to extract respectively a preset number of the candidate information which has the greatest relevancy with each obtained hot- topic key phrase from the candidate information collection to be the hot-topic information.
[0062] Thus, the disclosed hot-topic information obtaining apparatus obtains the hot- topic key phrase set, extracts relatively fresh information from the information collection to be the candidate information collection, and calculates respectively the relevancy between each hot-topic key phrase in the obtained hot-topic key phrase set and the relatively fresh information extracted from the information collection. Based on the calculated relevancy, the disclosed hot-topic information obtaining apparatus screens the extracted relatively fresh information to obtain the candidate information which satisfies the preset condition to be the hot-topic information. Thus, the disclosed hot-topic information obtaining apparatus may independently obtain the hot-topic information through a computer, which can save cost and increase the hot-topic information obtaining speed.
[0063] Figure 4 illustrates a block diagram of another hot-topic information obtaining apparatus according to disclosed embodiments of the present invention. As shown in Figure 4, the hot-topic information obtaining apparatus includes a hot-topic key phrase obtaining unit 401, an information obtaining unit 402, a candidate information extracting unit 403, a relevancy calculating unit 404, a hot-topic information screening unit 405, and a hot-topic information displaying unit 406.
[0064] The hot-topic key phrase obtaining unit 401 is configured to obtain a hot-topic key phrase set. The information obtaining unit 402 is configured to capture and analyze a RSS-feed of a pre-designated information website to obtain information and to store the obtained information in an information collection before the candidate information extracting unit 403 extracts information within a preset time window from the information collection to be a candidate information collection.
[0065] The candidate information extracting unit 403 is configured to extract the information within the preset time window from the information-containing information collection to be the candidate information collection.
[0066] The relevancy calculating unit 404 is configured to calculate respectively a relevancy between each hot-topic key phrase in the hot-topic key phrase set and the information in the candidate information collection.
[0067] The hot-topic information screening unit 405 is configured to screen the candidate information collection based the calculated relevancy to extract the candidate information which satisfies a preset condition to be a hot-topic information. The hot-topic information displaying unit 406 is configured to display the screened hot-topic information.
[0068] In addition, the information obtaining unit 402 may further be configured to capture and analyze periodically the RSS-feed of the pre-designated information website.
[0069] That is, comparing to the hot-topic information obtaining apparatus shown in Figure 3, the information obtaining unit is added to capture and analyze the RSS-feed of the pre-designated information website to obtain the information and to store the obtained information in the information collection before the candidate information extracting unit extracts the information within the preset time window from the information-containing information collection to be the candidate information collection. Thus, the disclosed hot- topic information obtaining apparatus may improve the extracting efficiency of the candidate information extracting unit and further improve the hot-topic information obtaining efficiency.
[0070] Further, the present invention may also provide an electronic device or terminal to obtain the hot-topic information. As shown in Figure 5, the terminal may include an RF (Radio Frequency) circuit 501, one or more computer-readable storage medium 502, an input unit 503, a display unit 504, a sensor 505, an audio circuit 506, a Wi-Fi (wireless Fidelity) module 507, a processor 508 with one or more processing cores, a power supply 509 and other components. Those skilled in the art may understand that the terminal structure as shown in Figure 5 does not constitute any limitation of the terminal, and the terminal may include more or fewer components than those as shown, or a combination of some of the components, or a different arrangement of components.
[0071] The RF circuit 501 may be configured to receive or transmit signals during the receiving and sending of a call or message. In particular, after receiving a downlink information from a base station, the RF circuit 501 delivers the downlink information to one or more processor 508. Further, the RF circuit 501 transmits a uplink data to the base station. In general, the RF circuit 501 includes, but is not limited to, an antenna, at least one amplifier, a tuner, one or more oscillators, a subscriber identity module (SIM) card, a transceiver, a coupler, a low noise amplifier (LNA), a diplexer, etc. In addition, the RF circuit 501 may communicate with other devices through wireless communication and a network. Wireless communication may use any communication standards or protocols, including but not limited to global system of mobile communication (GSM), general packet radio service (GPRS), code division multiple access (CDMA), sideband code division multiple access (WCDMA), long term evolution (LTE), email, short messaging service (SMS) and etc.
[0072] The memory 502 is used to store software programs and modules, and the processor 508 is used to perform various functions and data processing by running the software programs and modules stored in memory 502. The memory 502 may mainly include a program storage segment and a data storage segment, wherein the program storage segment may store an operating system and an application which implements at least one function (such as playing audio, playing video, etc.), the data storage segment may store the data created according to the use of the terminal (such as audio data, phone book, etc.). In addition, the memory 502 may include high-speed random access memory. The memory 502 may also include non-volatile memory, such as at least one disk storage device, flash memory device, or other non-volatile solid-state memory device. Accordingly, the memory 502 may also include a memory controller to provide the memory access for the processor 508 and the input unit 503.
[0073] The input unit 503 may be used to receive input numbers and character information and generates the user settings and function control related signal input for a keyboard, mouse, joystick, trackball or an optical signal input. Specifically, in one particular embodiment, the input unit 503 may include a touch sensitive surface 5031 and other input devices 5032. Touch sensitive surface 5031, also known as touch screen or touch pad, may collect a user's touching operations on or near it (such as the touching operations of a user's finger, stylus, or any suitable objects on or near the touch- sensitive surface 5031), and drive the corresponding connection device based on a preset program. Alternatively, the touch- sensitive surface 5031 may include a touch-detecting device portion and a touch-controller portion. The touch-detecting device detects a user's touching position, detects the signal caused by the touching operation, and sends the signal to the touch controller. The touch controller, after receiving the touching information from the touch-detecting device, converts the touching information into contact coordinates and sends the contact coordinates to the processor 508. Meanwhile the touch controller may also receive and execute the commands sent by the processor 508. Further, the touch sensitive surface 5031 may also be achieved in various forms, such as resistive, capacitive, infrared and surface acoustic wave, etc. In addition to the touch sensitive surface 5031, the input unit 503 may also include other input devices 5032. Specifically, other input devices 5032 may include, but are not limited to, the physical keyboard, function keys (such as volume control keys, key switches, etc.), a trackball, a mouse, one or more types of operating lever.
[0074] The display unit 504 may be used to display information entered by the user or to display the information provided to the user, and to provide various types of graphic user interface of a terminal. These graphical user interfaces may be constituted in the form of graphics, text, icons, video and any combination thereof. The display unit 504 may include a display panel 5041, optionally, the display panel 5041 may be configured in the form of a LCD (Liquid Crystal Display), an organic light emitting diode (OLED), etc. Further, the touch sensitive surface 5031 may cover the display panel 5041. When the touch sensitive surface 5031 detects a touching operation on or near it, the touch sensitive surface 5031 sends a touch event to the processor 508 to determine the type of the touch event. Then the processor 508 provides the corresponding visual output on the display panel 5041 according to the type of the touch event. Although, in Figure 5, the touch sensitive surface 5031 and the display panel 5041 are shown as two separate components to achieve the input and output function, in some embodiments, the touch sensitive surface 5031 may be integrated with the display panel 5041 to realize the input and output function.
[0075] The terminal may also include at least one sensor 505, such as light sensor, motion sensor and other sensors. Specifically, the light sensor may include an ambient light sensor and a proximity sensor, wherein the ambient light sensor may be used to adjust the brightness of the display panel 5041 according to the ambient light magnitude and the proximity sensor may be configured to turn off the display panel 5041 and/or the backlight when the terminal is moved close to the ear. As a motion sensor, the gravity acceleration sensor may detect the acceleration magnitude on all directions (typically in three axes) and may also detect the magnitude and the direction of gravity when in still mode. So the gravity acceleration sensor may be used in phone gesture-identifying applications (such as horizontal and vertical screen switch, relevant games, magnetometer posture calibration), vibration recognition related functions (such as pedometers, percussion), etc. As the terminal may also be configured with gyroscope, barometer, hygrometer, thermometer, infrared sensor and other sensors, the list can go on and no more description is given on this hereafter. [0076] The audio circuit 506, a speaker 5061, a microphone 5062 may provide audio interface between a user and the terminal. On one aspect, the audio circuit 506, after converting the received audio data into electrical signal, transmits the electrical signal to the speaker 5061 and the speaker 5061 converts the electrical signal into sound output. On another aspect, after the microphone 5062 converts the collected sound signal into electrical signal, the audio circuit 506 converts the received electrical signal into audio data and transmits audio data to the processor 508 to be processed. The processed audio data is transmitted through the RF circuit 501 to for example another terminal or to be stored in the memory 502 for further processing. The audio circuit 506 may also include an earplug jack to provide communications between a peripheral headset and the terminal.
[0077] Wi-Fi belongs to a short-range wireless transmission technology. The terminal may help a user to send and receive email, browse web page and access streaming media, etc., through the Wi-Fi module 507. It provides the user with wireless broadband Internet access. Although the Wi-Fi module 507 is shown in Figure 5, it is understood that it is not necessarily part of the terminal and can be omitted according to the need in the scope of not changing the nature of the invention.
[0078] The processor 508 is the control center of the terminal. By using a variety of interfaces and circuits to connect various parts of the entire mobile phone, running or executing the software programs and/or modules stored in the memory 502, and by calling the data stored in the memory 502, the processor 508 may perform a variety of functions and data processing of the terminal, so as to monitor the entire mobile phone. Alternatively, the processor 508 may include one or more processing cores. Preferably, the processor 508 may be integrated with an application processor and a modem processor, wherein the application processor mainly processes operating system, user interface and applications, etc., and the modem processor mainly handles the wireless communication. It is understandable that the above modem processor may also not be integrated into the processor 508.
[0079] The terminal further includes a power supply 509 (such as battery). Preferably, the power supply 509 may be logically connected to the processor 508 through a power management system. So the functions such as charge, discharge and power management, etc., may be implemented through the power management system. The power supply 509 may also include one or more of DC or AC power, a recharging system, a power failure detaching circuit, a power converter or inverter, power supply status indictors, etc.
[0080] Although not shown, the terminal may also include a camera, a Bluetooth module, etc., the list is not repeated hereafter. In certain embodiments, the display unit is a touch screen display, and the terminal may use memory or other storage to store one or more computer programs. The processor 508 of the terminal may be configured to load the one or more executable computer programs and to execute the one or more computer programs to achieve various functions as followings.
[0081] That is, the processor may be configured to obtain a hot-topic key phrase set, to extract information within a preset time window from an information collection containing information to be a candidate information collection, to calculate respectively a relevancy between each hot-topic key phrase included in the hot-topic key phrase set and the information in the candidate information collection, and to screen the candidate information collection according to calculated relevancy to extract a candidate information which satisfies a preset condition to be a hot-topic information.
[0082] Further, the processor 508 may be configured to obtain the hot-topic key phrase set from a pre-designated website.
[0083] Further, the processor 508 may be configured to capture and analyze a RSS feed of the pre-designated information website to obtain information and to store the obtained information in the information collection.
[0084] Further, the processor 508 may be configured to capture and analyze periodically the RSS feed of the pre-designated information website.
[0085] Further, a start time of the preset time window from a current time is less than a preset time length and an end time of the preset time window is the current time.
[0086] Further, the processor 508 may be configured to calculate a relevancy between an i-th hot-topic key phrase and a j-th information in the following steps. The i and j are positive integers no less than 1.
[0087] The i-th hot-topic key phrase is split into at least one hot-topic keyword combination. A relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is calculated respectively according to a preset algorithm. The relevancy between each hot-topic keyword combination split from the i-th hot-topic key phrase and the j-th information is added up to be the relevancy between the i-th hot-topic key phrase and the j-th information. The processor 508 is also configured to split every two adjacent words from the i-th hot-topic key phrase to be the hot-topic keyword combinations.
[0088] Further, the processor 508 may be configured to calculate the relevancy between the hot-topic key phrase and the information in the following formula. Score Q,D) = ^[« ) * λ2Η {D) l{^DF {q,) * 4Ζ ))]
where:
Q is the hot-topic key phrase;
D is the information;
Score(Q,D) is the relevancy between the hot-topic key phrase Q and the information D ; i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
n is the number of hot-topic keyword combinations included in the hot-topic key-phrase; qi is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
TF(qj) is a occurrence frequency of the hot-topic keyword combination qi in the document content of D ;
H(D) is a hotness index of the information D ;
DF(qj) is a document frequency of the hot-topic keyword combination qi ;
L(D) is a content length of the information D ;
and λ4 are preset coefficients.
[0089] Further, the processor 508 may be configured to screen the candidate information collection according to calculated relevancy to extract the candidate information which satisfies the preset condition to be the hot-topic information. Specifically, the processor 508 may be configured to extract respectively the candidate information which has the highest relevancy with each obtained hot-topic key phrase from the candidate information collection to be the hot-topic information. Further, the processor 508 may also be configured to extract respectively the candidate information whose relevancy with each obtained hot- topic key-phrase is greater than a preset threshold from the candidate information collection to be the hot-topic information. Furthermore, processor 508 may further be configured to extract respectively the candidate information whose relevancy with each obtained hot-topic key phrase is within a preset top number in the relevancy queue from the candidate
information collection to be the hot-topic information.
[0090] Further, the processor 508 may be configured to display the obtained hot-topic information.
[0091] It is understood that parts or entire of the disclosed embodiments may be implemented by software program, and the software program may be stored in a readable storage medium, such as a hard disk, an optical disk or a floppy disk, etc. [0092] It should be noted that the above disclosure only describes certain embodiments of the present invention, but the scope of the present invention is not limited thereto. Those skilled in the art may easily think of variations, changes, modifications or replacements of the disclosed embodiments within the disclosed technical scope. Any of those variations, changes, modifications or replacements should fall within the protection scope of the present invention. Accordingly, the scope of the present invention should be determined by the accompanying claims.
INDUSTRIAL APPLICABILITY AND ADVANTAGEOUS EFFECT S
[0093] Without limiting the scope of any claim and/or the specification, examples of industrial applicability and certain advantageous effects of the disclosed embodiments are listed for illustrative purposes. Various alternations, modifications, or equivalents to the technical solutions of the disclosed embodiments can be obvious to those skilled in the art and can be included in this disclosure.
[0094] According to the disclosed embodiments of the present invention, various hot- topic information obtaining methods and apparatus can be implemented. Using the disclosed hot-topic information obtaining methods and apparatus, a hot-topic key phrase set can be obtained, relatively fresh information can be extracted from an information collection to be a candidate information collection, and a relevancy between each hot-topic key phrase in the obtained hot-topic key phrase set and the relatively fresh information extracted from the information collection can be calculated respectively. Based on the calculated relevancy, the extracted relatively fresh information can be screened to obtain a candidate information which satisfies the preset condition to be a hot-topic information. Thus, the hot-topic information may be independently obtained through a computer and the hot-topic information can be displayed to the user in a timely manner, lowering cost and increasing the hot-topic information obtaining speed.

Claims

Claims
1. A hot-topic information obtaining method, comprising:
obtaining a hot-topic key phrase set;
extracting information within a preset time window from an information collection containing information to form a candidate information collection;
calculating respectively a relevancy between each hot-topic key phrase included in the hot-topic key phrase set and an information item in the candidate information collection; and screening the candidate information collection according to the calculated relevancy to extract a candidate information item which satisfies a preset condition to be a hot-topic information.
2. The method according to claim 1, wherein obtaining the hot-topic key phrase set includes:
obtaining the hot-topic key phrase set from a pre-designated website.
3. The method according to claim 1, wherein, before extracting the information within the preset time window from the information collection to be the candidate information collection, the method further includes:
capturing and analyzing an RSS feed of the pre-designated information website to obtain information; and
storing the obtained information in the information collection.
4. The method according to claim 3, wherein, capturing and analyzing the RSS-feed of the pre-designated information website includes:
capturing and analyzing periodically the RSS-feed of the pre-designated information website.
5. The method according to claim 1, wherein a start time of the preset time window from a current time is less than a preset time length and an end time of the preset time window is the current time.
6. The method according to claim 1, wherein calculating respectively the relevancy between each hot-topic key phrase included in the hot-topic key phrase set and the information item in the candidate information collection includes:
provided that i and j are positive integers no less than 1, calculating a relevancy between a i-th hot-topic key phrase and a j-th information item by:
splitting the i-th hot-topic key phrase into at least one hot-topic keyword combination; calculating respectively in a preset algorithm a relevancy between the j-th information item and each hot-topic keyword combination split from the i-th hot-topic key-phrase; and adding up the relevancy between the j-th information item and each hot-topic keyword combination split from the i-th hot-topic key phrase to be the relevancy between the i-th hot-topic key phrase and the j-th information item.
7. The method according to claim 6, wherein splitting the i-th hot-topic key phrase into the at least one hot-topic combination includes:
splitting every two adjacent words from the i-th hot-topic key phrase into a hot-topic keyword combination.
8. The method according to claim 6, wherein calculating respectively in the preset algorithm the relevancy between each hot-topic key phrase included in the hot-topic key phrase set and the information item in the candidate information collection includes:
calculating the relevancy between each hot-topic key phrase and the information in the candidate information collection by:
Score Q,D) = ^[« ) * λ2Η {D) l{^DF {q,) * 4Ζ ))]
wherein:
Q is the hot-topic key phrase;
D is the information;
Score(Q,D) jg ^ reievanCy between the hot-topic key phrase Q and the information
D - i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
n is the number of hot-topic keyword combinations included in the hot-topic key-phrase; is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
TF{qt) js ¾ occurrence frequency of the hot-topic keyword combination ^ in the document content of D ;
-^(-^) is a hotness index of the information D ;
DF{qt) js ¾ document frequency of the hot-topic keyword combination ^ ;
is a content length of the information D ;
A Λ Λ A and are preset coefficients.
9. The method according to claim 1, wherein screening the candidate information
collection based on the calculated relevancy to extract the candidate information which satisfies the preset condition to be the hot-topic information includes:
extracting respectively the candidate information which has highest relevancy with each obtained hot-topic key phrase from the candidate information collection to be the hot-topic information.
10. The method according to claim 1, wherein screening the candidate information
collection based on the calculated relevancy to extract the candidate information which satisfies the preset condition to be the hot-topic information further includes: extracting respectively the candidate information whose relevancy with each obtained hot-topic key phrase is greater than a preset threshold from the candidate information collection to be the hot-topic information.
11. The method according to claim 1, wherein screening the candidate information
collection based on the calculated relevancy to extract the candidate information which satisfies the preset condition to be the hot-topic information further includes: extracting respectively the information whose relevancy with each obtained hot-topic key phrase is within a preset top number in a relevancy queue from the candidate information collection to be the hot-topic information.
12. The method according to claim 1, wherein, after extracting the screened candidate information to be the hot-topic information, the method further comprises:
displaying the screened candidate hot-topic information.
13. A hot-topic information obtaining apparatus, comprising:
a hot-topic key phrase obtaining unit configured to obtain a hot-topic key phrase set; a candidate information extracting unit configured to extract information within a preset time window from the candidate information collection to be a candidate information collection;
a relevancy calculating unit configured to calculate respectively a relevancy between a hot-topic key phrase included in the hot-topic key phrase set and an information item in the candidate information collection; and
a hot-topic information screening unit configured to screen the candidate information collection based on the calculated relevancy to extract a candidate information item which satisfies a preset condition to be a hot-topic information.
14. The apparatus according to claim 13, wherein the hot-topic key phrase obtaining unit is further configured to obtain the hot-topic key phrase set from a pre-designated website.
15. The apparatus according to claim 13, further comprising:
an information obtaining unit configured to capture and analyze a RSS-feed of the pre- designated information website to obtain information and to store the obtained information in an information collection before the candidate information obtaining unit extracts the information within the preset time window from the information collection to form the candidate information collection.
16. The apparatus according to claim 15, wherein the information obtaining unit is
further configured to capture and analyze periodically the RSS-feed of the pre- designated information website.
17. The apparatus according to claim 13, wherein a start time of the preset time window from a current time is less than a preset time length and an end time of the preset time window is the current time.
18. The apparatus according to claim 13, wherein the relevancy calculating unit is
configured to: provided that i and j are positive integers no less than 1, calculate a relevancy between an i-th hot-topic key phrase and a j-th information item by:
splitting the i-th hot-topic key phrase into at least one hot-topic keyword combination; calculating respectively in a preset algorithm a relevancy between the j-th information item and each hot-topic keyword combination split from the i-th hot-topic key-phrase;
adding up the relevancy between the j-th information item and each hot-topic keyword combination split from the i-th hot-topic key-phrase to be the relevancy between the i-th hot-topic key phrase and the j-th information item.
19. The apparatus according to claim 18, wherein splitting the i-th hot-topic key phrase into the at least one hot-topic combination includes splitting every two adjacent words from the i-th hot-topic key phrase into a hot-topic keyword combination.
20. The apparatus according to claim 18, wherein calculating respectively in the preset algorithm the relevancy between each hot-topic key phrase included in the hot-topic key phrase set and the information in the candidate information collection includes: calculating the relevancy between each hot-topic key phrase and the information item by:
Score Q,D) = ^[« ) * λ2Η {D) l{^DF {q,) * 4Ζ ))]
wherein:
Q is the hot-topic key phrase;
D is the information;
Score(Q,D) jg ^ reievanCy between the hot-topic key phrase Q and the information
D - i is a sequence number of the hot-topic keyword combination included in the hot-topic key-phrase;
n is the number of hot-topic keyword combinations included in the hot-topic key-phrase; ^ is the i-th hot-topic keyword combination of the hot-topic key phrase Q ;
TF{qt) js ¾ occurrence frequency of the hot-topic keyword combination ^ in the document content of D ;
-^(-^) is a hotness index of the information D ;
DF{qt) js ¾ document frequency of the hot-topic keyword combination ^ ; -^(-^) is a content length of the information D ;
A Λ Λ A and are preset coefficients.
21. The apparatus according to claim 13, wherein the hot-topic information screening unit is further configured to extract respectively the candidate information which has the highest relevancy with each obtained hot-topic key phrase from the candidate information collection to be the hot-topic information.
22. The apparatus according to claim 13, wherein the hot-topic information screening unit is further configured to extract respectively the candidate information whose relevancy with each obtained hot-topic key phrase is within a preset top number in a relevancy queue from the candidate information collection to be the hot-topic information.
23. The apparatus according to claim 13, wherein the hot-topic information screening unit is further configured to extract respectively a preset number of the information which has the greatest relevancy with each obtained hot-topic key phase from the candidate information collection to be the hot-topic information.
24. The apparatus according to claim 13, wherein further includes a hot-topic
information displaying unit to display the screened hot-topic information.
PCT/CN2014/085260 2013-08-29 2014-08-27 Method and apparatus for obtaining hot-topic information WO2015027909A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310386577.6A CN104424278B (en) 2013-08-29 2013-08-29 A kind of method and device obtaining hot spot information
CN201310386577.6 2013-08-29

Publications (1)

Publication Number Publication Date
WO2015027909A1 true WO2015027909A1 (en) 2015-03-05

Family

ID=52585593

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/085260 WO2015027909A1 (en) 2013-08-29 2014-08-27 Method and apparatus for obtaining hot-topic information

Country Status (2)

Country Link
CN (1) CN104424278B (en)
WO (1) WO2015027909A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109918656A (en) * 2019-02-28 2019-06-21 武汉斗鱼鱼乐网络科技有限公司 A kind of live streaming hot spot acquisition methods, device, server and storage medium
CN110472013A (en) * 2019-08-06 2019-11-19 湖南蚁坊软件股份有限公司 A kind of hot topic update method, device and computer storage medium
US11159458B1 (en) 2020-06-10 2021-10-26 Capital One Services, Llc Systems and methods for combining and summarizing emoji responses to generate a text reaction from the emoji responses

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108228898A (en) * 2018-02-06 2018-06-29 广州市西美信息科技有限公司 Searching method, device and the server of customs's data
CN109977315A (en) * 2019-03-29 2019-07-05 厦门铠甲网络股份有限公司 A kind of article recommended method, device, equipment and storage medium
CN109977316A (en) * 2019-03-29 2019-07-05 厦门铠甲网络股份有限公司 A kind of parallel type article recommended method, device, equipment and storage medium
CN113656695A (en) * 2021-08-18 2021-11-16 北京奇艺世纪科技有限公司 Hot data generation method and device, data processing method and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101661513A (en) * 2009-10-21 2010-03-03 上海交通大学 Detection method of network focus and public sentiment
CN101986298A (en) * 2010-10-28 2011-03-16 浙江大学 Information real-time recommendation method for online forum
US8010545B2 (en) * 2008-08-28 2011-08-30 Palo Alto Research Center Incorporated System and method for providing a topic-directed search
CN102968439A (en) * 2012-10-11 2013-03-13 微梦创科网络科技(中国)有限公司 Method and device for sending microblogs

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100570611C (en) * 2008-08-22 2009-12-16 清华大学 A kind of methods of marking of the information retrieval document based on viewpoint searching
CN101923544B (en) * 2009-06-15 2012-08-08 北京百分通联传媒技术有限公司 Method for monitoring and displaying Internet hot spots
CN103218410A (en) * 2013-03-26 2013-07-24 亿赞普(北京)科技有限公司 Internet event analysis method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8010545B2 (en) * 2008-08-28 2011-08-30 Palo Alto Research Center Incorporated System and method for providing a topic-directed search
CN101661513A (en) * 2009-10-21 2010-03-03 上海交通大学 Detection method of network focus and public sentiment
CN101986298A (en) * 2010-10-28 2011-03-16 浙江大学 Information real-time recommendation method for online forum
CN102968439A (en) * 2012-10-11 2013-03-13 微梦创科网络科技(中国)有限公司 Method and device for sending microblogs

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109918656A (en) * 2019-02-28 2019-06-21 武汉斗鱼鱼乐网络科技有限公司 A kind of live streaming hot spot acquisition methods, device, server and storage medium
CN109918656B (en) * 2019-02-28 2022-12-23 武汉斗鱼鱼乐网络科技有限公司 Live broadcast hotspot acquisition method and device, server and storage medium
CN110472013A (en) * 2019-08-06 2019-11-19 湖南蚁坊软件股份有限公司 A kind of hot topic update method, device and computer storage medium
CN110472013B (en) * 2019-08-06 2023-03-24 湖南蚁坊软件股份有限公司 Hot topic updating method and device and computer storage medium
US11159458B1 (en) 2020-06-10 2021-10-26 Capital One Services, Llc Systems and methods for combining and summarizing emoji responses to generate a text reaction from the emoji responses
US11444894B2 (en) 2020-06-10 2022-09-13 Capital One Services, Llc Systems and methods for combining and summarizing emoji responses to generate a text reaction from the emoji responses

Also Published As

Publication number Publication date
CN104424278B (en) 2019-02-26
CN104424278A (en) 2015-03-18

Similar Documents

Publication Publication Date Title
WO2015027909A1 (en) Method and apparatus for obtaining hot-topic information
US9241242B2 (en) Information recommendation method and apparatus
US20170091335A1 (en) Search method, server and client
US10095666B2 (en) Method and terminal for adding quick link
CN108156508B (en) Barrage information processing method and device, mobile terminal, server and system
CN106708496B (en) Processing method and device for label page in graphical interface
US10643021B2 (en) Method and device for processing web page content
CN104978115A (en) Content display method and device
US10956653B2 (en) Method and apparatus for displaying page and a computer storage medium
CN110019840B (en) Method, device and server for updating entities in knowledge graph
CN104102419A (en) Page display method and device and terminal equipment
TW201512865A (en) Method for searching web page digital data, device and system thereof
CN104750730B (en) Browser display method and device
CN110633438B (en) News event processing method, terminal, server and storage medium
CN104281621A (en) Method and device for browsing web page
CN104182429A (en) Web page processing method and terminal
CN105630846A (en) Head portrait updating method and apparatus
CN107992615B (en) Website recommendation method, server and terminal
CN104267882A (en) Page suspension frame display method and device
CN103336838A (en) Method and device for processing webpage and terminal equipment
CN110688497A (en) Resource information searching method and device, terminal equipment and storage medium
CN104239369A (en) Method, device and system for filtering out webpage advertisements
CN105095161B (en) Method and device for displaying rich text information
CN108595107B (en) Interface content processing method and mobile terminal
CN103455601A (en) Webpage processing method and device, and terminal equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14838997

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 170616)

122 Ep: pct application non-entry in european phase

Ref document number: 14838997

Country of ref document: EP

Kind code of ref document: A1