CN111460289A - News information pushing method and device - Google Patents

News information pushing method and device Download PDF

Info

Publication number
CN111460289A
CN111460289A CN202010228073.1A CN202010228073A CN111460289A CN 111460289 A CN111460289 A CN 111460289A CN 202010228073 A CN202010228073 A CN 202010228073A CN 111460289 A CN111460289 A CN 111460289A
Authority
CN
China
Prior art keywords
event
news information
news
cluster
name
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010228073.1A
Other languages
Chinese (zh)
Other versions
CN111460289B (en
Inventor
潘禄
陈玉光
李法远
韩翠云
刘远圳
黄佳艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202010228073.1A priority Critical patent/CN111460289B/en
Publication of CN111460289A publication Critical patent/CN111460289A/en
Application granted granted Critical
Publication of CN111460289B publication Critical patent/CN111460289B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/55Push-based network services
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The application discloses a news information pushing method and a news information pushing device, and relates to the field of knowledge graphs. The specific implementation scheme is as follows: acquiring a news title of current news information, and extracting an event name in the news title according to a preset extraction strategy; if at least one event name is obtained through extraction, matching each event name in the at least one event name with a plurality of event clusters in a preset event library, wherein each event cluster in the event clusters comprises a plurality of news information under the same event; judging whether a target event cluster to which news information belongs exists in a preset event library or not according to the matching results of all event names; if the target event cluster exists, the news information is merged into the target event cluster so as to provide a push service of the news information according to the target event cluster. Therefore, news information is combed by taking the event as granularity, and the news information service with high relevance is conveniently pushed to the user.

Description

News information pushing method and device
Technical Field
The present application relates to the field of knowledge graph technology in the field of computer technology, and in particular, to a method and an apparatus for pushing news information.
Background
With the rapid popularization of the internet, network information is explosively increased, and everyone needs to spend a great deal of energy to screen the information. When a user wants to know a recent event or pay attention to a person or an organization, important information needs to be selected from a large amount of news information which is not sorted and arranged.
In the related art, in order to improve the correlation between news information and a search request of a user, keyword matching is performed between the search request of the user and the content of the news information, and the news information is sorted according to a matching result.
Disclosure of Invention
The application provides a news information pushing method and device. The method and the device realize the combing of news information by taking events as granularity, and are convenient for pushing news information service with high relevance to users.
According to a first aspect, a method for pushing news information is provided, which includes: acquiring a news title of current news information, and extracting an event name in the news title according to a preset extraction strategy; if at least one event name is obtained through extraction, matching each event name in the at least one event name with a plurality of event clusters in a preset event library, wherein each event cluster in the event clusters comprises a plurality of news information under the same event; judging whether a target event cluster to which the news information belongs exists in the preset event library or not according to the matching results of all event names; and if the target event cluster exists, merging the news information into the target event cluster so as to provide a push service of the news information according to the target event cluster.
Optionally, the extracting the event name in the news headline according to a preset extraction policy includes: and inputting the news headline into a preset event name extraction model, wherein the event name extraction model learns in advance to obtain the corresponding relation between the news headline and the event name.
Optionally, if at least one event name is obtained through extraction, matching each event name in the at least one event name with a plurality of event clusters in a preset event library, including: obtaining a subject event name of each event cluster in the plurality of event clusters; and performing semantic matching on each event name in the at least one event name and the topic event name of each event cluster to obtain semantic similarity.
Optionally, after merging the news information into the target event cluster if the target event cluster exists, the method further includes: acquiring a news reading request, and determining a reading event corresponding to the news reading request; judging whether the reading event is matched with an event corresponding to the target event cluster; if the news information is matched with the target event cluster, determining the release time of each news information in the target event cluster; and selecting a preset number of recommended news from the target event cluster according to the releasing time in the order from near to far, and displaying the recommended news according to a preset display strategy.
Optionally, after merging the news information into the target event cluster if the target event cluster exists, the method further includes: counting the number of news information contained in the target event cluster; judging whether the quantity is greater than a preset threshold value, if so, determining a hotspot event name according to the event name of each news information in the target event cluster; and pushing the hotspot event according to the hotspot event name.
Optionally, the merging the news information into the target event cluster includes: and storing the corresponding relation between the at least one event name and the news information in the target event cluster.
Optionally, after the determining whether the target event cluster to which the news information belongs exists in the preset event library, the method further includes: and if the target event cluster does not exist, newly adding an event cluster corresponding to the news information.
According to a second aspect, there is provided a news information push apparatus, including: the extraction module is used for acquiring a news title of current news information and extracting an event name in the news title according to a preset extraction strategy; the matching module is used for matching each event name in the at least one event name with a plurality of event clusters in a preset event library when the at least one event name is obtained through extraction, wherein each event cluster in the event clusters comprises a plurality of news information under the same event; the first judgment module is used for judging whether a target event cluster to which the news information belongs exists in the preset event library or not according to the matching results of all event names; and the merging module is used for merging the news information into the target event cluster when the target event cluster exists so as to provide the push service of the news information according to the target event cluster.
Optionally, the method further includes: the device comprises a first determining module, a second determining module and a third determining module, wherein the first determining module is used for acquiring a news reading request and determining a reading event corresponding to the news reading request; the second judging module is used for judging whether the reading event is matched with the event corresponding to the target event cluster; the second determining module is used for determining the release time of each news information in the target event cluster during matching; and the display module is used for selecting a preset number of recommended news from the target event cluster according to the release time in a sequence from near to far and displaying the recommended news according to a preset display strategy.
Optionally, the method further includes: the counting module is used for counting the number of news information contained in the target event cluster; the third determining module is used for judging whether the number is greater than a preset threshold value or not, and determining a hot event name according to the event name of each news information in the target event cluster when the number is greater than the preset threshold value; and the pushing module is used for pushing the hotspot event according to the hotspot event name.
According to a third aspect, there is provided an electronic device comprising: at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor to enable the at least one processor to execute the method for pushing news information described in the above embodiment of the first aspect.
According to a fourth aspect, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform the method for pushing news information as described in the embodiment of the first aspect.
The technical scheme provided by the application at least has the following technical effects:
the method comprises the steps of obtaining news titles of current news information, extracting event names in the news titles according to a preset extraction strategy, matching each event name in at least one event name with a plurality of event clusters in a preset event library if at least one event name is obtained through extraction, judging whether a target event cluster to which the news information belongs exists in the preset event library or not according to matching results of all the event names, and finally merging the news information into the target event cluster when the target event cluster exists so as to provide push service of the news information according to the target event cluster. Therefore, news information is combed by taking the event as granularity, and the news information service with high relevance is conveniently pushed to the user.
It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.
Drawings
The drawings are included to provide a better understanding of the present solution and are not intended to limit the present application. Wherein:
fig. 1 is a flowchart illustrating a method for pushing news information according to a first embodiment of the present application;
FIG. 2 is a diagram of an event name extraction model training application according to a second embodiment of the present application;
FIG. 3 is a schematic diagram of event cluster composition according to a third embodiment of the present application;
FIG. 4 is a diagram illustrating an application of event classification discriminant model training according to a fourth embodiment of the present application;
FIG. 5-1 is a schematic diagram of a target event cluster determination process according to a fifth embodiment of the present application;
FIG. 5-2 is a schematic diagram of a target event cluster determination process according to a sixth embodiment of the present application;
FIG. 6 is a diagram illustrating a structure of a target event cluster according to a seventh embodiment of the present application;
fig. 7 is a flowchart illustrating a news information push method according to an eighth embodiment of the present application;
FIG. 8-1 is a diagram illustrating a news information push scenario according to a ninth embodiment of the present application;
FIG. 8-2 is a diagram illustrating a news information push scenario according to a tenth embodiment of the present application;
fig. 9 is a flowchart illustrating a news information push method according to an eleventh embodiment of the present application;
FIG. 10 is a diagram illustrating a news information push scenario according to a twelfth embodiment of the present application;
fig. 11 is a schematic structural diagram of a news information pushing apparatus according to a thirteenth embodiment of the present application;
fig. 12 is a schematic structural diagram of a news information pushing apparatus according to a fourteenth embodiment of the present application;
fig. 13 is a schematic structural diagram of a news information pushing apparatus according to a fifteenth embodiment of the present application;
fig. 14 is a block diagram of an electronic device for implementing a method for pushing news information according to an embodiment of the present application.
Detailed Description
The following description of the exemplary embodiments of the present application, taken in conjunction with the accompanying drawings, includes various details of the embodiments of the application for the understanding of the same, which are to be considered exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
In order to solve the problem that in the related technology, news information cannot be sorted based on events, so that the news information pushed to a user is not really intended to be obtained by the user, and the search experience of the user is not high, the news information method provided by the application sorts the news information based on event granularity, and sorts a large amount of news information into different event clusters, so that the pushing service is provided for the user in a targeted manner, and the purity of the news information pushed for the user is improved. The events in the embodiment of the present application include topics described by news information, including parent events and corresponding child events, and lower-level events, and the events may include "health", "advertisement", and the like.
Specifically, fig. 1 is a flowchart of a method for pushing news information according to an embodiment of the present application, as shown in fig. 1, the method includes:
step 101, obtaining a news title of current news information, and extracting an event name in the news title according to a preset extraction strategy.
It can be understood that the news headline usually shows the event specifically described by the corresponding news information in a concise and complete manner, and therefore, in order to improve the combing efficiency, in the present application, the news headline of the current news information is obtained, for example, a headline extraction model may be trained in advance, and the headline of the current news information is obtained according to the headline extraction model, and for example, the news headline may be identified according to special parameters such as the format and font of the news headline.
And then, after the news headline is obtained, extracting the event name in the news headline according to a preset extraction strategy.
It should be noted that, in different application scenarios, the manner of extracting event names in news titles according to a preset extraction policy is different, and examples are as follows:
example one:
in this example, an event name extraction model is trained in advance, the event name extraction model learns in advance to obtain a corresponding relationship between a news title and an event name, in the training process, as shown in fig. 2, a large number of sample data are obtained in advance, each sample data includes a labeled < title, event name > pair, an event name extraction model is generated according to the labeled < title, event name > pair, and further, in this embodiment, in order to further improve the determination efficiency of the event name, as shown in fig. 2, a plurality of candidate event names in the news title may be extracted according to a preset event name extraction policy (for example, a plurality of participles are obtained by segmenting a news title, and a part-word of a noun and the like is taken as a candidate event name according to the part-word of the plurality of participles), the plurality of candidate event names are input to the event name extraction model, and a probability value output by the event name extraction model is greater than a, to determine whether each candidate event name is an event name, thereby extracting event names in news headlines.
Example two:
in the example, a neural network is trained according to a large amount of sample data in advance, the segmentation feature of each segmentation in a news title is extracted, the segmentation feature of each segmentation is input into an event name extraction neural network, and whether each segmentation is an event name or not is determined according to the output probability value of the event name extraction neural network.
Step 102, if at least one event name is obtained by extraction, matching each event name in the at least one event name with a plurality of event clusters in a preset event library, wherein each event cluster in the plurality of event clusters comprises a plurality of news information under the same event.
In one embodiment of the present application, if the event name cannot be extracted, the news information is sent to the artificial platform for processing.
It can be understood that each generated news information is subjected to the combing of the affiliated events, so that a plurality of event clusters are finally obtained according to the affiliated events for the published historical news information, wherein each event cluster in the plurality of event clusters comprises a plurality of news information under the same event, wherein each event cluster can also be provided with a subject event name, the subject event name can be manually labeled, or the event name with the highest frequency can be used as the subject event name according to the frequency of occurrence of the event name in a title comprising the news information, for example, as shown in fig. 3, for a "pneumonia epidemic situation" event cluster comprising 100 news information, 100 news information are all described as a "pneumonia epidemic situation", wherein the subject event name of the event cluster is "2020 pneumonia epidemic situation".
In an embodiment of the present application, if at least one event name is extracted, each event name of the at least one event name is matched with a plurality of event clusters in a preset event library, i.e., it is determined whether current news information is a brand new event or latest news information of a historical event.
It should be noted that, in different application scenarios, the manner of matching each event name in the at least one event name with a plurality of event clusters in the preset event library is different, and the following example is given:
example one:
in this example, a topic event of each event cluster in the plurality of event clusters is obtained, and semantic matching is performed between each event name in the at least one event name and the topic event name of each event cluster to obtain semantic similarity.
Example two:
in this example, as shown in fig. 4, an event classification and judgment model is obtained by training in advance according to a large number of samples, and the event classification and judgment model can judge whether an input event name and an event cluster in a corresponding event library belong to the same event according to the input event name and a subject event name of the event cluster. Therefore, in the actual execution process, the topic event name of the event cluster in the event library is retrieved, and the event classification and judgment model matches the topic event name with the event name corresponding to the news title and judges whether the topic event name and the event name belong to the same event.
Step 103, judging whether a target event cluster to which the news information belongs exists in the preset event library according to the matching result of all event names.
Specifically, in the matching process in the above example, as shown in fig. 5-1, all event names corresponding to news titles may participate in matching with the event clusters in the event library at the same time, the total matching result of all event names is obtained at one time, and the event cluster with the highest probability is obtained according to the matching probability in the total matching result and is used as the target event cluster; or, as shown in fig. 5-2, each event name may be respectively matched with an event cluster in the event library to obtain a matching result corresponding to each event name, and in this embodiment, when all event clusters with matched event names are inconsistent, the event cluster with the largest matching success frequency is taken as the target event cluster.
And 104, if the target event cluster exists, merging the news information into the target event cluster so as to provide a push display service of the news information according to the target event cluster.
Specifically, if the target event cluster exists, it indicates that the current news information is the news information of the historical event, and at this time, the news information is merged into the target event cluster, so as to provide a push display service of the news information according to the target event cluster.
As a possible implementation manner, a corresponding relationship between at least one event name and news information may be stored in the target event cluster, so as to obtain an event corresponding to each news information at a glance, wherein, when there are a plurality of at least one event name, an event name with the lowest rank may be randomly extracted from the at least one event name as an event name when the corresponding relationship is stored, for example, when the event name includes both a parent event name and a child event name, a child event name may be stored, and the like.
As shown in fig. 6, when the target event cluster is "pneumonia epidemic", the event name showing each news information is stored in the target event cluster, and of course, different news information can be sorted according to the distribution time, so as to further facilitate management, wherein, taking the example shown in fig. 6 as an example, the event name corresponding to each news information stored in the target event cluster is "infection source", "disease place", "safeguard", "infection route", and the like.
In the actual execution process, advertisement news information without events can be filtered according to needs, reference is continuously made to fig. 4, after news information is obtained, whether the current news information contains events or not is analyzed according to an information event judgment model of preselection training, if yes, the titles of the news information are input into an event name extraction model, event names in news titles are extracted according to the event name extraction model, then the event names are input into an event classification judgment model, the event classification judgment model recalls the subject event names of event clusters in an event library as candidate events, then whether the event names are matched with the subject event names is judged, if yes, a target event cluster is determined, and the news information is classified into the target event cluster.
In an embodiment of the present application, if there is no target event cluster, an event cluster corresponding to news information is newly added, for example, one event name is extracted from the at least one event name as a subject event name of the newly added event cluster, and the first news information in the newly added event cluster is current news information, and the current news information is added to subsequently published news information.
It should be emphasized that the above embodiments describe in detail how to comb the news information into the event clusters to which the news information belongs, and the pushing of the news information after the combing is significant, and the following description is made in conjunction with the two embodiments.
The first embodiment is as follows:
in this embodiment, as shown in fig. 7, after the step 104, the method further includes:
step 201, a news reading request is obtained, and a reading event corresponding to the news reading request is determined.
In practical applications, a user may search news in a browser or the like, and in order to provide news information with higher relevance to the user, in this embodiment, a news reading request is obtained, and a reading event corresponding to the news reading request is determined, and the extracting manner of the reading event may be to extract an event name in the reading request, and the extracting manner of the event name may refer to the extracting manner of the event name in the above embodiment, and in this embodiment, the event name may be taken as the reading event.
Step 202, determining whether the reading event matches with an event corresponding to the target event cluster.
As a possible implementation manner, the event name may be matched with a topic event name in the event cluster to determine whether an event corresponding to a target event cluster where the news information is located is an event, for example, the event name of the reading request may be matched with the event name of the target event cluster, and whether the reading event is matched with the event corresponding to the target event cluster is determined according to a matching result.
In step 203, if the news information matches, the time of distribution of each news information in the target event cluster is determined.
The time for releasing each news information can be determined according to the time information marked in each news information, and the time for receiving the news information by the internet when each news information is pushed can be the corresponding release time.
And 204, selecting a preset number of recommended news from the target event cluster according to the releasing time in the sequence from near to far, and displaying the recommended news according to a preset display strategy.
It can be understood that, in order to improve the reading experience of the user, the latest news information corresponding to the reading event is provided for the user in principle, and a preset number of recommended news are selected from the target event cluster according to the releasing time and the sequence from near to far, wherein the preset number can be calibrated by the system or determined by the storage space of the device for displaying the news information.
Furthermore, recommended news is displayed according to a preset display strategy, for example, as shown in fig. 8-1, after 20 pieces of news information are determined, the news can be pushed in a theme folder form, and when a user opens the corresponding theme folder, the cover of the theme folder can display the theme event name of the corresponding event cluster, and then the 20 pieces of news information are displayed (not shown in the figure), so that the neatness of display is improved.
For another example, as shown in fig. 8-2, after 20 news information are determined, the news information may be pushed in a time axis manner, where each time node of the time axis displays an event name of the news information, and a user may intuitively select news information of new interest as needed. And each event name is linked with corresponding news information, and after the user clicks the corresponding event name, the corresponding news information is jumped to.
Example two:
in this embodiment, as shown in fig. 9, after the step 104, the method further includes:
step 301, counting the number of news information contained in the target event cluster.
Specifically, the number of news information contained in the target event cluster is counted, so that the hot spot events can be actively mined according to the number.
Step 302, judging whether the number is larger than a preset threshold value, and if so, determining a hotspot event name according to the event name of each news information in the target event cluster.
Specifically, whether the number is larger than a preset threshold value or not is judged, if so, a hotspot event name is determined according to the event name of each news information in the target event cluster, for example, the subject event name of the event cluster is used as the hotspot event name, and for example, one or more event names with the highest occurrence frequency are selected from the corresponding event clusters as the hotspot event names.
Of course, in order to further ensure the timeliness of the hot spots, the number of the news information within a preset time from the current time can be counted during the number counting.
Step 303, pushing the hot event according to the hot event name.
Specifically, the hotspot event is pushed according to the hotspot event name, and the hotspot event is actively discovered and pushed, in some possible examples, as shown in fig. 10, the hotspot event name is displayed in the form of a floating control, and a user clicks the corresponding floating control to jump to a corresponding event cluster or news information.
To sum up, the method for pushing news information according to the embodiment of the present application obtains a news title of current news information, extracts event names in the news title according to a preset extraction strategy, matches each event name in the at least one event name with a plurality of event clusters in a preset event library if at least one event name is obtained by extraction, wherein each event cluster in the event clusters contains a plurality of news information under the same event, further determines whether a target event cluster to which the news information belongs exists in the preset event library according to matching results of all event names, and finally merges the news information into the target event cluster when the target event cluster exists, so as to provide a pushing service of the news information according to the target event cluster. Therefore, news information is combed by taking the event as granularity, and the news information service with high relevance is conveniently pushed to the user.
In order to implement the above embodiment, the present application further provides a device for pushing news information. Fig. 11 is a schematic structural diagram of a news information pushing apparatus according to an embodiment of the present application, and as shown in fig. 11, the news information pushing apparatus includes: an extraction module 10, a matching module 20, a first judgment module 30 and a merging module 40, wherein,
the extraction module 10 is configured to acquire a news title of current news information, and extract an event name in the news title according to a preset extraction strategy;
a matching module 20, configured to match each event name in the at least one event name with multiple event clusters in a preset event library when the at least one event name is obtained through extraction, where each event cluster in the multiple event clusters includes multiple news information under the same event;
the first judging module 30 is configured to judge whether a target event cluster to which news information belongs exists in the preset event library according to matching results of all event names;
the merging module 10 is configured to merge the news information into the target event cluster when the target event cluster exists, so as to provide a push service of the news information according to the target event cluster.
In one embodiment of the present application, as shown in fig. 12, on the basis of that shown in fig. 11, the apparatus further comprises: a first determination module 50, a second determination module 60, a second determination module 70, and a display module 80, wherein,
a first determining module 50, configured to obtain a news reading request, and determine a reading event corresponding to the news reading request;
a second judging module 60, configured to judge whether the reading event matches an event corresponding to the target event cluster;
a second determining module 70, configured to determine, during matching, a distribution time of each news item in the target event cluster;
and the display module 80 is configured to select a preset number of recommended news from the target event cluster according to the distribution time in the order from near to far, and display the recommended news according to a preset display strategy.
In one embodiment of the present application, as shown in fig. 13, on the basis of that shown in fig. 11, the apparatus further comprises: the event cluster pushing system comprises a counting module 90, a third determining module 100 and a pushing module 110, wherein the counting module 90 is used for counting the number of news information contained in a target event cluster;
a third determining module 100, configured to determine whether the number is greater than a preset threshold, and determine a hotspot event name according to an event name of each piece of news information in the target event cluster when the number is greater than the preset threshold;
the pushing module 110 is configured to push the hotspot event according to the hotspot event name.
It should be noted that the explanation of the method for pushing news information is also applicable to the device for pushing news information in the embodiment of the present application, and the implementation principle is similar, and details of the related technology are not repeated herein.
To sum up, the device for pushing news information in the embodiment of the present application obtains a news title of current news information, extracts event names in the news title according to a preset extraction strategy, matches each event name in the at least one event name with a plurality of event clusters in a preset event library if at least one event name is obtained by extraction, wherein each event cluster in the event clusters contains a plurality of news information under the same event, further determines whether a target event cluster to which the news information belongs exists in the preset event library according to matching results of all event names, and finally merges the news information into the target event cluster when the target event cluster exists, so as to provide a pushing service of the news information according to the target event cluster. Therefore, news information is combed by taking the event as granularity, and the news information service with high relevance is conveniently pushed to the user.
According to an embodiment of the present application, an electronic device and a readable storage medium are also provided.
Fig. 14 is a block diagram of an electronic device according to an embodiment of the present application. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the present application that are described and/or claimed herein.
As shown in fig. 14, the electronic apparatus includes: one or more processors 1401, a memory 1402, and interfaces for connecting the various components, including a high-speed interface and a low-speed interface. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions for execution within the electronic device, including instructions stored in or on the memory to display graphical information of a GUI on an external input/output apparatus (such as a display device coupled to the interface). In other embodiments, multiple processors and/or multiple buses may be used, along with multiple memories and multiple memories, as desired. Also, multiple electronic devices may be connected, with each device providing portions of the necessary operations (e.g., as a server array, a group of blade servers, or a multi-processor system). Fig. 14 illustrates an example of a processor 1401.
Memory 1402 is a non-transitory computer readable storage medium as provided herein. The memory stores instructions executable by at least one processor to cause the at least one processor to perform a method for pushing news information provided herein. The non-transitory computer readable storage medium of the present application stores computer instructions for causing a computer to perform a method of pushing news information provided by the present application.
The memory 1402, which is a non-transitory computer readable storage medium, may be used to store non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules corresponding to the method for pushing news information in the embodiment of the present application (for example, the extraction module 10, the matching module 20, the first judgment module 30, and the merging module 40 shown in fig. 11). The processor 1401 executes various functional applications of the server and data processing by running non-transitory software programs, instructions, and modules stored in the memory 1402, so as to implement the method for pushing news information in the above-described method embodiments.
The memory 1402 may include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required for at least one function; the storage data area may store data created according to use of the electronic device of the push method of the news information, and the like. Further, the memory 1402 may include high-speed random access memory, and may also include non-transitory memory, such as at least one disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, the memory 1402 may optionally include a memory remotely disposed from the processor 1401, and the remote memories may be connected to an electronic device performing a push method of news information through a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The electronic device performing the method of pushing news information may further include: an input device 1403 and an output device 1404. The processor 1401, the memory 1402, the input device 1403, and the output device 1404 may be connected by a bus or other means, as exemplified by the bus connection in fig. 14.
The input device 1403 may receive input numeric or character information and generate key signal inputs related to user settings and function control of an electronic device performing a push method of news information, such as a touch screen, a keypad, a mouse, a track pad, a touch pad, a pointing stick, one or more mouse buttons, a track ball, a joystick, etc. the output device 1404 may include a display device, an auxiliary lighting device (e.g., L ED), a tactile feedback device (e.g., a vibration motor), etc.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or device (e.g., magnetic discs, optical disks, memory, programmable logic devices (P L D)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal.
The systems and techniques described here can be implemented on a computer having a display device (e.g., a CRT (cathode ray tube) or L CD (liquid crystal display) monitor) for displaying information to the user and a keyboard and a pointing device (e.g., a mouse or a trackball) by which the user can provide input to the computer for providing interaction with the user.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., AN application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with AN implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present application may be executed in parallel, sequentially, or in different orders, and the present invention is not limited thereto as long as the desired results of the technical solutions disclosed in the present application can be achieved.
The above-described embodiments should not be construed as limiting the scope of the present application. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (12)

1. A method for pushing news information is characterized by comprising the following steps:
acquiring a news title of current news information, and extracting an event name in the news title according to a preset extraction strategy;
if at least one event name is obtained through extraction, matching each event name in the at least one event name with a plurality of event clusters in a preset event library, wherein each event cluster in the event clusters comprises a plurality of news information under the same event;
judging whether a target event cluster to which the news information belongs exists in the preset event library or not according to the matching results of all event names;
and if the target event cluster exists, merging the news information into the target event cluster so as to provide a push service of the news information according to the target event cluster.
2. The method of claim 1, wherein the extracting event names in the news headlines according to a preset extraction strategy comprises:
and inputting the news headline into a preset event name extraction model, wherein the event name extraction model learns in advance to obtain the corresponding relation between the news headline and the event name.
3. The method of claim 1, wherein the matching each event name of the at least one event name with a plurality of event clusters in a preset event library if the at least one event name is extracted comprises:
obtaining a subject event name of each event cluster in the plurality of event clusters;
and performing semantic matching on each event name in the at least one event name and the topic event name of each event cluster to obtain semantic similarity.
4. The method of claim 1, wherein after said merging the news information into the target event cluster if the target event cluster exists, further comprising:
acquiring a news reading request, and determining a reading event corresponding to the news reading request;
judging whether the reading event is matched with an event corresponding to the target event cluster;
if the news information is matched with the target event cluster, determining the release time of each news information in the target event cluster;
and selecting a preset number of recommended news from the target event cluster according to the releasing time in the order from near to far, and displaying the recommended news according to a preset display strategy.
5. The method of claim 1, wherein after said merging the news information into the target event cluster if the target event cluster exists, further comprising:
counting the number of news information contained in the target event cluster;
judging whether the quantity is greater than a preset threshold value, if so, determining a hotspot event name according to the event name of each news information in the target event cluster;
and pushing the hotspot event according to the hotspot event name.
6. The method of any of claims 1-5, wherein said merging the news information into the target event cluster comprises:
and storing the corresponding relation between the at least one event name and the news information in the target event cluster.
7. The method of claim 1, wherein after the determining whether the target event cluster to which the news information belongs exists in the preset event library, the method further comprises:
and if the target event cluster does not exist, newly adding an event cluster corresponding to the news information.
8. A news information pushing device is characterized by comprising:
the extraction module is used for acquiring a news title of current news information and extracting an event name in the news title according to a preset extraction strategy;
the matching module is used for matching each event name in the at least one event name with a plurality of event clusters in a preset event library when the at least one event name is obtained through extraction, wherein each event cluster in the event clusters comprises a plurality of news information under the same event;
the first judgment module is used for judging whether a target event cluster to which the news information belongs exists in the preset event library or not according to the matching results of all event names;
and the merging module is used for merging the news information into the target event cluster when the target event cluster exists so as to provide the push service of the news information according to the target event cluster.
9. The apparatus of claim 8, further comprising:
the device comprises a first determining module, a second determining module and a third determining module, wherein the first determining module is used for acquiring a news reading request and determining a reading event corresponding to the news reading request;
the second judging module is used for judging whether the reading event is matched with the event corresponding to the target event cluster;
the second determining module is used for determining the release time of each news information in the target event cluster during matching;
and the display module is used for selecting a preset number of recommended news from the target event cluster according to the release time in a sequence from near to far and displaying the recommended news according to a preset display strategy.
10. The apparatus of claim 8, further comprising:
the counting module is used for counting the number of news information contained in the target event cluster;
the third determining module is used for judging whether the number is greater than a preset threshold value or not, and determining a hot event name according to the event name of each news information in the target event cluster when the number is greater than the preset threshold value;
and the pushing module is used for pushing the hotspot event according to the hotspot event name.
11. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of pushing newsletter information as set forth in any one of claims 1 to 7.
12. A non-transitory computer readable storage medium storing computer instructions for causing a computer to execute the method of pushing news information according to any one of claims 1 to 7.
CN202010228073.1A 2020-03-27 2020-03-27 News information pushing method and device Active CN111460289B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010228073.1A CN111460289B (en) 2020-03-27 2020-03-27 News information pushing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010228073.1A CN111460289B (en) 2020-03-27 2020-03-27 News information pushing method and device

Publications (2)

Publication Number Publication Date
CN111460289A true CN111460289A (en) 2020-07-28
CN111460289B CN111460289B (en) 2024-03-29

Family

ID=71683305

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010228073.1A Active CN111460289B (en) 2020-03-27 2020-03-27 News information pushing method and device

Country Status (1)

Country Link
CN (1) CN111460289B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112580355A (en) * 2020-12-30 2021-03-30 中科院计算技术研究所大数据研究院 News information topic detection and real-time aggregation method
CN113343687A (en) * 2021-05-25 2021-09-03 北京奇艺世纪科技有限公司 Method, device and equipment for determining event name and storage medium
CN113836448A (en) * 2021-09-22 2021-12-24 北京字节跳动网络技术有限公司 Information display method and device, computer equipment and storage medium
CN114491102A (en) * 2022-04-14 2022-05-13 深圳格隆汇信息科技有限公司 Database monitoring method and system based on big data
WO2024061060A1 (en) * 2022-09-23 2024-03-28 花瓣云科技有限公司 Information pushing method, electronic device, and storage medium

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8051088B1 (en) * 2010-04-07 2011-11-01 The Boeing Company Document analysis
US20130290232A1 (en) * 2012-04-30 2013-10-31 Mikalai Tsytsarau Identifying news events that cause a shift in sentiment
CN103793418A (en) * 2012-10-31 2014-05-14 珠海富讯网络科技有限公司 Search method of real-time vertical search engine for security industry
CN104573054A (en) * 2015-01-21 2015-04-29 杭州朗和科技有限公司 Information pushing method and equipment
CN105677894A (en) * 2016-02-02 2016-06-15 清华大学 Network event model based news event monitoring method and device
US20160328482A1 (en) * 2015-05-08 2016-11-10 Sameena Shah Social Media Events Detection and Verification
WO2017020451A1 (en) * 2015-08-03 2017-02-09 百度在线网络技术(北京)有限公司 Information push method and device
CN106446179A (en) * 2016-09-28 2017-02-22 东软集团股份有限公司 Hot topic generation method and device
US20170235820A1 (en) * 2016-01-29 2017-08-17 Jack G. Conrad System and engine for seeded clustering of news events
CN107798147A (en) * 2017-12-05 2018-03-13 李贺满 A kind of news client and its information push method
CN108829699A (en) * 2018-04-19 2018-11-16 北京奇艺世纪科技有限公司 A kind of polymerization and device of focus incident
CN109857859A (en) * 2018-12-24 2019-06-07 北京百度网讯科技有限公司 Processing method, device, equipment and the storage medium of news information
CN109947935A (en) * 2018-08-17 2019-06-28 麒麟合盛网络技术股份有限公司 The generation method and device of media event
CN109960756A (en) * 2019-03-19 2019-07-02 国家计算机网络与信息安全管理中心 Media event information inductive method
CN110134787A (en) * 2019-05-15 2019-08-16 北京信息科技大学 A kind of news topic detection method
CN110399478A (en) * 2018-04-19 2019-11-01 清华大学 Event finds method and apparatus

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8051088B1 (en) * 2010-04-07 2011-11-01 The Boeing Company Document analysis
US20130290232A1 (en) * 2012-04-30 2013-10-31 Mikalai Tsytsarau Identifying news events that cause a shift in sentiment
CN103793418A (en) * 2012-10-31 2014-05-14 珠海富讯网络科技有限公司 Search method of real-time vertical search engine for security industry
CN104573054A (en) * 2015-01-21 2015-04-29 杭州朗和科技有限公司 Information pushing method and equipment
US20160328482A1 (en) * 2015-05-08 2016-11-10 Sameena Shah Social Media Events Detection and Verification
WO2017020451A1 (en) * 2015-08-03 2017-02-09 百度在线网络技术(北京)有限公司 Information push method and device
US20170235820A1 (en) * 2016-01-29 2017-08-17 Jack G. Conrad System and engine for seeded clustering of news events
CN105677894A (en) * 2016-02-02 2016-06-15 清华大学 Network event model based news event monitoring method and device
CN106446179A (en) * 2016-09-28 2017-02-22 东软集团股份有限公司 Hot topic generation method and device
CN107798147A (en) * 2017-12-05 2018-03-13 李贺满 A kind of news client and its information push method
CN108829699A (en) * 2018-04-19 2018-11-16 北京奇艺世纪科技有限公司 A kind of polymerization and device of focus incident
CN110399478A (en) * 2018-04-19 2019-11-01 清华大学 Event finds method and apparatus
CN109947935A (en) * 2018-08-17 2019-06-28 麒麟合盛网络技术股份有限公司 The generation method and device of media event
CN109857859A (en) * 2018-12-24 2019-06-07 北京百度网讯科技有限公司 Processing method, device, equipment and the storage medium of news information
CN109960756A (en) * 2019-03-19 2019-07-02 国家计算机网络与信息安全管理中心 Media event information inductive method
CN110134787A (en) * 2019-05-15 2019-08-16 北京信息科技大学 A kind of news topic detection method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
M NAUGHTON: "Event extraction from heterogeneous news sources", 《RESEARCHGATE》, 31 January 2006 (2006-01-31) *
乌吉斯古愣;刘晓影;鄢楚平;: "基于事件关联网络的用户兴趣话题发现算法", 现代电子技术, no. 06, 15 March 2015 (2015-03-15) *
牛振东;王帅;王诗航;陈杰;: "新闻事件的分布式混合推荐算法", 北京理工大学学报, no. 07, 15 July 2017 (2017-07-15) *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112580355A (en) * 2020-12-30 2021-03-30 中科院计算技术研究所大数据研究院 News information topic detection and real-time aggregation method
CN113343687A (en) * 2021-05-25 2021-09-03 北京奇艺世纪科技有限公司 Method, device and equipment for determining event name and storage medium
CN113343687B (en) * 2021-05-25 2023-09-05 北京奇艺世纪科技有限公司 Event name determining method, device, equipment and storage medium
CN113836448A (en) * 2021-09-22 2021-12-24 北京字节跳动网络技术有限公司 Information display method and device, computer equipment and storage medium
CN113836448B (en) * 2021-09-22 2023-10-20 抖音视界有限公司 Information display method, device, computer equipment and storage medium
CN114491102A (en) * 2022-04-14 2022-05-13 深圳格隆汇信息科技有限公司 Database monitoring method and system based on big data
WO2024061060A1 (en) * 2022-09-23 2024-03-28 花瓣云科技有限公司 Information pushing method, electronic device, and storage medium

Also Published As

Publication number Publication date
CN111460289B (en) 2024-03-29

Similar Documents

Publication Publication Date Title
CN111460289B (en) News information pushing method and device
CN103279513B (en) The method of generation content tab is, provide the method and device of multimedia content information
CN112052397B (en) User characteristic generation method and device, electronic equipment and storage medium
US11748429B2 (en) Indexing native application data
CN105930527A (en) Searching method and device
CN110727668A (en) Data cleaning method and device
CN112000863B (en) Analysis method, device, equipment and medium of user behavior data
CN112818230A (en) Content recommendation method and device, electronic equipment and storage medium
CN103164425B (en) Method and device for pushing application program in application program platform system
CN103631796A (en) Website sort management method and electronic device
CN111310044B (en) Page element information extraction method, device, equipment and storage medium
JP2022524670A (en) Applet material processing methods, appliances, electronic devices, computer-readable storage media and computer programs
CN112416212A (en) Program access method, device, electronic equipment and readable storage medium
CN112650919A (en) Entity information analysis method, apparatus, device and storage medium
CN111460257A (en) Thematic generation method and device, electronic equipment and storage medium
CN114519153B (en) Webpage history record display method, device, equipment and storage medium
CN114491232A (en) Information query method and device, electronic equipment and storage medium
EP2026216A1 (en) Data processing method, computer program product and data processing system
CN112101012A (en) Interactive domain determining method and device, electronic equipment and storage medium
CN111475721A (en) Information pushing method, device, equipment and storage medium
CN111339399A (en) Object processing method, object processing apparatus, object processing device, and medium
CN112764604B (en) Display method, display device, electronic equipment and readable storage medium
Mohiuddin et al. NEWSD: A Realtime News Classification Engine for Web Streaming Data
CN111552878A (en) Data processing method and device
CN114564554A (en) Data retrieval method, device and equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant