CN111291176A - Hot event mining method and device - Google Patents
Hot event mining method and device Download PDFInfo
- Publication number
- CN111291176A CN111291176A CN201811484821.1A CN201811484821A CN111291176A CN 111291176 A CN111291176 A CN 111291176A CN 201811484821 A CN201811484821 A CN 201811484821A CN 111291176 A CN111291176 A CN 111291176A
- Authority
- CN
- China
- Prior art keywords
- preset
- preset area
- events
- hot
- unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 62
- 238000005065 mining Methods 0.000 title claims abstract description 21
- 238000012216 screening Methods 0.000 claims abstract description 21
- 238000009412 basement excavation Methods 0.000 claims abstract 2
- 238000003860 storage Methods 0.000 claims description 16
- 238000000605 extraction Methods 0.000 claims description 6
- 238000012163 sequencing technique Methods 0.000 claims description 6
- 238000001914 filtration Methods 0.000 claims 1
- 238000012545 processing Methods 0.000 abstract description 12
- 238000004590 computer program Methods 0.000 description 10
- 238000010586 diagram Methods 0.000 description 10
- 230000008901 benefit Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 5
- 239000000284 extract Substances 0.000 description 3
- 238000003058 natural language processing Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000000354 decomposition reaction Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000013075 data extraction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 235000019633 pungent taste Nutrition 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000007480 spreading Effects 0.000 description 1
- 238000003892 spreading Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a mining method and device of a hot event, relates to the technical field of data processing, and aims to solve the problem that the existing hot event generation is determined based on news events in the nationwide range, and a user cannot learn about the locally occurring hot event, so that the hot event determination is limited. The method of the invention comprises the following steps: collecting news data; calling a preset interface to extract news data in a preset area range from the news data; classifying the news data in the preset area range; and screening out hot events in the preset area range according to the classification result. The invention is suitable for being applied to excavation of hot events.
Description
Technical Field
The invention relates to the technical field of data processing, in particular to a method and a device for mining hot events.
Background
With the continuous development of the internet, a network platform has become a main approach for spreading news events, and social platforms such as forums, microblog WeChats and the like, news websites and the like are all main media for news release or user speech. The mass text information brings commercial value and brings inconvenience to users, and how to dig out focus news events from the mass information enables the users not to miss the current news worth paying attention to become a research point of wide attention.
Currently, most websites are determined according to an algorithm or a user search browsing amount based on news events across the country when the hot events are generated, that is, the hot events learned by users across the country are the same. In the process of implementing the invention, the inventor finds that in the prior art, a user cannot browse a locally-occurring hot event, so that the generation of the hot event has a limited problem.
Disclosure of Invention
In view of the above problems, the present invention provides a method and an apparatus for mining a hot event, and a main object of the present invention is to mine hot events occurring in each area to improve comprehensiveness of hot event determination.
In order to solve the above technical problem, in a first aspect, the present invention provides a method for mining a hot event, including:
collecting news data;
calling a preset interface to extract news data in a preset area range from the news data, wherein the preset interface is packaged with field information corresponding to the preset area in advance;
classifying the news data in the preset area range;
and screening out hot events in the preset area range according to the classification result.
Optionally, the method further includes:
the method comprises the steps that the region is divided into multiple levels according to a preset rule to obtain a plurality of region units of different levels, and each region unit of the previous level comprises a plurality of sub-region units of the next level;
and generating field information corresponding to the preset area according to the region unit of the previous level and any one contained sub-region unit of the next level.
Optionally, the screening out the hot events in the preset area range according to the classification result includes:
sequencing a plurality of events in the preset area range obtained after classification according to data volume, and determining the events in a preset ranking as the hot events; and/or the presence of a gas in the gas,
and determining the event that the data volume increase rate exceeds a preset threshold as the hot event.
Optionally, the method further includes:
acquiring geographical position information of users and creating a user set corresponding to the preset area range, wherein the user set comprises all users in the preset area;
pushing the hit event in the user set.
In a second aspect, the present invention also provides a digging implement for a hot event, the implement comprising:
the acquisition unit is used for acquiring news data;
the extraction unit is used for calling a preset interface to extract the news data in a preset area range from the news data, and the preset interface is packaged with field information corresponding to the preset area in advance;
the classification unit is used for classifying the news data in the preset area range;
and the screening unit is used for screening out the hot events in the preset area range according to the classification result.
Optionally, the apparatus further comprises:
the dividing unit is used for carrying out multi-level division on the region according to a preset rule to obtain a plurality of region units of different levels, and each region unit of the previous level comprises a plurality of sub-region units of the next level;
and the generating unit is used for generating the field information corresponding to the preset area according to the region unit of the upper hierarchy and any one contained sub-region unit of the lower hierarchy.
Optionally, the screening unit includes:
the first determining module is used for sequencing the plurality of events in the preset area range obtained after classification according to data volume and determining the events in a preset ranking as the hot events;
and the second determination module is used for determining the event that the data volume increase rate exceeds a preset threshold as the hot event.
Optionally, the apparatus further comprises:
the acquisition unit is used for acquiring the geographical position information of a user and creating a user set corresponding to the preset area range, wherein the user set comprises all users in the preset area;
and the pushing unit is used for pushing the hot event in the user set.
In order to achieve the above object, according to a third aspect of the present invention, there is provided a storage medium including a stored program, wherein when the program runs, a device on which the storage medium is located is controlled to execute the above mining method for the hot event.
In order to achieve the above object, according to a fourth aspect of the present invention, there is provided a processor for executing a program, wherein the program executes to execute the mining method for hot events described above.
By the technical scheme, the method and the device for mining the hot events provided by the invention have the advantages that the hot events are determined based on the news events in the national range when the hot events are generated in the prior art, and the user cannot learn the locally occurring hot events, so that the hot event determination has limitation, the method and the device call the preset interface which is packaged with the field information of the preset area in advance after the news data are collected, extract the news data in the preset area range from the news data, classify the regional news data and screen out the hot events in the preset area according to the classification result, so compared with the prior art, the hot events occurring in each area can be mined when the hot events are determined, the problem that the hot event determination has limitation because the user cannot learn the locally occurring hot events is avoided, thereby improving the comprehensiveness of the hot event determination.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
FIG. 1 is a flow chart of a mining method for hot events according to an embodiment of the present invention;
FIG. 2 is a flow chart of another mining method for hot events according to an embodiment of the present invention;
FIG. 3 is a block diagram illustrating components of a mining device for a hot event according to an embodiment of the present invention;
fig. 4 is a block diagram illustrating a digging device for another hot event according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
In order to improve the pushing comprehensiveness of the hot event, an embodiment of the present invention provides a mining method of the hot event, as shown in fig. 1, where the method includes:
101. and collecting news data.
The news data may be text, image, voice, video and other data published from a plurality of network media, social networking sites and the like, for example, content data published by a user on a personal website, or comment data of the user on other articles and the like, and the step may be obtained by capturing from a large database containing all news data. In particular, news data may be collected at different time intervals for the settings, for example, news data collected every other hour over the past hour.
For the embodiment of the invention, after the news data are collected, each group of news data can be stored according to the time identifier, so that comparison can be performed according to each group of data to determine the change trend of the news data in a period of time.
102. And calling a preset interface to extract news data in a preset area range from the news data.
And the preset interface is pre-packaged with field information corresponding to the preset area. Specifically, the interfaces containing the field information of different areas can be respectively packaged in the interface layer in advance, so that each interface can be called in the step to be sequentially extracted from the collected news data, and a plurality of groups of news data containing the field information of different areas are obtained, so that the news data can be classified according to regions.
It should be noted that, in practical applications, for a local event, the regional name word is usually carried in a news article, a speech published by a user, and the like, so that when a regional event is monitored, capturing news data of each region can be realized by encapsulating different interfaces. In addition, for news data which does not contain field information, the embodiment of the invention can filter the data and does not perform subsequent processing, so as to avoid the problem of time and resource waste caused by processing the news data which does not contain regional field information.
103. And classifying the news data in the preset area range.
In the embodiment of the present invention, the classification of the news data in this step may utilize NLP
Natural Language Processing (Natural Language Processing) real-time extraction method for classification, that is, firstly, a preset model is used to disassemble each news data to obtain a plurality of participles and label each analysis, then, a clustering model is used to cluster the labeled participles, and the clustered data is output according to a preset structured format to complete the classification of the news data, but the specific implementation mode is not limited thereto.
104. And screening out hot events in the preset area range according to the classification result.
Specifically, in this step, the data volume of the events in the multiple preset regions obtained after the classification in the above steps may be counted, and N events with the data volume of the event ranked at the top are used as the hot events in the preset regions, which is of course also possible to calculate the data volume increase rate of each event in the preset regions, and determine the event with the increase rate exceeding the preset threshold as the hot event in the preset regions, which is not specifically limited in the embodiment of the present invention.
The method for mining the hot events provided by the embodiment of the invention has the advantages that the hot events are determined based on the news events in the national range when the hot events are generated in the prior art, and the user can not learn the locally occurred hot events, so that the problem that the hot event determination has limitation is solved, the invention calls the preset interface in which the field information of the preset area is packaged in advance after the news data are collected, extracts the news data in the preset area range from the news data, classifies the regional news data and screens the hot events in the preset area according to the classification result, so compared with the prior art, the method can mine the hot events in each area when the hot events are determined, and avoids the problem that the hot event determination has limitation because the user can not learn the locally occurred hot events, thereby improving the comprehensiveness of the hot event determination.
Further, as a refinement and an extension of the embodiment shown in fig. 1, another mining method for hot events is provided in the embodiment of the present invention, as shown in fig. 2.
201. And carrying out multi-level division on the region according to a preset rule.
Furthermore, a plurality of region units of different levels are obtained, and each region unit of the previous level comprises a plurality of sub-region units of the next level. For example, when the region is china, the region may be divided according to the province, then each province is divided according to the city, and the city may be divided according to the county, and the like, so as to obtain a plurality of region units of different levels.
202. And generating field information corresponding to the preset area according to the region unit of the previous level and any one contained sub-region unit of the next level.
When the number of the hierarchy levels of the domain division exceeds two hierarchy levels, the region units of each hierarchy level and the sub-region units of the hierarchy level can be sequentially extracted to form the region field information containing the multi-hierarchy region information. For example, the region to be divided is the Liaoning province, and the region is divided according to provinces and counties (local cities) in sequence to obtain: { Shenyang city: new city, kangping county, french county }, { dalian city: ware city, banker city }, … …, { Anshan city: sea city, taian county }, then the region units of each hierarchy and the sub-region unit generation field information under the region units are sequentially extracted in this step as follows: liaoning province + Shenyang city + Xinmin city | Liaoning province + Shenyang city + Kangping county | Liaoning province + Shenyang city + Fakuo county | Liaoning province + Dalian city + Wakuan city | … … Liaoning province + Anshan city + Taian county |.
In addition, in this step, only the regions of two or three of the plurality of levels may be extracted to generate corresponding regional field information, and as in the above example, only the regional field information generated in shenyang city, louing ning province | gangyin city, etc. may be extracted. For the embodiment of the invention, the regional field information is obtained by sequentially carrying out multi-level combination on each region and the sub-regions under the region, so that the data in each preset region can be extracted from the news data according to the regional field information, the accuracy of regional division can be ensured, and the accuracy of data extraction in the region can be further ensured.
203. And collecting news data.
The conceptual explanation of the news data and the specific implementation manner of this step may refer to the corresponding description in step 101, and are not repeated here.
204. And calling a preset interface to extract news data in a preset area range from the news data.
And the preset interface is pre-packaged with field information corresponding to the preset area. As described above in step 202, the obtained field information in the plurality of preset areas may be sequentially encapsulated in different interfaces, so that news data in the preset areas respectively corresponding to shenyang city kangping county in liaoning province, shenyang city in liaoning province, new people city in shenyang city in liaoning province, and the like may be extracted in this step.
205. And classifying the news data in the preset area range.
The detailed description of this step may refer to the corresponding description in step 103, which is not described in detail in this embodiment of the present invention.
206. And screening out hot events in the preset area range according to the classification result.
For the embodiment of the present invention, the step 206 may specifically include: sequencing a plurality of events in the preset area range obtained after classification according to data volume, and determining the events in a preset ranking as the hot events; and/or determining the event that the data volume increase rate exceeds a preset threshold as the hot event. The events in the preset ranking may be top five, top ten, and the like news events, and specifically, the data volumes corresponding to the multiple events obtained after the classification in the above steps are counted, and then the events are sorted according to the data volumes, or the data volume increase rate of the regional events is calculated according to the event number in a preset region included in two sets of news data in adjacent time intervals, and the regional events whose data volume increase rate exceeds a preset threshold are determined as hot events in the preset region. For example, if the data volume of the event M captured in a preset area from 9 to 10 points in 10, 15 and 15 months in 2018 is 500, and the data volume of the event M captured in 10 to 11 points in 10, 15 and 15 months in 2018 is ten thousand, the data volume increase rate of the event M is 1900%, and exceeds the preset threshold value by 50%, the event M is determined as a hot event in the preset area.
Further, in order to push the hot event in the preset area to the user in time, the embodiment of the present invention may further include: acquiring geographical position information of users and creating a user set corresponding to the preset area range, wherein the user set comprises all users in the preset area; pushing the hit event in the user set. The geographic location information of the user may be obtained by positioning in each application program, or may be selected by the user to pay attention to, which is not specifically limited in the embodiment of the present invention. For the embodiment of the invention, after different area hot events in each preset area are determined, the user set belonging to each area is created and the hot events in the area are pushed to all users under the area, so that each user in the area can receive the hot events in the area in time, the timeliness of the user for obtaining the hot events in the area is ensured, and the use experience of the user is improved.
Further, according to the method described in step 201-206, the embodiment of the present invention may further provide an implementation manner of digging hot door events in combination with specific application scenarios, specifically, as follows:
the method comprises the steps of firstly, capturing news data such as all articles and comments issued by users in a plurality of platforms such as network media, microblogs, WeChats and forums, and storing captured big data in a database.
And secondly, respectively packaging the field information corresponding to each preset area in different preset interfaces, and calling the preset interfaces to respectively extract the news data in each preset area from the big data captured in the step.
And thirdly, carrying out news clustering, semantic decomposition, hot word decomposition and other processing on the news data in each group of preset areas to obtain area events or characters contained in each group of data.
And fourthly, carrying out hotness ranking on the area events according to the event data volume and the event data growth rate, and pushing the area events ranked from the first to the fifth in the users in the area.
However, it should be noted that the specific implementation described in the above application scenarios is only an example, and is not the only specific implementation of the embodiment of the present invention, and is only one of the optimized implementations of the method according to the present invention.
Further, as an implementation of the method shown in fig. 1, an embodiment of the present invention further provides a device for excavating a hot event, which is used for implementing the method shown in fig. 1. The embodiment of the apparatus corresponds to the embodiment of the method, and for convenience of reading, details in the embodiment of the apparatus are not repeated one by one, but it should be clear that the apparatus in the embodiment can correspondingly implement all the contents in the embodiment of the method. As shown in fig. 3, the apparatus includes: an acquisition unit 31, an extraction unit 32, a classification unit 33, a screening unit 34, wherein
The collecting unit 31 may be used to collect news data.
The extracting unit 32 may be configured to invoke a preset interface to extract the news data in the preset area range from the news data acquired by the acquiring unit 31, where the preset interface is pre-packaged with field information corresponding to the preset area.
The classifying unit 33 may be configured to classify the news data extracted by the extracting unit 32 within the preset region.
The screening unit 34 may be configured to screen out the hot events in the preset area range according to the classification result obtained by the classification unit 33.
Further, as an implementation of the method shown in fig. 2, another mining device for hot events is further provided in the embodiment of the present invention, and is used for implementing the method shown in fig. 2. The embodiment of the apparatus corresponds to the embodiment of the method, and for convenience of reading, details in the embodiment of the apparatus are not repeated one by one, but it should be clear that the apparatus in the embodiment can correspondingly implement all the contents in the embodiment of the method. As shown in fig. 4, the apparatus includes: an acquisition unit 41, an extraction unit 42, a classification unit 43, a screening unit 44, wherein
The collecting unit 41 may be configured to collect news data.
The extracting unit 42 may be configured to invoke a preset interface to extract the news data in the preset area range from the news data acquired by the acquiring unit 41, where the preset interface is pre-packaged with field information corresponding to the preset area.
The classifying unit 43 may be configured to classify the news data extracted by the extracting unit 42 in the preset region.
The screening unit 44 may be configured to screen out the hot events in the preset area range according to the classification result obtained by the classification unit 43.
Further, the apparatus further comprises:
the dividing unit 45 may be configured to perform multi-level division on the region according to a preset rule to obtain a plurality of region units of different levels, where each region unit of a previous level includes a plurality of sub-region units of a next level.
The generating unit 46 may be configured to generate the field information corresponding to the preset area according to the region unit of the previous hierarchy and any one of the included sub-region units of the next hierarchy.
Further, the screening unit 44 includes:
the first determining module 4401 may be configured to sort the multiple events in the preset area range obtained after the classification according to a data amount, and determine an event in a preset ranking as the hot event.
The second determining module 4402 may be configured to determine, as the hot event, an event that the data volume increase rate exceeds a preset threshold.
Further, the apparatus further comprises: an acquisition unit 47 and a pushing unit 48.
The obtaining unit 47 may be configured to obtain geographic location information of a user and create a user set corresponding to the preset area range, where the user set includes all users in the preset area.
The pushing unit 48 may be configured to push the trending event in the user set.
The embodiment of the invention provides another digging device for hot events. The device comprises: the device comprises a collecting unit, an extracting unit, a classifying unit and a screening unit. The method comprises the steps that after news data are collected, a preset interface which is packaged with preset area field information in advance is called, the news data in the preset area range are extracted from the news data, then the area news data are classified, and hot events in the preset area are screened out according to the classification result, so that compared with the prior art, the hot events in each area can be mined when the hot events are determined, the problem that the hot event determination is limited due to the fact that the user cannot learn the locally-occurring hot events is solved, and the comprehensiveness of the hot event determination is improved; in addition, after the geographical hot events of different geographical areas are determined, the user sets belonging to the geographical areas are created, and the hot events in the preset geographical areas are pushed to all users in the geographical areas, so that the users can receive the hot events in the geographical areas in time, the timeliness of the users obtaining the hot events in the preset geographical areas is ensured, and the use experience of the users is improved.
The text processing device comprises a processor and a memory, wherein the acquisition unit 31, the extraction unit 32, the classification unit 33, the screening unit 34 and the like are stored in the memory as program units, and the processor executes the program units stored in the memory to realize corresponding functions.
The processor comprises a kernel, and the kernel calls the corresponding program unit from the memory. The kernel can be set to be one or more, and the comprehensiveness of hot event pushing is improved by adjusting kernel parameters.
The memory may include volatile memory in a computer readable medium, Random Access Memory (RAM) and/or nonvolatile memory such as Read Only Memory (ROM) or flash memory (flash RAM), and the memory includes at least one memory chip.
An embodiment of the present invention provides a storage medium on which a program is stored, the program implementing the mining method of the hot event when being executed by a processor.
The embodiment of the invention provides a processor, which is used for running a program, wherein the mining method of the hot event is executed when the program runs.
The embodiment of the invention provides equipment, which comprises a processor, a memory and a program which is stored on the memory and can run on the processor, wherein the processor executes the program and realizes the following steps: collecting news data; calling a preset interface to extract regional news data from the news data, wherein regional field information is packaged in the preset interface in advance; processing the regional news data to obtain a plurality of regional events; a geo-hit of the plurality of geo-events is determined.
Further, the method further comprises:
the method comprises the steps that a preset area is divided into multiple levels to obtain a plurality of area units of different levels, and each area unit of the previous level comprises a plurality of sub-area units of the next level;
and generating the region field information according to the region unit of the previous layer and any one contained sub-region unit of the next layer.
Further, the determining a geo-hit of the plurality of geo-events comprises:
sorting the region events according to data volume and determining the region events in a preset rank as the region hot events; and/or the presence of a gas in the gas,
and determining the region event with the abnormal data volume increase as the region hot event.
Further, the method further comprises:
acquiring user region information and creating a region user set, wherein the region user set comprises all users in the same region;
and pushing the regional hot event in the user set.
An embodiment of the present invention further provides a computer program product, which, when executed on a data processing apparatus, is adapted to execute a program that initializes the following method steps: collecting news data; calling a preset interface to extract news data in a preset area range from the news data, wherein the preset interface is packaged with field information corresponding to the preset area in advance; classifying the news data in the preset area range; and screening out hot events in the preset area range according to the classification result.
Further, the method further comprises:
the method comprises the steps that the region is divided into multiple levels according to a preset rule to obtain a plurality of region units of different levels, and each region unit of the previous level comprises a plurality of sub-region units of the next level;
and generating field information corresponding to the preset area according to the region unit of the previous level and any one contained sub-region unit of the next level.
Further, the screening out the hot events in the preset area range according to the classification result includes:
sequencing a plurality of events in the preset area range obtained after classification according to data volume, and determining the events in a preset ranking as the hot events; and/or the presence of a gas in the gas,
and determining the event that the data volume increase rate exceeds a preset threshold as the hot event.
Further, the method further comprises:
acquiring geographical position information of users and creating a user set corresponding to the preset area range, wherein the user set comprises all users in the preset area;
pushing the hit event in the user set.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In a typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include forms of volatile memory in a computer readable medium, Random Access Memory (RAM) and/or non-volatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). The memory is an example of a computer-readable medium.
Computer-readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of computer storage media include, but are not limited to, phase change memory (PRAM), Static Random Access Memory (SRAM), Dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), Read Only Memory (ROM), Electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), Digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium that can be used to store information that can be accessed by a computing device. As defined herein, a computer readable medium does not include a transitory computer readable medium such as a modulated data signal and a carrier wave.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in the process, method, article, or apparatus that comprises the element.
As will be appreciated by one skilled in the art, embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The above are merely examples of the present application and are not intended to limit the present application. Various modifications and changes may occur to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present application should be included in the scope of the claims of the present application.
Claims (10)
1. A method of mining a trending event, the method comprising:
collecting news data;
calling a preset interface to extract news data in a preset area range from the news data, wherein the preset interface is packaged with field information corresponding to the preset area in advance;
classifying the news data in the preset area range;
and screening out hot events in the preset area range according to the classification result.
2. The method of claim 1, further comprising:
the method comprises the steps that the region is divided into multiple levels according to a preset rule to obtain a plurality of region units of different levels, and each region unit of the previous level comprises a plurality of sub-region units of the next level;
and generating field information corresponding to the preset area according to the region unit of the previous level and any one contained sub-region unit of the next level.
3. The method of claim 1, wherein the filtering out the hot events in the preset area range according to the classification result comprises:
sequencing a plurality of events in the preset area range obtained after classification according to data volume, and determining the events in a preset ranking as the hot events; and/or the presence of a gas in the gas,
and determining the event that the data volume increase rate exceeds a preset threshold as the hot event.
4. The method according to any one of claims 1 to 3, further comprising:
acquiring geographical position information of users and creating a user set corresponding to the preset area range, wherein the user set comprises all users in the preset area;
pushing the hit event in the user set.
5. An excavation apparatus for a hot door event, the apparatus comprising:
the acquisition unit is used for acquiring news data;
the extraction unit is used for calling a preset interface to extract the news data in a preset area range from the news data, and the preset interface is packaged with field information corresponding to the preset area in advance;
the classification unit is used for classifying the news data in the preset area range;
and the screening unit is used for screening out the hot events in the preset area range according to the classification result.
6. The apparatus of claim 5, further comprising:
the dividing unit is used for carrying out multi-level division on the region according to a preset rule to obtain a plurality of region units of different levels, and each region unit of the previous level comprises a plurality of sub-region units of the next level;
and the generating unit is used for generating the field information corresponding to the preset area according to the region unit of the upper hierarchy and any one contained sub-region unit of the lower hierarchy.
7. The apparatus of claim 5, wherein the screening unit comprises:
the first determining module is used for sequencing the plurality of events in the preset area range obtained after classification according to data volume and determining the events in a preset ranking as the hot events;
and the second determination module is used for determining the event that the data volume increase rate exceeds a preset threshold as the hot event.
8. The apparatus of any one of claims 5 to 7, further comprising:
the acquisition unit is used for acquiring the geographical position information of a user and creating a user set corresponding to the preset area range, wherein the user set comprises all users in the preset area;
and the pushing unit is used for pushing the hot event in the user set.
9. A storage medium comprising a stored program, wherein the program, when executed, controls a device on which the storage medium is located to perform the mining method for the trending event according to any one of claims 1 to 4.
10. A processor, configured to run a program, wherein the program is configured to execute the mining method for the hit event according to any one of claims 1 to 4 when the program is run.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811484821.1A CN111291176A (en) | 2018-12-06 | 2018-12-06 | Hot event mining method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811484821.1A CN111291176A (en) | 2018-12-06 | 2018-12-06 | Hot event mining method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111291176A true CN111291176A (en) | 2020-06-16 |
Family
ID=71022878
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811484821.1A Pending CN111291176A (en) | 2018-12-06 | 2018-12-06 | Hot event mining method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111291176A (en) |
Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090019085A1 (en) * | 2007-07-10 | 2009-01-15 | Fatdoor, Inc. | Hot news neighborhood banter in a geo-spatial social network |
CN102722567A (en) * | 2012-05-30 | 2012-10-10 | 杭州遥指科技有限公司 | Method and device for screening in-station information |
CN103294712A (en) * | 2012-02-29 | 2013-09-11 | 三星电子(中国)研发中心 | System and method for recommending hot spot area in real time |
CN103617298A (en) * | 2013-12-19 | 2014-03-05 | 金蝶软件(中国)有限公司 | Data connecting method and data connector |
CN103714132A (en) * | 2013-12-17 | 2014-04-09 | 北京本果信息技术有限公司 | Method and equipment used for mining hot events based on regions and industries |
CN104516961A (en) * | 2014-12-18 | 2015-04-15 | 北京牡丹电子集团有限责任公司数字电视技术中心 | Topic digging and topic trend analysis method and system based on region |
CN104615715A (en) * | 2015-02-05 | 2015-05-13 | 北京航空航天大学 | Social network event analyzing method and system based on geographic positions |
CN106326496A (en) * | 2016-09-30 | 2017-01-11 | 广州特道信息科技有限公司 | Cloud platform-based news reading system |
CN106789584A (en) * | 2016-12-27 | 2017-05-31 | 上海景域文化传播股份有限公司 | A kind of FTP client FTP for travelling products sale |
CN107231570A (en) * | 2017-06-13 | 2017-10-03 | 中国传媒大学 | News data content characteristic obtains system and application system |
CN108153860A (en) * | 2017-12-25 | 2018-06-12 | 中译语通科技(青岛)有限公司 | A kind of geolocation analysis method based on multilingual news |
CN108170773A (en) * | 2017-12-26 | 2018-06-15 | 百度在线网络技术(北京)有限公司 | Media event method for digging, device, computer equipment and storage medium |
-
2018
- 2018-12-06 CN CN201811484821.1A patent/CN111291176A/en active Pending
Patent Citations (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090019085A1 (en) * | 2007-07-10 | 2009-01-15 | Fatdoor, Inc. | Hot news neighborhood banter in a geo-spatial social network |
CN103294712A (en) * | 2012-02-29 | 2013-09-11 | 三星电子(中国)研发中心 | System and method for recommending hot spot area in real time |
CN102722567A (en) * | 2012-05-30 | 2012-10-10 | 杭州遥指科技有限公司 | Method and device for screening in-station information |
CN103714132A (en) * | 2013-12-17 | 2014-04-09 | 北京本果信息技术有限公司 | Method and equipment used for mining hot events based on regions and industries |
CN103617298A (en) * | 2013-12-19 | 2014-03-05 | 金蝶软件(中国)有限公司 | Data connecting method and data connector |
CN104516961A (en) * | 2014-12-18 | 2015-04-15 | 北京牡丹电子集团有限责任公司数字电视技术中心 | Topic digging and topic trend analysis method and system based on region |
CN104615715A (en) * | 2015-02-05 | 2015-05-13 | 北京航空航天大学 | Social network event analyzing method and system based on geographic positions |
CN106326496A (en) * | 2016-09-30 | 2017-01-11 | 广州特道信息科技有限公司 | Cloud platform-based news reading system |
CN106789584A (en) * | 2016-12-27 | 2017-05-31 | 上海景域文化传播股份有限公司 | A kind of FTP client FTP for travelling products sale |
CN107231570A (en) * | 2017-06-13 | 2017-10-03 | 中国传媒大学 | News data content characteristic obtains system and application system |
CN108153860A (en) * | 2017-12-25 | 2018-06-12 | 中译语通科技(青岛)有限公司 | A kind of geolocation analysis method based on multilingual news |
CN108170773A (en) * | 2017-12-26 | 2018-06-15 | 百度在线网络技术(北京)有限公司 | Media event method for digging, device, computer equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10242121B2 (en) | Automatic browser tab groupings | |
CN102831248B (en) | Network focus method for digging and device | |
CN106383887A (en) | Environment-friendly news data acquisition and recommendation display method and system | |
CN103139256B (en) | A kind of many tenant network public sentiment method for supervising and system | |
CN103294712A (en) | System and method for recommending hot spot area in real time | |
CN112434168B (en) | Knowledge graph construction method and fragmented knowledge generation method based on library | |
CN103279476B (en) | The detection method of a kind of WEB application system sensitive word and system | |
CN112463859B (en) | User data processing method and server based on big data and business analysis | |
CN106202126A (en) | A kind of data analysing method for logistics monitoring and device | |
CN109582548B (en) | Page element circle selection method and device based on non-buried points | |
Banujan et al. | Twitter and online news analytics for enhancing post-natural disaster management activities | |
KR102124935B1 (en) | Disaster Monitoring System, Method Using Crowd Sourcing, and Computer Program therefor | |
CN104598561A (en) | Text-based intelligent agricultural video classification method and text-based intelligent agricultural video classification system | |
CN109492401B (en) | Content carrier risk detection method, device, equipment and medium | |
Sukel et al. | Multimodal classification of urban micro-events | |
CN111291176A (en) | Hot event mining method and device | |
WO2024007032A1 (en) | Generating article polygons within newspaper images for extracting actionable data | |
KR101568800B1 (en) | Real-time issue search word sorting method and system | |
CN104462448B (en) | A kind of packet name classification method and device | |
CN110955760B (en) | Evaluation method of judgment result and related device | |
CN106874354A (en) | A kind of daily record data screening technique and device | |
CN106776654B (en) | Data searching method and device | |
CN106933849A (en) | The method and device that keyword is pushed | |
CN110968993A (en) | Information processing method and device, storage medium and processor | |
KR20200000208A (en) | Social data collection analysis system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200616 |
|
RJ01 | Rejection of invention patent application after publication |