CN112487306A - Automatic event marking and classifying method based on knowledge graph - Google Patents

Automatic event marking and classifying method based on knowledge graph Download PDF

Info

Publication number
CN112487306A
CN112487306A CN202011417045.0A CN202011417045A CN112487306A CN 112487306 A CN112487306 A CN 112487306A CN 202011417045 A CN202011417045 A CN 202011417045A CN 112487306 A CN112487306 A CN 112487306A
Authority
CN
China
Prior art keywords
burst
phrase
text
phrases
time window
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011417045.0A
Other languages
Chinese (zh)
Other versions
CN112487306B (en
Inventor
王晓玲
赵鑫
袁佳豪
王韵弘
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
East China Normal University
Original Assignee
East China Normal University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by East China Normal University filed Critical East China Normal University
Priority to CN202011417045.0A priority Critical patent/CN112487306B/en
Publication of CN112487306A publication Critical patent/CN112487306A/en
Application granted granted Critical
Publication of CN112487306B publication Critical patent/CN112487306B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9536Search customisation based on social or collaborative filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/367Ontology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking

Abstract

The invention discloses an automatic event marking and classifying method based on a knowledge graph, which comprises the steps of constructing the knowledge graph of a required field category, crawling each text data published in social media in a preset time window, extracting key phrases and screening to obtain burst phrases, clustering the burst phrases to obtain burst phrase clusters, calculating TF-IDF scores of the corresponding burst phrases of the burst events on each knowledge graph, summing to obtain the TF-IDF scores of the burst events on each knowledge graph, and marking the corresponding events as the field category if the TF-IDF scores are larger than a preset threshold value, thereby determining the marking and classification of the events. According to the method, the emergency is automatically determined by screening and clustering the emergency phrases of the text data in the social media, and then TF-IDF scores of the emergency on knowledge maps of various field categories are calculated, so that the automatic accurate marking and classification of the social media events are realized.

Description

Automatic event marking and classifying method based on knowledge graph
Technical Field
The invention belongs to the technical field of event marking and classification, and particularly relates to an automatic event marking and classifying method based on a knowledge graph.
Background
In recent years, with the rapid development of social media, social media such as twitter and microblog gradually become important ways for people to acquire news information. Therefore, more and more work is focused on and analyzes social media information, wherein one important type of work is event extraction work of social media data, namely, events described by the social media information are extracted according to the social media data. However, for extracted events (event key phrases, abstracts, etc.), some events which are not concerned are inevitably generated, so that the extracted events need to be marked and classified (several categories such as military, politics, geography, etc.) to obtain the category to which each event belongs, so as to filter out the events which are not concerned about the category and screen out the events which are interested. However, how to accurately obtain the categories of events according to a small amount of information describing the events and how to solve the problem that a certain event may belong to multiple categories, and a better solution is not available, all of which need to be further researched and solved.
Disclosure of Invention
The invention aims to overcome the defects of the prior art and provide an automatic event marking and classifying method based on a knowledge graph, so that automatic and accurate marking and classification of social media events are realized.
In order to achieve the above object, the automatic event marking and classifying method based on knowledge graph of the present invention comprises the following steps:
s1: setting N field categories according to actual needs, respectively collecting text data of each field category and constructing a knowledge graph Gn,n=1,2…,N;
S2: presetting a time window T, crawling social media in the time windowAnd extracting key phrases from each piece of document data of the cloth, and forming a key phrase set A by using the extracted key phrases. Respectively calculating the burst degree W of each key phrase s in the key phrase set AsThe calculation formula is as follows:
Ws=ps×log(us)×log(rs)×log(log(fs))
wherein p issRepresenting the probability of a burst, u, of a key phrase s within a time window TsRepresenting the number of users, r, who used the key phrase s within the time window TsRepresenting the number of times a text containing a key phrase s is forwarded within a time window T, fsRepresents the sum of the number of interests of the user using the key phrase s within the time window T;
sorting all key phrases according to the burst degree from high to low, and selecting the first K key phrases as burst phrases to be added into a burst phrase set B;
s3: averagely dividing the time window T into M continuous disjoint sub-time windows, and recording the mth sub-time window as TmFor each burst phrase e, e ∈ B, each sub-time window T is countedmText set text (e, m) and number of texts f containing the burst phrase e1(e, m), and the amount of text f containing the burst phrase e over the time window T2(e) Calculating the time window T of each burst phrase e in the sub-time windowmThe ratio d (e, m) is f1(e,m)/f2(e);
Let two burst phrases be e respectivelya、ebFirst, text sets text (e) are calculated separatelyaM) and text set text (e)bM) similarity sim (text (e)a,m),text(ebM)), and then the similarity S (e) of the two burst phrases is calculated using the following formulaa,eb):
Figure BDA0002820446500000021
Clustering burst phrases according to similarity between burst phrasesGet K burst phrase clusters CkK is 1,2, …, K, each burst phrase cluster CkI.e. an emergency event within the time window T;
s4: for burst phrase cluster CkCorresponding emergencies are constructed according to the knowledge graph G of each field category constructed in the step S1nCovered text data, calculating a burst phrase cluster CkKnowledge graph G of each burst phrase v in each domain classnThe score of the TF-IDF is score (v, n), and then the sum is carried out to obtain the burst phrase cluster CkKnowledge graph G of corresponding emergency in each field categorynUpper TF-IDF score (k, n):
Figure BDA0002820446500000022
s5: presetting TF-IDF score threshold
Figure BDA0002820446500000023
For burst phrase cluster CkCorresponding emergency, if it is in domain class knowledge map GnA TF-IDF score of score (k, n) above a threshold value
Figure BDA0002820446500000024
The incident is marked as the domain category to determine the marking and classification of the incident.
The invention relates to an automatic event marking and classifying method based on a knowledge graph, which comprises the steps of constructing the knowledge graph of a required field category, crawling each text data published in social media in a preset time window, extracting key phrases and screening to obtain burst phrases, clustering the burst phrases to obtain burst phrase clusters, calculating TF-IDF scores of the corresponding burst phrases of the burst events on each knowledge graph, summing to obtain the TF-IDF scores of the burst events on each knowledge graph, and marking the corresponding events as the field category if the TF-IDF scores are larger than a preset threshold value, thereby determining the marking and classification of the events.
According to the method, the emergency is automatically determined by screening and clustering the emergency phrases of the text data in the social media, and then TF-IDF scores of the emergency on knowledge maps of various field categories are calculated, so that the automatic accurate marking and classification of the social media events are realized.
Drawings
FIG. 1 is a flow diagram of an embodiment of a method for knowledge-graph based automated event tagging and classification in accordance with the present invention.
Detailed Description
The following description of the embodiments of the present invention is provided in order to better understand the present invention for those skilled in the art with reference to the accompanying drawings. It is to be expressly noted that in the following description, a detailed description of known functions and designs will be omitted when it may obscure the subject matter of the present invention.
Examples
FIG. 1 is a flow diagram of an embodiment of a method for knowledge-graph based automated event tagging and classification in accordance with the present invention. As shown in FIG. 1, the automatic event labeling and classifying method based on knowledge graph of the present invention comprises the following specific steps:
s101: constructing knowledge graphs of various field categories:
setting N field categories according to actual needs, respectively collecting text data of each field category and constructing a knowledge graph Gn,n=1,2…,N。
Knowledge Graph (Knowledge Graph) is a series of different graphs displaying the relation between the Knowledge development process and the structure, and uses visualization technology to describe Knowledge resources and carriers thereof, and excavates, analyzes, constructs, draws and displays Knowledge and the mutual relation between the Knowledge resources and the carriers. For internet social media, the domain categories set in the present embodiment include Military (milery), People (peoples), Industry (Industry), security (Safety), weather (meteorology), and Geography (geographics), relevant data of these domain categories on wikipedia is crawled, a knowledge graph is constructed based on these data, and the event is labeled and classified by means of these knowledge graph information.
S102: screening burst phrases:
and presetting a time window T, crawling each text data published in social media in the time window, extracting key phrases of each file data, and forming a key phrase set A by the extracted key phrases. Respectively calculating the burst degree W of each key phrase s in the key phrase set AsThe calculation formula is as follows:
Ws=ps×log(us)×log(rs)×log(log(fs))
wherein p issRepresenting the probability of a burst of a key phrase s within a time window T, i.e.
Figure BDA0002820446500000041
ts、ts′Respectively representing the times of occurrence of key phrase s and key phrase s 'in the time window T, s, s' being equal to A and usRepresenting the number of users, r, who used the key phrase s within the time window TsRepresenting the number of times a text containing a key phrase s is forwarded within a time window T, fsRepresenting the sum of the number of interests of the user using the key phrase s within the time window T.
And sorting all key phrases according to the burst degree from high to low, selecting the first K key phrases as burst phrases to be added into a burst phrase set B, and setting the value of K according to the requirement.
Table 1 is a list of the degree of burstiness of some key phrases in this example.
Key phrases Degree of burst
Iran 20.1310
Zarif 6.91061
foreign minister 3.68816
Human rights 3.21209
president 2.54122
resignation 2.53455
fellow diplomats 1.22547
foreign policy 0.32457
hinting at 0.30289
condemns 0.02536
In front of 0.01785
except for 0.00566
TABLE 1
In this embodiment, the first 3 key phrases are selected as burst phrases to be added to the burst phrase set B, i.e., B ═ Iran, Zarif, forign minimum }.
S103: constructing an emergency based on the burst phrase clustering:
averagely dividing the time window T into M continuous disjoint sub-time windows, and recording the mth sub-time window as TmM1, 2, …, M, for each burst phrase e, e ∈ B, each sub-time window T is countedmText set text (e, m) and number of texts f containing the burst phrase e1(e, m), and the amount of text f containing the burst phrase e over the time window T2(e) Calculating the time window T of each burst phrase e in the sub-time windowmThe ratio d (e, m) is f1(e,m)f2(e)。
And calculating the similarity of the burst phrases in the burst phrase set B pairwise, wherein the calculation method comprises the following steps:
let two burst phrases be e respectivelya、ebFirst, text sets text (e) are calculated separatelyaM) and text set text (e)bM) similarity sim (text (e)a,m),text(ebM)), and then the similarity S (e) of the two burst phrases is calculated using the following formulaa,eb):
Figure BDA0002820446500000051
Clustering the burst phrases according to the similarity between the burst phrases to obtain K burst phrase clusters CkK is 1,2, …, K, each burst phrase cluster CkI.e. an emergency event of the time window T.
In this embodiment, the similarity between text sets is TF-IDF (term frequency-inverse document frequency) similarity. TF-IDF is a commonly used weighting technique for information retrieval (information retrieval) and text mining (text mining). TF-IDF is a statistical method to assess how important a word is for one of a set of documents or a corpus. The importance of a word increases in proportion to the number of times it appears in a document, but at the same time decreases in inverse proportion to the frequency with which it appears in the corpus. The method for calculating the TF-IDF similarity of the text set in the embodiment comprises the following steps:
1) and respectively segmenting the two text sets, and combining the two obtained word sets to obtain a word set phi.
2) For each word in the set of words phi, the TF-IDF value p of the word in the two text sets is calculated separatelyi,1、pi,2I ═ 1,2, …, | φ |, | φ | represents the number of words in the set of words φ. The TF-IDF value is calculated by the following method: for the word i, the word frequency TF (i) and the inverse text frequency IDF (i) log (D/D) of the word i in the text set are countedi) D denotes the number of texts in the text set, DiRepresenting the number of texts in the text set containing word i, the TF-IDF value is TF (i) x IDF (i).
3) A TF-IDF vector P corresponding to each text set is constructed and obtained according to the TF-IDF value of each word1=(p1,1,p2,1,…p|φ|,1)、P2=(p1,2,p2,2,…p|φ|,2)。
4) And calculating cosine similarity between TF-IDF vectors corresponding to the two text sets, namely taking the cosine similarity as the similarity between the text sets. In this embodiment, the clustering of the burst phrases adopts a Jarvis-Patrick clustering algorithm, and the algorithm can perform clustering based on the similarity between burst knowledge, and the method can be briefly described as follows: drawing an SNN (shared neighbor similarity) similarity graph according to the burst phrase similarity, thinning the SNN similarity graph by using a similarity threshold, and finding out a communication branch of the thinned SNN similarity graph to obtain a clustering result.
Table 2 shows the burst phrase clusters obtained by clustering the burst phrases in this embodiment.
Figure BDA0002820446500000061
TABLE 2
S104: calculating the TF-IDF score of the emergency:
for burst phrase cluster CkCorresponding events are constructed according to the knowledge graph G of each field category constructed in the step S101nCovered text data, calculating a burst phrase cluster CkKnowledge graph G of each burst phrase v in each domain classnThe score of the TF-IDF is score (v, n), and then the sum is carried out to obtain the burst phrase cluster CkKnowledge graph G of corresponding emergency in each field categorynUpper TF-IDF score (k, n):
Figure BDA0002820446500000062
table 3 shows TF-IDF scores of events corresponding to the burst phrase clusters on the domain category knowledge graph in the embodiment.
Figure BDA0002820446500000063
Figure BDA0002820446500000071
TABLE 3
S105: event tagging and classification:
presetting TF-IDF score threshold
Figure BDA0002820446500000072
For burst phrase cluster CkCorresponding emergency, if it is in domain class knowledge map GnA TF-IDF score of score (k, n) above a threshold value
Figure BDA0002820446500000073
The incident is marked as the domain category to determine the marking and classification of the incident.
Setting the score threshold in the present embodiment
Figure BDA0002820446500000074
The event can be derivedThe labels and classifications are "Military (Military)" and "People (People)", consistent with human observation.
Although illustrative embodiments of the present invention have been described above to facilitate the understanding of the present invention by those skilled in the art, it should be understood that the present invention is not limited to the scope of the embodiments, and various changes may be made apparent to those skilled in the art as long as they are within the spirit and scope of the present invention as defined and defined by the appended claims, and all matters of the invention which utilize the inventive concepts are protected.

Claims (3)

1. An automatic event marking and classifying method based on knowledge graph is characterized by comprising the following steps:
s1: setting N field categories according to actual needs, respectively collecting text data of each field category and constructing a knowledge graph Gn,n=1,2…,N;
S2: and presetting a time window T, crawling each text data published in social media in the time window, extracting key phrases of each file data, and forming a key phrase set A by the extracted key phrases. Respectively calculating the burst degree W of each key phrase s in the key phrase set AsThe calculation formula is as follows:
Ws=ps×log(us)×log(rs)×log(log(fs))
wherein p issRepresenting the probability of a burst, u, of a key phrase s within a time window TsRepresenting the number of users, r, who used the key phrase s within the time window TsRepresenting the number of times a text containing a key phrase s is forwarded within a time window T, fsRepresents the sum of the number of interests of the user using the key phrase s within the time window T;
sorting all key phrases according to the burst degree from high to low, and selecting the first K key phrases as burst phrases to be added into a burst phrase set B;
s3: dividing the time window T into M consecutive disjoint sub-time windows on averagem sub-time windows are TmFor each burst phrase e, e ∈ B, each sub-time window T is countedmText set text (e, m) and number of texts f containing the burst phrase e1(e, m), and the amount of text f containing the burst phrase e over the time window T2(e) Calculating the time window T of each burst phrase e in the sub-time windowmThe ratio d (e, m) is f1(e,m)/f2(e);
Let two burst phrases be e respectivelya、ebFirst, text sets text (e) are calculated separatelyaM) and text set text (e)bM) similarity sim (text (e)a,m),text(ebM)), and then the similarity S (e) of the two burst phrases is calculated using the following formulaa,eb):
Figure FDA0002820446490000011
Clustering the burst phrases according to the similarity between the burst phrases to obtain K burst phrase clusters CkK is 1,2, …, K, each burst phrase cluster CkI.e. an emergency event within the time window T;
s4: for burst phrase cluster CkCorresponding emergencies are constructed according to the knowledge graph G of each field category constructed in the step S101nCovered text data, calculating a burst phrase cluster CkKnowledge graph G of each burst phrase v in each domain classnThe score of the TF-IDF is score (v, n), and then the sum is carried out to obtain the burst phrase cluster CkKnowledge graph G of corresponding emergency in each field categorynUpper TF-IDF score (k, n):
Figure FDA0002820446490000021
s5: presetting TF-IDF score threshold
Figure FDA0002820446490000022
For burst phrase cluster CkCorresponding emergency, if it is in domain class knowledge map GnA TF-IDF score of score (k, n) above a threshold value
Figure FDA0002820446490000023
The incident is marked as the domain category to determine the marking and classification of the incident.
2. The automated event labeling and classification method according to claim 1, wherein the text set similarity in step S3 is TF-IDF similarity, and the calculation method comprises the following steps:
1) respectively segmenting the two text sets, and combining the two obtained word sets to obtain a word set phi;
2) for each word in the set of words phi, the TF-IDF value p of the word in the two text sets is calculated separatelyi,1、pi,2I ═ 1,2, …, | φ |, | φ | represents the number of words in the set of words φ;
3) a TF-IDF vector P corresponding to each text set is constructed and obtained according to the TF-IDF value of each word1=(p1,1,p2,1,…p|φ|,1)、P2=(p1,2,p2,2,…p|φ|,2);
4) And calculating cosine similarity between TF-IDF vectors corresponding to the two text sets, namely taking the cosine similarity as the similarity between the text sets.
3. The automated event labeling and classification method according to claim 1, wherein the clustering of the burst phrases in step S3 adopts a Jarvis-Patrick clustering algorithm.
CN202011417045.0A 2020-12-07 2020-12-07 Automatic event marking and classifying method based on knowledge graph Active CN112487306B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011417045.0A CN112487306B (en) 2020-12-07 2020-12-07 Automatic event marking and classifying method based on knowledge graph

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011417045.0A CN112487306B (en) 2020-12-07 2020-12-07 Automatic event marking and classifying method based on knowledge graph

Publications (2)

Publication Number Publication Date
CN112487306A true CN112487306A (en) 2021-03-12
CN112487306B CN112487306B (en) 2023-01-17

Family

ID=74939897

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011417045.0A Active CN112487306B (en) 2020-12-07 2020-12-07 Automatic event marking and classifying method based on knowledge graph

Country Status (1)

Country Link
CN (1) CN112487306B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113010689A (en) * 2021-03-22 2021-06-22 平安科技(深圳)有限公司 Buddhism knowledge discrimination method, device, equipment and storage medium
CN117076596A (en) * 2023-10-16 2023-11-17 微网优联科技(成都)有限公司 Data storage method, device and server applying artificial intelligence

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239436A (en) * 2014-08-27 2014-12-24 南京邮电大学 Network hot event detection method based on text classification and clustering analysis
CN107633044A (en) * 2017-09-14 2018-01-26 国家计算机网络与信息安全管理中心 A kind of public sentiment knowledge mapping construction method based on focus incident
CN107704637A (en) * 2017-11-20 2018-02-16 中国人民解放军国防科技大学 Knowledge graph construction method for emergency
CN107908633A (en) * 2017-07-26 2018-04-13 北京师范大学珠海分校 A kind of finance and economics reasoning method of knowledge based collection of illustrative plates
CN108052576A (en) * 2017-12-08 2018-05-18 国家计算机网络与信息安全管理中心 A kind of reason knowledge mapping construction method and system
CN108897857A (en) * 2018-06-28 2018-11-27 东华大学 The Chinese Text Topic sentence generating method of domain-oriented
CN110543574A (en) * 2019-08-30 2019-12-06 北京百度网讯科技有限公司 knowledge graph construction method, device, equipment and medium
CN111241278A (en) * 2020-01-06 2020-06-05 北京明略软件系统有限公司 Public opinion early warning method and device based on knowledge graph

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239436A (en) * 2014-08-27 2014-12-24 南京邮电大学 Network hot event detection method based on text classification and clustering analysis
CN107908633A (en) * 2017-07-26 2018-04-13 北京师范大学珠海分校 A kind of finance and economics reasoning method of knowledge based collection of illustrative plates
CN107633044A (en) * 2017-09-14 2018-01-26 国家计算机网络与信息安全管理中心 A kind of public sentiment knowledge mapping construction method based on focus incident
CN107704637A (en) * 2017-11-20 2018-02-16 中国人民解放军国防科技大学 Knowledge graph construction method for emergency
CN108052576A (en) * 2017-12-08 2018-05-18 国家计算机网络与信息安全管理中心 A kind of reason knowledge mapping construction method and system
CN108897857A (en) * 2018-06-28 2018-11-27 东华大学 The Chinese Text Topic sentence generating method of domain-oriented
CN110543574A (en) * 2019-08-30 2019-12-06 北京百度网讯科技有限公司 knowledge graph construction method, device, equipment and medium
CN111241278A (en) * 2020-01-06 2020-06-05 北京明略软件系统有限公司 Public opinion early warning method and device based on knowledge graph

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
KEVAL MORABIA ET AL.: ""SEDTWik: Segmentation-based Event Detection from Tweets Using Wikipedia"", 《PROCEEDINGS OF THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: STUDENT RESEARCH WORKSHOP》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113010689A (en) * 2021-03-22 2021-06-22 平安科技(深圳)有限公司 Buddhism knowledge discrimination method, device, equipment and storage medium
CN117076596A (en) * 2023-10-16 2023-11-17 微网优联科技(成都)有限公司 Data storage method, device and server applying artificial intelligence
CN117076596B (en) * 2023-10-16 2023-12-26 微网优联科技(成都)有限公司 Data storage method, device and server applying artificial intelligence

Also Published As

Publication number Publication date
CN112487306B (en) 2023-01-17

Similar Documents

Publication Publication Date Title
CN110297988B (en) Hot topic detection method based on weighted LDA and improved Single-Pass clustering algorithm
Bansal et al. On predicting elections with hybrid topic based sentiment analysis of tweets
US20230222366A1 (en) Systems and methods for semantic analysis based on knowledge graph
CN107766585B (en) Social network-oriented specific event extraction method
Kestemont et al. Cross-genre authorship verification using unmasking
US7526462B2 (en) Method and apparatus for generating time-series data from web pages
Suresh An unsupervised fuzzy clustering method for twitter sentiment analysis
TW201839628A (en) Method, system and apparatus for discovering and tracking hot topics from network media data streams
Nguyen et al. Real-time event detection using recurrent neural network in social sensors
Dang et al. Framework for retrieving relevant contents related to fashion from online social network data
El Ali et al. Measuring, understanding, and classifying news media sympathy on twitter after crisis events
CN112487306B (en) Automatic event marking and classifying method based on knowledge graph
Noel et al. Applicability of Latent Dirichlet Allocation to multi-disk search
Patil et al. Machine learning techniques for the classification of fake news
US11609959B2 (en) System and methods for generating an enhanced output of relevant content to facilitate content analysis
Andoh et al. Statistical analysis of public sentiment on the ghanaian government: a machine learning approach
CN107908749B (en) Character retrieval system and method based on search engine
Kumar et al. Fake news detection of Indian and United States election data using machine learning algorithm
Fabian et al. Privacy on Reddit? Towards Large-scale User Classification.
CN112434126B (en) Information processing method, device, equipment and storage medium
Kotevska et al. Sentiment analysis of Social Sensors for local services improvement
Hartanto et al. Classifying User Personality Based on Media Social Posts Using Support Vector Machine Algorithm Based on DISC Approach
Kotevska et al. Automatic Categorization of Social Sensor Data
Huang Research on feature classification method of network text data based on association rules
CN112527960A (en) Emergency detection method based on keyword clustering

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant