CN115481313A - News recommendation method based on text semantic mining - Google Patents

News recommendation method based on text semantic mining Download PDF

Info

Publication number
CN115481313A
CN115481313A CN202110668465.4A CN202110668465A CN115481313A CN 115481313 A CN115481313 A CN 115481313A CN 202110668465 A CN202110668465 A CN 202110668465A CN 115481313 A CN115481313 A CN 115481313A
Authority
CN
China
Prior art keywords
news
user
vector
topic
items
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110668465.4A
Other languages
Chinese (zh)
Inventor
王海艳
胡阳
骆健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University of Posts and Telecommunications
Original Assignee
Nanjing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University of Posts and Telecommunications filed Critical Nanjing University of Posts and Telecommunications
Priority to CN202110668465.4A priority Critical patent/CN115481313A/en
Publication of CN115481313A publication Critical patent/CN115481313A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a news recommendation method based on text semantic mining, which comprises the following steps: acquiring news item information and log information of user reading history from a database; establishing a word embedding matrix for the titles of the news items through the pre-trained word vectors; obtaining a news title embedded expression vector through feature extraction; performing topic modeling on news text content based on a neural topic model to obtain a news content topic embedding vector, and forming a final news characteristic representation; analyzing the behavior data in the user reading history log, and extracting user characteristic representation at the user side according to the user reading record; and introducing a time decay function into the model to generate a recommended news candidate set of the top N items. According to the invention, the word-level characteristics and the theme characteristics of the news items are analyzed through the bidirectional cyclic neural network and the neural theme model, so that abundant semantic information in news texts can be effectively mined, the characteristics of the news items can be more accurately represented, and the recommendation effect is improved.

Description

News recommendation method based on text semantic mining
Technical Field
The invention belongs to the technical field of news recommendation, and particularly relates to a news recommendation method based on text semantic mining.
Background
The increasingly prosperous World Wide Web (WWW) has gradually changed the way people look for and read news, from traditional print media to online portals. In order to alleviate the problem of information overload, recommendation systems are widely used in modern online services, which can help users quickly find out the relevant content needed by themselves. The same user typically has multiple interests that are reflected in the different news items that they browse through. Meanwhile, important semantic features of news are hidden in text segments with different granularities, and news contents are full of different types of subject information, which is very important for learning accurate news and user representation forms of news recommendation. However, the existing news recommendation method usually ignores that the fine-grained modeling of the news item can enhance the news recommendation effect.
CN2014104033786 discloses a news recommendation system, in which data correlation analysis builds a new personalized news recommendation hypergraph model by mining the internal relations between phrases, in the hypergraph model, nodes represent phrases, edges represent some internal relations between the phrases, and the weight of the edge is used to represent the contribution of the relation, but this news recommendation system obtains high-quality clients through historical data and recommends news which is newly published or does not have enough access records to the users, and the news image recognition effect is not good.
CN201510242541X discloses a news recommendation method based on user interests, which can quickly and efficiently store and process news data by using an interactive user side and a recommendation system, but the recommendation method has long data processing time, and the calculation overhead for calculating the news similarity is very large due to quick news updating.
Disclosure of Invention
In order to solve the problems, the invention provides a news recommendation method based on text semantic mining, which comprises the steps of extracting word-level semantic information of news titles by using a bidirectional recurrent neural network to obtain news title expression vectors, modeling topics of news text contents by using a neural topic model to obtain topic semantic information, and splicing the topic semantic information and the title embedded expression vectors to jointly express news item characteristics; distributing weights to different interests in the historical reading records of the user by using an attention network to obtain a characteristic expression vector of the user side; and introducing a time attenuation function into the model, and calculating the characteristic representation of the user side and the characteristic representation of the target news item by using a score function to generate the news item which is more in line with the reading interest of the user so as to realize news recommendation. The method specifically comprises the following steps:
s10, acquiring news item information and log information of reading history of a user from a database, and specifically comprising the following steps:
s11, obtaining news item information including news numbers, news titles, news contents and release date and time stamps from a database;
s12, obtaining news items browsed by a user from a user database, wherein the news items comprise user numbers, news numbers and user reading timestamps;
and S13, the acquired information is sorted and preprocessed to obtain a training set and a testing set.
S20, establishing a word embedding matrix for the titles of the news items through the pre-trained word vectors, wherein the specific method comprises the following steps: for each news article the headline is represented as a matrix consisting of word-embedding, i.e. a n-sized vocabulary in the headline is mapped into a d-dimensional vector, with the emphasis on meaning of the words, and word-embedding converts the news headline from a sequence of words into a semantic vector matrix X 1:m =[x 1 ,x 2 ,...,x n ]。
S30, extracting the characteristics of the news headline embedding matrix by using a bidirectional recurrent neural network to obtain a news headline embedding vector;
the method specifically comprises the following steps:
s31, extracting the title features of the news based on the bidirectional recurrent neural network, and capturing the context information of the word sequence:
i t =σ(W i x t +U i h t-1 +b i )
f t =σ(W f x t +U f h t-1 +b f )
Figure BDA0003117878760000031
o t =σ(W o x t +U o h t-1 +b o )
Figure BDA0003117878760000032
Figure BDA0003117878760000033
where σ (-) is the activation function, i t Is an input gate, f t Is a forgetting door o t Is an output gate of the optical fiber,
Figure BDA0003117878760000036
candidate information indicating that the current time t needs to be updated to the current cell state, and b is a bias vector;
s32, learning the header sequence by utilizing forward propagation and backward propagation, and calculating a hidden state H:
Figure BDA0003117878760000034
s33, extracting more information features in the news headlines based on the attention network, selecting key words in the headlines, carrying out weighted summation on context representations of the words through attention weights, and obtaining feature representation vectors v of the news headlines t The following were used:
Figure BDA0003117878760000035
s40, starting a neural topic model to perform topic modeling on news text content to obtain a news content topic embedding vector, and integrating the news title embedding vector and the news content topic embedding vector to form a final news characteristic representation vector, wherein the method for performing topic modeling on the news text content comprises the following steps:
s41, a variational-based self-encoder framework that learns potential topics by encoding-decoding, let x be the word bag representation of a given news text, where v is the vocabulary, and in the encoder, u = f u (x),logσ=f σ (x) Where u and σ are prior parameters of the parameterized topic model distribution in the decoder network, f u ,f σ Is a linear transformation with a Relu activation function;
s42, generating document themes by using a decoder, and drawing theme distribution by using a Gaussian softmax function, namely z-N (mu, sigma) 2 ),θ=softmax(z)
Where z is the underlying topic variable, σ is the subject distribution, and k is the predefined number of topics, by
Figure BDA0003117878760000041
Learning predicted words
Figure BDA0003117878760000042
The probability of (a) of (b) being,
Figure BDA0003117878760000043
similar to the topic word distribution matrix of the LDA topic model,
Figure BDA0003117878760000044
representing the correlation between the ith word and the jth topic, extracting each word from p to reconstruct the input x, and further using an intermediate parameter w 0 And θ to construct a topic representation, as follows:
Figure BDA0003117878760000045
Figure BDA0003117878760000046
wherein
Figure BDA0003117878760000047
Representing a set of topic representations having a predefined d dimension,
Figure BDA0003117878760000048
is a linear transformation with a Relu activation function,
Figure BDA0003117878760000049
is a weighted sum of each topic representation, considered as the overall topic representation of the news item;
s43, integrating the news headline embedding vector and the news content theme embedding vector to form a final news characteristic representation vector v, wherein the final news characteristic representation vector v is as follows: v = Concat (v) t ,v c )。
S50, analyzing the behavior data in the user reading history log, and extracting a user preference feature expression vector at the user side according to the user reading record;
the method for analyzing the behavior data in the user reading history log and extracting the user feature expression vector according to the user reading record at the user side comprises the following steps:
the news read by the user is compared with the target news by using the attention network to obtain the final embedded vector of the user. Denote the historical reading record of user u as d 1 ,d 2 ,...d n Its embedding can be expressed as: { v 1 ,v 2 ,...v n }. Different weights are assigned to news items clicked by the user by adopting the attention network so as to learn different interests of the user in the various news items. For a target news item, similarity between news embedded representations read by a user and the target news item is calculated, and a user representation vector of the news embedded representations in a historical reading record of the user as the target news item is calculated, wherein the interest weight of each news item read by the user is determined by the similarity. In the attention mechanism, the query is the target news, and the user's history reading news items are keys and values. The user characteristics obtained are represented as follows:
v u =Attention(q,k,v)
=softmax(qk T )v。
and S60, introducing the time attenuation factor into the model, and calculating the similarity of the user preference characteristic expression vector and the news characteristic expression vector by using a score function to generate a recommended news candidate set of the previous N items.
The time decay function is defined here as:
Figure BDA0003117878760000051
where λ is a parameter that needs to be adjusted during training to control the decay rate of the news, t and t 0 The reading time and the news release time at a certain moment are represented, and the top N candidate news items are searched according to a score function, wherein the score function is as follows:
Figure BDA0003117878760000052
where N is a predefined number of items to be retrieved during the matching phase.
The invention has the beneficial effects that:
(1) The invention uses the bidirectional circulation neural network and the neural topic model to learn the news headlines and the news contents with different granularities, can effectively extract the word-level semantic information and the topic semantic information of the news items, and enriches the feature representation of the news items.
(2) According to the method and the device, the attention network is used for simulating the influence of the news items in the historical reading records of the user on the target news, so that different interests of the user are represented, and the accuracy of the preference of the user side can be improved.
(3) The time decay function is introduced, timeliness of news recommendation is met to a certain extent, and the news recommendation can be carried out in real time.
Drawings
FIG. 1 is an overall flow diagram of an embodiment of the present invention.
Detailed Description
In the following description, for purposes of explanation, numerous implementation details are set forth in order to provide a thorough understanding of the embodiments of the invention. It should be understood, however, that these implementation details should not be taken to limit the invention. That is, in some embodiments of the invention, such implementation details are not necessary.
As shown in fig. 1, the present invention is a news recommendation method based on text semantic mining, which specifically includes the following steps:
and S10, acquiring news item information and log information of reading history of the user from the database.
First, news item information including news numbers, news headlines, news contents, and release date and time stamps is obtained from a database.
Then, the news items browsed by the user are obtained from the user database, wherein the news items comprise user numbers, news numbers and user reading time stamps.
And finally, arranging and preprocessing the acquired information to obtain a training set and a testing set.
And S20, establishing a word embedding matrix for the titles of the news items through the pre-trained word vectors.
The headline for each news article can be represented as a matrix consisting of word embedding, i.e., mapping n-sized words in the headline into a d-dimensional vector with emphasis on the meaning of the words, which converts the news headline from a sequence of words into a semantic vector matrix X 1:n =[x 1 ,x 2 ,...,x n ). Word embedding can be any pre-trained word embedding model, such as fastText, word2Vec, or Glove, which uses the word vector pre-trained by the fastText model in this example, and the dimension d of the word vector is 100.
S30, extracting the characteristics of the news headline embedding matrix by using a bidirectional recurrent neural network to obtain a news headline embedding vector;
firstly, extracting the title features of news based on a bidirectional recurrent neural network, and capturing the context information of a word sequence:
i t =σ(W i x t +U i h t-1 +b i )
f t =σ(W f x t +U f h t-1 +b f )
Figure BDA0003117878760000071
o t =σ(W o x t +U o h t-1 +b o )
Figure BDA0003117878760000072
Figure BDA0003117878760000073
where σ (·) is an activation function, i t Is an input gate, f t Is a forgetting door o t Is an output gate of the optical fiber,
Figure BDA0003117878760000074
candidate information indicating that the current time t needs to be updated to the current cell state, and b is a bias vector;
then, learning the header sequence by utilizing forward propagation and backward propagation respectively, and calculating a hidden state H:
Figure BDA0003117878760000075
finally, extracting more information features in the news headline based on the attention network so as to select key words in the headline, weighting and summing the context representation of the words through attention weight to obtain a feature representation vector v of the news headline t The following were used:
Figure BDA0003117878760000076
and S40, starting a neural topic model to perform topic modeling on news text content to obtain a news content topic embedding vector, and integrating the news title embedding vector and the news content topic embedding vector to form a final news characteristic representation vector.
S41, a variational-based self-encoder framework that learns potential topics by encoding-decoding, let x be the word bag representation of a given news text, where v is the vocabulary, and in the encoder, u = f u (x),logσ=f σ (x) Where u and σ are prior parameters of the parameterized topic model distribution in the decoder network, f u ,f σ Is a linear transformation with a Relu activation function;
s42, generating document themes by using a decoder, and drawing theme distribution by using a Gaussian softmax function, namely z-N (mu, sigma) 2 ),θ=sofmax(z)
Where z is the underlying topic variable, σ is the subject distribution, and k is the predefined number of topics, by
Figure BDA0003117878760000081
Learning predicted words
Figure BDA0003117878760000082
The probability of (a) of (b) being,
Figure BDA0003117878760000083
similar to the topic word distribution matrix of the LDA topic model,
Figure BDA0003117878760000084
representing the correlation between the ith word and the jth topic, extracting each word from p to reconstruct the input x, and further using intermediate parameters
Figure BDA00031178787600000810
And θ to construct a topic representation, as follows:
Figure BDA0003117878760000085
Figure BDA0003117878760000086
wherein
Figure BDA0003117878760000087
Representing a set of topic representations having a predefined d dimension,
Figure BDA0003117878760000088
is a linear transformation with a Relu activation function,
Figure BDA0003117878760000089
is a weighted sum of each topic representation, which can be considered as an overall topic representation of the news item;
s43, integrating the news headline embedding vector and the news content theme embedding vector to form a final news characteristic representation vector v, wherein the final news characteristic representation vector v is as follows: v = Concat (v) t ,v c )。
S50, analyzing the behavior data in the user reading history log, and extracting user preference characteristic representation according to the user reading record at the user side;
comparing the news read by the user with the target news by using the attention network to obtain a final embedded vector of the user, and expressing the historical reading record of the user u as { d } 1 ,d 2 ,...d n Its embedding can be expressed as: { v) 1 ,v 2 ,...,v n }; the attention network is adopted to distribute different weights for news items clicked by users so as to learn different interests of the users for each news item, for a target news, the similarity between news embedded representations read by the users and the target news items is calculated, and a user representation vector taking the news embedded representations in the history reading records of the users as the target news items is calculated, wherein the interest weight of each news item read by the users is determined by the similarity; in the attention mechanism, the query is target news, the historical reading news items of the user are keys and values, and the obtained user preference characteristics are represented as follows:
v u =Attention(q,k,v)
=softmax(qk T )v。
and S60, introducing the time attenuation factor into the model, and calculating the similarity of the user preference characteristic expression vector and the news characteristic expression vector by using a score function to generate a recommended news candidate set of the previous N items.
The time decay function is defined as:
Figure BDA0003117878760000091
where λ is a parameter that needs to be adjusted during training to control the decay rate of the news, t and t 0 The reading time and the news release time at a certain moment are represented, and the top N candidate news items are searched according to a score function, wherein the score function is as follows:
Figure BDA0003117878760000092
where N is the predefined number of items to be retrieved in the matching phase, which in this example is 50.
According to the invention, the word-level characteristics and the theme characteristics of the news items are analyzed through the bidirectional cyclic neural network and the neural theme model, so that abundant semantic information in news texts can be effectively mined, the characteristics of the news items can be more accurately represented, and the recommendation effect is improved.
The above description is only an embodiment of the present invention, and is not intended to limit the present invention. Various modifications and alterations to this invention will become apparent to those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the scope of the claims of the present invention.

Claims (7)

1. A news recommendation method based on text semantic mining is characterized in that: the recommendation method comprises the following steps:
s10, acquiring news item information and log information of reading history of a user from a database;
s20, establishing a word embedding matrix for the titles of the news items through the pre-trained word vectors;
s30, extracting features of the news headline embedding matrix by using a bidirectional recurrent neural network to obtain a news headline embedding vector;
s40, starting a neural topic model to perform topic modeling on news text content to obtain a news content topic embedding vector, and integrating the news title embedding vector and the news content topic embedding vector to form a final news characteristic representation vector;
s50, analyzing the behavior data in the user reading history log, and extracting a user preference feature expression vector at the user side according to the user reading record;
and S60, introducing the time attenuation factor into the model, and calculating the similarity of the user preference characteristic expression vector and the news characteristic expression vector by using a score function to generate a recommended news candidate set of the previous N items.
2. The news recommendation method based on text semantic mining as claimed in claim 1, wherein: the step S30 of extracting features of the news headline embedding matrix by using the bidirectional recurrent neural network to obtain a news headline embedding vector specifically includes the following steps:
s31, extracting the title features of the news based on the bidirectional recurrent neural network, and capturing the context information of the word sequence:
i t =σ(W i x t +U i h t-1 +b i )
f t =σ(W f x t +U f h t-1 +b f )
Figure FDA0003117878750000011
o t =σ(W o x t +U o h t-1 +b o )
Figure FDA0003117878750000012
Figure FDA0003117878750000013
where σ (-) is the activation function, i t Is an input gate, f t Is forgetting to gate o t Is an output gate of the optical fiber,
Figure FDA0003117878750000028
candidate information indicating that the current time t needs to be updated to the current cell state, and b is a bias vector;
s32, learning the header sequence by utilizing forward propagation and backward propagation, and calculating a hidden state H:
Figure FDA0003117878750000021
s33, extracting more information features in the news headlines based on the attention network so as to select key words in the headlines, and performing weighted summation on the context representation of the words through attention weight to obtain a feature representation vector v of the news headlines t The following were used:
Figure FDA0003117878750000022
3. the news recommendation method based on text semantic mining as claimed in claim 1, wherein: the method for topic modeling of news text content in the step S40 includes the following steps:
s41, a variational-based self-encoder framework that learns potential topics by encoding-decoding, letting x be a word bag representation of a given news textWhere v is a vocabulary, in the encoder, u = f u (x),logσ=f σ (x) Where u and σ are prior parameters of the parameterized topic model distribution in the decoder network, f u ,f σ Is a linear transformation with a Relu activation function;
s42, generating document themes by using a decoder, and drawing theme distribution by using a Gaussian softmax function, namely z-N (mu, sigma) 2 ),θ=softmax(z)
Where z is the underlying topic variable, σ is the subject distribution, k is the predefined number of topics, and
Figure FDA0003117878750000023
learning predicted words
Figure FDA0003117878750000024
The probability of (a) of (b) being,
Figure FDA0003117878750000025
similar to the topic word distribution matrix of the LDA topic model,
Figure FDA0003117878750000026
representing the correlation between the ith word and the jth topic, extracting each word from p to reconstruct the input x, and further using intermediate parameters
Figure FDA0003117878750000027
And θ to construct a topic representation, as follows:
Figure FDA0003117878750000031
Figure FDA0003117878750000032
wherein
Figure FDA0003117878750000033
Representing a set of topic representations having a predefined d dimension,
Figure FDA0003117878750000034
is a linear transformation with a Relu activation function,
Figure FDA0003117878750000035
is a weighted sum of each topic representation, considered as the overall topic representation of the news item;
s43, integrating the news headline embedding vector and the news content theme embedding vector to form a final news characteristic representation vector v, wherein the final news characteristic representation vector v is as follows: v = Concat (v) t ,v c )。
4. The news recommendation method based on text semantic mining as claimed in claim 1, wherein: the method for analyzing the behavior data in the user reading history log in the step S50 and extracting the user preference feature expression vector according to the user reading record at the user side includes the following steps:
s51, comparing the news read by the user with the target news by using the attention network to obtain a final embedded vector of the user, and expressing the historical reading record of the user u as { d } 1 ,d 2 ,…d n Its embedding can be expressed as: { v 1 ,v 2 ,...,v n };
S52, distributing different weights to news items clicked by users by adopting an attention network to learn different interests of the users to the news items, calculating the similarity between news embedded representations read by the users and the target news items for a target news, and calculating user representation vectors taking the news embedded representations in the history reading records of the users as the target news items, wherein the interest weight of each news item read by the users is determined by the similarity;
s53, in the attention mechanism, the query is target news, the historical reading news items of the user are keys and values, and the obtained user preference features are represented as follows:
v u =Attention(q,k,v)
=softmax(qk T )v。
5. the news recommendation method based on text semantic mining as claimed in claim 1, wherein: the time decay function in said step S60 is defined as:
Figure FDA0003117878750000041
where λ is a parameter that needs to be adjusted during training to control the decay rate of the news, t and t 0 The reading time and the news release time at a certain moment are represented, and the top N candidate news items are searched according to a score function, wherein the score function is as follows:
Figure FDA0003117878750000042
where N is a predefined number of items to be retrieved during the matching phase.
6. The news recommendation method based on text semantic mining as claimed in claim 1, wherein: the step S10 specifically includes the steps of:
s11, obtaining news item information from a database, wherein the news item information comprises news numbers, news titles, news contents and release date and time stamps;
s12, obtaining news items browsed by a user from a user database, wherein the news items comprise user numbers, news numbers and user reading timestamps;
and S13, the acquired information is sorted and preprocessed to obtain a training set and a testing set.
7. The news recommendation method based on text semantic mining as claimed in claim 1, wherein: the method for establishing the news headline embedding matrix in the step S20Comprises the following steps: for each news article the headline is represented as a matrix consisting of word-embedding, i.e. a n-sized vocabulary in the headline is mapped into a d-dimensional vector, with the emphasis on meaning of the words, and word-embedding converts the news headline from a sequence of words into a semantic vector matrix X 1:n =[x 1 ,x 2 ,...,x n ]。
CN202110668465.4A 2021-06-16 2021-06-16 News recommendation method based on text semantic mining Pending CN115481313A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110668465.4A CN115481313A (en) 2021-06-16 2021-06-16 News recommendation method based on text semantic mining

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110668465.4A CN115481313A (en) 2021-06-16 2021-06-16 News recommendation method based on text semantic mining

Publications (1)

Publication Number Publication Date
CN115481313A true CN115481313A (en) 2022-12-16

Family

ID=84420351

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110668465.4A Pending CN115481313A (en) 2021-06-16 2021-06-16 News recommendation method based on text semantic mining

Country Status (1)

Country Link
CN (1) CN115481313A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116383521A (en) * 2023-05-19 2023-07-04 苏州浪潮智能科技有限公司 Subject word mining method and device, computer equipment and storage medium
CN117312542A (en) * 2023-11-29 2023-12-29 泰山学院 Reading recommendation method and system based on artificial intelligence

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116383521A (en) * 2023-05-19 2023-07-04 苏州浪潮智能科技有限公司 Subject word mining method and device, computer equipment and storage medium
CN116383521B (en) * 2023-05-19 2023-08-29 苏州浪潮智能科技有限公司 Subject word mining method and device, computer equipment and storage medium
CN117312542A (en) * 2023-11-29 2023-12-29 泰山学院 Reading recommendation method and system based on artificial intelligence
CN117312542B (en) * 2023-11-29 2024-02-13 泰山学院 Reading recommendation method and system based on artificial intelligence

Similar Documents

Publication Publication Date Title
CN109948165B (en) Fine granularity emotion polarity prediction method based on mixed attention network
CN109829104B (en) Semantic similarity based pseudo-correlation feedback model information retrieval method and system
Qiu et al. DGeoSegmenter: A dictionary-based Chinese word segmenter for the geoscience domain
CN112131350B (en) Text label determining method, device, terminal and readable storage medium
CN109376222B (en) Question-answer matching degree calculation method, question-answer automatic matching method and device
CN110489523B (en) Fine-grained emotion analysis method based on online shopping evaluation
CN109726745B (en) Target-based emotion classification method integrating description knowledge
Cai et al. Intelligent question answering in restricted domains using deep learning and question pair matching
CN111414461A (en) Intelligent question-answering method and system fusing knowledge base and user modeling
CN113591483A (en) Document-level event argument extraction method based on sequence labeling
CN113761218A (en) Entity linking method, device, equipment and storage medium
CN113569001A (en) Text processing method and device, computer equipment and computer readable storage medium
Li et al. UD_BBC: Named entity recognition in social network combined BERT-BiLSTM-CRF with active learning
CN115481313A (en) News recommendation method based on text semantic mining
CN112287090A (en) Financial question asking back method and system based on knowledge graph
CN116680363A (en) Emotion analysis method based on multi-mode comment data
CN113821587B (en) Text relevance determining method, model training method, device and storage medium
CN117034921B (en) Prompt learning training method, device and medium based on user data
CN113961666A (en) Keyword recognition method, apparatus, device, medium, and computer program product
CN112749566B (en) Semantic matching method and device for English writing assistance
CN113535949A (en) Multi-mode combined event detection method based on pictures and sentences
CN116932736A (en) Patent recommendation method based on combination of user requirements and inverted list
CN116958677A (en) Internet short video classification method based on multi-mode big data
CN113780832B (en) Public opinion text scoring method, public opinion text scoring device, computer equipment and storage medium
CN116257618A (en) Multi-source intelligent travel recommendation method based on fine granularity emotion analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination