CN112100517A - Method for relieving cold start problem of recommendation system based on content feature extraction - Google Patents

Method for relieving cold start problem of recommendation system based on content feature extraction Download PDF

Info

Publication number
CN112100517A
CN112100517A CN202010977547.2A CN202010977547A CN112100517A CN 112100517 A CN112100517 A CN 112100517A CN 202010977547 A CN202010977547 A CN 202010977547A CN 112100517 A CN112100517 A CN 112100517A
Authority
CN
China
Prior art keywords
word
similarity
content feature
service
content
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010977547.2A
Other languages
Chinese (zh)
Inventor
陈佳雯
张宏国
马超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Harbin University of Science and Technology
Original Assignee
Harbin University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin University of Science and Technology filed Critical Harbin University of Science and Technology
Priority to CN202010977547.2A priority Critical patent/CN112100517A/en
Publication of CN112100517A publication Critical patent/CN112100517A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9536Search customisation based on social or collaborative filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Abstract

The invention relates to a method for relieving a cold start problem of a recommendation system based on content feature extraction. With the rapid development of the internet technology, the problem of information overload is more obvious, and a proper user group is difficult to find for recommendation for a new service without user history scores. A method for alleviating the cold start problem of a recommendation system based on content feature extraction is characterized in that when content features are extracted, a method of dependency syntax analysis in natural language processing is adopted to extract the description information features of items, and the extracted content features are converted into word vectors; secondly, considering that the importance degree of each Word is different in practical situation, a Weighted Word Distance algorithm (WWMD) optimized based on TF-IDF is used to improve the accuracy of calculating the Word Distance of the content feature vector, so that after the accuracy of the similarity between the articles is improved, recommendation is performed by combining the similarity calculated by using the Word Distance with a traditional similarity calculation method. The invention is applied to the field of Internet.

Description

Method for relieving cold start problem of recommendation system based on content feature extraction
Technical Field
The invention relates to a method for relieving a cold start problem of a recommendation system based on content feature extraction.
Background
At present, with the rapid development of the internet technology, the problem of information overload is more obvious, emerging services are difficult to find a proper user group for recommendation without user history scores, and users are difficult to contact the latest online service with a high probability of being suitable for the users in a plurality of services, which is the common cold start problem accompanying the application of the most extensive collaborative filtering recommendation algorithm in the field of recommendation systems.
Disclosure of Invention
The invention aims to provide a method for relieving the cold start problem of a recommendation system based on content feature extraction, which mainly solves the problem of recommending articles without historical scores and calculates the similarity between the articles according to the word distance between the extracted content features of the articles.
The above purpose is realized by the following technical scheme:
a method for alleviating the cold start problem of a recommendation system based on content feature extraction is characterized in that when content features are extracted, a method of dependency syntax analysis in natural language processing is adopted to extract the description information features of items, and the extracted content features are converted into word vectors; secondly, considering that the importance degree of each word is different under the actual condition, a TF-IDF method is used for optimizing a word distance algorithm so as to improve the accuracy of calculating the word distance of the content feature vector and further improve the accuracy of the similarity between the articles; and finally, recommending by combining the similarity calculated by using the word distance and a traditional similarity calculation method.
The method for alleviating the cold start problem of the recommendation system based on the content feature extraction comprises the following steps of: performing content characteristic analysis on the description information and the updating function description information in the article, and extracting the content characteristics of the article information through part-of-speech tagging and dependency syntax analysis in natural language processing; firstly, segmenting words of an article characteristic text, and performing part-of-speech tagging on each word after segmentation; secondly, according to the part of speech tagging of each word after word segmentation, dependency syntax analysis is carried out on the characteristic text of the article, and the relation between the lyrics in the text is analyzed.
The method for relieving the cold start problem of the recommendation system based on the content feature extraction comprises the following steps of:
(1) is provided with a trained characteristic word vector matrix
Figure BDA0002686301020000011
A total of n words, column i xiRdA d-dimensional word vector representing the ith word, the euclidean distance between word i and word j being c (i, j) | | xi-xj||2
(2) A piece of content feature text for describing service a is processed by a skip-gram model and can be used as a sparse vector da∈RnAs the expression of the word bag, the word i appears n in the content characteristic texti,aNext, at daThe sum of the number of times of all the words appearing in (a) is ∑knk,aThen the TF value of the word i is
Figure BDA0002686301020000021
(3) Assuming that the total number of content feature texts in the corpus is | D |, the number of texts containing the word i is | { a: i is e daI, the IDF value of the word i is
Figure BDA0002686301020000022
The TF-IDF value of the word i in the content feature description text of the service a is tfidfi,a=tfi,a×idfi
(4) There are two services a and b, order daAnd dbBag-of-words representations, each representing two pieces of content characteristic text to be computed, daEach word i in (a) may be wholly or partially transferred to dbIn (1), a sparse transfer matrix T epsilon R is definedn×nThen T isijIndicates how many slaves daWord i in (1) is transferred to dbThe word j, T inijIs greater than or equal to 0, so the sum of the transfer costs is sigmai,jTijc(i,j);
(5) Considering the idea of word distance algorithm, when the transfer cost sum is larger, the similarity between the two content feature texts participating in the calculation is lower, namely the similarity between the two content feature texts is inversely proportional to the minimum transfer cost sum between the texts, and the above steps are taken as followsAfter the problem is converted into a linear programming problem, the content feature text similarity between the services a and b is
Figure BDA0002686301020000023
So that
Figure BDA0002686301020000024
And is
Figure BDA0002686301020000025
The method for relieving the cold start problem of the recommendation system based on the content feature extraction comprises the following steps of:
(1) let two sets in the recommendation system: a user set U and an item set I; wherein U is { U ═1,u2,u3,...um},I={i1,i2,i3,...inAll articles can be scored by each user individual, most services are scored by the users, and user-service (U-S) scoring matrixes are obtained after the users correspond to the services one by one
Figure BDA0002686301020000033
(2) Let ru,aRepresenting the rating, r, of user u for service au,bRepresents the rating of the user u for the service b,
Figure BDA0002686301020000034
average rating for rating services on behalf of the user; the similarity between service a and service b is
Figure BDA0002686301020000031
(3) Setting two services a and b, calculating the service similarity based on the content feature text based on the content feature extraction method of claim 2 and the content feature similarity calculation method of claim 3 based on the TF-IDF and word distance algorithmDegree is simchar(a, b) calculating the mixed similarity of the services a and b as sim (a, b) lambda.sim by combining with the traditional similarity calculation methodchar(a,b)+(1-λ)·simrating(a, b), wherein lambda is a weight factor occupied by the two similarity values;
(4) let pred (u, p) be the predicted score of user u for service p, sim (i, p) be the mixed similarity between scored service i and predicted service p, and the predicted score is calculated as
Figure BDA0002686301020000032
The method for relieving the cold start problem of the recommendation system based on the content feature extraction comprises the following steps:
(1) establishing a user-service (U-S) scoring matrix according to the existing scoring data of the user in the recommending system;
(2) extracting content characteristics of services in a recommendation system and performing word vectorization processing;
(3) according to the obtained content feature word vectors, under a content feature similarity calculation method based on TF-IDF and word distance algorithm, calculating similarity sim between articleschar(si,sj);
(4) Calculating the service similarity sim based on the user score by using the traditional goods-based collaborative filtering recommendation algorithm according to the user-service (U-S) score matrixrating(si,sj);
(5) For each service skAnd calculating the user u according to a mixed recommendation algorithm based on the content characteristics and the article similarityiFor service skPrediction score pred (u, s) ofk);
(6) All the services to be recommended are sorted in a descending way according to the predicted score values of the services, wherein the first N services are used as target users uiAnd (4) completing service recommendation.
The invention has the following beneficial effects:
1. according to the method, the problem of cold start of the article in the conventional collaborative filtering recommendation algorithm is solved by combining the similarity of the article content characteristics and the user scoring similarity, so that the average error value is further reduced, and the performance of a recommendation system is improved. The recommendation algorithm adopted by the invention ensures that the recommendation result is more reliable and accurate and the coverage of the recommended articles is more perfect
2. The method for relieving the cold start problem of the recommendation system based on the content feature extraction considers the features of the description information of the service, can accurately find the features of the articles and strengthen the description of the articles, thereby improving the accuracy of recommendation
3. According to the content feature similarity calculation method based on the TF-IDF and the word distance algorithm, in consideration of actual conditions, the importance degree of each word in the feature text is different, so that the accuracy of the similarity is improved by using the word distance algorithm after the TF-IDF is optimized
4. The mixed recommendation algorithm based on the content features and the article similarity combines the similarity calculated by using the word distance with the traditional similarity calculation method for recommendation, and relieves the problem of cold start of the articles.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:
fig. 1 is a flow chart of a service characteristic text content characteristic extraction method of the present invention.
FIG. 2 is a flow chart of content feature text similarity between computing services of the present invention.
Figure 3 is an overall architecture diagram of the present invention.
Detailed Description
The preferred embodiments of the present invention will be described in conjunction with the accompanying drawings, and it will be understood that they are described herein for the purpose of illustration and explanation and not limitation.
Example 1:
the invention provides a method for relieving a cold start problem of a recommendation system based on content feature extraction, which is shown in figure 1:
performing content characteristic analysis on the description information and the updating function description information in the article, and extracting the content characteristics of the article information through part-of-speech tagging and dependency syntax analysis in natural language processing;
firstly, segmenting words of an article characteristic text, and performing part-of-speech tagging on each word after segmentation;
secondly, performing dependency syntax analysis on the characteristic text of the article according to part-of-speech tagging performed on each word after word segmentation, and analyzing the relation between the lyrics in the text;
the relationships that exist in dependency parsing are as follows:
main and predicate relation SBV
Moving object relationship VOB
Word-object relationship IOB
Front object FOB
Bilingual DBL
Centering relationship ATT
Middle structure ADV
Dynamic compensation structure CMD
Parallel relation COD
Mediate relation POB
Left additional relationship LAD
Right additive relationship RAD
Independent structure IS
Core relationship HED
Finally, according to the relationship existing in the dependency syntax analysis, the text is subjected to feature extraction, and the extracted relationship vocabulary comprises the following components:
the major-minor relationship phrase: business services, take-away offerings, etc.;
move guest relation phrase: providing phrases such as channel, tracking location, etc.;
centering the relational phrase: air quality, urban landscape, etc.
Example 2:
in another aspect, the present invention further provides a hybrid recommendation algorithm based on content features and item similarities, including:
(1) let two sets in the recommendation system: a user set U and an item set I. Wherein U is { U ═1,u2,u3,...um},I={i1,i2,i3,...inAll articles can be scored by each user individual, most services are scored by the users, and user-service (U-S) scoring matrixes are obtained after the users correspond to the services one by one
Figure BDA0002686301020000051
(2) Let ru,aRepresenting the rating, r, of user u for service au,bRepresents the rating of the user u for the service b,
Figure BDA0002686301020000052
representing the average rating of the user for the rating service. The similarity between service a and service b is
Figure BDA0002686301020000053
(3) Setting two services a and b, calculating the service similarity sim based on the content feature text by a content feature extraction method based on a requirement 2 and a content feature similarity calculation method based on a TF-IDF and word distance algorithm based on a requirement 3char(a, b) calculating the mixed similarity of the services a and b as sim (a, b) lambda.sim by combining with the traditional similarity calculation methodchar(a,b)+(1-λ)·simrating(a, b), wherein lambda is a weight factor occupied by the two similarity values;
(4) let pred (u, p) be the predicted score of user u for service p, sim (i, p) be the mixed similarity between scored service i and predicted service p, and the predicted score is calculated as
Figure BDA0002686301020000061
Finally, the invention provides a mixed recommendation algorithm based on content features and item similarity, which comprises the following steps:
(1) let two sets in the recommendation system: a user set U and an item set I, where U ═ U1,u2,u3,...um},I={i1,i2,i3,...inAnd (4) scoring all items by each user individual, and scoring each item individual by all users. Setting most services to be scored by users, and obtaining a user-service (U-S) scoring matrix after the users and the services are in one-to-one correspondence
Figure BDA0002686301020000062
(2) Let ru,aRepresenting the rating, r, of user u for service au,bRepresents the rating of the user u for the service b,
Figure BDA0002686301020000063
representing the average rating of the user for the rating service. The similarity between service a and service b is
Figure BDA0002686301020000064
(3) Setting two services a and b, calculating the service similarity sim based on the content feature text by a content feature extraction method based on a requirement 2 and a content feature similarity calculation method based on a TF-IDF and word distance algorithm based on a requirement 3char(a, b). Combining with the traditional similarity calculation method, the mixed similarity of the services a and b is calculated to be sim (a, b) ═ lambda · simchar(a,b)+(1-λ)·simrating(a, b), wherein lambda is a weight factor occupied by the two similarity values;
(4) let pred (u, p) be the predicted score of user u for service p, sim (i, p) be the mixed similarity between scored service i and predicted service p, and the predicted score is calculated
Figure BDA0002686301020000065

Claims (5)

1. A method for relieving the cold start problem of a recommendation system based on content feature extraction is characterized by comprising the following steps: when extracting the content characteristics, extracting the description information characteristics of the project by adopting a method of dependency syntax analysis in natural language processing, and converting the extracted content characteristics into word vectors;
secondly, considering that the importance degree of each Word is different under the actual condition, a Weighted Word Distance algorithm (WWMD) optimized based on TF-IDF is used so as to improve the accuracy of calculating the Distance of the content feature vector words, thereby improving the accuracy of the similarity between the articles;
and finally, recommending by combining the similarity calculated by using the word distance and a traditional similarity calculation method.
2. The method for alleviating a recommendation system cold start problem based on content feature extraction as claimed in claim 1, wherein: the content feature extraction method based on natural language processing comprises the following steps: performing content characteristic analysis on the description information and the updating function description information in the article, and extracting the content characteristics of the article information through part-of-speech tagging and dependency syntax analysis in natural language processing;
firstly, segmenting words of an article characteristic text, and performing part-of-speech tagging on each word after segmentation; secondly, performing dependency syntax analysis on the characteristic text of the article according to part-of-speech tagging performed on each word after word segmentation, and analyzing the relation among the words in the text.
3. The method for alleviating the cold start problem of the recommendation system based on the content feature extraction as claimed in claim 1 or 2, wherein: the content feature similarity calculation method based on the TF-IDF optimized weighted word distance algorithm comprises the following steps of:
(1) is provided with a trained characteristic word vector matrix
Figure 58086DEST_PATH_IMAGE001
A total of n words, column i
Figure 957909DEST_PATH_IMAGE002
A d-dimensional word vector representing the ith word, the euclidean distance between word i and word j being,
Figure 524020DEST_PATH_IMAGE003
(2) a strip for describing services
Figure 747060DEST_PATH_IMAGE004
The content feature text of (2) is processed by a skip-gram model, and can be used as a sparse vector
Figure 373213DEST_PATH_IMAGE005
As the expression of the word bag, the words are arranged in the content characteristic text
Figure 760332DEST_PATH_IMAGE006
Appear to
Figure 864554DEST_PATH_IMAGE007
Next, in
Figure 958412DEST_PATH_IMAGE008
The sum of the number of times of all the words appearing in
Figure 755467DEST_PATH_IMAGE009
Then word
Figure 364303DEST_PATH_IMAGE006
Has a TF value of
Figure 724746DEST_PATH_IMAGE010
(3) Let the total number of content feature texts in the corpus be
Figure 735427DEST_PATH_IMAGE011
Including words
Figure 703383DEST_PATH_IMAGE006
Number of texts of
Figure 799515DEST_PATH_IMAGE012
Then word
Figure 183223DEST_PATH_IMAGE006
Has an IDF value of
Figure 313990DEST_PATH_IMAGE013
Service of
Figure 452847DEST_PATH_IMAGE004
In the content feature description text, word
Figure 220296DEST_PATH_IMAGE006
Has a TF-IDF value of
Figure 470012DEST_PATH_IMAGE014
(4) Is provided with two services a and b, order
Figure 455286DEST_PATH_IMAGE008
And
Figure 765044DEST_PATH_IMAGE015
bag-of-words representations representing the two pieces of content feature text to be computed respectively,
Figure 773452DEST_PATH_IMAGE008
each word in (1)
Figure 826858DEST_PATH_IMAGE006
Can be wholly or partially transferred to
Figure 666638DEST_PATH_IMAGE015
In (1), a sparse transfer matrix is defined
Figure 334249DEST_PATH_IMAGE016
Then, then
Figure 626690DEST_PATH_IMAGE017
Indicates how many slaves are
Figure 483788DEST_PATH_IMAGE008
Chinese word
Figure 178074DEST_PATH_IMAGE006
Is transferred to
Figure 767318DEST_PATH_IMAGE015
Chinese word
Figure 812635DEST_PATH_IMAGE018
Figure 207844DEST_PATH_IMAGE019
Thus it shifts the cost sum of
Figure 943588DEST_PATH_IMAGE020
(5) Considering the idea of word distance algorithm, when the transfer cost sum is larger, the similarity between two content feature texts participating in calculation is lower, namely the similarity between the two content feature texts is in inverse proportion to the minimum transfer cost sum between the texts, and after the problems are converted into a linear programming problem, the service is carried out
Figure 31629DEST_PATH_IMAGE004
And
Figure 298663DEST_PATH_IMAGE021
content feature text similarity between them is
Figure 231984DEST_PATH_IMAGE022
So that
Figure 838546DEST_PATH_IMAGE023
And is and
Figure 97489DEST_PATH_IMAGE024
4. a method for mitigating recommendation system cold start problems based on content feature extraction as claimed in claim 1 or 2 or 3, characterized by: the mixed recommendation algorithm based on the content features and the item similarity comprises the following steps:
(1) let two sets in the recommendation system: user collection
Figure 586239DEST_PATH_IMAGE025
And article collections
Figure 775780DEST_PATH_IMAGE026
(ii) a Wherein the content of the first and second substances,
Figure 299166DEST_PATH_IMAGE027
Figure 463431DEST_PATH_IMAGE028
wherein each user individual can score all articles, each article individual can be scored by all users, most services are set to be scored by the users, and the users and the services are in one-to-one correspondence to obtain user-services (
Figure 705056DEST_PATH_IMAGE029
) Scoring matrix
Figure 183442DEST_PATH_IMAGE030
(2) Is provided with
Figure 561334DEST_PATH_IMAGE031
Representing a user
Figure 162079DEST_PATH_IMAGE032
For service
Figure 815302DEST_PATH_IMAGE004
The score of (a) is determined,
Figure 159696DEST_PATH_IMAGE033
representing a user
Figure 657673DEST_PATH_IMAGE032
For service
Figure 163741DEST_PATH_IMAGE021
The score of (a) is determined,
Figure 317642DEST_PATH_IMAGE034
average rating for rating services on behalf of the user; then service
Figure 200147DEST_PATH_IMAGE004
And service
Figure 552631DEST_PATH_IMAGE021
Has a similarity of
Figure 416551DEST_PATH_IMAGE035
(3) Setting two services
Figure 854485DEST_PATH_IMAGE004
And
Figure 540682DEST_PATH_IMAGE021
the content feature extraction method based on claim 2 and the content feature similarity calculation method based on TF-IDF and word distance algorithm in claim 3 calculate the service similarity based on the content feature text as
Figure 747672DEST_PATH_IMAGE036
Computing services in combination with conventional similarity computing methods
Figure 798805DEST_PATH_IMAGE004
And
Figure 724035DEST_PATH_IMAGE021
has a mixed similarity of
Figure 948343DEST_PATH_IMAGE037
Figure 462370DEST_PATH_IMAGE038
The weight factors occupied by the two similarity values;
(4) is provided with
Figure 746721DEST_PATH_IMAGE039
For the user
Figure 893668DEST_PATH_IMAGE032
For service
Figure 921667DEST_PATH_IMAGE040
The prediction score of (a) is determined,
Figure 40933DEST_PATH_IMAGE041
serving scored services
Figure 496185DEST_PATH_IMAGE006
And predictive service
Figure 396008DEST_PATH_IMAGE040
The mixed similarity between the predicted scores is calculated as
Figure 149069DEST_PATH_IMAGE042
5. The method for alleviating the cold start problem of the recommendation system based on the content feature extraction as claimed in claim 1 or 2 or 3 or 4, wherein: the method comprises the following steps:
(1) establishing a user-service (U-S) scoring matrix according to the existing scoring data of the user in the recommending system;
(2) extracting content characteristics of services in a recommendation system and performing word vectorization processing;
(3) according to the obtained content feature word vectors, under a content feature similarity calculation method based on TF-IDF and word distance algorithm, similarity between articles is calculated
Figure 185158DEST_PATH_IMAGE043
(4) Calculating the service similarity based on the user score by using the traditional goods-based collaborative filtering recommendation algorithm according to the user-service (U-S) score matrix
Figure 811312DEST_PATH_IMAGE044
(5) For each service
Figure 932852DEST_PATH_IMAGE045
And calculating the user according to a mixed recommendation algorithm based on the content characteristics and the article similarity
Figure 240336DEST_PATH_IMAGE046
For service
Figure 396511DEST_PATH_IMAGE045
Predictive scoring of
Figure 193566DEST_PATH_IMAGE047
(6) All the services to be recommended are sorted in a descending way according to the predicted score values of the services, wherein the first N services are taken as target users
Figure 986423DEST_PATH_IMAGE046
And (4) completing service recommendation.
CN202010977547.2A 2020-09-17 2020-09-17 Method for relieving cold start problem of recommendation system based on content feature extraction Pending CN112100517A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010977547.2A CN112100517A (en) 2020-09-17 2020-09-17 Method for relieving cold start problem of recommendation system based on content feature extraction

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010977547.2A CN112100517A (en) 2020-09-17 2020-09-17 Method for relieving cold start problem of recommendation system based on content feature extraction

Publications (1)

Publication Number Publication Date
CN112100517A true CN112100517A (en) 2020-12-18

Family

ID=73758846

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010977547.2A Pending CN112100517A (en) 2020-09-17 2020-09-17 Method for relieving cold start problem of recommendation system based on content feature extraction

Country Status (1)

Country Link
CN (1) CN112100517A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108287916A (en) * 2018-02-11 2018-07-17 北京方正阿帕比技术有限公司 A kind of resource recommendation method
CN108573411A (en) * 2018-04-17 2018-09-25 重庆理工大学 Depth sentiment analysis and multi-source based on user comment recommend the mixing of view fusion to recommend method
CN108776940A (en) * 2018-06-04 2018-11-09 南京邮电大学盐城大数据研究院有限公司 A kind of intelligent food and drink proposed algorithm excavated based on text comments
CN109063147A (en) * 2018-08-06 2018-12-21 北京航空航天大学 Online course forum content recommendation method and system based on text similarity
CN110851731A (en) * 2019-09-25 2020-02-28 浙江工业大学 Collaborative filtering recommendation method for user attribute coupling similarity and interest semantic similarity
KR102155768B1 (en) * 2019-10-02 2020-09-14 한경훈 Method for providing question and answer data set recommendation service using adpative learning from evoloving data stream for shopping mall

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108287916A (en) * 2018-02-11 2018-07-17 北京方正阿帕比技术有限公司 A kind of resource recommendation method
CN108573411A (en) * 2018-04-17 2018-09-25 重庆理工大学 Depth sentiment analysis and multi-source based on user comment recommend the mixing of view fusion to recommend method
CN108776940A (en) * 2018-06-04 2018-11-09 南京邮电大学盐城大数据研究院有限公司 A kind of intelligent food and drink proposed algorithm excavated based on text comments
CN109063147A (en) * 2018-08-06 2018-12-21 北京航空航天大学 Online course forum content recommendation method and system based on text similarity
CN110851731A (en) * 2019-09-25 2020-02-28 浙江工业大学 Collaborative filtering recommendation method for user attribute coupling similarity and interest semantic similarity
KR102155768B1 (en) * 2019-10-02 2020-09-14 한경훈 Method for providing question and answer data set recommendation service using adpative learning from evoloving data stream for shopping mall

Similar Documents

Publication Publication Date Title
CN109977413B (en) Emotion analysis method based on improved CNN-LDA
CN106599029B (en) Chinese short text clustering method
CN101231634B (en) Autoabstract method for multi-document
CN109948143B (en) Answer extraction method of community question-answering system
CN105183833B (en) Microblog text recommendation method and device based on user model
CN110162591B (en) Entity alignment method and system for digital education resources
CN111125349A (en) Graph model text abstract generation method based on word frequency and semantics
CN110008309B (en) Phrase mining method and device
CN111797898B (en) Online comment automatic reply method based on deep semantic matching
CN103678329B (en) Recommend method and device
CN109002473A (en) A kind of sentiment analysis method based on term vector and part of speech
CN110134799B (en) BM25 algorithm-based text corpus construction and optimization method
Liu et al. Using collaborative filtering algorithms combined with Doc2Vec for movie recommendation
CN110705247A (en) Based on x2-C text similarity calculation method
Zhang et al. Research on keyword extraction of Word2vec model in Chinese corpus
CN112949713A (en) Text emotion classification method based on ensemble learning of complex network
CN110110220A (en) Merge the recommended models of social networks and user's evaluation
CN110083676B (en) Short text-based field dynamic tracking method
CN105354184A (en) Method for using optimized vector space model to automatically classify document
CN112434533B (en) Entity disambiguation method, entity disambiguation device, electronic device, and computer-readable storage medium
CN112131341A (en) Text similarity calculation method and device, electronic equipment and storage medium
CN108427769B (en) Character interest tag extraction method based on social network
CN112100517A (en) Method for relieving cold start problem of recommendation system based on content feature extraction
CN116756303A (en) Automatic generation method and system for multi-topic text abstract
CN115203589A (en) Vector searching method and system based on Trans-dssm model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination