CN113076478A - Technical resource and service recommendation system based on hybrid recommendation algorithm - Google Patents

Technical resource and service recommendation system based on hybrid recommendation algorithm Download PDF

Info

Publication number
CN113076478A
CN113076478A CN202110399613.7A CN202110399613A CN113076478A CN 113076478 A CN113076478 A CN 113076478A CN 202110399613 A CN202110399613 A CN 202110399613A CN 113076478 A CN113076478 A CN 113076478A
Authority
CN
China
Prior art keywords
user
enterprise
module
technical
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110399613.7A
Other languages
Chinese (zh)
Other versions
CN113076478B (en
Inventor
洪佳升
贾宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tongji University
Original Assignee
Tongji University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tongji University filed Critical Tongji University
Priority to CN202110399613.7A priority Critical patent/CN113076478B/en
Publication of CN113076478A publication Critical patent/CN113076478A/en
Application granted granted Critical
Publication of CN113076478B publication Critical patent/CN113076478B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a technical resource and service recommendation system based on a hybrid recommendation algorithm based on a network collaborative manufacturing technical resource service platform, which comprises a data preprocessing module, a user semantic description module, an individualized technical resource recommendation module, an enterprise similarity calculation module, an individualized service recommendation module and a technical resource and service potential combination mining module. The data preprocessing module is used for preprocessing data such as user browsing and purchasing records, enterprise registration information and the like; the user semantic description module models the domain information which may be interested by the user; the personalized technical resource recommending module recommends technical resources to a user; the enterprise similarity calculation module comprises two similarity calculation methods. The invention fully utilizes platform data, and recommends technical resources and enterprise services to users in all directions by mixing various data analysis algorithms, thereby achieving the purpose of improving the network collaborative manufacturing degree.

Description

Technical resource and service recommendation system based on hybrid recommendation algorithm
Technical Field
The invention relates to the field of online services, in particular to a technical resource and service recommendation system based on a hybrid recommendation algorithm.
Background
With the development of communication technology, information technology, cloud computing and other technologies, the enterprise manufacturing model is changed greatly. A network collaborative manufacturing model with characteristics of agile manufacturing, information sharing, best utilization of resources, and the like, has gained more and more attention and applications.
The foundation for realizing the network collaborative manufacturing mode is a network collaborative manufacturing technology resource service platform. On the platform, a large number of technical resources uploaded by users or enterprises exist, including literature resources, article resources, document resources, software resources, manufacturing resources, talent resources and the like. There are also services released by enterprises for users to purchase. The user can look up technical resources and purchase related enterprise services according to the actual project requirements, and therefore project implementation progress is accelerated.
Due to the fact that the technical resources and services on the platform are large in types and quantity, users cannot obtain interested technical resources and services at the first time. In order to increase the user experience, a platform needs to develop a set of recommendation system to recommend technical resources and services to the user.
Disclosure of Invention
The invention provides a technical resource and service recommendation system based on a hybrid recommendation algorithm, which is used for recommending technical resources and services released on a platform.
Drawings
Fig. 1 is a system structure block diagram of a technical resource and service recommendation system based on a hybrid recommendation algorithm according to the present invention.
FIG. 2 is a flow diagram of a technology resource and service potential portfolio mining module.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, a technology resource and service recommendation system based on a hybrid recommendation algorithm includes a data preprocessing module, a user semantic description module, a personalized technology resource recommendation module, an enterprise similarity calculation module, a personalized service recommendation module, and a technology resource and service potential combination mining module.
Each module is specifically described below in detail.
The data preprocessing module takes technical resource records browsed by a user, enterprise service records purchased by the user and enterprise registration information as input. Wherein, technical resource records are browsed by a user, enterprise service records purchased by the user are extracted from a server log file through a regular expression; the enterprise registration information is obtained from a database. And then clustering by using a density-based clustering algorithm DBSCAN according to the time generated by the user behavior record to obtain a user behavior cluster.
Wherein:
the user browsing technical resource records and the enterprise information vectors are respectively used for the user semantic description module and the enterprise similarity calculation module, and the obtained user browsing technical resource record cluster and the user purchasing enterprise service record cluster are provided for the technical resource and service potential combination mining module.
The data preprocessing module firstly reads in technical resource records browsed by a user, enterprise service records purchased by the user and enterprise registration information, wherein the technical resource records browsed by the user comprise information such as user ID, technical resource ID, recording time and technical resource browsing amount; the user purchase enterprise service record includes information such as a user ID, a service ID, a recording time, and the like.
According to the project requirements of users in a period of time or the assumption that interested contents are concentrated in a certain limited field, the records are clustered based on the density according to the time to obtain a cluster.
The enterprise registration information includes the enterprise-oriented domain, the registered fund of the enterprise, the number of employees of the enterprise, the establishment time of the enterprise, the business turnover, the profit margin of the enterprise, and the like.
One-hot coding is carried out on characteristics such as enterprise-oriented fields, and characteristics such as registered fund of enterprises, number of employees of enterprises, establishment time of enterprises, business turnover and profit rate of enterprises are standardized to obtain enterprise information vectors.
When the user semantic description module calculates the weight of a certain technical resource, a certain punishment is carried out according to the browsing amount of the technical resource and is used for providing the user semantic description module for the personalized technical resource recommendation module. Because many users browse popular technical resources only because of the popular technical resources, the individuality of the users cannot be reflected; conversely, if a user browses cold technical resources, a general probability may determine the area of interest to the user.
The user semantic description module models the user in the form of tags. Firstly, the top N keywords with the highest scores are extracted by a TextRank algorithm when each technical resource is uploaded and are used as tags of the technical resource. After the user browses the technical resource, the same label as the technical resource label in the user label is added with a certain weight (if the label is not in the user label, the label is added).
The method comprises the following specific steps:
and extracting the technical resource key words by adopting a TextRank algorithm because the technical resources are stored and presented in a text form. Specifically, the top N keywords with the highest score are obtained as the tags of the technical resource, and the score of each keyword is normalized as the weight of the corresponding tag.
Assume that user u has a label set of LuThe label set of the technical resource i is LiThe browsing volume of the technical resource i is SiThe weight of the label a in the technical resource i is wiaThen the weight of tag a in user u is increased by a value Δ wuaIs composed of
Figure BDA0003019923010000031
If the user does not browse the technical resources related to a certain label within a period of time, the weight of the label is attenuated, so as to reflect the recent interest of the user. Assuming that a user has t time units and does not browse technical resources of a certain label, the weight before attenuation of the label is w, the weight attenuation factor is alpha, and the weight after attenuation is w
Figure BDA0003019923010000032
Because some users like a large amount of browsing technical resources, and the number of user labels is large, the first K labels with the maximum user label weight form a semantic description vector of the user.
The personalized technical resource recommendation module takes the user semantic description vector as input, calculates M technical resources most similar to the user semantic description vector through cosine similarity, and adds the M technical resources into a technical resource recommendation list of the user.
The input of the personalized technical resource recommendation module is a semantic description vector V of a user uuLet the weight of tag a in the vector be wuaThe weight of the label a in the technical resource i is wiaThen the preference degree p (u, i) of the user u to the technical resource i is
Figure BDA0003019923010000033
And after the preference degrees of the user to all the technical resources are obtained through calculation, recommending the top M technical resources with the highest preference degrees to the user.
The enterprise similarity calculation module comprises two similarity calculation methods which are respectively used for calculating the enterprise similarity by using enterprise registration information; and calculating the enterprise similarity by using the contact ratio of the enterprise user group. When the first method is used for recommending the system to be started in a cold state, the newly added enterprises cannot calculate the similarity of the enterprises by using the second method because the newly added enterprises do not have client groups.
The first method inputs enterprise information vectors obtained by a data preprocessing module, and the similarity of two enterprises is obtained through cosine similarity calculation. Assume that the information vector of enterprise e1 is Ve1The information vector of enterprise e2 is Ve2Then degree of similarity
Figure BDA0003019923010000041
And calculating the user group contact ratio between the two enterprises as the enterprise similarity, wherein the user group contact ratio can be calculated by using the Jaccard similarity. Specifically, the user group set input into the enterprise, that is, the users who have used the service provided by the enterprise, obtains the similarity of the two enterprises through the Jaccard similarity calculation. Suppose the user population of enterprise e1 is Ue1The user group of the enterprise e2 is Ue2Then degree of similarity
Figure BDA0003019923010000042
The personalized service recommendation module realizes the functions based on the similar enterprise calculation module. And according to the enterprise to which the service historically purchased by the user belongs, obtaining N enterprises closest to the enterprise through an enterprise similarity calculation module, and recommending hot services of the N enterprises to the user.
As shown in fig. 2:
the technical resource and service potential combination mining module firstly searches a cluster of user browsing technical resources near the time point according to the time of purchasing enterprise service by the user, and stores the service ID and the technical resource ID in the cluster into a set.
After a large number of sets consisting of the service IDs and the technical resource IDs are obtained, a frequent item set is obtained through calculation of an association rule algorithm FPgrowth (existing algorithm), the sets indicate technical resources which can be browsed by a plurality of users before and after a certain service is purchased, and this indicates that the service and the technical resources have strong correlation, namely, the service and the technical resources can be combined.
On the basis of obtaining a plurality of frequent itemses, if a user browses technical resources in the frequent itemsets, recommending corresponding enterprise services to the user; and if the user purchases the enterprise service, recommending the corresponding technical resource to the user.

Claims (6)

1. A technical resource and service recommendation system based on a hybrid recommendation algorithm is characterized by comprising a data preprocessing module, a user semantic description module, a personalized technical resource recommendation module, an enterprise similarity calculation module, a personalized service recommendation module and a technical resource and service potential combination mining module;
the data preprocessing module takes technical resource records browsed by a user, enterprise service records purchased by the user and enterprise registration information as input, wherein:
preprocessing the technical resource records browsed by the user to obtain vectorized recorded data and clustering clusters, wherein the vectorized recorded data is used for a user semantic description module, and the vectorized recorded data is used for a technical resource and service potential combination mining module;
preprocessing a service record of a user purchasing enterprise to obtain a clustering cluster used for a technical resource and service potential combination mining module;
the enterprise registration information is preprocessed and used for an enterprise similarity calculation module;
the user semantic description module takes the technical resource record browsed by the user as input, models the field information which is possibly interested by the user through the technical resource browsed by the user, and outputs a user semantic description vector to be used for the personalized technical resource recommendation module;
the personalized technical resource recommendation module takes the user semantic description vector as input and outputs a personalized technical resource recommendation list of each user;
the enterprise similarity calculation module comprises two similarity calculation methods; the first is enterprise similarity calculation based on content, and the input of the enterprise similarity calculation is an enterprise information vector obtained by a data preprocessing module and used when a recommendation system is in cold start; the second is data-driven enterprise similarity calculation, which is mainly used after the enterprise has a certain customer group; the similarity between enterprises can be obtained by the two algorithms and is used for the personalized service recommendation module;
the personalized service recommendation module takes the enterprise similarity as input and outputs an enterprise service recommendation list personalized by each user;
the input of the technical resource and service potential combination mining module is a user browsing technical resource record data cluster and a user purchasing enterprise service record obtained by the data preprocessing module; and mining a certain possible combination relationship between the technical resources and the enterprise services through an association rule algorithm, recommending the technical resources and the enterprise services to the users on the basis, and outputting a personalized technical resource recommendation list and an enterprise service recommendation list of each user.
2. The system according to claim 1, wherein the recording time of the data such as the technical resource record browsed by the user and the enterprise service record purchased by the user is clustered based on density, and the enterprise registration information is digitized and encoded.
3. The technical resource and service recommendation system based on the hybrid recommendation algorithm as claimed in claim 1, wherein the tag vector is automatically extracted by a TextRank algorithm.
4. The system of claim 3, wherein the tag vector is used as semantic description of the technical resource and the user.
5. The system for recommending technical resources and services based on a hybrid recommendation algorithm according to claim 1, wherein in the cold start phase, the enterprise similarity is calculated using enterprise registration information; and after the customer groups are accumulated, calculating the enterprise similarity by using the contact ratio of the enterprise user groups.
6. The system of claim 1, wherein an association rules algorithm (FPgrowth) is used to mine potential combinations of technical resources and services, and recommend technical resources and services to users based on the combinations.
CN202110399613.7A 2021-04-14 2021-04-14 Technical resource and service recommendation system based on hybrid recommendation algorithm Active CN113076478B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110399613.7A CN113076478B (en) 2021-04-14 2021-04-14 Technical resource and service recommendation system based on hybrid recommendation algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110399613.7A CN113076478B (en) 2021-04-14 2021-04-14 Technical resource and service recommendation system based on hybrid recommendation algorithm

Publications (2)

Publication Number Publication Date
CN113076478A true CN113076478A (en) 2021-07-06
CN113076478B CN113076478B (en) 2022-06-07

Family

ID=76618372

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110399613.7A Active CN113076478B (en) 2021-04-14 2021-04-14 Technical resource and service recommendation system based on hybrid recommendation algorithm

Country Status (1)

Country Link
CN (1) CN113076478B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017128433A1 (en) * 2016-01-31 2017-08-03 胡明祥 Information pushing method during recommendation update, and pushing system
CN110188208A (en) * 2019-06-04 2019-08-30 河海大学 A kind of the information resources inquiry recommended method and system of knowledge based map
CN112015987A (en) * 2020-08-28 2020-12-01 青岛格兰德信用管理咨询有限公司 Potential customer recommendation system and method based on enterprise tags
CN112100512A (en) * 2020-04-10 2020-12-18 南京邮电大学 Collaborative filtering recommendation method based on user clustering and project association analysis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017128433A1 (en) * 2016-01-31 2017-08-03 胡明祥 Information pushing method during recommendation update, and pushing system
CN110188208A (en) * 2019-06-04 2019-08-30 河海大学 A kind of the information resources inquiry recommended method and system of knowledge based map
CN112100512A (en) * 2020-04-10 2020-12-18 南京邮电大学 Collaborative filtering recommendation method based on user clustering and project association analysis
CN112015987A (en) * 2020-08-28 2020-12-01 青岛格兰德信用管理咨询有限公司 Potential customer recommendation system and method based on enterprise tags

Also Published As

Publication number Publication date
CN113076478B (en) 2022-06-07

Similar Documents

Publication Publication Date Title
US11734725B2 (en) Information sending method, apparatus and system, and computer-readable storage medium
CN112836130B (en) Context-aware recommendation system and method based on federated learning
US20220382814A1 (en) Systems and Methods for Generating Real-Time Recommendations
US10824806B2 (en) Counterintuitive recommendations based upon temporary conditions
Dhingra et al. Spam analysis of big reviews dataset using Fuzzy Ranking Evaluation Algorithm and Hadoop
Chang et al. Application of neural networks and Kano’s method to content recommendation in web personalization
Buckley et al. Social media and customer behavior analytics for personalized customer engagements
US20190019217A1 (en) Group formation and recommendations based on trigger events
CN109829593B (en) Credit determining method and device for target object, storage medium and electronic device
Nguyen et al. Ontology-based recommender system for sport events
An et al. Discover customers’ gender from online shopping behavior
CN117609612A (en) Resource recommendation method and device, storage medium and electronic equipment
CN113076478B (en) Technical resource and service recommendation system based on hybrid recommendation algorithm
Sun Music Individualization Recommendation System Based on Big Data Analysis
Renjith et al. SemRec–An efficient ensemble recommender with sentiment based clustering for social media text corpus
CN109710853A (en) A kind of artificial intelligence classified matching method and system
CN114996579A (en) Information pushing method and device, electronic equipment and computer readable medium
Liu Integration Path Analysis of Traditional Media and New Media Based on Internet of Things Data Mining
CN115114519A (en) Artificial intelligence based recommendation method and device, electronic equipment and storage medium
Hung Using Cloud Services to Develop Marketing Information System Applications
CN116150470A (en) Content recommendation method, device, apparatus, storage medium and program product
Wang English news text recommendation method based on hypergraph random walk label expansion
US11100535B2 (en) Group recommendations based on external factors
Vaganov et al. A comparative study of social data similarity measures related to financial behavior
Guo et al. Short video audience identification data recommended by multiple neural network algorithms

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant