CN103886073A - Coal mine information recommendation system based on collaborative filtering - Google Patents

Coal mine information recommendation system based on collaborative filtering Download PDF

Info

Publication number
CN103886073A
CN103886073A CN201410110180.9A CN201410110180A CN103886073A CN 103886073 A CN103886073 A CN 103886073A CN 201410110180 A CN201410110180 A CN 201410110180A CN 103886073 A CN103886073 A CN 103886073A
Authority
CN
China
Prior art keywords
user
webpage
collaborative filtering
information
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410110180.9A
Other languages
Chinese (zh)
Other versions
CN103886073B (en
Inventor
刘永利
赵珊
刘静
孙江峰
杨合超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan University of Technology
Original Assignee
Henan University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan University of Technology filed Critical Henan University of Technology
Priority to CN201410110180.9A priority Critical patent/CN103886073B/en
Publication of CN103886073A publication Critical patent/CN103886073A/en
Application granted granted Critical
Publication of CN103886073B publication Critical patent/CN103886073B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • G06F16/337Profile generation, learning or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention provides a coal mine information recommendation system based on collaborative filtering. The system comprises a coal mine information collaborative filtering module, a community user group access log module and a recommendation module, wherein the coal mine information collaborative filtering module is used for analyzing access logs of users and finding out user neighbors by means of a collaborative filtering algorithm; the community user group access log module is used for managing the logs; the recommendation module is used for calculating recommended information and recommending the information to the users. By the adoption of the system, items or information, according with requirements of the users, can be recommended to the users according to hobbies and interests of the users, and personalized service is achieved. The coal mine information recommendation system provides targeted recommendations according with specific users on the basis of the hobbies and interests of the users. Because of the recommendations, user experience is enhanced, user usage convenience is improved, and work efficiency of the users is improved.

Description

Coal information commending system based on collaborative filtering
Technical field
The invention belongs to Coal Information System.
Background technology
Due to the develop rapidly of Internet, the information capacity of every field is all very huge, comprises field, colliery.Under this background, the search engine system in field, colliery has obtained studying widely and applying, and provides convenience for people find useful information.But along with improving constantly of people's Search Requirement, the deep processing of result for retrieval becomes an emphasis of this area research gradually.
The search modes of traditional " One-Size-Fit-All " can not allow user satisfied, the substitute is the personalization of result for retrieval, and replaces " people looks for information " with the pattern of " information is looked for people ".So it is very necessary that coal information commending system just seems.
Commending system general reference guides user as the system of selecting using personalized recommendation as the system of output or in extensive optional object according to personal like.The coal information commending system that the present invention proposes is according to the log content of having browsed before user, and new, that do not browse, user may be pushed to user by interested coal information automatically.
Collaborative filtering is one of the method in Technologies of Recommendation System in E-Commerce that is early used in, its core concept is that user tends to buy and has commodity or the content that the customer group of similar interests hobby is bought, and basic starting point is: user is can be by categorize interests; The interest-degree that user has comprised user to the evaluation of different information; User to the evaluation of a unknown message by similar with its similar interests user's evaluation.
In the environment of community, between user, often have identical interest, demand, expectation and motivation and go to select similar information, this situation is that the recommendation based on collaborative filtering provides the foundation.
Summary of the invention
In order to realize the present invention, a kind of coal information commending system based on collaborative filtering, Community-oriented has been proposed, this system comprises coal information collaborative filtering module, community users colony access log module and recommending module; Coal information collaborative filtering module is responsible for the access log of analysis user, utilizes the algorithm of collaborative filtering to find out user neighbour; Community users colony access log module is responsible for the management of daily record; Recommending module is responsible for the information of calculated recommendation and is recommended user; Wherein, in coal information collaborative filtering module, adopt following methods to filter:
(1) system initialization: based on the access log of user in the community of colliery, build coal mine user-colliery webpage rating matrix A, matrix A comprises s user u 1, u 2..., u sto t webpage p 1, p 2..., p tscoring, A is the matrix on s × t rank, s and t are natural number, the element a in matrix ijrepresent the scoring of user i to webpage j, i and j are natural number, and 1≤i≤s, 1≤j≤t,
Figure DEST_PATH_GDA0000497978530000021
(2) rating matrix optimization: a t webpage obtains k bunch after K-Means clustering algorithm, is designated as c 1, c 2..., c k, k is number of clusters order, k is natural number; Bunch c mthe webpage number comprising is N m, this N mthe sequence number of individual webpage in original t webpage is respectively f (m, 1), f (m, 2) ..., f (m, N m), f function is for calculating the original number of bunch webpage after cluster, i.e. sequence number in original t webpage, f (m, N m) expression bunch c min N mthe original number of individual webpage, m is natural number, 1≤m≤k, 1≤N m≤ t, the rating matrix after being optimized
Figure DEST_PATH_GDA0000497978530000022
wherein b imthe element of representing matrix B,
b im = 1 N m Σ n = 1 N m a if ( m , n ) ;
(3) neighbor searching: user u va bunch sequence number set of marking is M v, user u wa bunch sequence number set of marking is M w, user u vand u wa bunch sequence number set of all marking is M, and v and w are natural number, 1≤v≤s, 1≤w≤s, user u vand u wbetween similarity be sim ( u v , u w ) = Σ z ∈ M ( b vz - b ‾ v ) ( b ‾ wz - b ‾ w ) Σ z ∈ M v ( b vz - b ‾ v ) Σ z ∈ M w ( b wz - b ‾ w ) , Wherein with
Figure DEST_PATH_GDA0000497978530000026
represent respectively user u vand u wto the average score of webpage, try to achieve each user u ineighbour user in user's space, that sort from high to low by similarity gathers U i;
(4) recommend: according to U idraw set U iin all users, by scoring from high to low, the collections of web pages P that browses i, user u ithe collections of web pages of having browsed is P' i, set (P i-P' i) in before L coal information webpage be user u iinterested content.
Adopt after above system, can, according to user's hobby, recommend to meet project or the information of its requirement, can realize personalized service.After introducing commending system, can, according to the hobby of coal mine user, user may interested coal information formation be recommended to user with the form of list.Can allow like this user in less range of information, search own interested content, whole information browse process is more quick, becomes " information is looked for people " from the mode of original " people looks for information ".The interest preference of coal information commending system based on each user, provides the recommendation targetedly that meets specific user, and this recommendation has strengthened the convenience that the user of user's experience, raising uses, and has also improved user's work efficiency.
Brief description of the drawings
Fig. 1 is system architecture schematic diagram of the present invention.
Embodiment
Coal information commending system coal information collaborative filtering module based on collaborative filtering, Community-oriented, community users colony access log module, recommending module.Wherein coal information collaborative filtering module is responsible for the access log of analysis user, utilizes the algorithm of collaborative filtering to find out user neighbour; Community users colony access log module is responsible for the management of daily record; Recommending module is responsible for the information of calculated recommendation and is recommended user.Coal information collaborative filtering module is the emphasis of native system, and its key step is:
(1) system initialization: this step is the access log based on user in the community of colliery mainly, builds coal mine user-colliery webpage rating matrix, is designated as A, supposes that this matrix comprises s user u 1, u 2..., u sto t webpage p 1, p 2..., p tscoring, A is the matrix on s × t rank, s and t are natural number, the element a in matrix ijrepresent the scoring of user i to webpage j, i and j are natural number, and 1≤i≤s, 1≤j≤t.Matrix A can be expressed as:
Figure DEST_PATH_GDA0000497978530000031
(2) rating matrix optimization: under normal circumstances, user's score data is fewer, and along with the continuous growth of webpage quantity t, cause rating matrix A extremely sparse, the precision that final impact is recommended.In view of this, first the present invention does further optimization to rating matrix A, and the method for optimization is that webpage is carried out to cluster, and cluster process adopts K-Means algorithm.
Suppose that t webpage can obtain k bunch after using K-Means clustering algorithm, is designated as c 1, c 2..., c k, k is number of clusters order, k is natural number.Suppose a bunch c mthe webpage number comprising is N m, this N mthe sequence number of individual webpage in original t webpage is respectively f (m, 1), f (m, 2) ..., f (m, N m), f function is for calculating the original number of bunch webpage after cluster, i.e. sequence number in original t webpage, f (m, N m) expression bunch c min N mthe original number of individual webpage, m is natural number, 1≤m≤k, 1≤N m≤ t.
Through above-mentioned optimizing process, the rating matrix B after can being optimized.
Wherein b imthe element of representing matrix B,
b im = 1 N m Σ n = 1 N m a if ( m , n ) ;
(3) neighbor searching: based on the rating matrix B after optimizing, can find out the user of same or similar interest, between user, the calculating of similarity adopts improved cosine similarity based method to carry out.If user is u va bunch sequence number set of marking is M v, user u wa bunch sequence number set of marking is M w, user u vand u wa bunch sequence number set of all marking is M, and v and w are natural number, 1≤v≤s, 1≤w≤s, user u vand u wbetween similarity be:
sim ( u v , u w ) = Σ z ∈ M ( b vz - b ‾ v ) ( b ‾ wz - b ‾ w ) Σ z ∈ M v ( b vz - b ‾ v ) Σ z ∈ M w ( b wz - b ‾ w ) ,
Wherein
Figure DEST_PATH_GDA0000497978530000042
with
Figure DEST_PATH_GDA0000497978530000043
represent respectively user u vand u wto the average score of webpage.
According to above-mentioned similarity calculating method, can be in the hope of each user u ineighbour user in user's space, that sort from high to low by similarity gathers U i.
(4) recommend: according to the user u obtaining ineighbour user gather U i, can draw set U iin all users, by scoring from high to low, the collections of web pages P that browses i, suppose user u ithe collections of web pages of having browsed is P' i, set (P i-P' i) in before L coal information webpage may be user u iinterested content, can be used as content recommendation and recommends user u i.
This system can, according to user's hobby, recommend to meet project or the information of its requirement, is a kind of individuation service system.After introducing commending system, can, according to the hobby of coal mine user, user may interested coal information formation be recommended to user with the form of list.So just can allow user in less range of information, search own interested content, whole information browse process is more quick, becomes " information is looked for people " from the mode of original " people looks for information ".The interest preference of coal information commending system based on each user, provides the recommendation targetedly that meets specific user, and this recommendation has strengthened the convenience that the user of user's experience, raising uses, and has also improved user's work efficiency.

Claims (1)

1. the coal information commending system based on collaborative filtering, this system comprises coal information collaborative filtering module, community users colony access log module and recommending module; Coal information collaborative filtering module is responsible for the access log of analysis user, utilizes the algorithm of collaborative filtering to find out user neighbour; Community users colony access log module is responsible for the management of daily record; Recommending module is responsible for the information of calculated recommendation and is recommended user; Wherein, in coal information collaborative filtering module, adopt following methods to filter:
(1) system initialization: based on the access log of user in the community of colliery, build coal mine user-colliery webpage rating matrix A, matrix A comprises s user u 1, u 2..., u sto t webpage p 1, p 2..., p tscoring, A is the matrix on s × t rank, s and t are natural number, the element a in matrix ijrepresent the scoring of user i to webpage j, i and j are natural number, and 1≤i≤s, 1≤j≤t,
(2) rating matrix optimization: a t webpage obtains k bunch after K-Means clustering algorithm, is designated as c 1, c 2..., c k, k is number of clusters order, k is natural number; Bunch c mthe webpage number comprising is N m, this N mthe sequence number of individual webpage in original t webpage is respectively f (m, 1), f (m, 2) ..., f (m, N m), f function is for calculating the original number of bunch webpage after cluster, i.e. sequence number in original t webpage, f (m, N m) expression bunch c min N mthe original number of individual webpage, m is natural number, 1≤m≤k, 1≤N m≤ t, the rating matrix after being optimized
Figure DEST_PATH_FDA0000497978520000012
wherein b imthe element of representing matrix B,
Figure DEST_PATH_FDA0000497978520000013
(3) neighbor searching: user u va bunch sequence number set of marking is M v, user u wa bunch sequence number set of marking is M w, user u vand u wa bunch sequence number set of all marking is M, and v and w are natural number, 1≤v≤s, 1≤w≤s, user u vand u wbetween similarity be
Figure DEST_PATH_FDA0000497978520000014
wherein
Figure DEST_PATH_FDA0000497978520000015
with
Figure DEST_PATH_FDA0000497978520000016
represent respectively user u vand u wto the average score of webpage, try to achieve each user u ineighbour user in user's space, that sort from high to low by similarity gathers U i;
(4) recommend: according to U idraw set U iin all users, by scoring from high to low, the collections of web pages P that browses i, user u ithe collections of web pages of having browsed is P' i, set (P i-P' i) in before L coal information webpage be user u iinterested content.
CN201410110180.9A 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering Expired - Fee Related CN103886073B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410110180.9A CN103886073B (en) 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410110180.9A CN103886073B (en) 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering

Publications (2)

Publication Number Publication Date
CN103886073A true CN103886073A (en) 2014-06-25
CN103886073B CN103886073B (en) 2017-03-29

Family

ID=50954965

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410110180.9A Expired - Fee Related CN103886073B (en) 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering

Country Status (1)

Country Link
CN (1) CN103886073B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104615683A (en) * 2015-01-21 2015-05-13 上海交通大学 Time and location sensing collaborative filtering technology with high expandability
CN106899668A (en) * 2017-02-23 2017-06-27 同济大学 Information Push Service processing method in car networking

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101685458A (en) * 2008-09-27 2010-03-31 华为技术有限公司 Recommendation method and system based on collaborative filtering
CN103412948A (en) * 2013-08-27 2013-11-27 北京交通大学 Cluster-based collaborative filtering commodity recommendation method and system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101685458A (en) * 2008-09-27 2010-03-31 华为技术有限公司 Recommendation method and system based on collaborative filtering
CN103412948A (en) * 2013-08-27 2013-11-27 北京交通大学 Cluster-based collaborative filtering commodity recommendation method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
史玉珍等: ""基于协同过滤技术的个性化推荐系统研究,2012年6月", 《电子设计工程》 *
翁小兰等: "基于项目特征聚类的协同过滤推荐算法", 《计算机应用与软件》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104615683A (en) * 2015-01-21 2015-05-13 上海交通大学 Time and location sensing collaborative filtering technology with high expandability
CN106899668A (en) * 2017-02-23 2017-06-27 同济大学 Information Push Service processing method in car networking
CN106899668B (en) * 2017-02-23 2019-12-03 同济大学 Information Push Service processing method in car networking

Also Published As

Publication number Publication date
CN103886073B (en) 2017-03-29

Similar Documents

Publication Publication Date Title
CN103106285B (en) Recommendation algorithm based on information security professional social network platform
CN101641697B (en) Related search queries for a webpage and their applications
CN102591995B (en) Processing method and device based on user information of cloud data center
CN106339502A (en) Modeling recommendation method based on user behavior data fragmentation cluster
CN103678672A (en) Method for recommending information
CN105045931A (en) Video recommendation method and system based on Web mining
CN103886048B (en) Cluster-based increment digital book recommendation method
CN103778260A (en) Individualized microblog information recommending system and method
Yadav et al. Mining the customer behavior using web usage mining in e-commerce
CN104166668A (en) News recommendation system and method based on FOLFM model
CN103886090A (en) Content recommendation method and device based on user favorites
CN105095433A (en) Recommendation method and device for entities
CN103823844A (en) Question forwarding system and question forwarding method on the basis of subjective and objective context and in community question-and-answer service
CN102929928A (en) Multidimensional-similarity-based personalized news recommendation method
CN103425763B (en) User based on SNS recommends method and device
CN103823893A (en) User comment-based product search method and system
CN103514239A (en) Recommendation method and system integrating user behaviors and object content
CN104484431A (en) Multi-source individualized news webpage recommending method based on field body
CN104298785A (en) Searching method for public searching resources
Zhang Language in our time: An empirical analysis of hashtags
CN103455487A (en) Extracting method and device for search term
CN102542489A (en) Recommendation method based on user interest association
CN102609465A (en) Information recommendation method based on potential communities
CN104751354A (en) Advertisement cluster screening method
CN105678590A (en) topN recommendation method for social network based on cloud model

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170329

Termination date: 20210324

CF01 Termination of patent right due to non-payment of annual fee