CN103886073B - Coal information commending system based on collaborative filtering - Google Patents

Coal information commending system based on collaborative filtering Download PDF

Info

Publication number
CN103886073B
CN103886073B CN201410110180.9A CN201410110180A CN103886073B CN 103886073 B CN103886073 B CN 103886073B CN 201410110180 A CN201410110180 A CN 201410110180A CN 103886073 B CN103886073 B CN 103886073B
Authority
CN
China
Prior art keywords
user
webpage
collaborative filtering
module
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201410110180.9A
Other languages
Chinese (zh)
Other versions
CN103886073A (en
Inventor
刘永利
赵珊
刘静
孙江峰
杨合超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Henan University of Technology
Original Assignee
Henan University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Henan University of Technology filed Critical Henan University of Technology
Priority to CN201410110180.9A priority Critical patent/CN103886073B/en
Publication of CN103886073A publication Critical patent/CN103886073A/en
Application granted granted Critical
Publication of CN103886073B publication Critical patent/CN103886073B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/335Filtering based on additional data, e.g. user or group profiles
    • G06F16/337Profile generation, learning or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present invention proposes a kind of coal information commending system based on collaborative filtering, and the system includes coal information collaborative filtering module, community users colony access log module and recommending module;Coal information collaborative filtering module is responsible for the access log for analyzing user, finds out user neighbour using the algorithm of collaborative filtering;Community users colony access log module is responsible for the management of daily record;Recommending module is responsible for calculating the information of recommendation and recommending user.After using the system, can recommend to meet the project or information of its requirement, realize personalized service according to the hobby of user.Interest preference of the coal information commending system based on each user, there is provided meet the targetedly recommendation of specific user, the convenience that this recommendation enhances the experience of user, the user for improving uses also improves the work efficiency of user.

Description

Coal information commending system based on collaborative filtering
Technical field
The invention belongs to Coal Information System.
Background technology
Due to developing rapidly for Internet, the information capacity of every field is very huge, including colliery field.At this Under background, the search engine system in colliery field has obtained extensive research and application, finds useful information for people and provides It is convenient.However, with the continuous improvement of people's Search Requirement, the deep processing of retrieval result is increasingly becoming one of the area research Emphasis.
The search modes of traditional " One-Size-Fit-All " can not allow user to be satisfied with, the substitute is retrieval knot The personalization of fruit, and " people looks for information " is replaced with the pattern of " information looks for people ".Then, coal information commending system just seems ten Divide necessity.
Commending system refers to be liked personalized recommendation according to personal as the system of output or in extensive optional object Guide well user's system alternatively.Coal information commending system proposed by the present invention is according to the daily record for having been browsed before user Content, by new, not browsed, user may coal information interested be automatically pushed to user.
Collaborative filtering is one of relatively early method being used in Technologies of Recommendation System in E-Commerce, and its core concept is that user inclines To the commodity or content bought in customer group of the purchase with similar interests hobby, basic starting point is:User can be Classify by interest;Evaluation of the user to different information contains the interest-degree of user;Evaluation of the user to a unknown message Will be similar with the evaluation of its similar interests user.
In the environment of community, often possess identical interest, demand, expectation between user and go selection similar with motivation Information, such case are to be provided the foundation based on the recommendation of collaborative filtering.
The content of the invention
In order to realize the present invention, it is proposed that a kind of based on collaborative filtering, the coal information commending system of Community-oriented, The system includes coal information collaborative filtering module, community users colony access log module and recommending module;Coal information is assisted Be responsible for analyzing the access log of user with filtering module, user neighbour is found out using the algorithm of collaborative filtering;Community users colony Access log module is responsible for the management of daily record;Recommending module is responsible for calculating the information of recommendation and recommending user;Wherein in colliery In Collaborative Filtering module, filtered using following methods:
(1) system initialization:Based on the access log of user in the community of colliery, coal mine user-colliery webpage scoring is built Matrix A, matrix A include s user u1,u2,...,usTo t webpage p1,p2,...,ptScoring, A for s × t ranks matrix, s Natural number is with t, the element a in matrixijScorings of the user i to webpage j is represented, i and j is natural number, and 1≤i≤s, 1 ≤ j≤t,
(2) rating matrix optimization:T webpage obtains k cluster after K-Means clustering algorithms, is designated as c1,c2,..., ck, k is number of clusters mesh, and k is natural number;Cluster cmComprising webpage number be Nm, this NmSequence number of the individual webpage in original t webpage Respectively f (m, 1), f (m, 2) ..., f (m, Nm), f function is used for the original number of webpage in cluster after calculating is clustered, i.e., in original The sequence number come in t webpage, f (m, Nm) represent cluster cmIn NmThe original number of individual webpage, m is natural number, 1≤m≤k, 1≤ Nm≤ t, the rating matrix after being optimizedWherein bimThe element of representing matrix B,
(3) neighbor searching:User uvThe cluster sequence number collection for scoring is combined into Mv, user uwThe cluster sequence number collection for scoring is combined into Mw, User uvAnd uwThe cluster sequence number collection for all scoring is combined into M, v and w and is natural number, 1≤v≤s, 1≤w≤s, then user uvAnd uw Between similarity beWhereinWithUser is represented respectively uvAnd uwAverage score to webpage, tries to achieve each user uiIn the user space, by similarity sort from high to low it is near Adjacent user's set Ui
(4) recommend:According to UiDraw set UiInterior all users, by scoring from high to low, the collections of web pages that browses Pi, user uiThe collections of web pages for having browsed is P'i, then gather (Pi-P'i) in before L coal information webpage be user uiIt is interested Content.
After using system above, can recommend to meet the project or information of its requirement according to the hobby of user, can be real Existing personalized service.After introducing commending system, can be believed according to the hobby of coal mine user, colliery that may be interested user Breath is formed and recommends user in the form of a list.User can be so allowed to search oneself in less range of information interested Content, whole information browse process are more quick, are changed into " information looks for people " from the mode of original " people looks for information ".Coal information Interest preference of the commending system based on each user, there is provided meet the targetedly recommendation of specific user, this recommendation strengthens The convenience that the experience of user, the user for improving use, also improves the work efficiency of user.
Description of the drawings
Fig. 1 is the system structure diagram of the present invention.
Specific embodiment
Based on collaborative filtering, the coal information commending system coal information collaborative filtering module of Community-oriented, community User group's access log module, recommending module.Wherein coal information collaborative filtering module is responsible for the access log for analyzing user, User neighbour is found out using the algorithm of collaborative filtering;Community users colony access log module is responsible for the management of daily record;Recommend mould Block is responsible for calculating the information of recommendation and recommending user.Coal information collaborative filtering module is the emphasis of the system, and which mainly walks Suddenly it is:
(1) system initialization:The step is based primarily upon the access log of user in the community of colliery, builds coal mine user-coal Ore deposit webpage rating matrix, is designated as A, it is assumed that the matrix includes s user u1,u2,...,usTo t webpage p1,p2,...,ptComment Point, i.e. matrixes of the A for s × t ranks, s and t are natural number, the element a in matrixijRepresent scorings of the user i to webpage j, i and j It is natural number, and 1≤i≤s, 1≤j≤t.Matrix A can be expressed as:
(2) rating matrix optimization:Under normal circumstances, user's score data is fewer, and continuous with webpage quantity t Increase, cause rating matrix A extremely sparse, the final precision for affecting to recommend.In view of this, the present invention is first to rating matrix A Do and further optimize, the method for optimization is that webpage is clustered, cluster process adopts K-Means algorithms.
Assume that t webpage is obtained k cluster after using K-Means clustering algorithms, be designated as c1,c2,...,ck, k is cluster Number, k are natural number.Assume cluster cmComprising webpage number be Nm, this NmSequence number of the individual webpage in original t webpage point Not Wei f (m, 1), f (m, 2) ..., f (m, Nm), f function is used for the original number of webpage in cluster after calculating is clustered, i.e., in original t Sequence number in individual webpage, f (m, Nm) represent cluster cmIn NmThe original number of individual webpage, m is natural number, 1≤m≤k, 1≤Nm≤ t。
Through above-mentioned optimization process, the rating matrix B after being optimized.
Wherein bimThe element of representing matrix B,
(3) neighbor searching:Based on the rating matrix B after optimization, the user of same or similar interest can be found out, user it Between the calculating of similarity carried out using improved cosine similarity method.If user is uvThe cluster sequence number collection for scoring is combined into Mv, user uwThe cluster sequence number collection for scoring is combined into Mw, user uvAnd uwThe cluster sequence number collection for all scoring is combined into M, v and w and is natural number, 1≤v ≤ s, 1≤w≤s, then user uvAnd uwBetween similarity be:
WhereinWithUser u is represented respectivelyvAnd uwAverage score to webpage.
According to above-mentioned similarity calculating method, can be in the hope of each user uiIn the user space, by similarity from The neighbour user set U of high to Low sequencei
(4) recommend:According to the user u for obtainingiNeighbour user set Ui, it can be deduced that set UiInterior all users, press Scoring from high to low, collections of web pages P that browsesi, it is assumed that user uiThe collections of web pages for having browsed is P'i, then gather (Pi-P 'i) in before L coal information webpage possibly user uiContent interested, can recommend user u as content recommendationi
The system can be recommended to meet the project or information of its requirement according to the hobby of user, be a kind of personalized Service system.After introducing commending system, can be according to the hobby of coal mine user, user possible coal information shape interested Into recommending user in the form of a list.Can thus allow user search in less range of information oneself it is interested in Hold, whole information browse process is more quick, is changed into " information looks for people " from the mode of original " people looks for information ".Coal information is pushed away Recommend interest preference of the system based on each user, there is provided meet the targetedly recommendation of specific user, this recommendation is enhanced The convenience that the experience of user, the user for improving use, also improves the work efficiency of user.

Claims (1)

1. a kind of coal information commending system based on collaborative filtering, the system include coal information collaborative filtering module, community User group's access log module and recommending module;Coal information collaborative filtering module is responsible for the access log for analyzing user, profit User neighbour is found out with the algorithm of collaborative filtering;Community users colony access log module is responsible for the management of daily record;Recommending module It is responsible for calculating the information of recommendation and recommending user;Wherein in coal information collaborative filtering module, carried out using following methods Filter:
(1) system initialization:Based on the access log of user in the community of colliery, coal mine user-colliery webpage rating matrix is built A, matrix A include s user u1,u2,...,usTo t webpage p1,p2,...,ptScoring, A for s × t ranks matrix, s and t Natural number is, the element a in matrixijScorings of the user i to webpage j is represented, i and j is natural number, and 1≤i≤s, 1≤j ≤ t,
(2) rating matrix optimization:T webpage obtains k cluster after K-Means clustering algorithms, is designated as c1,c2,...,ck, k is Number of clusters mesh, k are natural number;Cluster cmComprising webpage number be Nm, this NmSequence number of the individual webpage in original t webpage is respectively f (m,1),f(m,2),...,f(m,Nm), f function is used for the original number of webpage in cluster after calculating is clustered, i.e., in original t net Sequence number in page, f (m, Nm) represent cluster cmIn NmThe original number of individual webpage, m is natural number, 1≤m≤k, 1≤Nm≤ t, obtains Rating matrix to after optimizationWherein bimThe element of representing matrix B,
b im = 1 N m Σ n = 1 N m a if ( m , n ) ;
(3) neighbor searching:User uvThe cluster sequence number collection for scoring is combined into Mv, user uwThe cluster sequence number collection for scoring is combined into Mw, user uvAnd uwThe cluster sequence number collection for all scoring is combined into M, v and w and is natural number, 1≤v≤s, 1≤w≤s, then user uvAnd uwBetween Similarity be sim ( u v , u w ) = Σ z ∈ M ( b vz - b ‾ v ) ( b ‾ wz - b ‾ w ) Σ z ∈ M v ( b vz - b ‾ v ) Σ z ∈ M w ( b wz - b ‾ w ) , WhereinWithUser u is represented respectivelyvAnd uw Average score to webpage, tries to achieve each user uiIn the user space, the neighbour that sorted by similarity from high to low uses Family set Ui
(4) recommend:According to UiDraw set UiInterior all users, by scoring from high to low, collections of web pages P that browsesi, User uiThe collections of web pages for having browsed is P'i, then gather (Pi-P'i) in before L coal information webpage be user uiInterested is interior Hold.
CN201410110180.9A 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering Expired - Fee Related CN103886073B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410110180.9A CN103886073B (en) 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410110180.9A CN103886073B (en) 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering

Publications (2)

Publication Number Publication Date
CN103886073A CN103886073A (en) 2014-06-25
CN103886073B true CN103886073B (en) 2017-03-29

Family

ID=50954965

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410110180.9A Expired - Fee Related CN103886073B (en) 2014-03-24 2014-03-24 Coal information commending system based on collaborative filtering

Country Status (1)

Country Link
CN (1) CN103886073B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104615683A (en) * 2015-01-21 2015-05-13 上海交通大学 Time and location sensing collaborative filtering technology with high expandability
CN106899668B (en) * 2017-02-23 2019-12-03 同济大学 Information Push Service processing method in car networking

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101685458A (en) * 2008-09-27 2010-03-31 华为技术有限公司 Recommendation method and system based on collaborative filtering
CN103412948A (en) * 2013-08-27 2013-11-27 北京交通大学 Cluster-based collaborative filtering commodity recommendation method and system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101685458A (en) * 2008-09-27 2010-03-31 华为技术有限公司 Recommendation method and system based on collaborative filtering
CN103412948A (en) * 2013-08-27 2013-11-27 北京交通大学 Cluster-based collaborative filtering commodity recommendation method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"基于协同过滤技术的个性化推荐系统研究,2012年6月;史玉珍等;《电子设计工程》;20120630;第20卷(第11期);第41-44页 *
基于项目特征聚类的协同过滤推荐算法;翁小兰等;《计算机应用与软件》;20090731;第26卷(第7期);第260-262页 *

Also Published As

Publication number Publication date
CN103886073A (en) 2014-06-25

Similar Documents

Publication Publication Date Title
CN103106285B (en) Recommendation algorithm based on information security professional social network platform
CN106339502A (en) Modeling recommendation method based on user behavior data fragmentation cluster
CN104111941B (en) The method and apparatus that information is shown
CN105787068B (en) The academic recommended method and system analyzed based on citation network and user's proficiency
CN103455487B (en) The extracting method and device of a kind of search term
CN107894998B (en) Video recommendation method and device
CN105488233A (en) Reading information recommendation method and system
CN106250526A (en) A kind of text class based on content and user behavior recommends method and apparatus
CN103810162B (en) The method and system of recommendation network information
CN102831234A (en) Personalized news recommendation device and method based on news content and theme feature
CN106777051A (en) A kind of many feedback collaborative filtering recommending methods based on user's group
CN106250545A (en) A kind of multimedia recommendation method and system searching for content based on user
CN105045931A (en) Video recommendation method and system based on Web mining
CN102193936A (en) Data classification method and device
CN104615779A (en) Method for personalized recommendation of Web text
CN104809243A (en) Mixed recommendation method based on excavation of user behavior compositing factor
CN104484431A (en) Multi-source individualized news webpage recommending method based on field body
CN104462383A (en) Movie recommendation method based on feedback of users' various behaviors
CN104298785A (en) Searching method for public searching resources
CN107341199A (en) A kind of recommendation method based on documentation & info general model
CN105740387B (en) A kind of scientific and technical literature recommended method based on author's frequent mode
CN106227866A (en) A kind of hybrid filtering film based on data mining recommends method
CN106777359A (en) A kind of text services based on limited Boltzmann machine recommend method
CN103886073B (en) Coal information commending system based on collaborative filtering
CN104298702A (en) Method and system for electronic reading material recommendation on basis of social network information

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170329

Termination date: 20210324

CF01 Termination of patent right due to non-payment of annual fee