CN107368521B - Knowledge recommendation method and system based on big data and deep learning - Google Patents

Knowledge recommendation method and system based on big data and deep learning Download PDF

Info

Publication number
CN107368521B
CN107368521B CN201710417583.1A CN201710417583A CN107368521B CN 107368521 B CN107368521 B CN 107368521B CN 201710417583 A CN201710417583 A CN 201710417583A CN 107368521 B CN107368521 B CN 107368521B
Authority
CN
China
Prior art keywords
maintenance personnel
knowledge
similarity
scene
concept
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710417583.1A
Other languages
Chinese (zh)
Other versions
CN107368521A (en
Inventor
孔祥明
蔡禹
贾义动
朱容虎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Guangye Kaiyuan Technology Co ltd
Original Assignee
Guangdong Guangye Kaiyuan Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Guangye Kaiyuan Technology Co ltd filed Critical Guangdong Guangye Kaiyuan Technology Co ltd
Priority to CN201710417583.1A priority Critical patent/CN107368521B/en
Publication of CN107368521A publication Critical patent/CN107368521A/en
Application granted granted Critical
Publication of CN107368521B publication Critical patent/CN107368521B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Abstract

The invention discloses a knowledge recommendation method and a system based on big data and deep learning, wherein the method comprises the following steps: performing scene analysis according to the scene data information; processing the data information of the operation and maintenance personnel by adopting a deep learning method to generate an operation and maintenance personnel portrait; selecting knowledge according to a scene analysis result and the operation and maintenance personnel portrait to obtain a knowledge recommendation set; performing knowledge recommendation according to the knowledge recommendation set; the system comprises a scene analysis module, an operation and maintenance personnel portrait generation module, a knowledge selection module and a knowledge recommendation module. The invention integrates the scene analysis result and the operation and maintenance personnel image to select knowledge and recommend knowledge, thus improving the accuracy of knowledge recommendation from two dimensions of real-time scene use and user use; a deep learning method is adopted to learn data information such as operation behaviors of operation and maintenance personnel, so that the operation and maintenance personnel portray more and more accurately. The invention can be widely applied to the field of computer application.

Description

Knowledge recommendation method and system based on big data and deep learning
Technical Field
The invention relates to the field of computer application, in particular to a knowledge recommendation method and a knowledge recommendation system based on big data and deep learning.
Background
Today, knowledge is promoted to strategic resources by enterprises, and the enterprises are imperative to adopt knowledge management. With the increasingly large service systems, the increasingly complex service logics, the more frequent system changes and the higher working requirements, the difficulty of the operation management work of the service support network is also increased. By constructing a unified knowledge base with rich contents and human participation, the aims of establishing a channel for uploading and issuing knowledge for enterprises, creating a learning type business support team and assisting companies in long-term sustainable health development can be fulfilled.
Knowledge Management (KM, Knowledge Management) is an emerging Management thinking and method in the new economic era of networks, and Management scholars predicted by durac as early as one, nine, six, and five years: "knowledge will replace land, labor, capital and machinery equipment, becoming the most important production factor. The concept of knowledge management combined with the entrance website, database and application computer software system constructed by the internet is a new century benefit device for accumulating knowledge wealth and creating more competitiveness due to the vigorous development of informatization (informatization) in the 90 s of the 20 th century.
So-called knowledge management, which is defined as: a knowledge system with both humanity and technology is established in an organization, so that the final aim of continuous innovation of knowledge is achieved through the processes of obtaining, creating, sharing, integrating, recording, accessing, updating and the like of information and knowledge in the organization, the knowledge is fed back to the knowledge system, the knowledge of individuals and the organization can be accumulated forever, the wisdom capital of the organization is formed by thinking from the perspective of the system, and the enterprise can make correct decisions to adapt to market transition.
In knowledge management, how knowledge is applied is also an important topic. At present, the application of knowledge is mainly based on manual retrieval and directory tree display, and the application modes all require more manpower and time investment and have low accuracy. The advent of knowledge recommendation technology changes the acquisition mode of knowledge from "search" to "recommendation", solving the problem.
At present, part of knowledge application can analyze the flow or the place and recommend knowledge according to the analysis result. Compared with the manual retrieval and directory tree display mode, the mode not only greatly reduces the manpower and time investment, but also can improve the accuracy of knowledge recommendation. However, this method fails to consider the key information of the users (such as the knowledge operation and maintenance personnel) of the knowledge application, and the accuracy is not ideal. Meanwhile, the method cannot continuously learn and self-correct to adapt to the changing conditions, and the accuracy rate of the method is lower and lower as time goes on.
Disclosure of Invention
To solve the above technical problems, the present invention aims to: the knowledge recommendation method based on big data and deep learning is high in accuracy.
Another object of the present invention is to: the knowledge recommendation system based on big data and deep learning is high in accuracy.
The technical scheme adopted by the invention is as follows:
a knowledge recommendation method based on big data and deep learning comprises the following steps:
performing scene analysis according to the scene data information;
processing the data information of the operation and maintenance personnel by adopting a deep learning method to generate an operation and maintenance personnel portrait;
selecting knowledge according to a scene analysis result and the operation and maintenance personnel portrait to obtain a knowledge recommendation set;
and carrying out knowledge recommendation according to the knowledge recommendation set.
Further, the step of performing scene analysis according to the scene data information includes:
acquiring scene data information in real time, and acquiring scene data operated by current operation and maintenance personnel;
carrying out data cleaning on the acquired scene data;
and carrying out real-time scene analysis and labeling operation on the scene data after data cleaning to obtain a scene mark.
Further, the step of processing the data information of the operation and maintenance personnel by using the deep learning method to generate the operation and maintenance personnel portrait comprises the following steps:
acquiring operation and maintenance personnel data information, wherein the operation and maintenance personnel data information comprises data acquired from information registered by an operation and maintenance personnel account and operation behavior data of the operation and maintenance personnel acquired in real time;
carrying out data cleaning on the collected operation and maintenance personnel data information;
carrying out model training on the operation and maintenance personnel by adopting a method of fusing weak model training and Boosting on the data information of the operation and maintenance personnel after data cleaning to obtain an operation and maintenance personnel portrait;
and calculating the similarity between the operation and maintenance personnel and the knowledge label according to the operation and maintenance personnel portrait to obtain the operation and maintenance personnel-knowledge similarity.
Further, the operation and maintenance personnel model training is carried out on the operation and maintenance personnel data information after data cleaning by adopting a method of fusing weak model training and Boosting to obtain an operation and maintenance personnel portrait, and the method comprises the following steps:
performing text modeling, so as to divide the operation and maintenance personnel data information after data cleaning into static information data and dynamic information data;
performing weak model training on a given training sample according to the requirement of text modeling to obtain a plurality of weak models;
the accuracy of the weak models is improved by adopting a Boosting method, and a result classifier of the operation and maintenance personnel model is obtained;
performing model verification on a result classifier of the operation and maintenance personnel model by adopting a given test sample;
and storing the operation and maintenance personnel model after the model verification is passed, and acquiring new operation and maintenance personnel data information after data cleaning in real time to continuously correct the operation and maintenance personnel model and the corresponding operation and maintenance personnel portrait.
Further, the step of calculating the similarity between the operation and maintenance personnel and the knowledge tag according to the operation and maintenance personnel portrait to obtain the operation and maintenance personnel-knowledge similarity includes:
and calculating the name similarity according to the operation and maintenance personnel image, wherein the calculation formula of the name similarity is as follows:
Figure BDA0001314086590000031
wherein S isname(U, C) is the name similarity between the concept U in the operation and maintenance personnel figure and the concept C in the knowledge label, Ui(i is more than or equal to 1 and less than or equal to n) is a character string semantic word segmentation result of the name U in the concept U, cj(j is more than or equal to 1 and less than or equal to m) is the result of semantic segmentation of the character string of the name C in the concept C, n is the total number of the character strings of the name U in the concept U, m is the total number of the character strings of the name C in the concept C, and Sim (Ui,cj) Is uiAnd cjSimilarity between them;
belongs according to the operation and maintenance personnel figureAnd calculating the similarity, wherein the attribute similarity calculation formula is as follows:
Figure BDA0001314086590000032
wherein S isattri(U, C) is the attribute similarity between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge tag, UaAnd CaRepresenting the set of attributes of U and C, respectively, f being a given non-negative metric function, Ua∩CaRepresenting a set of two concepts, U and C, having the same attribute, Ua-CaRepresenting an attribute set, C, that is only present in the representation of the operation and maintenance person but not in the knowledge taga-UaRepresenting the attribute set which is only contained in the knowledge tag but not contained in the operation and maintenance personnel image, wherein lambda and mu are given weight coefficients;
and calculating example similarity according to the operation and maintenance personnel image, wherein the example similarity calculation formula is as follows:
Figure BDA0001314086590000033
wherein S isinst(U, C) is the example similarity between the concept U in the operation and maintenance personnel image and the concept C in the knowledge label, P (U, C) represents the probability that an example randomly extracted from the example space belongs to the concepts U and C at the same time,
Figure BDA0001314086590000034
representing the probability that an instance randomly drawn from the instance space belongs only to concept U and not to concept C,
Figure BDA0001314086590000035
represents the probability that an instance randomly drawn from the instance space belongs only to concept C and not to concept U;
calculating the relation similarity according to the operation and maintenance personnel portrait to obtain the relation similarity S between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge labelrelat(U, C), wherein the relationship comprises a synonymy relationship, an inheritance relationship, and an inclusion relationship, the weight of the synonymy relationship is greater than the weight of the inheritance relationship, and the weight of the synonymy relationship is greater than the weight of the inclusion relationship;
according toName similarity Sname(U, C), Attribute similarity Sattri(U, C), example similarity Sinst(U, C) and relationship similarity Srelat(U, C) calculating the similarity of the operation and maintenance personnel and the knowledge, wherein the calculation formula of the similarity of the operation and maintenance personnel and the knowledge Sim (U, C) is that Sim (U, C) is α Sname(U,C)+βSattri(U,C)+εSinst(U,C)+δSrelat(U, C), wherein α, β, epsilon, and delta are given name similarity coefficient, attribute similarity coefficient, instance similarity coefficient, and relationship similarity coefficient, respectively.
Further, the step of selecting knowledge according to the result of the scene analysis and the portrait of the operation and maintenance personnel to obtain a knowledge recommendation set includes:
searching the characteristics of the operation and maintenance personnel according to the operation and maintenance personnel images, and calculating an operation and maintenance similar knowledge set by combining the operation and maintenance personnel images;
acquiring scene characteristics in real time according to the scene analysis result, and calculating a scene similarity knowledge set according to the scene characteristics;
and judging whether the scene similar knowledge set and the operation and maintenance similar knowledge set have intersection, if so, forming a recommendation knowledge set according to the intersection, otherwise, re-acquiring scene characteristics, and re-training the operation and maintenance personnel model to obtain a new operation and maintenance personnel portrait.
Further, the step of selecting knowledge according to the result of the scene analysis and the portrait of the operation and maintenance personnel to obtain a knowledge recommendation set includes:
carrying out user scene retrieval on operation and maintenance personnel, wherein the user scene U ismContains a scene u, and
Figure BDA0001314086590000041
wherein, UTIs a full user scenario;
calculating a user scene U by adopting a set similarity calculation methodmSimilar knowledge scenes are sequenced to obtain a similar scene set Cn
Figure BDA0001314086590000042
Wherein, Um≈CuUser scene U representing operation and maintenance personnelmAnd knowledge scene CuA is an intersection symbol, CTIs a full knowledge scene set;
retrieving a given scene knowledge relationship model ORTo obtain a scene CnKnowledge-scene relationship pair P ofi,j
Figure BDA0001314086590000043
Wherein, PTAs a full-knowledge-scene-relationship pair, CiFor knowledge scenarios, R1As scene CiThe properties of (a) to (b) are,
Figure BDA0001314086590000044
represents Pi,jBy R1And CiConnecting;
retrieving a given set of knowledge domains ODTo obtain a compound containingi,jAssociated domain knowledge set Dk
Figure BDA0001314086590000045
Wherein D isTIn the field of full-scale knowledge, R2For domain knowledge DkThe properties of (a) to (b) are,
Figure BDA0001314086590000046
represents Pi,jBy R2And DkConnecting;
from a full-scale domain knowledge recommendation set
Figure BDA0001314086590000047
To obtain knowledge recommendation set KS
Figure BDA0001314086590000048
The other technical scheme adopted by the invention is as follows:
a knowledge recommendation system based on big data and deep learning, comprising:
the scene analysis module is used for carrying out scene analysis according to the scene data information;
the operation and maintenance personnel portrait generation module is used for processing the data information of the operation and maintenance personnel by adopting a deep learning method to generate an operation and maintenance personnel portrait;
the knowledge selection module is used for selecting knowledge according to the result of the scene analysis and the operation and maintenance personnel portrait to obtain a knowledge recommendation set;
and the knowledge recommendation module is used for recommending knowledge according to the knowledge recommendation set.
Further, the operation and maintenance personnel portrait generation module comprises:
the information acquisition unit is used for acquiring operation and maintenance personnel data information, and the operation and maintenance personnel data information comprises data acquired from information registered by an operation and maintenance personnel account and operation behavior data of the operation and maintenance personnel acquired in real time;
the data cleaning unit is used for cleaning the data of the collected operation and maintenance personnel data information;
the model training unit is used for carrying out model training on the operation and maintenance personnel by adopting a method of fusing weak model training and Boosting on the operation and maintenance personnel data information after data cleaning to obtain an operation and maintenance personnel portrait;
and the similarity calculation unit is used for calculating the similarity between the operation and maintenance personnel and the knowledge label according to the operation and maintenance personnel portrait to obtain the operation and maintenance personnel-knowledge similarity.
Further, the similarity calculation unit includes:
the name similarity calculation operator unit is used for calculating name similarity according to the operation and maintenance personnel image, and the calculation formula of the name similarity is as follows:
Figure BDA0001314086590000051
wherein S isname(U, C) is the name similarity between the concept U in the operation and maintenance personnel figure and the concept C in the knowledge label, Ui(i is more than or equal to 1 and less than or equal to n) is a character string semantic word segmentation result of the name U in the concept U, cj(j is more than or equal to 1 and less than or equal to m) is the result of semantic segmentation of the character string of the name C in the concept C, n is the total number of the character strings of the name U in the concept U, m is the total number of the character strings of the name C in the concept C, and Sim (Ui,cj) Is uiAnd cjSimilarity between them;
the attribute similarity calculation operator unit is used for calculating the attribute similarity according to the operation and maintenance personnel image, and the attribute similarity calculation formula is as follows:
Figure BDA0001314086590000052
wherein S isattri(U, C) is the attribute similarity between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge tag, UaAnd CaRepresenting the set of attributes of U and C, respectively, f being a given non-negative metric function, Ua∩CaRepresenting a set of two concepts, U and C, having the same attribute, Ua-CaRepresenting an attribute set, C, that is only present in the representation of the operation and maintenance person but not in the knowledge taga-UaRepresenting the attribute set which is only contained in the knowledge tag but not contained in the operation and maintenance personnel image, wherein lambda and mu are given weight coefficients;
the example similarity operator unit is used for calculating the example similarity according to the operation and maintenance personnel image, and the example similarity calculation formula is as follows:
Figure BDA0001314086590000061
wherein S isinst(U, C) is the example similarity between the concept U in the operation and maintenance personnel image and the concept C in the knowledge label, P (U, C) represents the probability that an example randomly extracted from the example space belongs to the concepts U and C at the same time,
Figure BDA0001314086590000062
representing the probability that an instance randomly drawn from the instance space belongs only to concept U and not to concept C,
Figure BDA0001314086590000063
represents the probability that an instance randomly drawn from the instance space belongs only to concept C and not to concept U;
the relational similarity calculation operator unit is used for calculating the relational similarity according to the operation and maintenance personnel image to obtain the relational similarity S between the concept U in the operation and maintenance personnel image and the concept C in the knowledge labelrelat(U,C),The relationships comprise a synonymy relationship, an inheritance relationship and an inclusion relationship, wherein the weight of the synonymy relationship is greater than that of the inheritance relationship, and the weight of the synonymy relationship is greater than that of the inclusion relationship;
an operation and maintenance personnel-knowledge similarity calculation operator unit for calculating the similarity S according to the namename(U, C), Attribute similarity Sattri(U, C), example similarity Sinst(U, C) and relationship similarity Srelat(U, C) calculating the similarity of the operation and maintenance personnel and the knowledge, wherein the calculation formula of the similarity of the operation and maintenance personnel and the knowledge Sim (U, C) is that Sim (U, C) is α Sname(U,C)+βSattri(U,C)+εSinst(U,C)+δSrelat(U, C), wherein α, β, epsilon, and delta are given name similarity coefficient, attribute similarity coefficient, instance similarity coefficient, and relationship similarity coefficient, respectively.
The method of the invention has the beneficial effects that: the method comprises the steps of selecting knowledge according to a scene analysis result and an operation and maintenance person portrait to obtain a knowledge recommendation set and recommending the knowledge according to the knowledge recommendation set, wherein the scene analysis result and the operation and maintenance person portrait are integrated to select and recommend the knowledge, so that not only can knowledge recommendation be performed according to a real-time use scene of the knowledge application, but also user information of the knowledge application, namely the operation and maintenance person portrait, is considered during the knowledge recommendation, and the accuracy of the knowledge recommendation is improved from the two dimensions of the real-time use scene and the use user; the method comprises the steps of processing data information of the operation and maintenance personnel by adopting a deep learning method and generating the portrait of the operation and maintenance personnel, wherein the operation and maintenance personnel are learned by adopting the deep learning method, so that the portrait of the operation and maintenance personnel can be trained, and the portrait model of the operation and maintenance personnel can be continuously corrected by combining with subsequent operation behavior data of the operation and maintenance personnel, so that the portrait of the operation and maintenance personnel is more and more accurate.
The system of the invention has the advantages that: the knowledge recommendation system comprises a knowledge selection module for selecting knowledge according to a scene analysis result and an operation and maintenance person portrait to obtain a knowledge recommendation set and a knowledge recommendation module for recommending knowledge according to the knowledge recommendation set, wherein the scene analysis result and the operation and maintenance person portrait are integrated to select and recommend knowledge, so that not only can knowledge be recommended according to a real-time use scene of knowledge application, but also user information of the knowledge application, namely the operation and maintenance person portrait, is considered during the knowledge recommendation, and the accuracy of the knowledge recommendation is improved from the real-time use scene and the use user; the operation and maintenance person portrait generation module learns data information such as operation behaviors of the operation and maintenance person by adopting a deep learning method, can train the operation and maintenance person portrait, and can continuously correct an operation and maintenance person portrait model by combining with subsequent operation behavior data of the operation and maintenance person, so that the operation and maintenance person portrait is more and more accurate.
Drawings
FIG. 1 is an overall flow chart of a knowledge recommendation method based on big data and deep learning according to the present invention;
FIG. 2 is a flowchart illustrating a knowledge recommendation method according to an embodiment of the present invention;
FIG. 3 is a flowchart of an operation and maintenance personnel image generation process according to an embodiment of the present invention;
FIG. 4 is a flow chart of a knowledge selection process according to an embodiment of the invention.
Detailed Description
Referring to fig. 1, a knowledge recommendation method based on big data and deep learning includes the following steps:
performing scene analysis according to the scene data information;
processing the data information of the operation and maintenance personnel by adopting a deep learning method to generate an operation and maintenance personnel portrait;
selecting knowledge according to a scene analysis result and the operation and maintenance personnel portrait to obtain a knowledge recommendation set;
and carrying out knowledge recommendation according to the knowledge recommendation set.
Further as a preferred embodiment, the step of performing scene analysis according to scene data information includes:
acquiring scene data information in real time, and acquiring scene data operated by current operation and maintenance personnel;
carrying out data cleaning on the acquired scene data;
and carrying out real-time scene analysis and labeling operation on the scene data after data cleaning to obtain a scene mark.
Further preferably, the step of processing the data information of the operation and maintenance personnel by using the deep learning method to generate the operation and maintenance personnel portrait includes:
acquiring operation and maintenance personnel data information, wherein the operation and maintenance personnel data information comprises data acquired from information registered by an operation and maintenance personnel account and operation behavior data of the operation and maintenance personnel acquired in real time;
carrying out data cleaning on the collected operation and maintenance personnel data information;
carrying out model training on the operation and maintenance personnel by adopting a method of fusing weak model training and Boosting on the data information of the operation and maintenance personnel after data cleaning to obtain an operation and maintenance personnel portrait;
and calculating the similarity between the operation and maintenance personnel and the knowledge label according to the operation and maintenance personnel portrait to obtain the operation and maintenance personnel-knowledge similarity.
Further, as a preferred embodiment, the step of performing model training on the operation and maintenance personnel data information after data cleaning by using a method of fusing weak model training and Boosting to obtain an operation and maintenance personnel portrait includes:
performing text modeling, so as to divide the operation and maintenance personnel data information after data cleaning into static information data and dynamic information data;
performing weak model training on a given training sample according to the requirement of text modeling to obtain a plurality of weak models;
the accuracy of the weak models is improved by adopting a Boosting method, and a result classifier of the operation and maintenance personnel model is obtained;
performing model verification on a result classifier of the operation and maintenance personnel model by adopting a given test sample;
and storing the operation and maintenance personnel model after the model verification is passed, and acquiring new operation and maintenance personnel data information after data cleaning in real time to continuously correct the operation and maintenance personnel model and the corresponding operation and maintenance personnel portrait.
Further, as a preferred embodiment, the step of calculating the similarity between the operation and maintenance person and the knowledge tag according to the operation and maintenance person portrait to obtain the operation and maintenance person-knowledge similarity includes:
and calculating the name similarity according to the operation and maintenance personnel image, wherein the calculation formula of the name similarity is as follows:
Figure BDA0001314086590000081
wherein S isname(U, C) is the name similarity between the concept U in the operation and maintenance personnel figure and the concept C in the knowledge label, Ui(i is more than or equal to 1 and less than or equal to n) is a character string semantic word segmentation result of the name U in the concept U, cj(j is more than or equal to 1 and less than or equal to m) is the result of semantic segmentation of the character string of the name C in the concept C, n is the total number of the character strings of the name U in the concept U, m is the total number of the character strings of the name C in the concept C, and Sim (Ui,cj) Is uiAnd cjSimilarity between them;
and performing attribute similarity calculation according to the operation and maintenance personnel image, wherein the attribute similarity calculation formula is as follows:
Figure BDA0001314086590000082
wherein S isattri(U, C) is the attribute similarity between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge tag, UaAnd CaRepresenting the set of attributes of U and C, respectively, f being a given non-negative metric function, Ua∩CaRepresenting a set of two concepts, U and C, having the same attribute, Ua-CaRepresenting an attribute set, C, that is only present in the representation of the operation and maintenance person but not in the knowledge taga-UaRepresenting the attribute set which is only contained in the knowledge tag but not contained in the operation and maintenance personnel image, wherein lambda and mu are given weight coefficients;
and calculating example similarity according to the operation and maintenance personnel image, wherein the example similarity calculation formula is as follows:
Figure BDA0001314086590000091
wherein S isinst(U, C) are drawn by operation and maintenance personnelThe similarity of the instances between the concept U in the image and the concept C in the knowledge tag, P (U, C) represents the probability that an instance randomly drawn from the instance space belongs to both concepts U and C,
Figure BDA0001314086590000092
representing the probability that an instance randomly drawn from the instance space belongs only to concept U and not to concept C,
Figure BDA0001314086590000093
represents the probability that an instance randomly drawn from the instance space belongs only to concept C and not to concept U;
calculating the relation similarity according to the operation and maintenance personnel portrait to obtain the relation similarity S between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge labelrelat(U, C), wherein the relationship comprises a synonymy relationship, an inheritance relationship, and an inclusion relationship, the weight of the synonymy relationship is greater than the weight of the inheritance relationship, and the weight of the synonymy relationship is greater than the weight of the inclusion relationship;
according to the name similarity Sname(U, C), Attribute similarity Sattri(U, C), example similarity Sinst(U, C) and relationship similarity Srelat(U, C) calculating the similarity of the operation and maintenance personnel and the knowledge, wherein the calculation formula of the similarity of the operation and maintenance personnel and the knowledge Sim (U, C) is that Sim (U, C) is α Sname(U,C)+βSattri(U,C)+εSinst(U,C)+δSrelat(U, C), wherein α, β, epsilon, and delta are given name similarity coefficient, attribute similarity coefficient, instance similarity coefficient, and relationship similarity coefficient, respectively.
Further, as a preferred embodiment, the step of obtaining a knowledge recommendation set by selecting knowledge according to a result of the scene analysis and the operation and maintenance person portrait includes:
searching the characteristics of the operation and maintenance personnel according to the operation and maintenance personnel images, and calculating an operation and maintenance similar knowledge set by combining the operation and maintenance personnel images;
acquiring scene characteristics in real time according to the scene analysis result, and calculating a scene similarity knowledge set according to the scene characteristics;
and judging whether the scene similar knowledge set and the operation and maintenance similar knowledge set have intersection, if so, forming a recommendation knowledge set according to the intersection, otherwise, re-acquiring scene characteristics, and re-training the operation and maintenance personnel model to obtain a new operation and maintenance personnel portrait.
Further, as a preferred embodiment, the step of obtaining a knowledge recommendation set by selecting knowledge according to a result of the scene analysis and the operation and maintenance person portrait includes:
carrying out user scene retrieval on operation and maintenance personnel, wherein the user scene U ismContains a scene u, and
Figure BDA0001314086590000094
wherein, UTIs a full user scenario;
calculating a user scene U by adopting a set similarity calculation methodmSimilar knowledge scenes are sequenced to obtain a similar scene set Cn
Figure BDA0001314086590000101
Wherein, Um≈CuUser scene U representing operation and maintenance personnelmAnd knowledge scene CuA is an intersection symbol, CTIs a full knowledge scene set;
retrieving a given scene knowledge relationship model ORTo obtain a scene CnKnowledge-scene relationship pair P ofi,j
Figure BDA0001314086590000102
Wherein, PTAs a full-knowledge-scene-relationship pair, CiFor knowledge scenarios, R1As scene CiThe properties of (a) to (b) are,
Figure BDA0001314086590000103
represents Pi,jBy R1And CiConnecting;
retrieving a given set of knowledge domains ODTo obtain a compound containingi,jAssociated domain knowledge set Dk
Figure BDA0001314086590000104
Wherein D isTIn the field of full-scale knowledge, R2For domain knowledge DkThe properties of (a) to (b) are,
Figure BDA0001314086590000105
represents Pi,jBy R2And DkConnecting;
from a full-scale domain knowledge recommendation set
Figure BDA0001314086590000106
To obtain knowledge recommendation set KS
Figure BDA0001314086590000107
Wherein, the scene knowledge relation model ORAnd field of knowledge set ODCalculated or pre-given before knowledge selection. The knowledge recommendation method and the knowledge recommendation system sequentially search the user scene, match the similarity of the user scene, match the knowledge-scene relationship and match the knowledge field when selecting the knowledge, and finally find out the knowledge recommendation set.
The invention relates to a knowledge recommendation system based on big data and deep learning, which comprises:
the scene analysis module is used for carrying out scene analysis according to the scene data information;
the operation and maintenance personnel portrait generation module is used for processing the data information of the operation and maintenance personnel by adopting a deep learning method to generate an operation and maintenance personnel portrait;
the knowledge selection module is used for selecting knowledge according to the result of the scene analysis and the operation and maintenance personnel portrait to obtain a knowledge recommendation set;
and the knowledge recommendation module is used for recommending knowledge according to the knowledge recommendation set.
Further preferably, the operation and maintenance person representation generating module includes:
the information acquisition unit is used for acquiring operation and maintenance personnel data information, and the operation and maintenance personnel data information comprises data acquired from information registered by an operation and maintenance personnel account and operation behavior data of the operation and maintenance personnel acquired in real time;
the data cleaning unit is used for cleaning the data of the collected operation and maintenance personnel data information;
the model training unit is used for carrying out model training on the operation and maintenance personnel by adopting a method of fusing weak model training and Boosting on the operation and maintenance personnel data information after data cleaning to obtain an operation and maintenance personnel portrait;
and the similarity calculation unit is used for calculating the similarity between the operation and maintenance personnel and the knowledge label according to the operation and maintenance personnel portrait to obtain the operation and maintenance personnel-knowledge similarity.
Further preferably, the similarity calculation unit includes:
the name similarity calculation operator unit is used for calculating name similarity according to the operation and maintenance personnel image, and the calculation formula of the name similarity is as follows:
Figure BDA0001314086590000111
wherein S isname(U, C) is the name similarity between the concept U in the operation and maintenance personnel figure and the concept C in the knowledge label, Ui(i is more than or equal to 1 and less than or equal to n) is a character string semantic word segmentation result of the name U in the concept U, cj(j is more than or equal to 1 and less than or equal to m) is the result of semantic segmentation of the character string of the name C in the concept C, n is the total number of the character strings of the name U in the concept U, m is the total number of the character strings of the name C in the concept C, and Sim (Ui,cj) Is uiAnd cjSimilarity between them;
the attribute similarity calculation operator unit is used for calculating the attribute similarity according to the operation and maintenance personnel image, and the attribute similarity calculation formula is as follows:
Figure BDA0001314086590000112
wherein S isattri(U, C) is the attribute similarity between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge tag, UaAnd CaRepresenting the set of attributes of U and C, respectively, f being a given non-negative metric function, Ua∩CaRepresenting a set of two concepts, U and C, having the same attribute, Ua-CaRepresentation is operation and maintenance onlyAttribute sets, C, present in person representations but absent in knowledge tagsa-UaRepresenting the attribute set which is only contained in the knowledge tag but not contained in the operation and maintenance personnel image, wherein lambda and mu are given weight coefficients;
the example similarity operator unit is used for calculating the example similarity according to the operation and maintenance personnel image, and the example similarity calculation formula is as follows:
Figure BDA0001314086590000113
wherein S isinst(U, C) is the example similarity between the concept U in the operation and maintenance personnel image and the concept C in the knowledge label, P (U, C) represents the probability that an example randomly extracted from the example space belongs to the concepts U and C at the same time,
Figure BDA0001314086590000114
representing the probability that an instance randomly drawn from the instance space belongs only to concept U and not to concept C,
Figure BDA0001314086590000115
represents the probability that an instance randomly drawn from the instance space belongs only to concept C and not to concept U;
the relational similarity calculation operator unit is used for calculating the relational similarity according to the operation and maintenance personnel image to obtain the relational similarity S between the concept U in the operation and maintenance personnel image and the concept C in the knowledge labelrelat(U, C), wherein the relationship comprises a synonymy relationship, an inheritance relationship, and an inclusion relationship, the weight of the synonymy relationship is greater than the weight of the inheritance relationship, and the weight of the synonymy relationship is greater than the weight of the inclusion relationship;
an operation and maintenance personnel-knowledge similarity calculation operator unit for calculating the similarity S according to the namename(U, C), Attribute similarity Sattri(U, C), example similarity Sinst(U, C) and relationship similarity Srelat(U, C) calculating the similarity of the operation and maintenance personnel and the knowledge, wherein the calculation formula of the similarity of the operation and maintenance personnel and the knowledge Sim (U, C) is that Sim (U, C) is α Sname(U,C)+βSattri(U,C)+εSinst(U,C)+δSrelat(U, C) wherein α, β,. epsilon.and.delta.are eachGiven name similarity coefficient, attribute similarity coefficient, instance similarity coefficient, and relationship similarity coefficient.
The invention will be further explained and explained with reference to the drawings and the embodiments in the description.
Example one
Aiming at the problem of low accuracy of knowledge recommendation in the prior art, the invention provides an efficient knowledge recommendation technology applied to an operation and maintenance environment. The knowledge recommendation technology models the use scene of the knowledge base and the use user (namely, operation and maintenance personnel) through deep learning algorithms such as reinforcement learning, and further carries out accurate knowledge popularization and application based on the established model. The recommendation technology has the following characteristics:
(1) the technology is used for training the portrait of the operation and maintenance personnel by off-line learning of historical operation behaviors of the operation and maintenance personnel and continuously correcting the portrait model of the operation and maintenance personnel by combining subsequent operation behavior data of the operation and maintenance personnel, so that the portrait of the operation and maintenance personnel is more and more accurate.
(2) The technology analyzes the system operation scene of the operation and maintenance work in real time, evaluates (namely selects knowledge) by combining the trained operation and maintenance personnel figure, and can accurately recommend the knowledge.
The detailed implementation of the knowledge recommendation of the present invention will be described in detail below in terms of both noun explanation and implementation.
Interpretation of noun
The invention relates to the following terms:
knowledge recommendation: recommendations for knowledge data. The knowledge recommendation pushes and introduces related knowledge to the user through the processing content and the user model of the operation and maintenance platform system user, on one hand, the knowledge recommendation can help the user to accelerate the processing of work, and on the other hand, more deep related information is provided to enhance the user capability.
Deep learning: research from artificial neural networks has led to the discovery of distributed feature representations of data by combining lower-level features to form more abstract higher-level representation attribute classes or features.
A knowledge base: structured, easy-to-operate, easy-to-use and fully organized knowledge clusters in knowledge engineering are interconnected knowledge slice sets which are stored, organized, managed and used in a computer memory by adopting a certain knowledge representation mode (or a plurality of knowledge representation modes) according to the needs of solving problems in a certain (or certain) field. These knowledge pieces include theoretical knowledge related to the domain, factual data, heuristic knowledge derived from expert experience (such as definition, theorem, algorithm, etc. related to a domain), and common knowledge. The difference between general applications and knowledge-based systems is: while general applications implicitly encode problem-solving knowledge in the program, knowledge-based systems explicitly express problem-solving knowledge in the application domain and separately compose a relatively independent program entity.
Operation and maintenance informatization system: the method takes the daily operation maintenance management process of an IT department as a core, takes event tracking as a main line, aims to solve eight management problems (process management, event management, problem management, change management, release management, operation management, knowledge management and comprehensive analysis management) in the IT operation management, and provides an efficient and standard IT operation management platform for the IT department. The system not only realizes the interface with the service system used in the enterprise at present, but also integrates the system functions of customer service, operation and maintenance, service management and the like, and can carry out stage prompt on the responsible person in the forms of mails, mobile phone short messages and the like, thereby improving the service response efficiency of system maintenance; by integrating information, the comprehensive management of various resources is realized, including the effective management of various static resources, basic data and spare part resources, so that the quick response capability of the operation and maintenance of the IT department is comprehensively improved, and meanwhile, a perfect data model is established for the business knowledge accumulation and business assessment of the IT department.
The Boosting method comprises the following steps: the method is used for improving the accuracy of the weak classification algorithm, and can integrate a plurality of classifiers into one classifier.
(II) concrete implementation process of knowledge recommendation method
As shown in fig. 2, taking the application of the knowledge base of the operation and maintenance information system as an example, the knowledge recommendation method of the present invention specifically includes the following steps:
the method comprises the following steps: and carrying out scene analysis to generate a scene mode.
In order to acquire scene information operated by operation and maintenance personnel in real time and carry out scene-knowledge correlation analysis to improve the accuracy of knowledge recommendation, the invention provides a new scene analysis algorithm. As shown in fig. 1, the detailed steps of the new scene analysis algorithm are as follows:
step 1: acquiring scene data information: and acquiring scene data P operated by current operation and maintenance personnel.
Step 2: data cleaning: and (4) carrying out data cleaning on the scene data P, wherein the aim is to carry out examination and verification on the scene data P, delete repeated information, correct existing errors and ensure data consistency.
Step 3: scene mode generation: the process is simply a process of analyzing and labeling the collected and cleaned scene data, and finally the scene mark O can be obtainedC
Step two: and generating an operation and maintenance personnel portrait.
The invention provides a new operation and maintenance personnel analysis algorithm, the operation and maintenance personnel portrait is constructed by acquiring static data information and dynamic data information of the operation and maintenance personnel and applying machine learning algorithms such as reinforcement learning, the operation and maintenance personnel portrait model can be corrected and adjusted by using the new operation and maintenance personnel data acquired during operation, and the accuracy of the operation and maintenance personnel portrait is improved.
As shown in fig. 2 and fig. 3, the detailed steps of the operation and maintenance personnel analysis algorithm are as follows:
step 1: acquiring operation and maintenance personnel data information: on one hand, data are obtained from the account registration information of the operation and maintenance personnel, and on the other hand, the operation behavior data of the operation and maintenance personnel are collected in real time.
Step 2: data cleaning: and the collected operation and maintenance personnel data information is subjected to data cleaning, so that the operation and maintenance personnel data information is examined and checked, repeated information is deleted, existing errors are corrected, and the data consistency is ensured.
Step 3: text modeling: the data after data cleaning is mainly divided into static information data and dynamic information data. The static information data refers to the relatively stable information of the operation and maintenance personnel, such as name, birth date, gender and the like. The dynamic information data refers to the behavior information of the operation and maintenance personnel, which changes constantly, and comprises daily operation behavior, browsing behavior and the like.
Step 4: training a plurality of weak models: and training a plurality of weak models by using machine learning algorithms such as SVM and the like. When performing text classification, the computer may be made to look at the training samples provided to it (i.e., given training samples): each training sample is composed of a vector (i.e., a vector composed of text features) and a class label (for indicating which class this training sample belongs to), such as Di=(xi,yi) In xiThat is, a text vector (which is higher in dimensionality), yiAre the class labels. In a binary linear classification, this class label has only two values, 1 and-1 (used to indicate whether it belongs to or does not belong to this class, respectively). With this representation, the spacing of a sample point from a hyperplane can be defined as: deltai=yi(wxi+b)。
Step 5: boosting accuracy of a plurality of weak models: obtaining a training sample subset through the operation of a training sample set, then training the training sample subset by using a weak classification algorithm to generate a series of base classifiers, and finally obtaining a result classifier of the operation and maintenance personnel model through a Boosting method.
The method comprises the steps of putting other weak classification algorithms except Step4 into a Boosting framework as a basis classification algorithm, obtaining different training sample subsets through the operation of the Boosting framework on a training sample set, and then using the training sample subsets to train to generate a basis classifier (each time a training sample subset is obtained, one basis classifier is generated on the training sample subset by using one basis classification algorithm, so that the number n of training rounds is given0Then n can be generated0Individual basis classifier), and then the n is processed by Boosting framework algorithm0Weighted fusion by individual basis classifierAnd a final result classifier is generated.
Step 6: and (3) model verification: model verification is carried out on the trained portrait model of the operation and maintenance personnel (the model verification can be completed by sampling or given test samples) so as to ensure the accuracy of the model.
Step 7: and (4) saving the model: and storing the preliminary operation and maintenance personnel image model after model verification, and then acquiring new operation and maintenance personnel data information in real time in the operation process to continuously correct the operation and maintenance personnel image model and the operation and maintenance personnel image result.
Step 8: and (3) calculating the similarity of the operation and maintenance personnel and the knowledge: and calculating the similarity Sim (U, C) of the operation and maintenance personnel-knowledge according to the concept name, the attribute, the example and the relationship in the operation and maintenance personnel portrait.
The process of calculating the operation and maintenance personnel-knowledge similarity can be further subdivided into:
1) and (3) performing name similarity calculation:
Figure BDA0001314086590000151
wherein S isname(U, C) is the name similarity between the concept U in the operation and maintenance personnel figure and the concept C in the knowledge label, Ui(i is more than or equal to 1 and less than or equal to n) is a character string semantic word segmentation result of the name U in the concept U, cj(j is more than or equal to 1 and less than or equal to m) is the result of semantic segmentation of the character string of the name C in the concept C, n is the total number of the character strings of the name U in the concept U, m is the total number of the character strings of the name C in the concept C, and Sim (Ui,cj) Is uiAnd cjSimilarity between them;
2) and performing attribute similarity calculation, wherein the attribute similarity calculation formula is as follows:
Figure BDA0001314086590000152
wherein S isattri(U, C) is the attribute similarity between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge tag, UaAnd CaRepresenting the set of attributes of U and C, respectively, f being a given non-negative metric function, Ua∩CaRepresenting a set of two concepts, U and C, having the same attribute, Ua-CaIs shown onlyIs an attribute set that is present in the operation and maintenance person representation but not in the knowledge tag, Ca-UaRepresenting the attribute set which is only contained in the knowledge tag but not contained in the operation and maintenance personnel image, wherein lambda and mu are given weight coefficients;
3) example similarity calculations were performed:
Figure BDA0001314086590000153
wherein S isinst(U, C) is the example similarity between the concept U in the operation and maintenance personnel image and the concept C in the knowledge label, P (U, C) represents the probability that an example randomly extracted from the example space belongs to the concepts U and C at the same time,
Figure BDA0001314086590000154
representing the probability that an instance randomly drawn from the instance space belongs only to concept U and not to concept C,
Figure BDA0001314086590000155
represents the probability that an instance randomly drawn from the instance space belongs only to concept C and not to concept U;
4) calculating the relation similarity according to the operation and maintenance personnel portrait to obtain the relation similarity S between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge labelrelat(U, C). The relationships comprise synonymy relationships, inheritance relationships and containment relationships. When the relationship similarity is calculated, the weight of the synonymy relationship is greater than that of the inheritance relationship, and the weight of the synonymy relationship is greater than that of the inclusion relationship;
5) according to the name similarity Sname(U, C), Attribute similarity Sattri(U, C), example similarity Sinst(U, C) and relationship similarity Srelat(U, C) calculating the similarity of the operation and maintenance personnel and the knowledge, wherein the calculation formula of the similarity of the operation and maintenance personnel and the knowledge Sim (U, C) is that Sim (U, C) is α Sname(U,C)+βSattri(U,C)+εSinst(U,C)+δSrelat(U, C), wherein α, β, epsilon, and delta are given name similarity coefficient, attribute similarity coefficient, instance similarity coefficient, and relationship similarity coefficient, respectively.
Step three: knowledge selection: and selecting knowledge according to the result of the scene analysis and the portrait of the operation and maintenance personnel to obtain a knowledge recommendation set.
As shown in fig. 4, knowledge selection can be further refined into the following process:
1) searching the characteristics of the operation and maintenance personnel according to the operation and maintenance personnel images, and calculating an operation and maintenance similar knowledge set by combining the operation and maintenance personnel images;
2) acquiring scene characteristics in real time according to the scene analysis result, and calculating a scene similarity knowledge set according to the scene characteristics;
3) and judging whether the scene similar knowledge set and the operation and maintenance similar knowledge set have intersection, if so, forming a recommendation knowledge set according to the intersection, otherwise, re-acquiring scene characteristics, and re-training the operation and maintenance person portrait model to obtain a new operation and maintenance person portrait.
If the knowledge label is UkThen the set of recommended knowledge services can be represented as KS. At this time, the detailed refining steps of knowledge selection are as follows:
step 1: carrying out user scene retrieval on operation and maintenance personnel, wherein the user scene U ismContains a scene u, and
Figure BDA0001314086590000161
wherein, UTIs a full user scenario;
step 2: calculating a user scene U by adopting a set similarity calculation methodmSimilar knowledge scenes are sequenced to obtain a similar scene set Cn
Figure BDA0001314086590000162
Wherein, Um≈CuUser scene U representing operation and maintenance personnelmAnd knowledge scene CuA is an intersection symbol, CTIs a full knowledge scene set;
step 3: retrieving a given scene knowledge relationship model ORTo obtain a scene CnKnowledge-scene relationship pair P ofi,j
Figure BDA0001314086590000163
Wherein, PTAs a full-knowledge-scene-relationship pair, CiFor knowledge scenarios, R1As scene CiThe properties of (a) to (b) are,
Figure BDA0001314086590000164
represents Pi,jBy R1And CiConnecting;
step 4: retrieving a given set of knowledge domains ODTo obtain a compound containingi,jAssociated domain knowledge set Dk
Figure BDA0001314086590000165
Wherein D isTIn the field of full-scale knowledge, R2For domain knowledge DkThe properties of (a) to (b) are,
Figure BDA0001314086590000166
represents Pi,jBy R2And DkConnecting;
step 5: from a full-scale domain knowledge recommendation set
Figure BDA0001314086590000171
To obtain knowledge recommendation set KS
Figure BDA0001314086590000172
Compared with the prior art, the invention has the following advantages:
(1) high accuracy and high efficiency: the knowledge recommendation technology is adopted, so that the problem that the hit rate of knowledge retrieval is greatly different due to different understanding expressions of individuals on knowledge is avoided, the time consumption of manual knowledge retrieval is reduced, the knowledge application can be popularized in real time, and the processing efficiency of users is improved.
(2) Providing an operation and maintenance personnel portrait: the operation behavior of the user is introduced as input, the user model of the operation and maintenance personnel portrait model is created, knowledge pushing is carried out according to the user model, and the accuracy of knowledge recommendation is improved.
(3) The self-learning capability is provided: through a deep learning algorithm based on machine learning, the operation and maintenance personnel portrait model continuously acquires the operation behaviors of the operation and maintenance personnel to correct the model, so that the operation and maintenance personnel portrait model is advanced along with time, the operation times of the operation and maintenance personnel are increased, and the accuracy is higher and higher.
While the preferred embodiments of the present invention have been illustrated and described, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (6)

1. A knowledge recommendation method based on big data and deep learning is characterized in that: the method comprises the following steps:
performing scene analysis according to the scene data information;
processing the data information of the operation and maintenance personnel by adopting a deep learning method to generate an operation and maintenance personnel portrait;
selecting knowledge according to a scene analysis result and the operation and maintenance personnel portrait to obtain a knowledge recommendation set;
performing knowledge recommendation according to the knowledge recommendation set;
the operation and maintenance personnel data information is processed by adopting a deep learning method to generate an operation and maintenance personnel portrait, and the operation and maintenance personnel portrait generation method comprises the following steps:
acquiring operation and maintenance personnel data information, wherein the operation and maintenance personnel data information comprises data acquired from information registered by an operation and maintenance personnel account and operation behavior data of the operation and maintenance personnel acquired in real time;
carrying out data cleaning on the collected operation and maintenance personnel data information;
carrying out model training on the operation and maintenance personnel by adopting a method of fusing weak model training and Boosting on the data information of the operation and maintenance personnel after data cleaning to obtain an operation and maintenance personnel portrait;
calculating the similarity between the operation and maintenance personnel and the knowledge label according to the operation and maintenance personnel figure to obtain the operation and maintenance personnel-knowledge similarity;
the step of calculating the similarity between the operation and maintenance personnel and the knowledge label according to the operation and maintenance personnel portrait to obtain the operation and maintenance personnel-knowledge similarity comprises the following steps:
and calculating the name similarity according to the operation and maintenance personnel image, wherein the calculation formula of the name similarity is as follows:
Figure FDA0002220381880000011
wherein S isname(U, C) is the name similarity between the concept U in the operation and maintenance personnel figure and the concept C in the knowledge label, Ui(i is more than or equal to 1 and less than or equal to n) is a character string semantic word segmentation result of the name U in the concept U, cj(j is more than or equal to 1 and less than or equal to m) is the result of semantic segmentation of the character string of the name C in the concept C, n is the total number of the character strings of the name U in the concept U, m is the total number of the character strings of the name C in the concept C, and Sim (Ui,cj) Is uiAnd cjSimilarity between them;
and performing attribute similarity calculation according to the operation and maintenance personnel image, wherein the attribute similarity calculation formula is as follows:
Figure FDA0002220381880000012
wherein S isattri(U, C) is the attribute similarity between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge tag, UaAnd CaRepresenting the set of attributes of U and C, respectively, f being a given non-negative metric function, Ua∩CaRepresenting a set of two concepts, U and C, having the same attribute, Ua-CaRepresenting an attribute set, C, that is only present in the representation of the operation and maintenance person but not in the knowledge taga-UaRepresenting the attribute set which is only contained in the knowledge tag but not contained in the operation and maintenance personnel image, wherein lambda and mu are given weight coefficients;
and calculating example similarity according to the operation and maintenance personnel image, wherein the example similarity calculation formula is as follows:
Figure FDA0002220381880000021
wherein S isinst(U, C) is the example similarity between the concept U in the operation and maintenance personnel image and the concept C in the knowledge label, and P (U, C) represents the random extraction from the example spaceThe probability that an instance is subordinate to both concepts U and C,
Figure FDA0002220381880000022
representing the probability that an instance randomly drawn from the instance space belongs only to concept U and not to concept C,
Figure FDA0002220381880000023
represents the probability that an instance randomly drawn from the instance space belongs only to concept C and not to concept U;
calculating the relation similarity according to the operation and maintenance personnel portrait to obtain the relation similarity S between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge labelrelat(U, C), wherein the relationship comprises a synonymy relationship, an inheritance relationship, and an inclusion relationship, the weight of the synonymy relationship is greater than the weight of the inheritance relationship, and the weight of the synonymy relationship is greater than the weight of the inclusion relationship;
according to the name similarity Sname(U, C), Attribute similarity Sattri(U, C), example similarity Sinst(U, C) and relationship similarity Srelat(U, C) calculating the similarity of the operation and maintenance personnel and the knowledge, wherein the calculation formula of the similarity of the operation and maintenance personnel and the knowledge Sim (U, C) is that Sim (U, C) is α Sname(U,C)+βSattri(U,C)+εSinst(U,C)+δSrelat(U, C), wherein α, β, epsilon, and delta are given name similarity coefficient, attribute similarity coefficient, instance similarity coefficient, and relationship similarity coefficient, respectively.
2. The knowledge recommendation method based on big data and deep learning of claim 1, wherein: the step of performing scene analysis according to the scene data information includes:
acquiring scene data information in real time, and acquiring scene data operated by current operation and maintenance personnel;
carrying out data cleaning on the acquired scene data;
and carrying out real-time scene analysis and labeling operation on the scene data after data cleaning to obtain a scene mark.
3. The knowledge recommendation method based on big data and deep learning of claim 1, wherein: the operation and maintenance personnel model training is carried out on the operation and maintenance personnel data information after data cleaning by adopting a method of fusing weak model training and Boosting to obtain the operation and maintenance personnel portrait, and the method comprises the following steps:
performing text modeling, so as to divide the operation and maintenance personnel data information after data cleaning into static information data and dynamic information data;
performing weak model training on a given training sample according to the requirement of text modeling to obtain a plurality of weak models;
the accuracy of the weak models is improved by adopting a Boosting method, and a result classifier of the operation and maintenance personnel model is obtained;
performing model verification on a result classifier of the operation and maintenance personnel model by adopting a given test sample;
and storing the operation and maintenance personnel model after the model verification is passed, and acquiring new operation and maintenance personnel data information after data cleaning in real time to continuously correct the operation and maintenance personnel model and the corresponding operation and maintenance personnel portrait.
4. The knowledge recommendation method based on big data and deep learning of claim 1, wherein: the step of selecting knowledge according to the result of the scene analysis and the portrait of the operation and maintenance personnel to obtain a knowledge recommendation set comprises the following steps:
searching the characteristics of the operation and maintenance personnel according to the operation and maintenance personnel images, and calculating an operation and maintenance similar knowledge set by combining the operation and maintenance personnel images;
acquiring scene characteristics in real time according to the scene analysis result, and calculating a scene similarity knowledge set according to the scene characteristics;
and judging whether the scene similar knowledge set and the operation and maintenance similar knowledge set have intersection, if so, forming a recommendation knowledge set according to the intersection, otherwise, re-acquiring scene characteristics, and re-training the operation and maintenance personnel model to obtain a new operation and maintenance personnel portrait.
5. The knowledge recommendation method based on big data and deep learning of claim 1, wherein: the step of selecting knowledge according to the result of the scene analysis and the portrait of the operation and maintenance personnel to obtain a knowledge recommendation set comprises the following steps:
carrying out user scene retrieval on operation and maintenance personnel, wherein the user scene U ismContains a scene u, and
Figure FDA0002220381880000031
wherein, UTIs a full user scenario;
calculating a user scene U by adopting a set similarity calculation methodmSimilar knowledge scenes are sequenced to obtain a similar scene set Cn
Figure FDA0002220381880000032
Wherein, Um≈CuUser scene U representing operation and maintenance personnelmAnd knowledge scene CuA is an intersection symbol, CTIs a full knowledge scene set;
retrieving a given scene knowledge relationship model ORTo obtain a scene CnKnowledge-scene relationship pair P ofi,j
Figure FDA0002220381880000033
Wherein, PTAs a full-knowledge-scene-relationship pair, CiFor knowledge scenarios, R1As scene CiThe properties of (a) to (b) are,
Figure FDA0002220381880000034
represents Pi,jBy R1And CiConnecting;
retrieving a given set of knowledge domains ODTo obtain a compound containingi,jAssociated domain knowledge set Dk
Figure FDA0002220381880000035
Wherein D isTIn the field of full-scale knowledge, R2For domain knowledge DkThe properties of (a) to (b) are,
Figure FDA0002220381880000036
represents Pi,jBy R2And DkConnecting;
from a full-scale domain knowledge recommendation set
Figure FDA0002220381880000038
To obtain knowledge recommendation set KS
Figure FDA0002220381880000037
6. A knowledge recommendation system based on big data and deep learning, characterized by: the method comprises the following steps:
the scene analysis module is used for carrying out scene analysis according to the scene data information;
the operation and maintenance personnel portrait generation module is used for processing the data information of the operation and maintenance personnel by adopting a deep learning method to generate an operation and maintenance personnel portrait;
the knowledge selection module is used for selecting knowledge according to the result of the scene analysis and the operation and maintenance personnel portrait to obtain a knowledge recommendation set;
the knowledge recommendation module is used for recommending knowledge according to the knowledge recommendation set;
the operation and maintenance personnel portrait generation module comprises:
the information acquisition unit is used for acquiring operation and maintenance personnel data information, and the operation and maintenance personnel data information comprises data acquired from information registered by an operation and maintenance personnel account and operation behavior data of the operation and maintenance personnel acquired in real time;
the data cleaning unit is used for cleaning the data of the collected operation and maintenance personnel data information;
the model training unit is used for carrying out model training on the operation and maintenance personnel by adopting a method of fusing weak model training and Boosting on the operation and maintenance personnel data information after data cleaning to obtain an operation and maintenance personnel portrait;
the similarity calculation unit is used for calculating the similarity between the operation and maintenance personnel and the knowledge label according to the operation and maintenance personnel portrait to obtain the operation and maintenance personnel-knowledge similarity;
the similarity calculation unit includes:
the name similarity calculation operator unit is used for calculating name similarity according to the operation and maintenance personnel image, and the calculation formula of the name similarity is as follows:
Figure FDA0002220381880000041
wherein S isname(U, C) is the name similarity between the concept U in the operation and maintenance personnel figure and the concept C in the knowledge label, Ui(i is more than or equal to 1 and less than or equal to n) is a character string semantic word segmentation result of the name U in the concept U, cj(j is more than or equal to 1 and less than or equal to m) is the result of semantic segmentation of the character string of the name C in the concept C, n is the total number of the character strings of the name U in the concept U, m is the total number of the character strings of the name C in the concept C, and Sim (Ui,cj) Is uiAnd cjSimilarity between them;
the attribute similarity calculation operator unit is used for calculating the attribute similarity according to the operation and maintenance personnel image, and the attribute similarity calculation formula is as follows:
Figure FDA0002220381880000042
wherein S isattri(U, C) is the attribute similarity between the concept U in the operation and maintenance personnel portrait and the concept C in the knowledge tag, UaAnd CaRepresenting the set of attributes of U and C, respectively, f being a given non-negative metric function, Ua∩CaRepresenting a set of two concepts, U and C, having the same attribute, Ua-CaRepresenting an attribute set, C, that is only present in the representation of the operation and maintenance person but not in the knowledge taga-UaRepresenting the attribute set which is only contained in the knowledge tag but not contained in the operation and maintenance personnel image, wherein lambda and mu are given weight coefficients;
the example similarity operator unit is used for calculating the example similarity according to the operation and maintenance personnel image, and the example similarity calculation formula is as follows:
Figure FDA0002220381880000051
wherein S isinst(U, C) is the example similarity between the concept U in the operation and maintenance personnel image and the concept C in the knowledge label, P (U, C) represents the probability that an example randomly extracted from the example space belongs to the concepts U and C at the same time,
Figure FDA0002220381880000052
representing the probability that an instance randomly drawn from the instance space belongs only to concept U and not to concept C,
Figure FDA0002220381880000053
represents the probability that an instance randomly drawn from the instance space belongs only to concept C and not to concept U;
the relational similarity calculation operator unit is used for calculating the relational similarity according to the operation and maintenance personnel image to obtain the relational similarity S between the concept U in the operation and maintenance personnel image and the concept C in the knowledge labelrelat(U, C), wherein the relationship comprises a synonymy relationship, an inheritance relationship, and an inclusion relationship, the weight of the synonymy relationship is greater than the weight of the inheritance relationship, and the weight of the synonymy relationship is greater than the weight of the inclusion relationship;
an operation and maintenance personnel-knowledge similarity calculation operator unit for calculating the similarity S according to the namename(U, C), Attribute similarity Sattri(U, C), example similarity Sinst(U, C) and relationship similarity Srelat(U, C) calculating the similarity of the operation and maintenance personnel and the knowledge, wherein the calculation formula of the similarity of the operation and maintenance personnel and the knowledge Sim (U, C) is that Sim (U, C) is α Sname(U,C)+βSattri(U,C)+εSinst(U,C)+δSrelat(U, C), wherein α, β, epsilon, and delta are given name similarity coefficient, attribute similarity coefficient, instance similarity coefficient, and relationship similarity coefficient, respectively.
CN201710417583.1A 2017-06-06 2017-06-06 Knowledge recommendation method and system based on big data and deep learning Active CN107368521B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710417583.1A CN107368521B (en) 2017-06-06 2017-06-06 Knowledge recommendation method and system based on big data and deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710417583.1A CN107368521B (en) 2017-06-06 2017-06-06 Knowledge recommendation method and system based on big data and deep learning

Publications (2)

Publication Number Publication Date
CN107368521A CN107368521A (en) 2017-11-21
CN107368521B true CN107368521B (en) 2020-04-14

Family

ID=60304824

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710417583.1A Active CN107368521B (en) 2017-06-06 2017-06-06 Knowledge recommendation method and system based on big data and deep learning

Country Status (1)

Country Link
CN (1) CN107368521B (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109871856B (en) * 2017-12-04 2022-03-04 北京京东尚科信息技术有限公司 Method and device for optimizing training sample
CN111291019B (en) * 2018-12-07 2023-09-29 中国移动通信集团陕西有限公司 Similarity discrimination method and device for data model
CN110991865A (en) * 2019-11-29 2020-04-10 杭州安恒信息技术股份有限公司 Intelligent threat analysis method for operation and maintenance auditing system
CN112035287B (en) * 2020-09-01 2023-07-14 中国银行股份有限公司 Method, device, storage medium and equipment for testing data cleaning result
CN114153716B (en) * 2022-02-08 2022-05-06 中国电子科技集团公司第五十四研究所 Real-time portrait generation method for people and nobody objects under semantic information exchange network
CN116010383B (en) * 2023-03-30 2023-06-23 中国船舶科学研究中心 Deep sea submersible result information collection and management method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294489A (en) * 2015-06-08 2017-01-04 北京三星通信技术研究有限公司 Content recommendation method, Apparatus and system
CN106570033A (en) * 2015-10-13 2017-04-19 赤子城网络技术(北京)有限公司 Individual recommendation method and device based on scene data
CN106682035A (en) * 2015-11-11 2017-05-17 中国移动通信集团公司 Individualized learning recommendation method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160283548A1 (en) * 2015-03-27 2016-09-29 International Business Machines Corporation Recommending connections in a social network system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294489A (en) * 2015-06-08 2017-01-04 北京三星通信技术研究有限公司 Content recommendation method, Apparatus and system
CN106570033A (en) * 2015-10-13 2017-04-19 赤子城网络技术(北京)有限公司 Individual recommendation method and device based on scene data
CN106682035A (en) * 2015-11-11 2017-05-17 中国移动通信集团公司 Individualized learning recommendation method and device

Also Published As

Publication number Publication date
CN107368521A (en) 2017-11-21

Similar Documents

Publication Publication Date Title
CN107368521B (en) Knowledge recommendation method and system based on big data and deep learning
Bharadiya A comparative study of business intelligence and artificial intelligence with big data analytics
CN112632385B (en) Course recommendation method, course recommendation device, computer equipment and medium
CN107368468B (en) Operation and maintenance knowledge map generation method and system
CN111444236B (en) Mobile terminal user portrait construction method and system based on big data
CN110633373B (en) Automobile public opinion analysis method based on knowledge graph and deep learning
CN111859160B (en) Session sequence recommendation method and system based on graph neural network
CN115002200B (en) Message pushing method, device, equipment and storage medium based on user portrait
CN111506723B (en) Question-answer response method, device, equipment and storage medium
CN108363748B (en) Topic portrait system and topic portrait method based on knowledge
CN114238573B (en) Text countercheck sample-based information pushing method and device
CN112836509A (en) Expert system knowledge base construction method and system
CN116049379A (en) Knowledge recommendation method, knowledge recommendation device, electronic equipment and storage medium
CN117271767A (en) Operation and maintenance knowledge base establishing method based on multiple intelligent agents
Dinh et al. A proposal of deep learning model for classifying user interests on social networks
CN112288337B (en) Behavior recommendation method, behavior recommendation device, behavior recommendation equipment and behavior recommendation medium
CN113935880A (en) Policy recommendation method, device, equipment and storage medium
CN113627797A (en) Image generation method and device for employee enrollment, computer equipment and storage medium
CN108363759A (en) Subject tree generation method and system based on structural data and Intelligent dialogue method
Yan et al. User recommendation with tensor factorization in social networks
CN116340516A (en) Entity relation cluster extraction method, device, equipment and storage medium
CN115718807A (en) Personnel relationship analysis method, device, equipment and storage medium
CN112328812B (en) Domain knowledge extraction method and system based on self-adjusting parameters and electronic equipment
CN115309865A (en) Interactive retrieval method, device, equipment and storage medium based on double-tower model
CN115099680A (en) Risk management method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant