CN117591668A - Science popularization theme and core knowledge extraction method based on concept lattice - Google Patents
Science popularization theme and core knowledge extraction method based on concept lattice Download PDFInfo
- Publication number
- CN117591668A CN117591668A CN202311603695.8A CN202311603695A CN117591668A CN 117591668 A CN117591668 A CN 117591668A CN 202311603695 A CN202311603695 A CN 202311603695A CN 117591668 A CN117591668 A CN 117591668A
- Authority
- CN
- China
- Prior art keywords
- concept
- decision
- lattice
- science popularization
- concept lattice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000605 extraction Methods 0.000 title claims abstract description 17
- 238000000034 method Methods 0.000 claims abstract description 21
- 238000010276 construction Methods 0.000 claims description 2
- 238000003066 decision tree Methods 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 10
- 238000013528 artificial neural network Methods 0.000 description 10
- 238000013467 fragmentation Methods 0.000 description 4
- 238000006062 fragmentation reaction Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 101150096839 Fcmr gene Proteins 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000001149 cognitive effect Effects 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/34—Browsing; Visualisation therefor
- G06F16/345—Summarisation for human users
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/35—Clustering; Classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/36—Creation of semantic tools, e.g. ontology or thesauri
- G06F16/367—Ontology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/042—Knowledge-based neural networks; Logical representations of neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/01—Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computing arrangements using knowledge-based models
- G06N5/02—Knowledge representation; Symbolic representation
- G06N5/022—Knowledge engineering; Knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/01—Probabilistic graphical models, e.g. probabilistic networks
Abstract
A science popularization theme and core knowledge extraction method based on concept lattice comprises the following steps: s1, acquiring a science popularization theme as an object set O, acquiring core knowledge corresponding to each science popularization theme as an attribute set A, and simultaneously acquiring audience groups of each science popularization theme content as a label set D; s2, constructing a decision form background F, S3, selecting a part of subsets from the decision form background as decision form sub-backgrounds; s4, constructing a concept lattice L of the decision form sub-background according to the definition of the concept lattice on the decision form sub-background; s5, respectively constructing form background concept lattices by using decision form sub-background concept lattices through an incremental method; s6, acquiring non-trivial decision rules, and putting all non-trivial decision rules meeting the conditions into a rule set R.
Description
Technical Field
The invention relates to the technical field of formal concept analysis in grain calculation, in particular to a science popularization theme and core knowledge extraction method based on concept grids.
Background
With the development of mobile internet technology, the expression forms of knowledge are increasingly diverse, such as videos, texts, pictures, songs and the like, and the learning habit of 'fragmentation' of readers is objectively developed. The fragmented learning has the characteristics of the digital age, and firstly has high flexibility and is not limited by time and places. Second, the fragmentation learning is highly targeted, and the learner can focus on selecting the part of the content that the learner wants to learn. Thirdly, the learning efficiency of fragmentation learning is high, and the learning interest of readers is guaranteed due to the short learning time of single fragment content. In the face of the phenomenon of fragmentation of the current knowledge, how to systematically and individually provide popular science for people becomes a current urgent problem to be solved.
A concept lattice is a mathematical tool and method of concept analysis that can be used to process and analyze data, information, and knowledge. The core idea is to represent data or knowledge as a conceptual network structure, including objects and attributes. The network is presented in the form of a table or matrix, where rows represent objects and columns represent attributes. The elements in each lattice represent whether the object has a particular attribute. By analyzing the network structure, different concepts, namely, association relations between objects and attributes, can be identified. In fact, formal concept analysis may be used for knowledge extraction. Formal backgrounds can provide a solid basis for knowledge extraction and represent science popularization topics and core knowledge in a structured manner. On the other hand, formal concept analysis is helpful to systematically extract and analyze key knowledge, and makes knowledge extraction more accurate and reliable, thereby providing powerful support for science popularization and education.
Decision making research is a comprehensive discussion activity conducted by a multidisciplinary joint arm, and has a place in information science, management science, system science, behavior science and even psychology. Generally, data information with decisions is more attractive than data without decisions. On the one hand, the data with decision can comprehensively reflect the relation among various factors, and the inherent law of waiting for our discovery is hidden. On the other hand, one of the main purposes of the human cognitive world is to make predictions and decisions, and only on the basis of the original accumulated knowledge, the human can make a larger progress by obtaining general rules or guidance capable of guiding subsequent life, scientific research and creation. It is therefore desirable to obtain more valuable information, in particular information related to decision rules, from the data with decisions.
However, current recommendation methods generally do not have the systematicness and layering of formal concept analysis, and lack of deep mining and comprehensive understanding of knowledge by decision-making studies, resulting in unilateral and limited recommendation results. Second, these recommendation methods often rely too much on labels, i.e., the recommendation performance is highly dependent on accurate and comprehensive label information, so that when the labels are inaccurate or inadequate, the robustness of the recommendation system may be drastically reduced, thereby possibly degrading the user experience and recommendation quality.
Disclosure of Invention
In view of this, the present invention provides a method for extracting a subject of science popularization and core knowledge based on concept lattices, which is used for at least solving the problem that the performance of a recommendation model in the prior art excessively depends on a correct label.
In order to achieve the above purpose, the present invention adopts the following technical scheme:
a science popularization theme and core knowledge extraction method based on concept lattice comprises the following steps:
s1, acquiring a science popularization subject as an object set O= { O 1 ,o 2 ,...,o n And acquiring core knowledge corresponding to each science popularization topic as an attribute set A= { a 1 ,a 2 ,...,a m Simultaneously acquiring audience groups of all science popularization subject contents as tag sets D= { D } 1 ,d 2 ,...,d r -a }; wherein n, m and r are all greater than 0;
s2, constructing decision form background F= (O, A, I) OA ,D,I OD ) Wherein I OA For the binary relation between the object set O and the attribute set A, if I OA (o i ,a j ) =1, representing object o i Possession of attribute a j The method comprises the steps of carrying out a first treatment on the surface of the Conversely if I OA (o i ,a j ) =0, then object o is represented i Not possessing attribute a j ;I OD Representing the binary relationship between the object set O and the attribute set D, if I OD (o i ,d j ) =1, representing object o i The audience group of (2) is d j The method comprises the steps of carrying out a first treatment on the surface of the Conversely if I OD (o i ,d j ) =0, then object o is represented i The audience group of (2) is not d j ;
S3, selecting a part of subsets from the decision form background to be used as decision form sub-background;
s4, in decision form sub-contexts (O', A, I) O′A ) And decision form sub-contexts (O', D, I) O′D ) Respectively constructing concept lattices L (O', A, I) according to the definition of the concept lattices O′A ) And L (O', D, I) O′D );
S5, utilizing concept lattice L (O', A, I) O′A ) And L (O', D, I) O′D ) Construction of formal backgrounds by incremental methods (O, A, I) OA ) And L (O, D, I) OD ) Is a concept lattice of (2);
s6, acquiring non-trivial decision rules according to the decision rule definition, and putting all non-trivial decision rules meeting the conditions into a rule set R to be regarded as finishing rule and knowledge extraction.
Preferably, the concept lattice L (O, A, I) is obtained in S5 OA ) Specific content package of (a)The method comprises the following steps:
if for concept lattice L (O', A, I) O′A ) Newly add an object o i Wherein the object isAnd->Then for concept lattice L (O', A, I) O′A ) The original concept (X, B) of (C) is changed to satisfy the following conditions:
(1) If it isWherein f (o) i ) Representing object o i If the owned attribute is the concept (X, B) is not updated, except that B is an empty set;
(2) If B.u.f (o) i ) The concept (X, B) will be updated to (X ∈ { o) i },B);
(3) At the position ofAnd B.andf (o) i ) In case of not equal to B, if o i Outside has no object o j Satisfy the following requirementsThen a new concept (xU { o) is generated by means of the concept (X, B) i },B∩f(o i ));
(4) If any j < i is not satisfiedObject o i Generates a new concept by itself ({ o) i },f(o i ));
By continuously adding objects, a formal background (O, A, I) OA ) Concept lattice L (O, A, I) OA )。
Preferably, the concept lattice L (O, D, I) is obtained in S5 OD ) The specific contents of (3) include:
if the concept lattice L (O', D,I OD ) Newly add an object o i Wherein the object isAnd->Then for concept lattice L (O', D, I) OD ) The original concept (Y, C) of (C) is changed to satisfy the following conditions:
(1) If it isWherein f (o) i ) Representing object o i If the owned attribute is the concept (Y, C) is not updated, except that C is an empty set;
(2) If C.u.f (o) i ) The concept (Y, C) will be updated to (Y ≡o- i },C);
(3) At the position ofAnd C.andf (o) i ) In case of not equal to C, if o i Outside has no object o j Satisfy the following requirementsThen a new concept (Y.u.o) is generated by means of the concept (Y, C) i },C∩f(o i ));
(4) If any j < i is not satisfiedObject o i Generates a new concept by itself ({ o) i },f(o i ));
By continuously adding objects, a formal background (O, D, I) is finally obtained OD ) Concept lattice L (O, D, I) OD )。
Preferably, the defining specific content for obtaining the non-trivial decision rule according to the decision rule in S6 includes: for (X, B) ∈L (O, A, I) OA ) And (Y, C) εL (O, D, I) OD ) If (3)Then B.fwdarw.C is called a decision rule, and if X, Y, B, C are all non-null, B.fwdarw.C is called a non-trivial decision rule.
Compared with the prior art, the invention discloses a science popularization theme and core knowledge extraction method based on concept lattices, and the technology has the following beneficial effects:
formal concept analysis helps identify concepts of diversity and accuracy to better meet the needs of users, which can provide more diversity by representing different concepts in a concept lattice. The decision rule based on formal concept analysis is constructed based on concept lattices, has a clear hierarchical structure, is easier to analyze and interpret, and is helpful for avoiding contradictory decisions; in addition, because of the multi-level relationship between concepts, the model can be better adapted to changeable and complex situations, especially when the data volume is small, a considerable rule number can still be provided, so that the model can be widely applied to data with relatively small data volume. In addition, the decision rule based on the decision form background is more universal and adaptive, and can be simply applied to the fields such as a recommendation model by constructing indexes such as similarity. Most importantly, the decision rules generated based on the decision form background are more and are not easily affected by inaccuracy or incompleteness of the labels, so that compared with the prior art, the method has higher robustness and higher performance in the fields such as recommendation models.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
Fig. 1 is a schematic flow chart of a science popularization topic and core knowledge extraction method based on concept lattices.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The invention provides a science popularization theme and core knowledge extraction method based on concept lattices, which is shown in fig. 1 and comprises the following steps:
s1, acquiring a science popularization theme (such as various algorithm type science popularization themes) as an object set O= { O 1 ,o 2 ,...,o n And obtaining the corresponding core knowledge (such as decision tree algorithm, clustering algorithm, neural network algorithm, etc.) of each science popularization topic as an attribute set A= { a 1 ,a 2 ,...,a m Simultaneously acquiring audience groups (such as primary school, junior middle school, high school, university and above) of various science popularization subjects as a tag set D= { D 1 ,d 2 ,...,d r -a }; wherein n, m and r are all greater than 0;
s2, constructing decision form background F= (O, A, I) OA ,D,I OD ) Wherein I OA For the binary relation between the object set O and the attribute set A, if I OA (o i ,a j ) =1, representing object o i Possession of attribute a j The method comprises the steps of carrying out a first treatment on the surface of the Conversely if I OA (o i ,a j ) =0, then object o is represented i Not possessing attribute a j ;I OD Representing the binary relationship between the object set O and the attribute set D, if I OD (o i ,d j ) =1, representing object o i The audience group of (2) is d j The method comprises the steps of carrying out a first treatment on the surface of the Conversely if I OD (o i ,d j ) =0, then object o is represented i The audience group of (2) is not d j ;
S3, selecting a part of subsets from the decision form background to be used as decision form sub-background;
s4, in decision form sub-contexts (O', A, I) O′A ) And decision form sub-contexts (O', D, I) O′D ) Respectively constructing concept lattices L (O', A, I) according to the definition of the concept lattices O′A ) And L (O', D, I) O′D ) Wherein
S5, in concept lattice L (O', A, I) O′A ) And L (O', D, I) O′D ) Based on (a) respectively constructing form background (O, A, I) by incremental method OA ) And (O, D, I) OD ) Corresponding concept lattice L (O, A, I) OA ) And L (O, D, I) OD );
S6, acquiring non-trivial decision rules according to the decision rule definition, putting all non-trivial decision rules meeting the conditions into a rule set R, and taking the non-trivial decision rules as finishing rules and knowledge extraction.
The content acquired in the present embodiment includes:
object set o= { algorithm 1, algorithm 2, algorithm 3, algorithm 4};
attribute set a= { cluster, neural network, decision tree, naive bayes };
tag set d= { primary school, university, and above };
as shown in table 1, where "1" indicates that the science popularization content has the corresponding knowledge (or the corresponding tag), and "0" indicates that the science popularization content does not have the corresponding knowledge (or does not have the corresponding tag).
TABLE 1
In S3, a smaller-scale decision form sub-background is selected from the decision form backgrounds, and in this embodiment, only the set of object sets O ' = { algorithm 1, algorithm 2, algorithm 3} will be considered as decision form sub-backgrounds F ' = (O ', a, I) O′A ,D,I O′D ) In formal sub-contexts (O', A, I O′A ) In accordance with the conceptConsider a subset of all O', i.e. pairsConsider all concepts (gf (X), f (X)) and remove duplicate concepts, resulting in a set of concepts In the same way, the processing method comprises the steps of,
in order to further implement the above technical solution, the concept lattice L (O, A, I) is obtained in S5 OA ) The specific contents of (3) include:
if for concept lattice L (O', A, I) O′A ) Newly add an object o i Wherein the object isAnd->Then for concept lattice L (O', A, I) O′A ) The original concept (X, B) of (C) is changed to satisfy the following conditions:
(1) If it isWherein f (o) i ) Representing object o i If the owned attribute is the concept (X, B) is not updated, except that B is an empty set;
(2) If B.u.f (o) i ) The concept (X, B) will be updated to (X ∈ { o) i },B);
(3) At the position ofAnd B.andf (oi) +.B, if o i Outside has no object o j Satisfy the following requirementsThen a new concept (xU { o) is generated by means of the concept (X, B) i },B∩f(o i ));
(4) If any j < i is not satisfiedObject o i Generates a new concept by itself ({ o) i },f(o i ));
By continuously adding objects, a formal background (O, A, I) OA ) Concept lattice L (O, A, I) OA )。
In order to further implement the above technical solution, the concept lattice L (O, D, I) is obtained in S5 OD ) The specific contents of (3) include:
if the concept lattice L (O', D, I) OD ) Newly add an object o i Wherein the object isAnd->Then for concept lattice L (O', D, I) OD ) The original concept (Y, C) of (C) is changed to satisfy the following conditions:
(1) If it isWherein f (o) i ) Representing object o i If the owned attribute is the concept (Y, C) is not updated, except that C is an empty set;
(2) If C.u.f (o) i ) The concept (Y, C) will be updated to (Y ≡o- i },C);
(3) At the position ofAnd C.andf (o) i ) In case of not equal to C, if o i Outside has no object o j Satisfy the following requirementsThen a new concept (Y.u.o) is generated by means of the concept (Y, C) i },C∩f(o i ));
(4) If any j < i is not satisfiedObject o i Generates a new concept by itself ({ o) i },f(o i ));
By continuously adding objects, a formal background (O, D, I) is finally obtained OD ) Concept lattice L (O, D, I) OD )。
In this embodiment:
now a new object "Algorithm 4" is introduced, according to the incremental method, for the conceptDue to-> And a n f ({ algorithm 4 }) is not equal to a, so a new concept ({ algorithm 4}, { cluster, neural network, decision tree }) will be generated according to case (3);
for the concept ({ algorithm 1, algorithm 2}, { cluster }), since { cluster } ≡f ({ algorithm 4 }) = { cluster }, the concept is updated as the case (2) ({ algorithm 1, algorithm 2, algorithm 4}, { cluster });
for concepts ({ Algorithm 3}, { decision tree, naive Bayes }) due to And { decision tree, naive bayes } n f ({ algorithm 4 }) is not equal to { decision tree, naive bayes }, so a new concept will be generated according to case (3) ({ algorithm 3, algorithm 4}, { decision tree });
for the concept ({ algorithm 2}, { cluster, neural network }), the concept is updated as the case (2) ({ algorithm 2, algorithm 4, { cluster, neural network }) because { cluster, neural network } f ({ algorithm 4 }) = { cluster, neural network };
in particular, due toThus, the concept->Updated to->For case (4), a concept is generated ({ algorithm 4}, { cluster, neural network, decision tree }), but this concept appears and is therefore not considered. }
Therefore, it is
Is available in the same way
In order to further implement the above technical solution, the defining specific content of obtaining the non-trivial decision rule according to the decision rule in S6 includes: for (X, B) ∈L (O, A, I) OA ) And (Y, C) εL (O, D, I) OD ) If (3)Then B.fwdarw.C is called a decision rule, and if X, Y, B, C are all non-null, B.fwdarw.C is called a non-trivial decision rule.
In this embodiment:
traversing L (O, A, I) according to the definition of the decision rule OA )×L(O,D,I OD ) Searching for a concept extension meeting the condition. For example: for ({ Algorithm 3}, { decision tree, naive Bayes }) ε L (O, A, I) OA ) And ({ Algorithm 1, algorithm 3}, { Primary and secondary }) ε L (O, D, I) OD ) Due toSo { decision tree, naive Bayes } → { middle and primary school }, is a non-trivial decision rule.
Repeating the above steps can obtain all non-trivial decision rules r= { decision tree, naive bayes } → { middle and primary school }, { cluster, neural network } - { university and above }, { cluster, neural network, decision tree } - { university and above }.
The invention applies the decision rule based on formal concept analysis to the extraction of popular science content, thereby bringing a plurality of excellent effects for the extraction of popular science content, in particular: firstly, the decision rule based on formal concept analysis can more accurately identify and extract the content related to the science popularization subject, and ensures the high quality and relevance of related science popularization resources while not excessively depending on labels; second, concept lattices based on formal concept analysis can represent popular content in a structured manner, making it easier to understand and organize, which helps to improve discoverability and usability of knowledge; finally, the decision rules involved can be used to build a more intelligent science popularization platform to improve education and training areas, or to provide decision support for users to help them make informed decisions in the scientific and technical areas, thereby improving scientific literacy and scientific decision making capability.
The above embodiments are only for illustrating the technical solution of the present application, and are not limiting thereof; although the present application has been described in detail with reference to the foregoing embodiments, it should be understood by those of ordinary skill in the art that: the technical scheme described in the foregoing embodiments can be modified or some technical features thereof can be replaced by equivalents; such modifications and substitutions do not depart from the spirit and scope of the technical solutions of the embodiments of the present application, and are intended to be included in the scope of the present application.
Claims (4)
1. The science popularization theme and core knowledge extraction method based on the concept lattice is characterized by comprising the following steps of:
s1, acquiring a science popularization subject as an object set O= { O 1 ,o 2 ,...,o n And acquiring core knowledge corresponding to each science popularization topic as an attribute set A= { a 1 ,a 2 ,...,a m Simultaneously acquiring audience groups of all science popularization subject contents as tag sets D= { D } 1 ,d 2 ,...,d r -a }; wherein n, m and r are all greater than 0;
s2, constructing decision form background F= (O, A, I) OA ,D,I OD ) Wherein I OA For the binary relation between the object set O and the attribute set A, if I OA (o i ,a j ) =1, representing object o i Possession of attribute a j The method comprises the steps of carrying out a first treatment on the surface of the Conversely if I OA (o i ,a j ) =0, then object o is represented i Not possessing attribute a j ;I OD Representing the binary relationship between the object set O and the attribute set D, if I OD (o i ,d j ) =1, representing object o i The audience group of (2) is d j The method comprises the steps of carrying out a first treatment on the surface of the Conversely if I OD (o i ,d j ) =0, then object o is represented i The audience group of (2) is not d j ;
S3, selecting a part of subsets from the decision form background to be used as decision form sub-background;
s4, in decision form sub-contexts (O', A, I) O′A ) And decision form sub-contexts (O', D, I) O′D ) Respectively constructing concept lattices L (O', A, I) according to the definition of the concept lattices O′A ) And L (O', D, I) O′D );
S5, utilizing concept lattice L (O', A, I) O′A ) And L (O', D, I) O′D ) Construction of formal backgrounds by incremental methods (O, A, I) OA ) And L (O, D, I) OD ) Is a concept lattice of (2);
s6, acquiring non-trivial decision rules according to the decision rule definition, and putting all non-trivial decision rules meeting the conditions into a rule set R to be regarded as finishing rule and knowledge extraction.
2. The method for extracting subject matter and core knowledge based on concept lattice as claimed in claim 1, wherein the step S5 is to obtain concept lattice L (O, A, I) OA ) The specific contents of (3) include:
if for concept lattice L (O', A, I) O′A ) Newly add an object o i Wherein the object isAnd->Then for concept lattice L (O', A, I) O′A ) The original concept (X, B) of (C) is changed to satisfy the following conditions:
(1) If it isWherein f (o) i ) Representing object o i If the owned attribute is the concept (X, B) is not updated, except that B is an empty set;
(2) If B.u.f (o) i ) The concept (X, B) will be updated to (X ∈ { o) i },B);
(3) At the position ofAnd B.andf (o) i ) In case of not equal to B, if o i Outside has no object o j Satisfy the following requirementsThen a new concept (xU { o) is generated by means of the concept (X, B) i },B∩f(o i ));
(4) If any j < i is not satisfiedObject o i Generates a new concept by itself ({ o) i },f(o i ));
By continuously adding objects, a formal background (O, A, I) OA ) Concept lattice L (O, A, I) OA )。
3. The method for extracting subject matter and core knowledge based on concept lattice as claimed in claim 1, wherein the step S5 is to obtain concept lattice L (O, D, I) OD ) The specific contents of (3) include:
if the concept lattice L (O', D, I) OD ) Newly add an object o i Wherein the object isAnd->Then for concept lattice L (O', D, I) OD ) The original concept (Y, C) of (C) is changed to satisfy the following conditions:
(1) If it isWherein f (o) i ) Representing object o i If the owned attribute is the concept (Y, C) is not updated, except that C is an empty set;
(2) If C.u.f (o) i ) The concept (Y, C) will be updated to (Y ≡o- i },C);
(3) At the position ofAnd C.andf (o) i ) In case of not equal to C, if o i Outside has no object o j Satisfy the following requirementsThen a new concept (Y.u.o) is generated by means of the concept (Y, C) i },C∩f(o i ));
(4) If any j < i is not satisfiedObject o i Generates a new concept by itself ({ o) i },f(o i ));
By continuously adding objects, a formal background (O, D, I) is finally obtained OD ) Concept lattice L (O, D, I) OD )。
4. The method for extracting science popularization topics and core knowledge based on concept lattices according to claim 1, wherein the step of defining and acquiring specific contents of non-trivial decision rules according to the decision rules in S6 comprises: for (X, B) ∈L (O, A, I) OA ) And (Y, C) εL (O, D, I) OD ) If (3)Then B.fwdarw.C is called a decision rule, and if X, Y, B, C are all non-null, B.fwdarw.C is called a non-trivial decision rule.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311603695.8A CN117591668A (en) | 2023-11-28 | 2023-11-28 | Science popularization theme and core knowledge extraction method based on concept lattice |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202311603695.8A CN117591668A (en) | 2023-11-28 | 2023-11-28 | Science popularization theme and core knowledge extraction method based on concept lattice |
Publications (1)
Publication Number | Publication Date |
---|---|
CN117591668A true CN117591668A (en) | 2024-02-23 |
Family
ID=89909558
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202311603695.8A Pending CN117591668A (en) | 2023-11-28 | 2023-11-28 | Science popularization theme and core knowledge extraction method based on concept lattice |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN117591668A (en) |
-
2023
- 2023-11-28 CN CN202311603695.8A patent/CN117591668A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110796190B (en) | Exponential modeling with deep learning features | |
CN110569353B (en) | Attention mechanism-based Bi-LSTM label recommendation method | |
Tong et al. | A shilling attack detector based on convolutional neural network for collaborative recommender system in social aware network | |
CN108038492A (en) | A kind of perceptual term vector and sensibility classification method based on deep learning | |
You et al. | BERTMeSH: deep contextual representation learning for large-scale high-performance MeSH indexing with full text | |
CN113344053B (en) | Knowledge tracking method based on examination question different composition representation and learner embedding | |
Pardos et al. | Imputing KCs with representations of problem content and context | |
CN112749326A (en) | Information processing method, information processing device, computer equipment and storage medium | |
CN111581368A (en) | Intelligent expert recommendation-oriented user image drawing method based on convolutional neural network | |
WO2021190662A1 (en) | Medical text sorting method and apparatus, electronic device, and storage medium | |
CN113392331A (en) | Text processing method and equipment | |
CN111125520A (en) | Event line extraction method for news text based on deep clustering model | |
CN111625715A (en) | Information extraction method and device, electronic equipment and storage medium | |
Volkova et al. | Online bayesian models for personal analytics in social media | |
Azzini et al. | A classifier to identify soft skills in a researcher textual description | |
Agrawal et al. | HSIL: hybrid semantic infused learning approach for course recommendation | |
Agarwal et al. | WGSDMM+ GA: A genetic algorithm-based service clustering methodology assimilating dirichlet multinomial mixture model with word embedding | |
Arif et al. | Extracting and reusing blocks of knowledge in learning classifier systems for text classification: a lifelong machine learning approach | |
CN112948561B (en) | Method and device for automatically expanding question-answer knowledge base | |
CN117591668A (en) | Science popularization theme and core knowledge extraction method based on concept lattice | |
Foote et al. | A computational analysis of social media scholarship | |
CN111615178B (en) | Method and device for identifying wireless network type and model training and electronic equipment | |
CN113495964A (en) | Method, device and equipment for screening triples and readable storage medium | |
Yang et al. | Bottom-up and top-down graph pooling | |
Xu et al. | What makes us curious? analysis of a corpus of open-domain questions |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |