CN106372099A - Agricultural field ontology validity assessment method - Google Patents
Agricultural field ontology validity assessment method Download PDFInfo
- Publication number
- CN106372099A CN106372099A CN201610534240.9A CN201610534240A CN106372099A CN 106372099 A CN106372099 A CN 106372099A CN 201610534240 A CN201610534240 A CN 201610534240A CN 106372099 A CN106372099 A CN 106372099A
- Authority
- CN
- China
- Prior art keywords
- concept
- relation
- agriculture field
- sim
- classification
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Machine Translation (AREA)
Abstract
The invention discloses an agricultural field ontology validity assessment method. Agricultural field ontology concepts are assessed in combination with a recall ratio R, a precision ratio P and an F1 index; the consistency, accuracy and conciseness of a classification relationship among the agricultural field ontology concepts are assessed in combination with semantic similarity among the concepts; and an object attribute relationship among the agricultural field ontology concepts is assessed according to an assessment index. Therefore, three important problems, including validity assessment of the ontology concepts, validity assessment of the ontology classification relationship and validity assessment of the ontology non-classification relationship, existent in agricultural field ontology validity assessment are solved emphatically. An agricultural field ontology validity assessment method system proposed by the invention takes an agricultural field ontology as a research object, and can effectively assess the quality of the agricultural field ontology and improve the agricultural ontology construction efficiency.
Description
Technical field
The present invention relates to computer information technology is in the application of agriculture field, more particularly, to a kind of agriculture field body is effective
Property appraisal procedure.
Background technology
Body as a kind of shared ideas model that can advise knowledge in semantic level upper table, day by day become information management,
The important component part of the multiple fields such as information retrieval and semantic web.Particularly since semantic web proposition, body is just gradually
Become an important research direction in artificial intelligence and knowledge engineering field, in the acquisition of knowledge, expression, analysis and application etc.
Aspect has far-reaching significance.Body, according to the level of detail and field dependency degree two indices, can be divided into 4 classifications: top
Body, domain body, task ontology and applied ontology.Wherein, domain body is professional body, describes a certain specific
Relation between the vocabulary of concept and concept in section field, or the most important theories of this ambit.Due to domain body
Knowledge has significant domain feature, so domain body can be more reasonable and effectively carry out expressing for knowledge.
The structure of domain body makes the shared of domain knowledge and reuse be possibly realized.In numerous domain knowledges, agriculture
Industry domain knowledge is a kind of high-quality production factors, and agricultural workforce and the efficiency of capital production can be made to increase substantially;
Agriculture knowledge be also Agricultural Information application basis, as agricultural information system, agricultural decision making system, agriculture specialist system,
Agriculture intelligence control system etc. all be unable to do without agricultural knowledge.Agriculture field body is one and comprises agriculture term, definition and art
Between language, the system of normative connection explanation, is the formalization table of the mutual relation between concept, concept and concept in agriculture ambit
Reach.It can be expressed as four-tuple, including body essential information, such as body title, founder, design time, modification time, mesh
And the body such as Knowledge Source metadata information;The set of agricultural knowledge concept;Classification relation between agricultural knowledge concept and
Non-categorical set of relationship;Axiomatic set theory present in body.
This physical ability of agriculture field describes on semantic level to the related data of agricultural, information and knowledge it is provided that disease
The ontology services such as pest control, Making A Strategic Decision of The Agricultural Productions support, intelligent diagnostics and control.Carry within 2000 since FAO (Food and Agriculture Organization of the United Nation)
Since going out Agricultural ontology service research plan, how effectively the agriculture field body quantity of different scales is being in just explosive growth,
Assess the structure quality of these agriculture field bodies, if really meet domain knowledge situation, could really embody knowledge organization
Function, effect of Agriculture knowledge service can be realized, automatically or semi-automatically whether Method for Ontology Learning scientific and reasonable, how to help
Help user to select more suitably Agricultural ontology, become problem demanding prompt solution in current agricultural ontology research.Accordingly, it would be desirable to agriculture
Industry domain body carries out efficiency assessment.
Body is assessed as an important means assessing this weight, by the method with science, it then follows unified
Index system, assessment performance under specific application area or specific environment for the body and the suitability, are whether impact body can be
An important factor in order of large-scale application in semantic net.Set up a set of scientific and effective body Assessment theory and assessment side
Method, contributes to weighing whether body constructing method has effectiveness, is conducive to existing body is effectively managed, updates and ties up
Shield, helps instruct ontological construction process and lifting body construction efficiency, thus forming more scientific rational domain body and more preferably
Ground provides associated body service.
Associated specialist scholar has done substantial amounts of research in terms of body assessment both at home and abroad, and proposes a lot of body assessment sides
Method.On the whole, body appraisal procedure mainly has seven kinds, and they are respectively: appraisal procedure based on user, the commenting of task based access control
Estimate method, the appraisal procedure based on principle, the appraisal procedure Ji Yu " gold criterion ", the appraisal procedure based on application, be based on language
The material appraisal procedure in storehouse and the appraisal procedure based on compound standard.With the constantly improve of body appraisal procedure, body automatic
Change assessment also to be developed rapidly, and have developed some body assessment tools, wherein odeval, ontoqa, core,
Ontomanager, aeon etc. are wherein to apply more extensive body assessment tool.
Although existing body appraisal procedure and assessment tool achieve some achievements, in terms of some, body can be entered
Row assessment, but for Chinese body and the efficiency assessment method being applied to complicated ambit body such as agriculture field body
Or few, and existing body appraisal procedure compares one-sided, does not form a whole set of scientific comprehensive body assessment
Theory and method system.
Content of the invention
The technical problem being existed based on background technology, the present invention proposes a kind of agriculture field body efficiency assessment side
Method.
A kind of agriculture field body efficiency assessment method proposed by the present invention, comprises the following steps:
S1, assessment agriculture field Ontological concept;
Classification relation between s2, assessment agriculture field Ontological concept;
Non-categorical relation between s3, assessment agriculture field Ontological concept;
Step s1 specifically include following step by step:
S11, agriculture field body to be assessed is parsed according to concept, obtained concept set;
S12, setting Concept Evaluation index, Concept Evaluation index includes recall ratio r, precision ratio p and f1 index, wherein,
It is defined on recall ratio and in the case that precision ratio occurs contradiction, this recall ratio and precision ratio are weighted be in harmonious proportion
The average value obtaining is f index,
F1 index is f index during α=1,
S13, the ambit scope according to body to be assessed, select or build corresponding domain lexicon;
S14, it is that recall ratio, precision ratio and f1 exponent pair concept are estimated according to Concept Evaluation index, obtain correctly general
Read number;
S15, the assessment result of output recall ratio, precision ratio and f1 index;
Step s2 specifically include following step by step:
S21, by agriculture field body to be assessed according to concept between subclassof relation parsed, and protected
Save as data structure be figure g=<v, e>data file, v be Ontological concept set, e be classification relation set;
S22, three evaluation indexes of setting classification relation assessment, are concordance, accuracy and terseness respectively;
Semantic similarity between s23, calculating concept, and optimal threshold ε is set;
S24, set up Concept Semantic Similarity set s (s1, s2..., sn) and concordance, accuracy and terseness judgement knot
Fruit set t (t1, t2..., tn), si=sim (vi, vj), i, j are natural numbers, and concept vi is concept vj hypernym, and vi≠
vj;
S25, judge that whether concept vi hypernym number is more than 1, be, then judge the classification error of concept vi and be stored in judgement and tie
Fruit set t (t1, t2..., tn);
S26, by sim (vi, vj) compare with optimal threshold ε, and judge whether concept vi and concept vj accord with according to comparative result
Close classification relation, meet, then by siIt is stored in Concept Semantic Similarity set s (s1, s2..., sn);Do not meet and then judge concept vi
Classification error simultaneously will determine that result is stored in judged result set t (t1, t2..., tn);
S27, judge whether sim (vi, vj)=sim (vi, vk), if it is present judging concept vjWith concept vkSend out
Raw redundancy error is simultaneously stored in judged result set t (t1, t2..., tn);
S28, repeat step s25 to s27, until all calculating and having judged Ontological concept set v and classification relation set e
Finish;
S29, output Similarity Measure results set s and concordance, accuracy and terseness judged result set t;
Step s3 specifically include following step by step:
S31, parsed according to object properties relation pair agriculture field body, the attributed relational graph that obtains of parsing is preserved
For data structure it isData file, and represented with adjacency matrix a;V is Ontological concept set, e is attribute
Set of relationship,For relation on attributes concept pair;
The evaluation index of object properties relation between s32, setting agriculture field Ontological concept, and sentenced according to evaluation index setting
Surely set q;
S33, according to evaluation index, between agriculture field Ontological concept, object properties relation is estimated, and by assessment result
Record is in judging set q;
Relation on attributes set e in s34, output agriculture field body, relation on attributes concept pairAnd result of determination set
q.
Preferably, in step s13, the structure of domain lexicon comprises the following steps:
S131, determine subject fields scope, and assembling sphere vocabulary;
S132, determine the classification characteristicses of vocabulary, mutual relation between organized words and vocabulary, form vocabulary structure, and specify
Attribute dictionary;
S133, the vocabulary to dictionary and relation carry out examination & verification and revise.
Preferably, in step s131, from document, subject classical works, Baidupedia, existing dictionary and expertise way
Footpath obtains Field Words.
Preferably, in step s23, concept c1And c2Between semantic similarity sim (c1, c2) computing formula is as follows:
sim(c1,c2)=[sim_dist (c1,c2)]α×[sim_depth(c1,c2)]β×[sim_width(c1,c2)
]γ,
Wherein, sim_dist (c1, c2) for semantic similarity and semantic distance corresponding relation, sim_depth (c1, c2
For the corresponding relation of semantic similarity and semantic depth, sim_width (c1, c2) it is semantic similarity and semantic node density
Corresponding relation;α represents the weight of semantic distance, the weight of β representational level depth, and γ represents the weight of node density, α, β, γ
For regulation parameter, and α, β, γ sum is 1.
Preferably,Wherein, dist (c1, c2) represent semantic distance, a
For regulation parameter;dist(c1,c2)=len (c1,c2), len (c1,c2) represent concept c1And c2Between path.
Preferably,Depth (c) represents concept c
Depth, depth (c)=depth (parent (c))+1, parent (c) represents the father node of concept c it is preferable that root node
Depth depth (root)=1.
Preferably, the density of concept c is expressed as width (c), and it is directly proportional to classification degree of refinement, as width (c1)≤
width(c2),
Preferably, in step s26, if ε is < sim (vi, vj) < 1, then concept viWith concept vjMeet classification relation requirement;
If 0≤sim is (vi, vj) < ε, then judge concept viWith concept vjThere is classification error and be stored in judged result set t.
Preferably, in judged result set t, tnCorresponding with concept vn, if concept vn meets evaluation index, correspond to tn
It is labeled;If concept vn does not meet evaluation index, the type of error that corresponding tn mark is judged.
Preferably, between agriculture field Ontological concept, the evaluation index of object properties relation includes:
Reversibility: there is relation r between concept a and concept b, meanwhile, there is relation r between concept b and concept a-1,
R is then claimed to have reversibility;
Relation inheritance: there is relation r between concept a and father's concept b, there is also between concept a and sub- concept c meanwhile
Relation r, then claim relation r to have relation inheritance;
Inverse relationship inheritance: there is relation r between father's concept and concept a, meanwhile, between its sub- concept and concept a
There is also relation r, then claim relation r to have inverse relationship inheritance;
Transitivity: if there is relation r between concept a and concept b, there is relation r between concept b and concept c, meanwhile, generally
Read and between a and concept c, there is relation r, then claim relation r to have transitivity;
Symmetry: if concept a and concept b have relation r, be denoted as a=r (b), then concept b and concept a have relation r, are denoted as b
=r (a);
Reflexivity: concept a has relation r, if r has reflexivity, a=r (a).
A kind of agriculture field body efficiency assessment method that the present invention provides, refers in conjunction with recall ratio r, precision ratio p and f1
Several agriculture field Ontological concept is estimated, in conjunction with concept between semantic similarity classify between agriculture field Ontological concept pass
The concordance of system, accuracy, terseness are estimated, according to evaluation index between agriculture field Ontological concept object properties relation
It is estimated.So, to solve three major issue bodies present in agriculture field body efficiency assessment general for emphasis
The efficiency assessment of the efficiency assessment, the efficiency assessment of Ontology relation and body non-categorical relation read.
Agriculture field body efficiency assessment method system proposed by the present invention, can with agriculture field body as object of study
Effectively assess the quality of agriculture field body, lifting Agricultural ontology builds efficiency.
Brief description
Fig. 1 is a kind of agriculture field body efficiency assessment method flow diagram proposed by the present invention;
Fig. 2 is assessment agriculture field Ontological concept method flow diagram;
Fig. 3 is classification relation method flow diagram between assessment agriculture field Ontological concept;
Fig. 4 is non-categorical relational approach flow chart between assessment agriculture field Ontological concept;
Fig. 5 assesses prototype system overall framework figure for tea-plant pests body.
Specific embodiment
With reference to Fig. 1, a kind of agriculture field body efficiency assessment method proposed by the present invention, comprise the following steps.
S1, assessment agriculture field Ontological concept;
Classification relation between s2, assessment agriculture field Ontological concept;
Non-categorical relation between s3, assessment agriculture field Ontological concept.
Concept, as the basic element of ontology model, is the core of expression domain knowledge.Therefore to domain body concept
Assessment is the pith completing body assessment.
With reference to Fig. 2, in present embodiment, assessment agriculture field Ontological concept be step s1 specifically include following step by step.
S11, agriculture field body to be assessed is parsed according to concept, obtained concept set.This step is to defeated
The agriculture field body to be assessed entering carries out body pretreatment, specifically can be led agricultural to be assessed by body analytical tool
Domain body is parsed according to concept.
S12, setting Concept Evaluation index, Concept Evaluation index includes recall ratio r, precision ratio p and f1 index.
Recall ratio r refers to the correct concept number in body and the ratio of the sum of the concept in domain lexicon,
Precision ratio p refers to the correct concept number in body and the ratio of the sum of the concept in body,
It is defined on recall ratio and in the case that precision ratio occurs contradiction, this recall ratio and precision ratio are weighted be in harmonious proportion
The average value obtaining is f index, and f1 index is the specific value of f index.F index is specifically calculated as follows with f1 index:
F1 index is f index during α=1,
The evaluation index of one group of scientific and reasonable concept should be able to evaluate the level of coverage of concept in constructed body, general
Read the accuracy of term, i.e. concept set in body to be assessed should farthest comprise the important terms in this field.
And the concept defined in body not only needs to have integrity in addition it is also necessary to ensure the correctness of Ontological concept.
Recall ratio, precision ratio and f index as reflection Concept Evaluation effect important indicator, f index can comprehensively this two
Individual evaluation index, for the index that concentrated expression is overall, f1 index is clear and definite to f index.In present embodiment, complete to look into
Rate, precision ratio and f1 index, as Concept Evaluation index, are conducive to the levels of precision of Concept Evaluation.
S13, the ambit scope according to body to be assessed, select or build corresponding domain lexicon.
Domain lexicon refers to the term of specific area or the set of vocabulary, can reflect information in certain specific field,
The elements such as concept, element therein can extract from related corpus and refine.With conventional dictionaries simply simply by word
Converge with organizing according to morphology or lexicographic order and compare, modern dictionary not only concept to be comprised, more to consider general in the middle of dictionary
Read the relation and concept between.The accuracy of Concept Evaluation result depends on the selection of domain lexicon, therefore, the choosing of domain lexicon
Fixed extremely important.
In step s13, the structure of domain lexicon comprises the following steps:
S131, determine subject fields scope, and assembling sphere vocabulary;Specifically, Field Words can be classical from document, subject
Works, Baidupedia, existing dictionary and expertise approach obtain.
S132, determine the classification characteristicses of vocabulary, mutual relation between organized words and vocabulary, form vocabulary structure, and specify
Attribute dictionary;
S133, the vocabulary to dictionary and relation carry out examination & verification and revise.
S14, it is that recall ratio, precision ratio and f1 exponent pair concept are estimated according to Concept Evaluation index, obtain correctly general
Read number.
S15, the assessment result of output recall ratio, precision ratio and f1 index, obtain Concept Evaluation result.
Relation is that in body, the one kind between concept and concept contacts, and the classification relation between agriculture field Ontological concept is concrete
Show as subclassof relation, describe the relation between father's concept and sub- concept.Classification relation system is general as tissue
The basis read, is the backbone structure of body.Therefore, how research is effectively assessed to classification relation is to be highly desirable to
's.
With reference to Fig. 3, in present embodiment, between assessment agriculture field Ontological concept classification relation be step s2 specifically include with
Under step by step.
S21, by body analytical tool by agriculture field body to be assessed according to concept between subclassof relation
Parsed, and be saved as data structure be figure g=<v, e>data file, v be Ontological concept set, e be classification close
Assembly is closed.
S22, three evaluation indexes of setting classification relation assessment, are concordance, accuracy and terseness respectively.
In present embodiment, three evaluation indexes have been directed to common three kinds of typical faults in classification relation and classify not
Unanimously, classification error and redundancy error.Concordance is mainly used to occur in that conflicting classification relation in detection body;Accurately
Property mainly in Ontology system occur concept classification Problem-Error;Terseness is primarily directed in Ontology system
There is the redundancy error of same concept in middle discovery.
Semantic similarity between s23, calculating concept, and optimal threshold ε is set.In present embodiment, optimal threshold ε can tie
Close Conventional wisdom and analysis expert to obtain through test.
Classification relation system in body is to arrange concept with tree according to subclassof, traditional based on language
The Concept Semantic Similarity algorithm of adopted distance can reasonably utilize this type of organization.Traditional general based on semantic distance
Read on the basis of Arithmetic of Semantic Similarity, the present invention has carried out certain optimization and adjustment, by level depth, node density
Take into account, and be provided with regulation parameter, using a kind of improved Concept Semantic Similarity algorithm.
Semantic similarity between concept is calculated using Concept Semantic Similarity algorithm.Concept c1And c2Between semantic phase
Like degree sim (c1, c2) computing formula is as follows:
sim(c1,c2)=[sim_dist (c1,c2)]α×[sim_depth(c1,c2)]β×[sim_width(c1,c2)
]γ,
Wherein, sim_dist (c1, c2) for semantic similarity and semantic distance corresponding relation, sim_depth (c1, c2For
Semantic similarity and the corresponding relation of semantic depth, sim_width (c1, c2) right for semantic similarity and semantic node density
Should be related to;α represents the weight of semantic distance, the weight of β representational level depth, and γ represents the weight of node density, and α, β, γ are
Regulation parameter, and α, β, γ sum is 1.
Concept Semantic Similarity value and semantic distance are inversely proportional to.
Wherein, dist (c1, c2) representing semantic distance, a is to adjust
Parameter;If the weight of each edge is the same, weight is all set to 1, then
dist(c1,c2)=len (c1,c2),
len(c1,c2) represent concept c1And c2Between path.
Concept Semantic Similarity value is directly proportional to level depth difference, with level depth and being inversely proportional to.
Depth (c) represents the depth of concept c,
Depth (c)=depth (parent (c))+1, parent (c) represents the father node of concept c.Depth depth of root node
(root)=1.
Concept Semantic Similarity value and node density are inversely proportional to.In classification relation system, density higher finger classification is thinner
Cause, degree of refinement is higher, and between concept and concept, similarity degree is lower.The density of concept c is expressed as width (c), its with classification
Degree of refinement is directly proportional, as width (c1)≤width(c2),
So:
S24, set up Concept Semantic Similarity set s (s1, s2..., sn) and concordance, accuracy and terseness judgement knot
Fruit set t (t1, t2..., tn), si=sim (vi, vj), i, j are natural numbers, and concept vi is concept vj hypernym, and vi≠
vj.
In judged result set t, tnCorresponding with concept vn, if concept vn meets evaluation index, correspond to tnEnter rower
Note, specifically can label symbol " √ ";If concept vn does not meet evaluation index, the type of error that corresponding tn mark is judged,
Type of error is classification error or redundancy error.
If s25 concept vi hypernym number is more than 1, judge the classification error of concept vi and in judged result set t
(t1, t2..., tn) in corresponding tiRecord sort mistake.
S26, by sim (vi, vj) compare with optimal threshold ε, and judge whether concept vi and concept vj accord with according to comparative result
Close classification relation, meet, then by siIt is stored in Concept Semantic Similarity set s (s1, s2..., sn);Do not meet and then judge concept vi
Classification error simultaneously will determine that result is stored in judged result set t (t1, t2..., tn).
Specifically, if ε is < sim (vi, vj) < 1, then concept viWith concept vjMeet classification relation requirement, by si=sim
(vi, vj) it is stored in Concept Semantic Similarity set s;If 0≤sim is (vi, vj) < ε, then judge concept viWith concept vjThere is classification
Mistake, is unsatisfactory for accuracy, corresponding t in judged result set ti、tjRecord sort mistake.
S27, judge whether sim (vi, vj)=sim (vi, vk), if it is present judging concept vjWith concept vkIt is
There is redundancy error and be stored in judged result set t (t in synonym1, t2..., tn), specifically, in judged result set t
Corresponding tj, t record redundancy error.
S28, repeat step s25 to s27, until all calculating and having judged Ontological concept set v and classification relation set e
Finish.
S29, output Similarity Measure results set s and concordance, accuracy and terseness judged result set t.
According to relation not same-action in the body, classification relation and two kinds of non-categorical relation can be divided into.Non-categorical closes
System illustrates all other all relations being not belonging to classification relation such as cause effect relation, relation on attributes, example relationship.Non-categorical closes
System enhances completeness and the complexity of ontology knowledge expression, makes body really become network structure.Therefore, to non-categorical relation
The efficiency assessment that carries out have important theory significance and researching value.
Non-categorical relation has many kinds, in agriculture field body, wherein function the strongest, using most be object properties
Relation.The present invention is mainly estimated to the object properties relation in non-categorical relation system, proposes a kind of theoretical based on figure
Non-categorical relationship assessment method between agriculture field Ontological concept.
With reference to Fig. 4, in present embodiment, between assessment agriculture field Ontological concept, non-categorical relation is that step s3 specifically includes
Below step by step.
S31, parsed according to object properties relation pair agriculture field body using analytical tool, will the genus that obtains of parsing
Sexual intercourse figure saves as data structureData file, and represented with adjacency matrix a;Wherein, v is this
Body concept set, e be relation on attributes set,For relation on attributes concept pair.
The evaluation index of object properties relation between s32, setting agriculture field Ontological concept, and sentenced according to evaluation index setting
Surely set q.
S33, according to evaluation index, between agriculture field Ontological concept, object properties relation is estimated, and by assessment result
Record is in judging set q.
In present embodiment, between agriculture field Ontological concept, the evaluation index of object properties relation includes following six.
1. reversibility: there is relation r between concept a and concept b, meanwhile, there is relation r between concept b and concept a-1,
R is then claimed to have reversibility.
The judgement of " reversibility " evaluation index can be carried out using the theoretical relevant knowledge of figure.If in adjacency matrix a=(aij) in
Find aij=1, aji=1, and fromFind aijCorresponding relation on attributes r1 and ajiCorresponding relation on attributes r2 is inverse each other closing
System, then meet " reversibility ".Will determine that result record in set of matrices q.
2. relation inheritance: there is relation r between concept a and father's concept b, also deposit between concept a and sub- concept c meanwhile
In relation r, then relation r is claimed to have relation inheritance.
The judgement of relation inheritance feature needs combining classification relation system.If aij=1, aik=1, and fromFind aijWith
aikCorresponding relation on attributes name r is also identical, and finds vjAnd vkBelong to father's subclass relation in classification relation system, then meet and " close
It is inheritance ";If aij=1, aik≠ 1, then it is unsatisfactory for " relation inheritance ".And will determine that result record in set of matrices q.
3. inverse relationship inheritance: there is relation r between father's concept and concept a, meanwhile, its sub- concept and concept a it
Between there is also relation r, then claim relation r there is inverse relationship inheritance.
The judgement of inverse relationship inheritance feature needs combining classification relation system.If aik=1, ajk=1, and fromFind
aikAnd ajkCorresponding non-categorical relation name r is also identical, and finds viAnd vjBelong to father's subclass relation in classification relation system, then full
Foot " the reverse inheritance of relation ";If aik=1, ajk≠ 1, then it is unsatisfactory for " inverse relationship inheritance ".And will determine that result record exists
In set of matrices q.
4. transitivity: if there is relation r between concept a and concept b, between concept b and concept c, there is relation r, meanwhile,
There is relation r between concept a and concept c, then claim relation r to have transitivity.
If in adjacency matrix a=(aij) in find aij=1, ajk=1, and aik=1, and fromFind relation on attributes name r
Identical, i.e. concept vi, vkBetween relation on attributes contradict with " there is no transitivity " this property;?In find out aij, ajk,
aikCorresponding relation on attributes name, and will determine that result record in set of matrices q.
5. symmetry: if concept a and concept b have relation r, be denoted as a=r (b), then concept b and concept a have relation r, are denoted as
B=r (a).
If in adjacency matrix a=(aij) in find aij=1, aji=1, and fromFind that relation on attributes name is also identical, that is, generally
Read viAnd vjContradict with relation on attributes " there is no symmetry " this property, find concept viAnd vjAnd institute is right between them
The relation on attributes answered, and will determine that result record in set of matrices q.
6. reflexivity: concept a has relation r, if r has reflexivity, a=r (a).
If in adjacency matrix a=(aij) in found aiiWhen=1 (1≤i≤n), then this result and relation on attributes " do not have
Having reflexivity " this property contradicts, and finds aiiCorresponding concept and relation on attributes, and will determine that result record in matrix stack
Close in q.
Relation on attributes refers to the attribute that certain concept is another concept, describes " property of things and its mutual pass
System ".Wherein, object properties relation is used for associating two concepts, the non-categorical relation between expression concept, and relation on attributes has inverse
The constraint such as reflexive, symmetry, transitivity, its domain of definition and codomain determine two classes of association respectively, and attribute-name can be according to reality
Situation is named.Six properties being had according to relation on attributes: have reversibility, relation inheritance, inverse relationship inheritance and
There is no transitivity, symmetry and reflexivity.In present embodiment, six evaluation index reflexivitys of setting, symmetry, can
Inverse property, transitivity, relation inheritance, inverse relationship inheritance are true to the assessment of the relation on attributes in non-categorical relation system
The real extent of reaction is very high.
S34, according to above step by all concepts in Ontological concept set v and relation on attributes set e and relation on attributes
All after the completion of decision analysis, the relation on attributes set e in output agriculture field body, relation on attributes concept pairAnd judge
Results set q.
Specifically commented using the tea-plant pests body of above agriculture field body efficiency assessment method below in conjunction with a kind of
Estimate system above method is further explained.
By verifying feasibility and the effectiveness of proposed agriculture field body appraisal procedure, java language specifically can be used
Speech, calls ontology development bag jena, completes the prototype system of tea-plant pests body assessment in eclipse development platform.Should
The development environment that system is used is window7, and development platform is eclipse3.3, and jdk version for 1.8, jena version is
jena2.6.4.This prototype system can be parsed to tea-plant pests body and be assessed.
Tea-plant pests body assessment system includes tea-plant pests body pretreatment module, the Concept Evaluation of tea-plant pests body
Non-categorical relationship assessment between the concept of classification relation evaluation module and tea-plant pests body between module, the concept of tea-plant pests body
Module.With reference to Fig. 5, the operating procedure of the system includes.
A) the tea-plant pests body of structure is imported body assessment system.
B) the tea-plant pests body that imports enters body pretreatment module, by call jena bag to the concept in body,
Classification relation, non-categorical relation etc. are parsed.
C) in the Concept Evaluation module of tea-plant pests body, using step 1) assessment agriculture field Ontological concept side
Method, calculates recall ratio, precision ratio and f index respectively, and exports the assessment result of recall ratio, precision ratio and f1 index.
D) in classification relation evaluation module between the concept of tea-plant pests body, using step 2) assessment agriculture field this
The method of classification relation between body concept, obtain classification relation between agriculture field Ontological concept Similarity Measure results set s and
Concordance, accuracy and terseness judged result, and export.
E) in non-categorical relationship assessment module between the concept of tea-plant pests body, using step 3) assessment agriculture field
Between Ontological concept, the method for non-categorical relation is estimated respectively to six evaluation indexes of relation on attributes, and exports.This step
Specifically, the relation on attributes in tea-plant pests body, after parsing, is shown in the form of relation on attributes name, domain of definition, codomain
In systems, and using the non-categorical relationship assessment method theoretical based on figure six evaluation indexes of relation on attributes are carried out respectively
Assessment, and show non-categorical relationship assessment result in systems.
By the test analysis to system, the present invention achieves good effect, demonstrates the feasibility of the method and has
Effect property.Tea-plant pests body assess prototype system using based on domain lexicon Concept Evaluation method, based on Concept Semantic similar
The classification relation appraisal procedure of degree and the non-categorical relationship assessment method based on figure theory can be with effectively solving agriculture field bodies
Evaluation problem, the effectively quality of the assessment agriculture field body of structure, assess existing automatically or semi-automatically ontological construction
The feasibility of method, instructs Ontology engineering teacher to construct higher-quality Agricultural ontology, provides more for agriculture field associated user
Suitable Agricultural ontology, for the rationale carrying out Agricultural ontology assessment in a deep going way and key technology research, has certain showing
Sincere justice and practical study are worth.
The above, the only present invention preferably specific embodiment, but protection scope of the present invention is not limited thereto,
Any those familiar with the art the invention discloses technical scope in, technology according to the present invention scheme and its
Inventive concept equivalent or change in addition, all should be included within the scope of the present invention.
Claims (10)
1. a kind of agriculture field body efficiency assessment method is it is characterised in that comprise the following steps:
S1, assessment agriculture field Ontological concept;
Classification relation between s2, assessment agriculture field Ontological concept;
Non-categorical relation between s3, assessment agriculture field Ontological concept;
Step s1 specifically include following step by step:
S11, agriculture field body to be assessed is parsed according to concept, obtained concept set;
S12, setting Concept Evaluation index, Concept Evaluation index includes recall ratio r, precision ratio p and f1 index, wherein,
It is defined on recall ratio and in the case that precision ratio occurs contradiction, harmonic average is weighted to this recall ratio and precision ratio
The value obtaining is f index,
F1 index is f index during α=1,
S13, the ambit scope according to body to be assessed, select or build corresponding domain lexicon;
S14, it is that recall ratio, precision ratio and f1 exponent pair concept are estimated according to Concept Evaluation index, obtain correct concept number;
S15, the assessment result of output recall ratio, precision ratio and f1 index;
Step s2 specifically include following step by step:
S21, by agriculture field body to be assessed according to concept between subclassof relation parsed, and be saved as
Data structure be figure g=<v, e>data file, v be Ontological concept set, e be classification relation set;
S22, three evaluation indexes of setting classification relation assessment, are concordance, accuracy and terseness respectively;
Semantic similarity between s23, calculating concept, and optimal threshold ε is set;
S24, set up Concept Semantic Similarity set s (s1, s2..., sn) and concordance, accuracy and terseness judged result collection
Close t (t1, t2..., tn), si=sim (vi, vj), i, j are natural numbers, and concept vi is concept vj hypernym, and vi≠vj;
S25, judge that whether concept vi hypernym number is more than 1, be then to judge the classification error of concept vi and be stored in judged result collection
Close t (t1, t2..., tn);
S26, by sim (vi, vj) compare with optimal threshold ε, and concept vi is judged according to comparative result and whether concept vj meets point
Class relation, meets, then by siIt is stored in Concept Semantic Similarity set s (s1, s2..., sn);Do not meet and then judge that concept vi is classified
Mistake simultaneously will determine that result is stored in judged result set t (t1, t2..., tn);
S27, judge whether sim (vi, vj)=sim (vi, vk), if it is present judging concept vjWith concept vkOccur superfluous
Remaining mistake is simultaneously stored in judged result set t (t1, t2..., tn);
S28, repeat step s25 to s27, finish until Ontological concept set v and classification relation set e is all calculated judgement;
S29, output Similarity Measure results set s and concordance, accuracy and terseness judged result set t;
Step s3 specifically include following step by step:
S31, parsed according to object properties relation pair agriculture field body, the attributed relational graph that parsing is obtained saves as number
According to structure it isData file, and represented with adjacency matrix a;V is Ontological concept set, e is relation on attributes
Set,For relation on attributes concept pair;
The evaluation index of object properties relation between s32, setting agriculture field Ontological concept, and collection is judged according to evaluation index setting
Close q;
S33, according to evaluation index, between agriculture field Ontological concept, object properties relation is estimated, and by assessment result record
In judging set q;
Relation on attributes set e in s34, output agriculture field body, relation on attributes concept pairAnd result of determination set q.
2. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that domain term in step s13
The structure of allusion quotation comprises the following steps:
S131, determine subject fields scope, and assembling sphere vocabulary;
S132, determine the classification characteristicses of vocabulary, mutual relation between organized words and vocabulary, form vocabulary structure, and specify attribute
Dictionary;
S133, the vocabulary to dictionary and relation carry out examination & verification and revise.
3. agriculture field body efficiency assessment method as claimed in claim 2 is it is characterised in that in step s131, from literary composition
Offer, subject classical works, Baidupedia, existing dictionary and expertise approach obtain Field Words.
4. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that in step s23, concept c1
And c2Between semantic similarity sim (c1, c2) computing formula is as follows:
sim(c1,c2)=[sim_dist (c1,c2)]α×[sim_depth(c1,c2)]β×[sim_width(c1,c2)]γ,
Wherein, sim_dist (c1, c2) for semantic similarity and semantic distance corresponding relation, sim_depth (c1, c2For semanteme
Similarity and the corresponding relation of semantic depth, sim_width (c1, c2) close for semantic similarity is corresponding with semantic node density
System;α represents the weight of semantic distance, the weight of β representational level depth, and γ represents the weight of node density, and α, β, γ are to adjust
Parameter, and α, β, γ sum is 1.
5. agriculture field body efficiency assessment method as claimed in claim 4 it is characterised in thatWherein, dist (c1, c2) representing semantic distance, a is regulation parameter;dist
(c1,c2)=len (c1,c2), len (c1,c2) represent concept c1And c2Between path.
6. agriculture field body efficiency assessment method as claimed in claim 4 it is characterised in thatThe depth of depth (c) expression concept c, depth (c)=
Depth (parent (c))+1, parent (c) represents the father node of concept c it is preferable that depth depth (root) of root node
=1.
7. agriculture field body efficiency assessment method as claimed in claim 4 is it is characterised in that the density of concept c represents
For width (c), it is directly proportional to classification degree of refinement, as width (c1)≤width(c2),
8. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that in step s26, if ε is <
sim(vi, vj) < 1, then concept viWith concept vjMeet classification relation requirement;If 0≤sim is (vi, vj) < ε, then judge concept vi
With concept vjThere is classification error and be stored in judged result set t.
9. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that in judged result set t,
tnCorresponding with concept vn, if concept vn meets evaluation index, correspond to tnIt is labeled;If concept vn does not meet assessment referred to
Mark, then correspond to the type of error that tn mark is judged.
10. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that agriculture field body is general
Between thought, the evaluation index of object properties relation includes:
Reversibility: there is relation r between concept a and concept b, meanwhile, there is relation r between concept b and concept a-1, then claim r
There is reversibility;
Relation inheritance: there is relation r between concept a and father's concept b, meanwhile, there is also relation between concept a and sub- concept c
R, then claim relation r to have relation inheritance;
Inverse relationship inheritance: there is relation r between father's concept and concept a, also deposit between its sub- concept and concept a meanwhile
In relation r, then relation r is claimed to have inverse relationship inheritance;
Transitivity: if there is relation r between concept a and concept b, between concept b and concept c, there is relation r, meanwhile, concept a with
There is relation r between concept c, then claim relation r to have transitivity;
Symmetry: if concept a and concept b have relation r, be denoted as a=r (b), then concept b and concept a have relation r, are denoted as b=r
(a);
Reflexivity: concept a has relation r, if r has reflexivity, a=r (a).
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610534240.9A CN106372099B (en) | 2016-07-07 | 2016-07-07 | Agricultural field ontology effectiveness evaluation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610534240.9A CN106372099B (en) | 2016-07-07 | 2016-07-07 | Agricultural field ontology effectiveness evaluation method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106372099A true CN106372099A (en) | 2017-02-01 |
CN106372099B CN106372099B (en) | 2020-06-16 |
Family
ID=57880237
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610534240.9A Active CN106372099B (en) | 2016-07-07 | 2016-07-07 | Agricultural field ontology effectiveness evaluation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106372099B (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107704602A (en) * | 2017-10-16 | 2018-02-16 | 西南大学 | Extensive agriculture Ontology matching process based on AGROVOC |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090125549A1 (en) * | 2007-11-13 | 2009-05-14 | Nec (China) Co., Ltd. | Method and system for calculating competitiveness metric between objects |
CN102231151A (en) * | 2011-05-19 | 2011-11-02 | 安徽农业大学 | Method for modelling adaptive learning of agricultural ontology |
CN102236641A (en) * | 2011-05-18 | 2011-11-09 | 安徽农业大学 | Method for generating similarity matrix between concepts in agricultural field |
CN103440314A (en) * | 2013-08-27 | 2013-12-11 | 北京工业大学 | Semantic retrieval method based on Ontology |
CN105335487A (en) * | 2015-10-16 | 2016-02-17 | 北京农业信息技术研究中心 | Agricultural specialist information retrieval system and method on basis of agricultural technology information ontology library |
-
2016
- 2016-07-07 CN CN201610534240.9A patent/CN106372099B/en active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090125549A1 (en) * | 2007-11-13 | 2009-05-14 | Nec (China) Co., Ltd. | Method and system for calculating competitiveness metric between objects |
CN102236641A (en) * | 2011-05-18 | 2011-11-09 | 安徽农业大学 | Method for generating similarity matrix between concepts in agricultural field |
CN102231151A (en) * | 2011-05-19 | 2011-11-02 | 安徽农业大学 | Method for modelling adaptive learning of agricultural ontology |
CN103440314A (en) * | 2013-08-27 | 2013-12-11 | 北京工业大学 | Semantic retrieval method based on Ontology |
CN105335487A (en) * | 2015-10-16 | 2016-02-17 | 北京农业信息技术研究中心 | Agricultural specialist information retrieval system and method on basis of agricultural technology information ontology library |
Non-Patent Citations (1)
Title |
---|
章丽: "基于黄金准则的农业领域本体概念有效性评估方法研究", 《苏州大学学报》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107704602A (en) * | 2017-10-16 | 2018-02-16 | 西南大学 | Extensive agriculture Ontology matching process based on AGROVOC |
CN107704602B (en) * | 2017-10-16 | 2021-02-02 | 西南大学 | Large-scale agricultural semantic ontology matching method based on AGROVOC |
Also Published As
Publication number | Publication date |
---|---|
CN106372099B (en) | 2020-06-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108182262A (en) | Intelligent Answer System construction method and system based on deep learning and knowledge mapping | |
CN108447534A (en) | A kind of electronic health record data quality management method based on NLP | |
CN106295186A (en) | A kind of method and system of aided disease diagnosis based on intelligent inference | |
CN106815293A (en) | System and method for constructing knowledge graph for information analysis | |
Ratajczyk et al. | Challenges and opportunities in coding the commons: problems, procedures, and potential solutions in large-N comparative case studies | |
CN107705839A (en) | Disease automatic coding and system | |
CN107577826A (en) | Classification of diseases coding method and system based on raw diagnostic data | |
CN107578106A (en) | A kind of neutral net natural language inference method for merging semanteme of word knowledge | |
CN103207855A (en) | Fine-grained sentiment analysis system and method specific to product comment information | |
CN106202955A (en) | Diagnosis associated packets method and system based on intellectual coded adaptation | |
CN101493820A (en) | Medicine Regulatory industry knowledge base platform and construct method thereof | |
CN110209839A (en) | Agricultural knowledge map construction device, method and computer readable storage medium | |
CN113962219A (en) | Semantic matching method and system for knowledge retrieval and question answering of power transformer | |
CN113032418B (en) | Method for converting complex natural language query into SQL (structured query language) based on tree model | |
CN104217013B (en) | The positive and negative mode excavation method and system of course based on the item weighted sum item collection degree of association | |
CN104537280B (en) | Protein interactive relation recognition methods based on text relation similitude | |
Elayidom et al. | A generalized data mining framework for placement chance prediction problems | |
CN109241199A (en) | A method of it is found towards financial knowledge mapping | |
CN114580418A (en) | Knowledge map system for police physical training | |
CN108932350A (en) | Based on shifty rice pest intelligent answer method | |
CN106372099A (en) | Agricultural field ontology validity assessment method | |
Ni et al. | Rapid generation of emergency response plans for unconventional emergencies | |
Bouhineau et al. | Share data treatment and analysis processes inTechnology enhanced learning | |
Campos | Designing a Network of Reference Ontologies for the Integration of Water Quality Data | |
Li et al. | Multi-source Education Knowledge Graph Construction and Fusion for College Curricula |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |