CN106372099A - Agricultural field ontology validity assessment method - Google Patents

Agricultural field ontology validity assessment method Download PDF

Info

Publication number
CN106372099A
CN106372099A CN201610534240.9A CN201610534240A CN106372099A CN 106372099 A CN106372099 A CN 106372099A CN 201610534240 A CN201610534240 A CN 201610534240A CN 106372099 A CN106372099 A CN 106372099A
Authority
CN
China
Prior art keywords
concept
relation
agriculture field
sim
classification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610534240.9A
Other languages
Chinese (zh)
Other versions
CN106372099B (en
Inventor
李绍稳
章丽
张筱丹
刘超
赵会园
许高建
徐济成
耿凡凡
孔晨
李景霞
孙靓
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui Agricultural University AHAU
Original Assignee
Anhui Agricultural University AHAU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui Agricultural University AHAU filed Critical Anhui Agricultural University AHAU
Priority to CN201610534240.9A priority Critical patent/CN106372099B/en
Publication of CN106372099A publication Critical patent/CN106372099A/en
Application granted granted Critical
Publication of CN106372099B publication Critical patent/CN106372099B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses an agricultural field ontology validity assessment method. Agricultural field ontology concepts are assessed in combination with a recall ratio R, a precision ratio P and an F1 index; the consistency, accuracy and conciseness of a classification relationship among the agricultural field ontology concepts are assessed in combination with semantic similarity among the concepts; and an object attribute relationship among the agricultural field ontology concepts is assessed according to an assessment index. Therefore, three important problems, including validity assessment of the ontology concepts, validity assessment of the ontology classification relationship and validity assessment of the ontology non-classification relationship, existent in agricultural field ontology validity assessment are solved emphatically. An agricultural field ontology validity assessment method system proposed by the invention takes an agricultural field ontology as a research object, and can effectively assess the quality of the agricultural field ontology and improve the agricultural ontology construction efficiency.

Description

A kind of agriculture field body efficiency assessment method
Technical field
The present invention relates to computer information technology is in the application of agriculture field, more particularly, to a kind of agriculture field body is effective Property appraisal procedure.
Background technology
Body as a kind of shared ideas model that can advise knowledge in semantic level upper table, day by day become information management, The important component part of the multiple fields such as information retrieval and semantic web.Particularly since semantic web proposition, body is just gradually Become an important research direction in artificial intelligence and knowledge engineering field, in the acquisition of knowledge, expression, analysis and application etc. Aspect has far-reaching significance.Body, according to the level of detail and field dependency degree two indices, can be divided into 4 classifications: top Body, domain body, task ontology and applied ontology.Wherein, domain body is professional body, describes a certain specific Relation between the vocabulary of concept and concept in section field, or the most important theories of this ambit.Due to domain body Knowledge has significant domain feature, so domain body can be more reasonable and effectively carry out expressing for knowledge.
The structure of domain body makes the shared of domain knowledge and reuse be possibly realized.In numerous domain knowledges, agriculture Industry domain knowledge is a kind of high-quality production factors, and agricultural workforce and the efficiency of capital production can be made to increase substantially; Agriculture knowledge be also Agricultural Information application basis, as agricultural information system, agricultural decision making system, agriculture specialist system, Agriculture intelligence control system etc. all be unable to do without agricultural knowledge.Agriculture field body is one and comprises agriculture term, definition and art Between language, the system of normative connection explanation, is the formalization table of the mutual relation between concept, concept and concept in agriculture ambit Reach.It can be expressed as four-tuple, including body essential information, such as body title, founder, design time, modification time, mesh And the body such as Knowledge Source metadata information;The set of agricultural knowledge concept;Classification relation between agricultural knowledge concept and Non-categorical set of relationship;Axiomatic set theory present in body.
This physical ability of agriculture field describes on semantic level to the related data of agricultural, information and knowledge it is provided that disease The ontology services such as pest control, Making A Strategic Decision of The Agricultural Productions support, intelligent diagnostics and control.Carry within 2000 since FAO (Food and Agriculture Organization of the United Nation) Since going out Agricultural ontology service research plan, how effectively the agriculture field body quantity of different scales is being in just explosive growth, Assess the structure quality of these agriculture field bodies, if really meet domain knowledge situation, could really embody knowledge organization Function, effect of Agriculture knowledge service can be realized, automatically or semi-automatically whether Method for Ontology Learning scientific and reasonable, how to help Help user to select more suitably Agricultural ontology, become problem demanding prompt solution in current agricultural ontology research.Accordingly, it would be desirable to agriculture Industry domain body carries out efficiency assessment.
Body is assessed as an important means assessing this weight, by the method with science, it then follows unified Index system, assessment performance under specific application area or specific environment for the body and the suitability, are whether impact body can be An important factor in order of large-scale application in semantic net.Set up a set of scientific and effective body Assessment theory and assessment side Method, contributes to weighing whether body constructing method has effectiveness, is conducive to existing body is effectively managed, updates and ties up Shield, helps instruct ontological construction process and lifting body construction efficiency, thus forming more scientific rational domain body and more preferably Ground provides associated body service.
Associated specialist scholar has done substantial amounts of research in terms of body assessment both at home and abroad, and proposes a lot of body assessment sides Method.On the whole, body appraisal procedure mainly has seven kinds, and they are respectively: appraisal procedure based on user, the commenting of task based access control Estimate method, the appraisal procedure based on principle, the appraisal procedure Ji Yu " gold criterion ", the appraisal procedure based on application, be based on language The material appraisal procedure in storehouse and the appraisal procedure based on compound standard.With the constantly improve of body appraisal procedure, body automatic Change assessment also to be developed rapidly, and have developed some body assessment tools, wherein odeval, ontoqa, core, Ontomanager, aeon etc. are wherein to apply more extensive body assessment tool.
Although existing body appraisal procedure and assessment tool achieve some achievements, in terms of some, body can be entered Row assessment, but for Chinese body and the efficiency assessment method being applied to complicated ambit body such as agriculture field body Or few, and existing body appraisal procedure compares one-sided, does not form a whole set of scientific comprehensive body assessment Theory and method system.
Content of the invention
The technical problem being existed based on background technology, the present invention proposes a kind of agriculture field body efficiency assessment side Method.
A kind of agriculture field body efficiency assessment method proposed by the present invention, comprises the following steps:
S1, assessment agriculture field Ontological concept;
Classification relation between s2, assessment agriculture field Ontological concept;
Non-categorical relation between s3, assessment agriculture field Ontological concept;
Step s1 specifically include following step by step:
S11, agriculture field body to be assessed is parsed according to concept, obtained concept set;
S12, setting Concept Evaluation index, Concept Evaluation index includes recall ratio r, precision ratio p and f1 index, wherein,
It is defined on recall ratio and in the case that precision ratio occurs contradiction, this recall ratio and precision ratio are weighted be in harmonious proportion The average value obtaining is f index,
f = ( α 2 + 1 ) p × r α 2 ( p + r )
F1 index is f index during α=1,
f 1 = 2 × p × r p + r
S13, the ambit scope according to body to be assessed, select or build corresponding domain lexicon;
S14, it is that recall ratio, precision ratio and f1 exponent pair concept are estimated according to Concept Evaluation index, obtain correctly general Read number;
S15, the assessment result of output recall ratio, precision ratio and f1 index;
Step s2 specifically include following step by step:
S21, by agriculture field body to be assessed according to concept between subclassof relation parsed, and protected Save as data structure be figure g=<v, e>data file, v be Ontological concept set, e be classification relation set;
S22, three evaluation indexes of setting classification relation assessment, are concordance, accuracy and terseness respectively;
Semantic similarity between s23, calculating concept, and optimal threshold ε is set;
S24, set up Concept Semantic Similarity set s (s1, s2..., sn) and concordance, accuracy and terseness judgement knot Fruit set t (t1, t2..., tn), si=sim (vi, vj), i, j are natural numbers, and concept vi is concept vj hypernym, and vi≠ vj
S25, judge that whether concept vi hypernym number is more than 1, be, then judge the classification error of concept vi and be stored in judgement and tie Fruit set t (t1, t2..., tn);
S26, by sim (vi, vj) compare with optimal threshold ε, and judge whether concept vi and concept vj accord with according to comparative result Close classification relation, meet, then by siIt is stored in Concept Semantic Similarity set s (s1, s2..., sn);Do not meet and then judge concept vi Classification error simultaneously will determine that result is stored in judged result set t (t1, t2..., tn);
S27, judge whether sim (vi, vj)=sim (vi, vk), if it is present judging concept vjWith concept vkSend out Raw redundancy error is simultaneously stored in judged result set t (t1, t2..., tn);
S28, repeat step s25 to s27, until all calculating and having judged Ontological concept set v and classification relation set e Finish;
S29, output Similarity Measure results set s and concordance, accuracy and terseness judged result set t;
Step s3 specifically include following step by step:
S31, parsed according to object properties relation pair agriculture field body, the attributed relational graph that obtains of parsing is preserved For data structure it isData file, and represented with adjacency matrix a;V is Ontological concept set, e is attribute Set of relationship,For relation on attributes concept pair;
The evaluation index of object properties relation between s32, setting agriculture field Ontological concept, and sentenced according to evaluation index setting Surely set q;
S33, according to evaluation index, between agriculture field Ontological concept, object properties relation is estimated, and by assessment result Record is in judging set q;
Relation on attributes set e in s34, output agriculture field body, relation on attributes concept pairAnd result of determination set q.
Preferably, in step s13, the structure of domain lexicon comprises the following steps:
S131, determine subject fields scope, and assembling sphere vocabulary;
S132, determine the classification characteristicses of vocabulary, mutual relation between organized words and vocabulary, form vocabulary structure, and specify Attribute dictionary;
S133, the vocabulary to dictionary and relation carry out examination & verification and revise.
Preferably, in step s131, from document, subject classical works, Baidupedia, existing dictionary and expertise way Footpath obtains Field Words.
Preferably, in step s23, concept c1And c2Between semantic similarity sim (c1, c2) computing formula is as follows:
sim(c1,c2)=[sim_dist (c1,c2)]α×[sim_depth(c1,c2)]β×[sim_width(c1,c2) ]γ,
Wherein, sim_dist (c1, c2) for semantic similarity and semantic distance corresponding relation, sim_depth (c1, c2 For the corresponding relation of semantic similarity and semantic depth, sim_width (c1, c2) it is semantic similarity and semantic node density Corresponding relation;α represents the weight of semantic distance, the weight of β representational level depth, and γ represents the weight of node density, α, β, γ For regulation parameter, and α, β, γ sum is 1.
Preferably,Wherein, dist (c1, c2) represent semantic distance, a For regulation parameter;dist(c1,c2)=len (c1,c2), len (c1,c2) represent concept c1And c2Between path.
Preferably,Depth (c) represents concept c Depth, depth (c)=depth (parent (c))+1, parent (c) represents the father node of concept c it is preferable that root node Depth depth (root)=1.
Preferably, the density of concept c is expressed as width (c), and it is directly proportional to classification degree of refinement, as width (c1)≤ width(c2),
s i m _ w i d t h ( c 1 , c 2 ) = w i d t h ( c 1 ) w i d t h ( c 2 ) .
Preferably, in step s26, if ε is < sim (vi, vj) < 1, then concept viWith concept vjMeet classification relation requirement; If 0≤sim is (vi, vj) < ε, then judge concept viWith concept vjThere is classification error and be stored in judged result set t.
Preferably, in judged result set t, tnCorresponding with concept vn, if concept vn meets evaluation index, correspond to tn It is labeled;If concept vn does not meet evaluation index, the type of error that corresponding tn mark is judged.
Preferably, between agriculture field Ontological concept, the evaluation index of object properties relation includes:
Reversibility: there is relation r between concept a and concept b, meanwhile, there is relation r between concept b and concept a-1, R is then claimed to have reversibility;
Relation inheritance: there is relation r between concept a and father's concept b, there is also between concept a and sub- concept c meanwhile Relation r, then claim relation r to have relation inheritance;
Inverse relationship inheritance: there is relation r between father's concept and concept a, meanwhile, between its sub- concept and concept a There is also relation r, then claim relation r to have inverse relationship inheritance;
Transitivity: if there is relation r between concept a and concept b, there is relation r between concept b and concept c, meanwhile, generally Read and between a and concept c, there is relation r, then claim relation r to have transitivity;
Symmetry: if concept a and concept b have relation r, be denoted as a=r (b), then concept b and concept a have relation r, are denoted as b =r (a);
Reflexivity: concept a has relation r, if r has reflexivity, a=r (a).
A kind of agriculture field body efficiency assessment method that the present invention provides, refers in conjunction with recall ratio r, precision ratio p and f1 Several agriculture field Ontological concept is estimated, in conjunction with concept between semantic similarity classify between agriculture field Ontological concept pass The concordance of system, accuracy, terseness are estimated, according to evaluation index between agriculture field Ontological concept object properties relation It is estimated.So, to solve three major issue bodies present in agriculture field body efficiency assessment general for emphasis The efficiency assessment of the efficiency assessment, the efficiency assessment of Ontology relation and body non-categorical relation read.
Agriculture field body efficiency assessment method system proposed by the present invention, can with agriculture field body as object of study Effectively assess the quality of agriculture field body, lifting Agricultural ontology builds efficiency.
Brief description
Fig. 1 is a kind of agriculture field body efficiency assessment method flow diagram proposed by the present invention;
Fig. 2 is assessment agriculture field Ontological concept method flow diagram;
Fig. 3 is classification relation method flow diagram between assessment agriculture field Ontological concept;
Fig. 4 is non-categorical relational approach flow chart between assessment agriculture field Ontological concept;
Fig. 5 assesses prototype system overall framework figure for tea-plant pests body.
Specific embodiment
With reference to Fig. 1, a kind of agriculture field body efficiency assessment method proposed by the present invention, comprise the following steps.
S1, assessment agriculture field Ontological concept;
Classification relation between s2, assessment agriculture field Ontological concept;
Non-categorical relation between s3, assessment agriculture field Ontological concept.
Concept, as the basic element of ontology model, is the core of expression domain knowledge.Therefore to domain body concept Assessment is the pith completing body assessment.
With reference to Fig. 2, in present embodiment, assessment agriculture field Ontological concept be step s1 specifically include following step by step.
S11, agriculture field body to be assessed is parsed according to concept, obtained concept set.This step is to defeated The agriculture field body to be assessed entering carries out body pretreatment, specifically can be led agricultural to be assessed by body analytical tool Domain body is parsed according to concept.
S12, setting Concept Evaluation index, Concept Evaluation index includes recall ratio r, precision ratio p and f1 index.
Recall ratio r refers to the correct concept number in body and the ratio of the sum of the concept in domain lexicon,
Precision ratio p refers to the correct concept number in body and the ratio of the sum of the concept in body,
It is defined on recall ratio and in the case that precision ratio occurs contradiction, this recall ratio and precision ratio are weighted be in harmonious proportion The average value obtaining is f index, and f1 index is the specific value of f index.F index is specifically calculated as follows with f1 index:
f = ( &alpha; 2 + 1 ) p &times; r &alpha; 2 ( p + r ) ;
F1 index is f index during α=1,
f 1 = 2 &times; p &times; r p + r .
The evaluation index of one group of scientific and reasonable concept should be able to evaluate the level of coverage of concept in constructed body, general Read the accuracy of term, i.e. concept set in body to be assessed should farthest comprise the important terms in this field. And the concept defined in body not only needs to have integrity in addition it is also necessary to ensure the correctness of Ontological concept.
Recall ratio, precision ratio and f index as reflection Concept Evaluation effect important indicator, f index can comprehensively this two Individual evaluation index, for the index that concentrated expression is overall, f1 index is clear and definite to f index.In present embodiment, complete to look into Rate, precision ratio and f1 index, as Concept Evaluation index, are conducive to the levels of precision of Concept Evaluation.
S13, the ambit scope according to body to be assessed, select or build corresponding domain lexicon.
Domain lexicon refers to the term of specific area or the set of vocabulary, can reflect information in certain specific field, The elements such as concept, element therein can extract from related corpus and refine.With conventional dictionaries simply simply by word Converge with organizing according to morphology or lexicographic order and compare, modern dictionary not only concept to be comprised, more to consider general in the middle of dictionary Read the relation and concept between.The accuracy of Concept Evaluation result depends on the selection of domain lexicon, therefore, the choosing of domain lexicon Fixed extremely important.
In step s13, the structure of domain lexicon comprises the following steps:
S131, determine subject fields scope, and assembling sphere vocabulary;Specifically, Field Words can be classical from document, subject Works, Baidupedia, existing dictionary and expertise approach obtain.
S132, determine the classification characteristicses of vocabulary, mutual relation between organized words and vocabulary, form vocabulary structure, and specify Attribute dictionary;
S133, the vocabulary to dictionary and relation carry out examination & verification and revise.
S14, it is that recall ratio, precision ratio and f1 exponent pair concept are estimated according to Concept Evaluation index, obtain correctly general Read number.
S15, the assessment result of output recall ratio, precision ratio and f1 index, obtain Concept Evaluation result.
Relation is that in body, the one kind between concept and concept contacts, and the classification relation between agriculture field Ontological concept is concrete Show as subclassof relation, describe the relation between father's concept and sub- concept.Classification relation system is general as tissue The basis read, is the backbone structure of body.Therefore, how research is effectively assessed to classification relation is to be highly desirable to 's.
With reference to Fig. 3, in present embodiment, between assessment agriculture field Ontological concept classification relation be step s2 specifically include with Under step by step.
S21, by body analytical tool by agriculture field body to be assessed according to concept between subclassof relation Parsed, and be saved as data structure be figure g=<v, e>data file, v be Ontological concept set, e be classification close Assembly is closed.
S22, three evaluation indexes of setting classification relation assessment, are concordance, accuracy and terseness respectively.
In present embodiment, three evaluation indexes have been directed to common three kinds of typical faults in classification relation and classify not Unanimously, classification error and redundancy error.Concordance is mainly used to occur in that conflicting classification relation in detection body;Accurately Property mainly in Ontology system occur concept classification Problem-Error;Terseness is primarily directed in Ontology system There is the redundancy error of same concept in middle discovery.
Semantic similarity between s23, calculating concept, and optimal threshold ε is set.In present embodiment, optimal threshold ε can tie Close Conventional wisdom and analysis expert to obtain through test.
Classification relation system in body is to arrange concept with tree according to subclassof, traditional based on language The Concept Semantic Similarity algorithm of adopted distance can reasonably utilize this type of organization.Traditional general based on semantic distance Read on the basis of Arithmetic of Semantic Similarity, the present invention has carried out certain optimization and adjustment, by level depth, node density Take into account, and be provided with regulation parameter, using a kind of improved Concept Semantic Similarity algorithm.
Semantic similarity between concept is calculated using Concept Semantic Similarity algorithm.Concept c1And c2Between semantic phase Like degree sim (c1, c2) computing formula is as follows:
sim(c1,c2)=[sim_dist (c1,c2)]α×[sim_depth(c1,c2)]β×[sim_width(c1,c2) ]γ,
Wherein, sim_dist (c1, c2) for semantic similarity and semantic distance corresponding relation, sim_depth (c1, c2For Semantic similarity and the corresponding relation of semantic depth, sim_width (c1, c2) right for semantic similarity and semantic node density Should be related to;α represents the weight of semantic distance, the weight of β representational level depth, and γ represents the weight of node density, and α, β, γ are Regulation parameter, and α, β, γ sum is 1.
Concept Semantic Similarity value and semantic distance are inversely proportional to.
Wherein, dist (c1, c2) representing semantic distance, a is to adjust Parameter;If the weight of each edge is the same, weight is all set to 1, then
dist(c1,c2)=len (c1,c2),
len(c1,c2) represent concept c1And c2Between path.
Concept Semantic Similarity value is directly proportional to level depth difference, with level depth and being inversely proportional to.
Depth (c) represents the depth of concept c, Depth (c)=depth (parent (c))+1, parent (c) represents the father node of concept c.Depth depth of root node (root)=1.
Concept Semantic Similarity value and node density are inversely proportional to.In classification relation system, density higher finger classification is thinner Cause, degree of refinement is higher, and between concept and concept, similarity degree is lower.The density of concept c is expressed as width (c), its with classification Degree of refinement is directly proportional, as width (c1)≤width(c2),
s i m _ w i d t h ( c 1 , c 2 ) = w i d t h ( c 1 ) w i d t h ( c 2 ) .
So:
s i m ( c 1 , c 2 ) = &lsqb; s i m _ d i s t ( c 1 , c 2 ) &rsqb; &alpha; &times; &lsqb; s i m _ d e p t h ( c 1 , c 2 ) &rsqb; &beta; &times; &lsqb; s i m _ w i d t h ( c 1 , c 2 ) &rsqb; &gamma; = &lsqb; a a + d i s t ( c 1 , c 2 ) &rsqb; &alpha; &times; &lsqb; 1 - | d e p t h ( c 1 ) - d e p t h ( c 2 ) | d e p t h ( c 1 ) + d e p t h ( c 2 ) &rsqb; &beta; &times; &lsqb; w i d t h ( c 1 ) w i d t h ( c 2 ) &rsqb; &gamma; .
S24, set up Concept Semantic Similarity set s (s1, s2..., sn) and concordance, accuracy and terseness judgement knot Fruit set t (t1, t2..., tn), si=sim (vi, vj), i, j are natural numbers, and concept vi is concept vj hypernym, and vi≠ vj.
In judged result set t, tnCorresponding with concept vn, if concept vn meets evaluation index, correspond to tnEnter rower Note, specifically can label symbol " √ ";If concept vn does not meet evaluation index, the type of error that corresponding tn mark is judged, Type of error is classification error or redundancy error.
If s25 concept vi hypernym number is more than 1, judge the classification error of concept vi and in judged result set t (t1, t2..., tn) in corresponding tiRecord sort mistake.
S26, by sim (vi, vj) compare with optimal threshold ε, and judge whether concept vi and concept vj accord with according to comparative result Close classification relation, meet, then by siIt is stored in Concept Semantic Similarity set s (s1, s2..., sn);Do not meet and then judge concept vi Classification error simultaneously will determine that result is stored in judged result set t (t1, t2..., tn).
Specifically, if ε is < sim (vi, vj) < 1, then concept viWith concept vjMeet classification relation requirement, by si=sim (vi, vj) it is stored in Concept Semantic Similarity set s;If 0≤sim is (vi, vj) < ε, then judge concept viWith concept vjThere is classification Mistake, is unsatisfactory for accuracy, corresponding t in judged result set ti、tjRecord sort mistake.
S27, judge whether sim (vi, vj)=sim (vi, vk), if it is present judging concept vjWith concept vkIt is There is redundancy error and be stored in judged result set t (t in synonym1, t2..., tn), specifically, in judged result set t Corresponding tj, t record redundancy error.
S28, repeat step s25 to s27, until all calculating and having judged Ontological concept set v and classification relation set e Finish.
S29, output Similarity Measure results set s and concordance, accuracy and terseness judged result set t.
According to relation not same-action in the body, classification relation and two kinds of non-categorical relation can be divided into.Non-categorical closes System illustrates all other all relations being not belonging to classification relation such as cause effect relation, relation on attributes, example relationship.Non-categorical closes System enhances completeness and the complexity of ontology knowledge expression, makes body really become network structure.Therefore, to non-categorical relation The efficiency assessment that carries out have important theory significance and researching value.
Non-categorical relation has many kinds, in agriculture field body, wherein function the strongest, using most be object properties Relation.The present invention is mainly estimated to the object properties relation in non-categorical relation system, proposes a kind of theoretical based on figure Non-categorical relationship assessment method between agriculture field Ontological concept.
With reference to Fig. 4, in present embodiment, between assessment agriculture field Ontological concept, non-categorical relation is that step s3 specifically includes Below step by step.
S31, parsed according to object properties relation pair agriculture field body using analytical tool, will the genus that obtains of parsing Sexual intercourse figure saves as data structureData file, and represented with adjacency matrix a;Wherein, v is this Body concept set, e be relation on attributes set,For relation on attributes concept pair.
The evaluation index of object properties relation between s32, setting agriculture field Ontological concept, and sentenced according to evaluation index setting Surely set q.
S33, according to evaluation index, between agriculture field Ontological concept, object properties relation is estimated, and by assessment result Record is in judging set q.
In present embodiment, between agriculture field Ontological concept, the evaluation index of object properties relation includes following six.
1. reversibility: there is relation r between concept a and concept b, meanwhile, there is relation r between concept b and concept a-1, R is then claimed to have reversibility.
The judgement of " reversibility " evaluation index can be carried out using the theoretical relevant knowledge of figure.If in adjacency matrix a=(aij) in Find aij=1, aji=1, and fromFind aijCorresponding relation on attributes r1 and ajiCorresponding relation on attributes r2 is inverse each other closing System, then meet " reversibility ".Will determine that result record in set of matrices q.
2. relation inheritance: there is relation r between concept a and father's concept b, also deposit between concept a and sub- concept c meanwhile In relation r, then relation r is claimed to have relation inheritance.
The judgement of relation inheritance feature needs combining classification relation system.If aij=1, aik=1, and fromFind aijWith aikCorresponding relation on attributes name r is also identical, and finds vjAnd vkBelong to father's subclass relation in classification relation system, then meet and " close It is inheritance ";If aij=1, aik≠ 1, then it is unsatisfactory for " relation inheritance ".And will determine that result record in set of matrices q.
3. inverse relationship inheritance: there is relation r between father's concept and concept a, meanwhile, its sub- concept and concept a it Between there is also relation r, then claim relation r there is inverse relationship inheritance.
The judgement of inverse relationship inheritance feature needs combining classification relation system.If aik=1, ajk=1, and fromFind aikAnd ajkCorresponding non-categorical relation name r is also identical, and finds viAnd vjBelong to father's subclass relation in classification relation system, then full Foot " the reverse inheritance of relation ";If aik=1, ajk≠ 1, then it is unsatisfactory for " inverse relationship inheritance ".And will determine that result record exists In set of matrices q.
4. transitivity: if there is relation r between concept a and concept b, between concept b and concept c, there is relation r, meanwhile, There is relation r between concept a and concept c, then claim relation r to have transitivity.
If in adjacency matrix a=(aij) in find aij=1, ajk=1, and aik=1, and fromFind relation on attributes name r Identical, i.e. concept vi, vkBetween relation on attributes contradict with " there is no transitivity " this property;?In find out aij, ajk, aikCorresponding relation on attributes name, and will determine that result record in set of matrices q.
5. symmetry: if concept a and concept b have relation r, be denoted as a=r (b), then concept b and concept a have relation r, are denoted as B=r (a).
If in adjacency matrix a=(aij) in find aij=1, aji=1, and fromFind that relation on attributes name is also identical, that is, generally Read viAnd vjContradict with relation on attributes " there is no symmetry " this property, find concept viAnd vjAnd institute is right between them The relation on attributes answered, and will determine that result record in set of matrices q.
6. reflexivity: concept a has relation r, if r has reflexivity, a=r (a).
If in adjacency matrix a=(aij) in found aiiWhen=1 (1≤i≤n), then this result and relation on attributes " do not have Having reflexivity " this property contradicts, and finds aiiCorresponding concept and relation on attributes, and will determine that result record in matrix stack Close in q.
Relation on attributes refers to the attribute that certain concept is another concept, describes " property of things and its mutual pass System ".Wherein, object properties relation is used for associating two concepts, the non-categorical relation between expression concept, and relation on attributes has inverse The constraint such as reflexive, symmetry, transitivity, its domain of definition and codomain determine two classes of association respectively, and attribute-name can be according to reality Situation is named.Six properties being had according to relation on attributes: have reversibility, relation inheritance, inverse relationship inheritance and There is no transitivity, symmetry and reflexivity.In present embodiment, six evaluation index reflexivitys of setting, symmetry, can Inverse property, transitivity, relation inheritance, inverse relationship inheritance are true to the assessment of the relation on attributes in non-categorical relation system The real extent of reaction is very high.
S34, according to above step by all concepts in Ontological concept set v and relation on attributes set e and relation on attributes All after the completion of decision analysis, the relation on attributes set e in output agriculture field body, relation on attributes concept pairAnd judge Results set q.
Specifically commented using the tea-plant pests body of above agriculture field body efficiency assessment method below in conjunction with a kind of Estimate system above method is further explained.
By verifying feasibility and the effectiveness of proposed agriculture field body appraisal procedure, java language specifically can be used Speech, calls ontology development bag jena, completes the prototype system of tea-plant pests body assessment in eclipse development platform.Should The development environment that system is used is window7, and development platform is eclipse3.3, and jdk version for 1.8, jena version is jena2.6.4.This prototype system can be parsed to tea-plant pests body and be assessed.
Tea-plant pests body assessment system includes tea-plant pests body pretreatment module, the Concept Evaluation of tea-plant pests body Non-categorical relationship assessment between the concept of classification relation evaluation module and tea-plant pests body between module, the concept of tea-plant pests body Module.With reference to Fig. 5, the operating procedure of the system includes.
A) the tea-plant pests body of structure is imported body assessment system.
B) the tea-plant pests body that imports enters body pretreatment module, by call jena bag to the concept in body, Classification relation, non-categorical relation etc. are parsed.
C) in the Concept Evaluation module of tea-plant pests body, using step 1) assessment agriculture field Ontological concept side Method, calculates recall ratio, precision ratio and f index respectively, and exports the assessment result of recall ratio, precision ratio and f1 index.
D) in classification relation evaluation module between the concept of tea-plant pests body, using step 2) assessment agriculture field this The method of classification relation between body concept, obtain classification relation between agriculture field Ontological concept Similarity Measure results set s and Concordance, accuracy and terseness judged result, and export.
E) in non-categorical relationship assessment module between the concept of tea-plant pests body, using step 3) assessment agriculture field Between Ontological concept, the method for non-categorical relation is estimated respectively to six evaluation indexes of relation on attributes, and exports.This step Specifically, the relation on attributes in tea-plant pests body, after parsing, is shown in the form of relation on attributes name, domain of definition, codomain In systems, and using the non-categorical relationship assessment method theoretical based on figure six evaluation indexes of relation on attributes are carried out respectively Assessment, and show non-categorical relationship assessment result in systems.
By the test analysis to system, the present invention achieves good effect, demonstrates the feasibility of the method and has Effect property.Tea-plant pests body assess prototype system using based on domain lexicon Concept Evaluation method, based on Concept Semantic similar The classification relation appraisal procedure of degree and the non-categorical relationship assessment method based on figure theory can be with effectively solving agriculture field bodies Evaluation problem, the effectively quality of the assessment agriculture field body of structure, assess existing automatically or semi-automatically ontological construction The feasibility of method, instructs Ontology engineering teacher to construct higher-quality Agricultural ontology, provides more for agriculture field associated user Suitable Agricultural ontology, for the rationale carrying out Agricultural ontology assessment in a deep going way and key technology research, has certain showing Sincere justice and practical study are worth.
The above, the only present invention preferably specific embodiment, but protection scope of the present invention is not limited thereto, Any those familiar with the art the invention discloses technical scope in, technology according to the present invention scheme and its Inventive concept equivalent or change in addition, all should be included within the scope of the present invention.

Claims (10)

1. a kind of agriculture field body efficiency assessment method is it is characterised in that comprise the following steps:
S1, assessment agriculture field Ontological concept;
Classification relation between s2, assessment agriculture field Ontological concept;
Non-categorical relation between s3, assessment agriculture field Ontological concept;
Step s1 specifically include following step by step:
S11, agriculture field body to be assessed is parsed according to concept, obtained concept set;
S12, setting Concept Evaluation index, Concept Evaluation index includes recall ratio r, precision ratio p and f1 index, wherein,
It is defined on recall ratio and in the case that precision ratio occurs contradiction, harmonic average is weighted to this recall ratio and precision ratio The value obtaining is f index,
f = ( &alpha; 2 + 1 ) p &times; r &alpha; 2 ( p + r )
F1 index is f index during α=1,
f 1 = 2 &times; p &times; r p + r
S13, the ambit scope according to body to be assessed, select or build corresponding domain lexicon;
S14, it is that recall ratio, precision ratio and f1 exponent pair concept are estimated according to Concept Evaluation index, obtain correct concept number;
S15, the assessment result of output recall ratio, precision ratio and f1 index;
Step s2 specifically include following step by step:
S21, by agriculture field body to be assessed according to concept between subclassof relation parsed, and be saved as Data structure be figure g=<v, e>data file, v be Ontological concept set, e be classification relation set;
S22, three evaluation indexes of setting classification relation assessment, are concordance, accuracy and terseness respectively;
Semantic similarity between s23, calculating concept, and optimal threshold ε is set;
S24, set up Concept Semantic Similarity set s (s1, s2..., sn) and concordance, accuracy and terseness judged result collection Close t (t1, t2..., tn), si=sim (vi, vj), i, j are natural numbers, and concept vi is concept vj hypernym, and vi≠vj
S25, judge that whether concept vi hypernym number is more than 1, be then to judge the classification error of concept vi and be stored in judged result collection Close t (t1, t2..., tn);
S26, by sim (vi, vj) compare with optimal threshold ε, and concept vi is judged according to comparative result and whether concept vj meets point Class relation, meets, then by siIt is stored in Concept Semantic Similarity set s (s1, s2..., sn);Do not meet and then judge that concept vi is classified Mistake simultaneously will determine that result is stored in judged result set t (t1, t2..., tn);
S27, judge whether sim (vi, vj)=sim (vi, vk), if it is present judging concept vjWith concept vkOccur superfluous Remaining mistake is simultaneously stored in judged result set t (t1, t2..., tn);
S28, repeat step s25 to s27, finish until Ontological concept set v and classification relation set e is all calculated judgement;
S29, output Similarity Measure results set s and concordance, accuracy and terseness judged result set t;
Step s3 specifically include following step by step:
S31, parsed according to object properties relation pair agriculture field body, the attributed relational graph that parsing is obtained saves as number According to structure it isData file, and represented with adjacency matrix a;V is Ontological concept set, e is relation on attributes Set,For relation on attributes concept pair;
The evaluation index of object properties relation between s32, setting agriculture field Ontological concept, and collection is judged according to evaluation index setting Close q;
S33, according to evaluation index, between agriculture field Ontological concept, object properties relation is estimated, and by assessment result record In judging set q;
Relation on attributes set e in s34, output agriculture field body, relation on attributes concept pairAnd result of determination set q.
2. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that domain term in step s13 The structure of allusion quotation comprises the following steps:
S131, determine subject fields scope, and assembling sphere vocabulary;
S132, determine the classification characteristicses of vocabulary, mutual relation between organized words and vocabulary, form vocabulary structure, and specify attribute Dictionary;
S133, the vocabulary to dictionary and relation carry out examination & verification and revise.
3. agriculture field body efficiency assessment method as claimed in claim 2 is it is characterised in that in step s131, from literary composition Offer, subject classical works, Baidupedia, existing dictionary and expertise approach obtain Field Words.
4. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that in step s23, concept c1 And c2Between semantic similarity sim (c1, c2) computing formula is as follows:
sim(c1,c2)=[sim_dist (c1,c2)]α×[sim_depth(c1,c2)]β×[sim_width(c1,c2)]γ,
Wherein, sim_dist (c1, c2) for semantic similarity and semantic distance corresponding relation, sim_depth (c1, c2For semanteme Similarity and the corresponding relation of semantic depth, sim_width (c1, c2) close for semantic similarity is corresponding with semantic node density System;α represents the weight of semantic distance, the weight of β representational level depth, and γ represents the weight of node density, and α, β, γ are to adjust Parameter, and α, β, γ sum is 1.
5. agriculture field body efficiency assessment method as claimed in claim 4 it is characterised in thatWherein, dist (c1, c2) representing semantic distance, a is regulation parameter;dist (c1,c2)=len (c1,c2), len (c1,c2) represent concept c1And c2Between path.
6. agriculture field body efficiency assessment method as claimed in claim 4 it is characterised in thatThe depth of depth (c) expression concept c, depth (c)= Depth (parent (c))+1, parent (c) represents the father node of concept c it is preferable that depth depth (root) of root node =1.
7. agriculture field body efficiency assessment method as claimed in claim 4 is it is characterised in that the density of concept c represents For width (c), it is directly proportional to classification degree of refinement, as width (c1)≤width(c2),
s i m _ w i d t h ( c 1 , c 2 ) = w i d t h ( c 1 ) w i d t h ( c 2 ) .
8. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that in step s26, if ε is < sim(vi, vj) < 1, then concept viWith concept vjMeet classification relation requirement;If 0≤sim is (vi, vj) < ε, then judge concept vi With concept vjThere is classification error and be stored in judged result set t.
9. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that in judged result set t, tnCorresponding with concept vn, if concept vn meets evaluation index, correspond to tnIt is labeled;If concept vn does not meet assessment referred to Mark, then correspond to the type of error that tn mark is judged.
10. agriculture field body efficiency assessment method as claimed in claim 1 is it is characterised in that agriculture field body is general Between thought, the evaluation index of object properties relation includes:
Reversibility: there is relation r between concept a and concept b, meanwhile, there is relation r between concept b and concept a-1, then claim r There is reversibility;
Relation inheritance: there is relation r between concept a and father's concept b, meanwhile, there is also relation between concept a and sub- concept c R, then claim relation r to have relation inheritance;
Inverse relationship inheritance: there is relation r between father's concept and concept a, also deposit between its sub- concept and concept a meanwhile In relation r, then relation r is claimed to have inverse relationship inheritance;
Transitivity: if there is relation r between concept a and concept b, between concept b and concept c, there is relation r, meanwhile, concept a with There is relation r between concept c, then claim relation r to have transitivity;
Symmetry: if concept a and concept b have relation r, be denoted as a=r (b), then concept b and concept a have relation r, are denoted as b=r (a);
Reflexivity: concept a has relation r, if r has reflexivity, a=r (a).
CN201610534240.9A 2016-07-07 2016-07-07 Agricultural field ontology effectiveness evaluation method Active CN106372099B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610534240.9A CN106372099B (en) 2016-07-07 2016-07-07 Agricultural field ontology effectiveness evaluation method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610534240.9A CN106372099B (en) 2016-07-07 2016-07-07 Agricultural field ontology effectiveness evaluation method

Publications (2)

Publication Number Publication Date
CN106372099A true CN106372099A (en) 2017-02-01
CN106372099B CN106372099B (en) 2020-06-16

Family

ID=57880237

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610534240.9A Active CN106372099B (en) 2016-07-07 2016-07-07 Agricultural field ontology effectiveness evaluation method

Country Status (1)

Country Link
CN (1) CN106372099B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107704602A (en) * 2017-10-16 2018-02-16 西南大学 Extensive agriculture Ontology matching process based on AGROVOC

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090125549A1 (en) * 2007-11-13 2009-05-14 Nec (China) Co., Ltd. Method and system for calculating competitiveness metric between objects
CN102231151A (en) * 2011-05-19 2011-11-02 安徽农业大学 Method for modelling adaptive learning of agricultural ontology
CN102236641A (en) * 2011-05-18 2011-11-09 安徽农业大学 Method for generating similarity matrix between concepts in agricultural field
CN103440314A (en) * 2013-08-27 2013-12-11 北京工业大学 Semantic retrieval method based on Ontology
CN105335487A (en) * 2015-10-16 2016-02-17 北京农业信息技术研究中心 Agricultural specialist information retrieval system and method on basis of agricultural technology information ontology library

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090125549A1 (en) * 2007-11-13 2009-05-14 Nec (China) Co., Ltd. Method and system for calculating competitiveness metric between objects
CN102236641A (en) * 2011-05-18 2011-11-09 安徽农业大学 Method for generating similarity matrix between concepts in agricultural field
CN102231151A (en) * 2011-05-19 2011-11-02 安徽农业大学 Method for modelling adaptive learning of agricultural ontology
CN103440314A (en) * 2013-08-27 2013-12-11 北京工业大学 Semantic retrieval method based on Ontology
CN105335487A (en) * 2015-10-16 2016-02-17 北京农业信息技术研究中心 Agricultural specialist information retrieval system and method on basis of agricultural technology information ontology library

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
章丽: "基于黄金准则的农业领域本体概念有效性评估方法研究", 《苏州大学学报》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107704602A (en) * 2017-10-16 2018-02-16 西南大学 Extensive agriculture Ontology matching process based on AGROVOC
CN107704602B (en) * 2017-10-16 2021-02-02 西南大学 Large-scale agricultural semantic ontology matching method based on AGROVOC

Also Published As

Publication number Publication date
CN106372099B (en) 2020-06-16

Similar Documents

Publication Publication Date Title
CN108182262A (en) Intelligent Answer System construction method and system based on deep learning and knowledge mapping
CN108447534A (en) A kind of electronic health record data quality management method based on NLP
CN106295186A (en) A kind of method and system of aided disease diagnosis based on intelligent inference
CN106815293A (en) System and method for constructing knowledge graph for information analysis
Ratajczyk et al. Challenges and opportunities in coding the commons: problems, procedures, and potential solutions in large-N comparative case studies
CN107705839A (en) Disease automatic coding and system
CN107577826A (en) Classification of diseases coding method and system based on raw diagnostic data
CN107578106A (en) A kind of neutral net natural language inference method for merging semanteme of word knowledge
CN103207855A (en) Fine-grained sentiment analysis system and method specific to product comment information
CN106202955A (en) Diagnosis associated packets method and system based on intellectual coded adaptation
CN101493820A (en) Medicine Regulatory industry knowledge base platform and construct method thereof
CN110209839A (en) Agricultural knowledge map construction device, method and computer readable storage medium
CN113962219A (en) Semantic matching method and system for knowledge retrieval and question answering of power transformer
CN113032418B (en) Method for converting complex natural language query into SQL (structured query language) based on tree model
CN104217013B (en) The positive and negative mode excavation method and system of course based on the item weighted sum item collection degree of association
CN104537280B (en) Protein interactive relation recognition methods based on text relation similitude
Elayidom et al. A generalized data mining framework for placement chance prediction problems
CN109241199A (en) A method of it is found towards financial knowledge mapping
CN114580418A (en) Knowledge map system for police physical training
CN108932350A (en) Based on shifty rice pest intelligent answer method
CN106372099A (en) Agricultural field ontology validity assessment method
Ni et al. Rapid generation of emergency response plans for unconventional emergencies
Bouhineau et al. Share data treatment and analysis processes inTechnology enhanced learning
Campos Designing a Network of Reference Ontologies for the Integration of Water Quality Data
Li et al. Multi-source Education Knowledge Graph Construction and Fusion for College Curricula

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant