CN109582961A - A kind of efficient robot data similarity calculation algorithm - Google Patents

A kind of efficient robot data similarity calculation algorithm Download PDF

Info

Publication number
CN109582961A
CN109582961A CN201811433367.7A CN201811433367A CN109582961A CN 109582961 A CN109582961 A CN 109582961A CN 201811433367 A CN201811433367 A CN 201811433367A CN 109582961 A CN109582961 A CN 109582961A
Authority
CN
China
Prior art keywords
word
term
similarity
mapping
ontology
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811433367.7A
Other languages
Chinese (zh)
Inventor
罗志勇
范志鹏
赵杰
王月
韩冷
于士杰
郑焕平
蔡婷
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chongqing University of Post and Telecommunications
Original Assignee
Chongqing University of Post and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chongqing University of Post and Telecommunications filed Critical Chongqing University of Post and Telecommunications
Priority to CN201811433367.7A priority Critical patent/CN109582961A/en
Publication of CN109582961A publication Critical patent/CN109582961A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • G06F40/247Thesauruses; Synonyms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The phenomenon that a kind of efficient robot data similarity calculation algorithm is claimed in the present invention, is related to field there are multiple and different ontologies, and then cause the Interoperability between isomery ontology.This method defines ontology pretreatment, maps candidate generation, the merging of the Word similarity of concept term, multiple mapping and mapping result output.In all kinds of mapping methods, the concept similarity based on word constitutive characteristic calculates other corpus resources not needed in addition to word itself substantially and supports that calculating has the characteristics that direct and agility.But there are still for semantic identical but write the problems such as not fully consistent allosome, synonym similarity calculation be difficult and the allocation strategy of concept term composition term weighing to be matched is not perfect for presently relevant method.For these problems, the invention proposes a kind of efficient robot data similarity calculation algorithms, to promote Ontology Mapping resultant effect.

Description

A kind of efficient robot data similarity calculation algorithm
Technical field
The invention belongs to computer information processing fields more particularly to a kind of efficient robot data similarity calculation to calculate Method.
Background technique
Ontology Mapping Method can generally be summarized as following 4 kinds:
(1) by concept similarity calculation method, compare the similarity penetrated between object, to find between isomery ontology Connection, such as Rodr í guez etc. propose a kind of method for calculating Concept Similarity using concept definition, by the concept in ontology It is divided into three parts of semantic relation collection for indicating the synset of concept, the feature set for portraying concept, concept, and utilizes this three Divide and carries out similarity calculation, the final mapping relations determined between concept.
(2) similitude between isomery ontology in structure is analyzed, finds mapping relations by writing mapping ruler. Sunna etc. proposes a kind of method for using bulk junction composition as contextual information to realize Ontology Mapping.This method is in addition to examining Consider outside node self-information, reference is also made to the multi-level information such as its father node, child nodes, grandchild node.
(3) by the example in ontology, the mapping relations between ontology are found using technologies such as machine learning.It is typical Example is the GLUE system of the propositions such as the Doan of University of Washington.This method comprehensively considers the various Heterogeneities of ontology, passes through Machine learning classifies to the example of concept, and the Joint Distribution probability that then occurs in concept using example calculates concept Between similarity and combine domain constraint and heuristic knowledge finally to determine mapping relations.
(4) a variety of methods are carried out to comprehensive Ontology Mapping Method.Shailendra etc. develops a hybrid ontology and reflects Engine is penetrated, by the Ontology Mapping algorithm based on syntax, the Ontology Mapping algorithm based on background knowledge and structure-based ontology Mapping algorithm is integrated, and multi-angle is mapped from many aspects, while playing various method advantages, also compensates for difference The shortcoming of method.High bright wait improves similarity calculating method in terms of Ontological concept title, structure, example, attribute 4, And propose the similarity calculating method of fusion.Li Jia etc. proposes the side using Hownet, calculated in conjunction with a variety of Lexical Similarities Method realizes the ELOMC system of Chinese Ontology Mapping.
In above-mentioned Ontology Mapping Method, find that the method contacted between ontology is generally only borrowed by concept similarity calculating Help word, phrase of expression Ontological concept etc. that can carry out similarity calculation as input, it is of less demanding to design conditions.This Outside, such method not only can individually carry out Ontology Mapping calculating, but also can easily be counted with other types method integration It calculates, use is more flexible, therefore is widely used.The core of such Ontology Mapping Method is the similarity based on word It calculates.There are some similarity calculating methods based on word at present, following 4 major class can be divided into:
(1) cosine value method.Cosine similarity (Cosine Similarity) algorithm by calculate have n dimension two to Cosine angle between amount reflects estimating for similarity degree between vector.Such algorithm has widely applicable, realization simply The features such as, it is one of the Words similarity algorithm being most widely used.
(2) edit distance approach.In this type of method, the Smith- that Smith and Wa-terma was proposed in 1981 Waterman algorithm is most representative.There is no directly two sequences of calculating othernesses on the whole for the algorithm, but by Different location insertion space, multiple calculate take optimal method in shorter sequence, find out two sequence highest similarity value conducts Final result.The Jaro-Winkler Distance algorithm that Winkler is proposed is also a kind of more typical editing distance calculation Method, he improves on the basis of JaroDistance algorithm, the quantity factor of character match is not only allowed for, also by character Matched positional factor takes into account.
(3) similarity calculating method based on word role.The word of Ontological concept title is constituted, that is, in title There is different similarities to contribute by centre word (Head) and qualifier part (Modifiers), can be used as that estimate word similar The important symbol of property.Each term is expressed as by Nenadic etc. in carrying out field of biomedicine Similarity of Term calculating task Bipartite word description scheme, i.e., the modification composition part of key vocabularies and term in term;Then one is used The Dice coefficients (Dicelike Coefficient) of a weighting compare the word description scheme of two terms.Those are shared more The long term for collectively constituting part will obtain more score values;If two term keywords having the same, additional point Value will increase in similarity measure result.
(4) a variety of concept similarities calculate the method combined.Noess-ner etc. proposes the integrated side of CODI Words similarity Method, by cosine similarity algorithm, Levenshtein similarity algorithm, Jaro-Winkler Dis-tance similarity algorithm, Smith-Waterman similarity algorithm, Over-lap Coefficient similarity algorithm and Jaccard similarity algorithm It is integrated with specified weight, to promote the concept term similarity calculation effect during Ontology Matching.
In the above-mentioned experiment for calculating progress domain body mapping by concept similarity of application, find existing method two A aspect still has deficiency:
(1) for allosome word as similar " Mutation " (variation) and " Variation " (variation), synonym, closely Adopted word hardly results in correct mapping relations using the existing similarity calculating method based on word.Especially when these words When the centre word of the term as expression Ontological concept, counting loss will lead to corresponding Ontological concept matching fault.
(2) existing algorithm distributes the similar weight of qualifier in term in average mode, this makes similarity calculation As a result discrimination is bad.
The present invention proposes a kind of efficient robot data similarity calculation algorithm, introduces the synonym of WordNet, nearly justice Word and search and editing distance calculate, to solve the problems, such as that related algorithm is difficult to allosome word, synonym, near synonym;By again The Weight Value Distributed Methods for designing term centre word and term qualifier, keep centre word, qualifier weight distribution more reasonable, in turn Effectively promote the resultant effect of Ontological concept mapping.
Summary of the invention
Present invention seek to address that the above problem of the prior art.Propose it is a kind of needed for condition it is few, dispose quick and easy, energy It is enough effectively to promote the efficient robot data similarity calculation algorithm for calculating effect.Technical scheme is as follows:
A kind of efficient robot data similarity calculation algorithm comprising following steps:
A. ontology to be matched is imported, ontology to be matched is pre-processed;
B. mapping is candidate generates, including between the similar situation in pretreatment ontology concept characteristic, conceptual example Factor is analyzed, and selects concept mapping candidate right;
C. synonymous using the editing distance similarity calculation, the Word Net based on centre word that constitute word based on term The method that word, near synonym retrieval and the weight automatic Assignment based on term centre word, qualifier combine, between ontology The term description collections that combination of two is formed are to progress similarity calculation;
D. knot of the highest concept of mapping of similarity to set, as Ontology Mapping is selected from qualified concept group Fruit is exported, and is formatted output and storage to Ontology Mapping result.
Further, in the step a ontology pretreatment specifically include: import ontology to be matched, in ontology include class Concept, attribute, the term for reading including example, attribute instance parsed, feature extraction and formatted storage, be subsequent With ready for operation.
Further, candidate generate is mapped in the step b to have used for reference Huber etc. and use in CODI Ontology Mapping Method Mapping candidate's generation method, have main steps that: factors such as similar situation in 1. betweens of ontologies concept characteristic, conceptual example point Analysis;2. the possible concept mapping of selection and combination is candidate right.
Further, described using the editing distance similarity calculation for constituting word based on term, it specifically includes: 3.1) art Language constitutes the similarity calculation of word level, on the basis of establishing the word one-to-one similarity calculation in term, therefore, leads to The mode of two term word matrixes of building is crossed, the best match corresponding relationship in term between word is found, in the list of two terms In set of words matching process, in order to find optimum matching relation, following formula is proposed:
sim(wi,wj)=dω(wi,wj)if dω(wi,wj)≥0.8
Wherein, sim (wi,wj) indicating similarity in two terms between any word pair, similarity threshold takes 0.8, dω (wi,wj) indicate to calculate the word w obtained by editing distance formulaiAnd wjBetween editing distance.
Further, the similarity calculation between the word pair is divided into three kinds of situations:
1. if the d of two wordsωValue is less than 0.8, then it is assumed that mismatches between them, similarity 0;
2. if the d of two wordsωValue is more than or equal to 0.8, then it is assumed that mismatches between them, similarity dω
3. if first retrieving Word Net two words are the centre word of respectively place term and judging whether the two is same each other Adopted word, near synonym or antonym, if it is return value is 1, and otherwise word calculates similarity by two.
Further, the Words similarity weight distribution specifically includes: parsing, is found out in analytic tree to term The most deep noun in position, and using the word as term centre word, other words are as qualifier, to term centre word and qualifier It carries out after correctly distinguishing, carries out the distribution of similarity weight to each word using following formula;
Wherein, wt1iAnd wt2jRespectively indicate term t1In i-th of word and term t2In j-th of word, d (wt1i) indicate word Language wt1iWith t1The distance between subject term in term, Weight (wt1i,wt2j) indicate that matching word is to similar in progress in two terms Weight distribution when degree calculates.
Further, the Words similarity COMPREHENSIVE CALCULATING formula are as follows:
Wherein, t1And t2For the term pair for needing to calculate Words similarity, sim (wt1i,wt2j) indicate term t1In i-th Word and term t2In similarity between j-th of word, Weight (wt1i,wt2j) indicate according to and subject term in respective term it Between distance, be wt1iAnd wt2jThe similarity weight of distribution, in denominator the value range of l be [0, Max (| t1|,|t2|) -1], really Entire calculated result has been protected to be in always in [0,1] range.
Further, multiple mapping merges in the step d and mapping result exports, and specifically includes: according to experimental data Analysis sets optimal trusted degree threshold value, filters out the concept pair for having more than this threshold value, is being more than the ontology to be matched of threshold value Concept centering will form the mapping relations of 1:n, m:1 or m:n between concept, in order to be translated into 1:1 mapping relations, so as to Mapping result collection is manually marked with domain expert to compare, and needs to select similarity from these qualified concept groups again To set, the result as Ontology Mapping is exported highest concept of mapping;And Ontology Mapping result is formatted Output and storage.
It advantages of the present invention and has the beneficial effect that:
Innovative point of the invention is to propose a kind of efficient robot data similarity calculation algorithm, in Ontology Mapping Concept similarity calculates existing deficiency, proposes a kind of improved method.By the synonym of Word Net, near synonym retrieval and editor Distance algorithm introduces the similarity deterministic process between term centre word, and by new automatic Weight Value Distributed Methods in term Heart word and term modification Word similarity are integrated.It comprises the concrete steps that: ontology pretreatment, the candidate generation of mapping, Words similarity Calculating, multiple mapping merge and mapping result output, wherein in Word similarity, by constituting centre word to term Carry out Word Net synonym, near synonym retrieval, and to the centre word and all qualifiers that can not retrieve edited away from From calculating, the correct matching relationship between two terms composition word is found, and is each section by similarity Weight Value Distributed Methods With relation allocation weight, similarity calculation result between term is obtained eventually by COMPREHENSIVE CALCULATING formula.The advantages of this method is: tool Have required condition it is few, deployment it is quick and easy, can effectively promote calculating effect
Detailed description of the invention
Fig. 1 is that the present invention provides Ontology Mapping procedure chart of the preferred embodiment based on Word similarity;
Fig. 2 is Word similarity flow chart of the invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, detailed Carefully describe.Described embodiment is only a part of the embodiments of the present invention.
The technical solution that the present invention solves above-mentioned technical problem is:
The invention proposes a kind of efficient robot data similarity calculation algorithms, it is characterised in that: by editing distance The composition word match that algorithm is applied to Ontological concept term calculates, and the synonym of WordNet, near synonym retrieval are applied to art Similarity judgement between language centre word, and the Weight Value Distributed Methods of Design Technology language centre word and term qualifier again, to mention Rise Ontology Mapping resultant effect.Hereinafter reference will be made to the drawings and invention is further described in detail in conjunction with example.
As shown in Figure 1, the Ontology Mapping procedure chart of the invention based on Word similarity, is specifically realized in :
1. ontology pre-processes, it is responsible for importing ontology to be matched, to the concept of class in ontology, attribute, reads example, attribute instance Equal terms are parsed, feature extraction and formatted storage, are subsequent match ready for operation.
It is generated 2. mapping is candidate, by between the factor analyses such as similar situation in ontology concept characteristic, conceptual example, choosing It selects and to combine the mapping of possible concept candidate right.
3. Word similarity, using constituting the editing distance similarity calculation of word based on term, be based on centre word Word Net synonym, near synonym retrieval and the weight automatic Assignment based on term centre word, qualifier combine Method, the term description collections formed between combination of two ontology are to similarity calculation is carried out, and result is as judging ontology The important evidence of middle concept mapping, the specific implementation process is as follows step:
3.1) term constitutes the similarity calculation of word level, it should establish the one-to-one similarity of word in term On the basis of calculating.Therefore, it is necessary to find the best match in term between word by way of constructing two term word matrixes Corresponding relationship.In the set of letters matching process of two terms, in order to find optimum matching relation, following formula is proposed:
sim(wi,wj)=dω(wi,wj)if dω(wi,wj)≥0.8
Wherein, sim (wi,wj) indicate similarity in two terms between any word pair, it is determined here by Germicidal efficacy Similarity threshold take it is 0.8 more appropriate.dω(wi,wj) indicate to calculate the word w obtained by editing distance formulaiAnd wjIt Between editing distance.Similarity calculation between word pair is divided into three kinds of situations:
1. if the d of two wordsωValue is less than 0.8, then it is assumed that mismatches between them, similarity 0;
2. if the d of two wordsωValue is more than or equal to 0.8, then it is assumed that mismatches between them, similarity dω
3. if first retrieving Word Net two words are the centre word of respectively place term and judging whether the two is same each other Adopted word, near synonym or antonym, if it is return value is 1, and otherwise word calculates similarity by two.
3.2) Words similarity weight distribution designs, and parses to term, finds out the name that position is most deep in analytic tree Word, and using the word as term centre word, other words are as qualifier.It is correctly distinguished to term centre word and qualifier Afterwards, the distribution of similarity weight is carried out to each word using following formula.
Wherein, wt1iAnd wt2jRespectively indicate term t1In i-th of word and term t2In j-th of word, d (wt1i) indicate word Language wt1iWith t1The distance between subject term in term, Weight (wt1i,wt2j) indicate that matching word is to similar in progress in two terms Weight distribution when degree calculates.
3.3) Words similarity COMPREHENSIVE CALCULATING designs
Wherein, t1And t2For the term pair for needing to calculate Words similarity, sim (wt1i,wt2j) indicate term t1In i-th Word and term t2In similarity between j-th of word, Weight (wt1i,wt2j) indicate according to and subject term in respective term it Between distance, be wt1iAnd wt2jThe similarity weight of distribution, in denominator the value range of l be [0, Max (| t1|,|t2|) -1], really Entire calculated result has been protected to be in always in [0,1] range.
4. multiple mapping merges and mapping result output, according to analysis of experimental data, optimal trusted degree threshold value, screening are set The concept pair of this threshold value is had more than out.In the Ontological concept pair to be matched for being more than threshold value, it may will form between concept The mapping relations of 1:n, m:1 or m:n, in order to be translated into 1:1 mapping relations, manually to mark mapping knot with domain expert Fruit collection compares, and needs to select the highest concept of mapping of similarity to set from these qualified concept groups again, make It is exported for the result of Ontology Mapping;And output and storage are formatted to Ontology Mapping result, in order to subsequent benefit With and calculate effect evaluation and test.
As shown in Fig. 2, Word similarity flow chart of the invention, Fig. 2 is done for Word similarity in Fig. 1 Further refinement.Specifically it is achieved in that
1. obtaining two words first from ontology library;
2. two words that will acquire carry out judging whether it is synonym;
3. if two words are synonym, return value 1;
4. if obtaining the distance of two words by calculating two words are not synonyms;
5. the similarity of two words is 0 when if the distance of two words is less than 0.8;
6. the similarity of two words is d when if the distance of two words is greater than or equal to 0.8w(between two words away from From value).
The above embodiment is interpreted as being merely to illustrate the present invention rather than limit the scope of the invention.? After the content for having read record of the invention, technical staff can be made various changes or modifications the present invention, these equivalent changes Change and modification equally falls into the scope of the claims in the present invention.

Claims (8)

1. a kind of efficient robot data similarity calculation algorithm, which comprises the following steps:
A. ontology to be matched is imported, ontology to be matched is pre-processed;
B. mapping is candidate generates, by between the factor including the similar situation in pretreatment ontology concept characteristic, conceptual example It is analyzed, selects concept mapping candidate right;
C. the editing distance similarity calculation of word, the Word Net synonym based on centre word, close is constituted using based on term The method that adopted word and search and weight automatic Assignment based on term centre word, qualifier combine, between ontology two-by-two The term description collections formed are combined to progress similarity calculation;
D. select the highest concept of mapping of similarity to set from qualified concept group, as Ontology Mapping result into Row output, and output and storage are formatted to Ontology Mapping result.
2. a kind of efficient robot data similarity calculation algorithm according to claim 1, which is characterized in that the step Ontology pretreatment specifically includes in rapid a: ontology to be matched is imported, to including the concept of class in ontology, attribute, read example, attribute Term including example parsed, feature extraction and formatted storage, is subsequent match ready for operation.
3. a kind of efficient robot data similarity calculation algorithm according to claim 1, which is characterized in that the step Candidate generate of mapping has used for reference Huber etc. and map candidate's generation method used in CODI Ontology Mapping Method in rapid b, mainly Step is: the similar situation factor analysis in 1. betweens of ontologies concept characteristic, conceptual example;2. the possible concept of selection and combination Mapping is candidate right.
4. a kind of efficient robot data similarity calculation algorithm according to claim 1, which is characterized in that described to adopt With the editing distance similarity calculation for constituting word based on term, specifically include: 3.1) term constitutes the similarity of word level It calculates, on the basis of establishing the word one-to-one similarity calculation in term, therefore, passes through two term word matrixes of building Mode finds the best match corresponding relationship in term between word, in the set of letters matching process of two terms, in order to Optimum matching relation is enough found, proposes following formula:
sim(wi,wj)=dω(wi,wj)if dω(wi,wj)≥0.8
Wherein, sim (wi,wj) indicating similarity in two terms between any word pair, similarity threshold takes 0.8, dω(wi,wj) It indicates to calculate the word w obtained by editing distance formulaiAnd wjBetween editing distance.
5. a kind of efficient robot data similarity calculation algorithm according to claim 4, which is characterized in that the list Similarity calculation between word pair is divided into three kinds of situations:
1. if the d of two wordsωValue is less than 0.8, then it is assumed that mismatches between them, similarity 0;
2. if the d of two wordsωValue is more than or equal to 0.8, then it is assumed that mismatches between them, similarity dω
3. if first retrieving Word Net judges whether the two is synonymous each other two words are the centre word of respectively place term Word, near synonym or antonym, if it is return value is 1, and otherwise word calculates similarity by two.
6. a kind of efficient robot data similarity calculation algorithm according to claim 4, which is characterized in that institute's predicate Language similarity weight distribution specifically includes: parsing to term, finds out the noun that position is most deep in analytic tree, and by the word As term centre word, other words are as qualifier, and after correctly distinguish to term centre word and qualifier, use is following Formula carries out the distribution of similarity weight to each word;
Wherein, wt1iAnd wt2jRespectively indicate term t1In i-th of word and term t2In j-th of word, d (wt1i) indicate word wt1iWith t1The distance between subject term in term, Weight (wt1i,wt2j) indicate that matching word is in progress similarity in two terms Weight distribution when calculating.
7. a kind of efficient robot data similarity calculation algorithm according to claim 6, which is characterized in that institute's predicate Language similarity COMPREHENSIVE CALCULATING formula are as follows:
Wherein, t1And t2For the term pair for needing to calculate Words similarity, sim (wt1i,wt2j) indicate term t1In i-th of word With term t2In similarity between j-th of word, Weight (wt1i,wt2j) indicate according to between subject term in respective term Distance is wt1iAnd wt2jThe similarity weight of distribution, in denominator the value range of l be [0, Max (| t1|,|t2|) -1], it is ensured that Entire calculated result is in always in [0,1] range.
8. a kind of efficient robot data similarity calculation algorithm according to claim 7, which is characterized in that the step Multiple mapping merges in rapid d and mapping result exports, and specifically includes: according to analysis of experimental data, optimal trusted degree threshold value is set, The concept pair for having more than this threshold value is filtered out, in the Ontological concept pair to be matched for being more than threshold value, will form between concept The mapping relations of 1:n, m:1 or m:n, in order to be translated into 1:1 mapping relations, manually to mark mapping knot with domain expert Fruit collection compares, and needs to select the highest concept of mapping of similarity to set from these qualified concept groups again, make It is exported for the result of Ontology Mapping;And output and storage are formatted to Ontology Mapping result.
CN201811433367.7A 2018-11-28 2018-11-28 A kind of efficient robot data similarity calculation algorithm Pending CN109582961A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811433367.7A CN109582961A (en) 2018-11-28 2018-11-28 A kind of efficient robot data similarity calculation algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811433367.7A CN109582961A (en) 2018-11-28 2018-11-28 A kind of efficient robot data similarity calculation algorithm

Publications (1)

Publication Number Publication Date
CN109582961A true CN109582961A (en) 2019-04-05

Family

ID=65924735

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811433367.7A Pending CN109582961A (en) 2018-11-28 2018-11-28 A kind of efficient robot data similarity calculation algorithm

Country Status (1)

Country Link
CN (1) CN109582961A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110704522A (en) * 2019-09-04 2020-01-17 南京航空航天大学 Concept data model automatic conversion method based on semantic analysis
CN111768821A (en) * 2020-05-29 2020-10-13 上海森亿医疗科技有限公司 Distributed patient record matching method, system and terminal
WO2021051763A1 (en) * 2019-09-16 2021-03-25 深圳中兴网信科技有限公司 Term matching method and apparatus, terminal, and computer readable storage medium
CN113326686A (en) * 2020-02-28 2021-08-31 株式会社斯库林集团 Similarity calculation device, recording medium, and similarity calculation method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104699767A (en) * 2015-02-15 2015-06-10 首都经济贸易大学 Large-scale ontology mapping method for Chinese languages
CN107885749A (en) * 2016-09-30 2018-04-06 南京理工大学 Ontology extends the process knowledge search method with collaborative filtering Weighted Fusion

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104699767A (en) * 2015-02-15 2015-06-10 首都经济贸易大学 Large-scale ontology mapping method for Chinese languages
CN107885749A (en) * 2016-09-30 2018-04-06 南京理工大学 Ontology extends the process knowledge search method with collaborative filtering Weighted Fusion

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
徐健 等: "一种基于词语相似度计算的本体映射方法", 《现代图书情报技术》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110704522A (en) * 2019-09-04 2020-01-17 南京航空航天大学 Concept data model automatic conversion method based on semantic analysis
CN110704522B (en) * 2019-09-04 2023-04-07 南京航空航天大学 Concept data model automatic conversion method based on semantic analysis
WO2021051763A1 (en) * 2019-09-16 2021-03-25 深圳中兴网信科技有限公司 Term matching method and apparatus, terminal, and computer readable storage medium
CN113326686A (en) * 2020-02-28 2021-08-31 株式会社斯库林集团 Similarity calculation device, recording medium, and similarity calculation method
CN113326686B (en) * 2020-02-28 2024-05-10 株式会社斯库林集团 Similarity calculation device, recording medium, and similarity calculation method
CN111768821A (en) * 2020-05-29 2020-10-13 上海森亿医疗科技有限公司 Distributed patient record matching method, system and terminal

Similar Documents

Publication Publication Date Title
Grishman Information extraction
CN109582961A (en) A kind of efficient robot data similarity calculation algorithm
Kashyap et al. TaxaMiner: an experimentation framework for automated taxonomy bootstrapping
Sarkar et al. A new approach to keyphrase extraction using neural networks
CN106844658A (en) A kind of Chinese text knowledge mapping method for auto constructing and system
US20110208776A1 (en) Method and apparatus of semantic technological approach based on semantic relation in context and storage media having program source thereof
US9659001B2 (en) Query evaluation using ancestor information
Lin et al. An integrated approach to extracting ontological structures from folksonomies
CN101685455A (en) Method and system of data retrieval
Thijs Using neural-network based paragraph embeddings for the calculation of within and between document similarities
Janik et al. Training-less ontology-based text categorization
CN114997288A (en) Design resource association method
Babekr et al. Personalized semantic retrieval and summarization of web based documents
Ramachandran et al. A Novel Method for Text Summarization and Clustering of Documents
Luong et al. Ontology learning using word net lexical expansion and text mining
Chifu et al. Unsupervised semantic annotation of Web service datatypes
Bounhas et al. A hierarchical approach for semi-structured document indexing and terminology extraction
Chen English translation template retrieval based on semantic distance ontology knowledge recognition algorithm
Wang et al. A graph-based approach for semantic similar word retrieval
Abel et al. The impact of multifaceted tagging on learning tag relations and search
Boese et al. Semantic document networks to support concept retrieval
Yu et al. Performance analysis and assessment of a tf-idf based archetype-SNOMED-CT binding algorithm
Lauser Semi-automatic ontology engineering and ontology supported document indexing in a multilingual environment
Pernelle Semantic enrichment of data: annotation and data linking
Mousavi et al. Mining semantics structures from syntactic structures in web document corpora

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190405

RJ01 Rejection of invention patent application after publication