CN111626567A - Identification and calculation method for guaranteeing resource similarity - Google Patents

Identification and calculation method for guaranteeing resource similarity Download PDF

Info

Publication number
CN111626567A
CN111626567A CN202010370021.8A CN202010370021A CN111626567A CN 111626567 A CN111626567 A CN 111626567A CN 202010370021 A CN202010370021 A CN 202010370021A CN 111626567 A CN111626567 A CN 111626567A
Authority
CN
China
Prior art keywords
similarity
resource
guaranteed
calculating
resources
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010370021.8A
Other languages
Chinese (zh)
Inventor
辛冀
王斌
刘松宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Helicopter Research and Development Institute
Original Assignee
China Helicopter Research and Development Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Helicopter Research and Development Institute filed Critical China Helicopter Research and Development Institute
Priority to CN202010370021.8A priority Critical patent/CN111626567A/en
Publication of CN111626567A publication Critical patent/CN111626567A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0631Resource planning, allocation, distributing or scheduling for enterprises or organisations
    • G06Q10/06313Resource planning in a project environment
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/20Administration of product repair or maintenance

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • General Business, Economics & Management (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Biology (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Game Theory and Decision Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention belongs to the technical field of use and maintenance support, and relates to a support resource similarity identification and calculation method. The method for identifying and calculating the guaranteed resource similarity adopts a mode of fusing the Jaccard similarity coefficient and the vector space cosine similarity to calculate, commonly represents the guaranteed resource similarity of two devices by the common characteristic quantity represented by the Jaccard similarity coefficient and the common characteristic value represented by the cosine similarity, and carries out unified quantification operation. The computer rule for similarity of the abnormal security resources established by the invention is used for analyzing the similarity of the resources, intelligently identifying the security resources with different security tasks having the types and the similar key characteristics, and providing the merging suggestion of the similar resources.

Description

Identification and calculation method for guaranteeing resource similarity
Technical Field
The invention belongs to the technical field of use and maintenance support, relates to a text data mining technology, and particularly relates to a support resource similarity identification and calculation method.
Background
At present, helicopter guarantee equipment still generally has the problems of multiple models, miscellaneous types, single function, large size and the like, is backward to helicopter equipment development on the whole, is contrary to the current guarantee concept of high strength, full territory and rapid deployment, and restricts the whole fighting capacity and the rapid maneuvering capacity of helicopter aviation soldier troops. Although the development requirements of "integration, generalization and miniaturization" (abbreviated as "three") of aviation support equipment are also provided, the generalization degree of support resources is low at present. For different guaranteed resources, when performing similarity calculation, the attributes of the resources themselves need to be considered, including: functional, weight, appearance geometry, etc., as well as the associated target subsystem/component for which it is intended, frequency of use, etc. Here we mainly discuss the data sparsity problem of the guaranteed resource similarity evaluation computation and the scalability problem caused thereby.
In the existing method for calculating similarity of guaranteed resources, there are mainly cosine similarity (cosine similarity), Pearson Correlation Coefficient (Pearson Correlation Coefficient), and Jaccard similarity Coefficient (Jaccard Correlation Coefficient).
The above method has the following problems:
1. cosine similarity uses a cosine value of an included angle between two vectors in a vector space as a measure for the difference between two individuals, the cosine similarity focuses more on the difference of the two vectors in the direction rather than on the good length of the distance, and in the calculation of the similarity of the security equipment, the cosine similarity distinguishes more differences from the direction and is insensitive to absolute numerical values;
2. the Pearson correlation coefficient is mainly used for finding a project set which is scored by two users together, then the correlation coefficient of the two vectors is calculated, and in the calculation of the similarity of the security equipment, the correlation coefficient is insensitive to absolute numerical values and only describes the variation trend of data;
3. the Jaccard similarity coefficient is mainly used for calculating the similarity between individuals with symbol measurement or Boolean value measurement, and because the characteristic attributes of the individuals are symbol measurement or Boolean value marks, the specific value of the difference cannot be measured, and only the result of 'whether the differences are the same' can be obtained.
Disclosure of Invention
The purpose of the invention is: the method for identifying and calculating the resource similarity is provided to solve the technical problem that the quantity and the numerical value of common characteristics cannot be considered simultaneously in the conventional single similarity calculation.
In order to solve the technical problem, the technical scheme of the invention is as follows:
a guaranteed resource similarity recognition and calculation method is characterized in that a mode of fusing a Jaccard similarity coefficient and a vector space cosine similarity is adopted for calculation, and the guaranteed resource similarity of two devices is represented by a common characteristic quantity represented by the Jaccard similarity coefficient and a common characteristic numerical value represented by the cosine similarity.
The method for identifying and calculating the similarity of the guaranteed resources further comprises the operation of uniformly quantizing the number of the common characteristics and the common characteristic value.
The method for identifying and calculating the similarity of the guarantee resources adopts the following fusion similarity formula:
Figure RE-GDA0002600848310000021
in the formula, the similarity between the X equipment and the Y equipment is determined by two parts, namely the measurement of the quantity of the common characteristics represented by Jaccard, and the similarity of the common characteristic value represented by cosine cos. The similarity calculation is carried out by fusing the Jaccard and the cosine, the number and the numerical value of the common characteristics can be considered at the same time, the unified quantification is carried out, meanwhile, the Jaccard formula is regarded as the weight, the similarity attenuation can be considered to be carried out according to the number of the common characteristics, and under the condition of sparse data, the more accurate similarity of the special-shaped equipment characteristics can be obtained.
The method for identifying and calculating the similarity of the guarantee resources comprises the following steps:
step one, sorting all the guarantee resource feature vectors;
step two, calculating the value of each element of the similarity matrix by using the fusion similarity formula;
thirdly, based on threshold judgment, carrying out similarity analysis on the guaranteed resources;
preferably, the method for identifying and calculating the similarity of the safeguard resources further comprises the step of giving a resource merging suggestion according to the similarity.
Preferably, the guaranteed resource characteristics include weight, output power, number of users, and frequency of use.
Preferably, the similarity matrix is corrected in combination with the characteristics of the two devices in the value judgment.
The invention has the beneficial effects that: the similarity calculation method fusing the Jaccard and the cosine firstly provides a helicopter guarantee resource similarity recognition method for recognizing the similarity of each attribute of an abnormal guarantee resource and provides input for subsequent guarantee resource optimization. Due to the limitations of the guaranteed space and other conditions, the carrying of guaranteed resources for helicopter equipment is very limited. Therefore, the universal research on the guarantee resources is developed, the types and the number of the carried guarantee resources are reduced, and the method has very important significance for improving the guarantee efficiency of the helicopter. The computer rule for similarity of the abnormal security resources established by the invention is used for analyzing the similarity of the resources, intelligently identifying the security resources with different security tasks having the types and the similar key characteristics, and providing the merging suggestion of the similar resources.
Drawings
In order to more clearly illustrate the technical solution of the present invention, the drawings used in the embodiment of the present invention will be briefly explained. It is obvious that the drawings described below are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be obtained from these drawings without inventive effort.
FIG. 1 is a flow chart of the method of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention. It is to be understood that the embodiments described are only a few embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Features of various aspects of embodiments of the invention will be described in detail below. In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without these specific details. The following description of the embodiments is merely intended to better understand the present invention by illustrating examples thereof. The present invention is not limited to any particular arrangement or method provided below, but rather covers all product structures, any modifications, alterations, etc. of the method covered without departing from the spirit of the invention.
In the drawings and the following description, well-known structures and techniques are not shown to avoid unnecessarily obscuring the present invention.
Aiming at the similarity identification and calculation method of the guarantee resource, which is disclosed by the invention, a similarity calculation method of Jaccard and cosine is fused, a similarity identification method of the guarantee resource of the helicopter is provided to identify the similarity of each attribute of the special-shaped guarantee resource, and the similarity is analyzed by taking a related guarantee tool of an electrical system of the helicopter as an example. According to the step flow chart shown in fig. 1, the specific steps are as follows:
firstly, determining equipment guarantee resources, and sorting guarantee resource feature vectors;
selecting a typical tool, 5 tool resources of a power supply vehicle, a universal meter, an electric brush extractor, a motor electric brush measuring tool and a storage battery charging and discharging station, and paying attention to several frequently used attributes of the tool in the selection of characteristics: weight, volume, function, frequency of use. And selecting equipment guarantee resource characteristics, expressing the characteristics in a vectorization manner, and calculating the value of each element of the similarity matrix by using the fusion similarity formula.
Step two, calculating the value of each element of the similarity matrix by using the fusion similarity formula;
5 tool resources of a power supply vehicle, a universal meter, a brush extractor, a motor brush measuring tool and a storage battery charging and discharging station are represented by T1, T2, T3, T4 and T5 respectively, each vector is composed of 4 dimensions and represents weight, output power, number of users and frequency (average daily use times). The method comprises the following specific steps:
Figure RE-GDA0002600848310000041
T2={1,0.01,1,1};
Figure RE-GDA0002600848310000042
Figure RE-GDA0002600848310000043
T5={1000,220,4,8};
calculating by adopting the fused Jaccard cosine similarity to obtain a similarity matrix among 5 resources, wherein the similarity matrix is as follows:
Figure RE-GDA0002600848310000044
thirdly, based on threshold judgment, carrying out similarity analysis on the guaranteed resources;
from the similarity matrix analysis, the highest similarity is that T3 and T4 reach 0.998. It is worth noting that the similarity between T1 and T5 is high according to the calculation of a common cosine formula, but through introducing Jaccard to carry out similarity correction, the quantity of similar guarantee characteristics is comprehensively considered, the similarity is punished, the similarity is more practical, and similar situations also exist between T2 and T3 and T4. In the similarity matrix, features for a certain dimension may be close, but actually may have two different features, such as electronic and mechanical, and the correction needs to be performed manually through a threshold.
Step four, providing a resource generalization suggestion according to the similarity
The highest similarity is that T3 and T4 reach 0.998, and the generalized merging is recommended.
The foregoing is merely a detailed description of the embodiments of the present invention, and some of the conventional techniques are not detailed. The scope of the present invention is not limited thereto, and any changes or substitutions that can be easily made by those skilled in the art within the technical scope of the present invention will be covered by the scope of the present invention. The protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (8)

1. A method for identifying and calculating the similarity of guarantee resources is characterized by comprising the following steps: the method for identifying and calculating the guaranteed resource similarity adopts a mode of fusing the Jaccard similarity coefficient and the vector space cosine similarity to calculate, and the guaranteed resource similarity of the two devices is represented by the common characteristic quantity represented by the Jaccard similarity coefficient and the common characteristic value represented by the cosine similarity.
2. The guaranteed resource similarity recognition computing method of claim 1, wherein: the method for identifying and calculating the similarity of the guaranteed resources further comprises the operation of uniformly quantizing the number of the common characteristics and the common characteristic value.
3. The guaranteed resource similarity recognition computing method of claim 2, wherein: the fusion similarity of the equipment X and the equipment Y in the method for identifying and calculating the resource similarity adopts the following formula:
Figure RE-FDA0002600848300000011
wherein X represents XXXX, Y represents XXXX, X vector represents XXX, YX vector represents XXX, | X | | represents XXX, and | Y | | represents XXX.
4. The guaranteed resource similarity recognition computing method of claim 3, wherein: the Jaccard formula in the fusion formula is a weight.
5. The guaranteed resource similarity recognition computing method of claim 1, wherein: the method for identifying and calculating the similarity of the guarantee resources comprises the following steps:
step one, sorting all the guarantee resource feature vectors;
step two, calculating the value of each element of the similarity matrix by using the fusion similarity formula;
and thirdly, performing similarity analysis on the guaranteed resources based on threshold judgment.
6. The guaranteed resource similarity recognition computing method of claim 5, wherein: the method for identifying and calculating the similarity of the guarantee resources further comprises the step of giving a resource merging suggestion according to the similarity.
7. The guaranteed resource similarity recognition computing method of claim 5, wherein: the guaranteed resource characteristics comprise weight, output power, number of users and use frequency.
8. The guaranteed resource similarity recognition computing method of claim 5, wherein: and correcting the similarity matrix by combining the characteristics of the two devices during value judgment.
CN202010370021.8A 2020-04-30 2020-04-30 Identification and calculation method for guaranteeing resource similarity Pending CN111626567A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010370021.8A CN111626567A (en) 2020-04-30 2020-04-30 Identification and calculation method for guaranteeing resource similarity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010370021.8A CN111626567A (en) 2020-04-30 2020-04-30 Identification and calculation method for guaranteeing resource similarity

Publications (1)

Publication Number Publication Date
CN111626567A true CN111626567A (en) 2020-09-04

Family

ID=72273003

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010370021.8A Pending CN111626567A (en) 2020-04-30 2020-04-30 Identification and calculation method for guaranteeing resource similarity

Country Status (1)

Country Link
CN (1) CN111626567A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112464991A (en) * 2020-11-04 2021-03-09 西北工业大学 Multi-sensor evidence evolution game fusion recognition method based on multi-population dynamics
CN113552496A (en) * 2021-06-29 2021-10-26 哈尔滨理工大学 Voltage cosine similarity-based diagnosis method for short circuit fault in battery series module

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080114750A1 (en) * 2006-11-14 2008-05-15 Microsoft Corporation Retrieval and ranking of items utilizing similarity
CN103823880A (en) * 2014-03-03 2014-05-28 国家认证认可监督管理委员会信息中心 Attribute weight-based method for calculating similarity between detection mechanisms
CN103984685A (en) * 2013-02-07 2014-08-13 百度国际科技(深圳)有限公司 Method, device and equipment for classifying items to be classified
CN108549883A (en) * 2018-08-06 2018-09-18 国网浙江省电力有限公司 A kind of face recognition methods again
CN110991793A (en) * 2019-10-25 2020-04-10 中国飞行试验研究院 Aviation security equipment identification resource allocation method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080114750A1 (en) * 2006-11-14 2008-05-15 Microsoft Corporation Retrieval and ranking of items utilizing similarity
CN103984685A (en) * 2013-02-07 2014-08-13 百度国际科技(深圳)有限公司 Method, device and equipment for classifying items to be classified
CN103823880A (en) * 2014-03-03 2014-05-28 国家认证认可监督管理委员会信息中心 Attribute weight-based method for calculating similarity between detection mechanisms
CN108549883A (en) * 2018-08-06 2018-09-18 国网浙江省电力有限公司 A kind of face recognition methods again
CN110991793A (en) * 2019-10-25 2020-04-10 中国飞行试验研究院 Aviation security equipment identification resource allocation method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
李征等: "一种基于改进相似度计算的文本聚类方法", 《河南大学学报(自然科学版)》 *
王焱等: "飞机结构件分类编码系统", 《航空制造技术》 *
蒋宗礼等: "融合用户相似度与信任度的协同过滤推荐算法", 《软件导刊》 *
闫纪红等: "一种基于改进成组算法的特征提取方法", 《计算机集成制造系统》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112464991A (en) * 2020-11-04 2021-03-09 西北工业大学 Multi-sensor evidence evolution game fusion recognition method based on multi-population dynamics
CN113552496A (en) * 2021-06-29 2021-10-26 哈尔滨理工大学 Voltage cosine similarity-based diagnosis method for short circuit fault in battery series module
CN113552496B (en) * 2021-06-29 2024-04-02 哈尔滨理工大学 Battery series module internal short circuit fault diagnosis method based on voltage cosine similarity

Similar Documents

Publication Publication Date Title
Sun et al. Classification of electric vehicle charging time series with selective clustering
US10215814B2 (en) System and method for cognitive alarm management for the power grid
CN111626567A (en) Identification and calculation method for guaranteeing resource similarity
CN106407280A (en) Query target matching method and device
Chandra et al. A multivariate time series clustering approach for crime trends prediction
Xie et al. Item similarity learning methods for collaborative filtering recommender systems
CN109614074A (en) Approximate adder reliability degree calculation method based on probability transfer matrix model
CN111008673A (en) Method for collecting and extracting malignant data chain in power distribution network information physical system
Zhao et al. Anomaly detection of aircraft lead‐acid battery
Saxena et al. Rank me thou shalln't compare me
Wu et al. Classification of complex power quality disturbances based on modified empirical wavelet transform and light gradient boosting machine
Chao et al. Hesitant Mahalanobis distance with applications to estimating the optimal number of clusters
Orang et al. Improving performance of similarity measures for uncertain time series using preprocessing techniques
Dagnely et al. Annotating the performance of industrial assets via relevancy estimation of event logs
Wang Application of E-Commerce Recommendation Algorithm in Consumer Preference Prediction
Abdelwahab et al. Alleviating the sparsity problem of collaborative filtering using an efficient iterative clustered prediction technique
Zhang et al. Online social network profile linkage based on cost-sensitive feature acquisition
He et al. Semi-supervised dialogue abstractive summarization via high-quality pseudolabel selection
Chen et al. Pseudo-Relevance Feedback Method Based on the Topic Relevance Model
Shahbaba et al. Efficient unimodality test in clustering by signature testing
Dhanalakshmi et al. Hybrid Cohort Rating Prediction Technique to leverage Recommender System
Bellandi et al. A Comparative Study of Clustering Techniques Applied on Covid-19 Scientific Literature
US20020138466A1 (en) Method, computer program and data processing system for data clustering
Zhang et al. Improved locally linear embedding based method for nonlinear system fault detection
Jain et al. A Proposed similarity measure for text-classification

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200904

RJ01 Rejection of invention patent application after publication