CN111626567A - Identification and calculation method for guaranteeing resource similarity - Google Patents
Identification and calculation method for guaranteeing resource similarity Download PDFInfo
- Publication number
- CN111626567A CN111626567A CN202010370021.8A CN202010370021A CN111626567A CN 111626567 A CN111626567 A CN 111626567A CN 202010370021 A CN202010370021 A CN 202010370021A CN 111626567 A CN111626567 A CN 111626567A
- Authority
- CN
- China
- Prior art keywords
- similarity
- resource
- guaranteed
- calculating
- resources
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000004364 calculation method Methods 0.000 title claims abstract description 22
- 238000000034 method Methods 0.000 claims abstract description 22
- 239000013598 vector Substances 0.000 claims abstract description 13
- 239000011159 matrix material Substances 0.000 claims description 10
- 230000004927 fusion Effects 0.000 claims description 7
- 238000004458 analytical method Methods 0.000 claims description 4
- 230000002159 abnormal effect Effects 0.000 abstract description 3
- 238000012423 maintenance Methods 0.000 abstract description 2
- 238000011002 quantification Methods 0.000 abstract description 2
- 238000005259 measurement Methods 0.000 description 4
- 238000012937 correction Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000007599 discharging Methods 0.000 description 2
- 230000004075 alteration Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
- G06Q10/063—Operations research, analysis or management
- G06Q10/0631—Resource planning, allocation, distributing or scheduling for enterprises or organisations
- G06Q10/06313—Resource planning in a project environment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/20—Administration of product repair or maintenance
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Human Resources & Organizations (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Entrepreneurship & Innovation (AREA)
- Economics (AREA)
- General Physics & Mathematics (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- Marketing (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- General Business, Economics & Management (AREA)
- Life Sciences & Earth Sciences (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Biology (AREA)
- Development Economics (AREA)
- Educational Administration (AREA)
- Biodiversity & Conservation Biology (AREA)
- Game Theory and Decision Science (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention belongs to the technical field of use and maintenance support, and relates to a support resource similarity identification and calculation method. The method for identifying and calculating the guaranteed resource similarity adopts a mode of fusing the Jaccard similarity coefficient and the vector space cosine similarity to calculate, commonly represents the guaranteed resource similarity of two devices by the common characteristic quantity represented by the Jaccard similarity coefficient and the common characteristic value represented by the cosine similarity, and carries out unified quantification operation. The computer rule for similarity of the abnormal security resources established by the invention is used for analyzing the similarity of the resources, intelligently identifying the security resources with different security tasks having the types and the similar key characteristics, and providing the merging suggestion of the similar resources.
Description
Technical Field
The invention belongs to the technical field of use and maintenance support, relates to a text data mining technology, and particularly relates to a support resource similarity identification and calculation method.
Background
At present, helicopter guarantee equipment still generally has the problems of multiple models, miscellaneous types, single function, large size and the like, is backward to helicopter equipment development on the whole, is contrary to the current guarantee concept of high strength, full territory and rapid deployment, and restricts the whole fighting capacity and the rapid maneuvering capacity of helicopter aviation soldier troops. Although the development requirements of "integration, generalization and miniaturization" (abbreviated as "three") of aviation support equipment are also provided, the generalization degree of support resources is low at present. For different guaranteed resources, when performing similarity calculation, the attributes of the resources themselves need to be considered, including: functional, weight, appearance geometry, etc., as well as the associated target subsystem/component for which it is intended, frequency of use, etc. Here we mainly discuss the data sparsity problem of the guaranteed resource similarity evaluation computation and the scalability problem caused thereby.
In the existing method for calculating similarity of guaranteed resources, there are mainly cosine similarity (cosine similarity), Pearson Correlation Coefficient (Pearson Correlation Coefficient), and Jaccard similarity Coefficient (Jaccard Correlation Coefficient).
The above method has the following problems:
1. cosine similarity uses a cosine value of an included angle between two vectors in a vector space as a measure for the difference between two individuals, the cosine similarity focuses more on the difference of the two vectors in the direction rather than on the good length of the distance, and in the calculation of the similarity of the security equipment, the cosine similarity distinguishes more differences from the direction and is insensitive to absolute numerical values;
2. the Pearson correlation coefficient is mainly used for finding a project set which is scored by two users together, then the correlation coefficient of the two vectors is calculated, and in the calculation of the similarity of the security equipment, the correlation coefficient is insensitive to absolute numerical values and only describes the variation trend of data;
3. the Jaccard similarity coefficient is mainly used for calculating the similarity between individuals with symbol measurement or Boolean value measurement, and because the characteristic attributes of the individuals are symbol measurement or Boolean value marks, the specific value of the difference cannot be measured, and only the result of 'whether the differences are the same' can be obtained.
Disclosure of Invention
The purpose of the invention is: the method for identifying and calculating the resource similarity is provided to solve the technical problem that the quantity and the numerical value of common characteristics cannot be considered simultaneously in the conventional single similarity calculation.
In order to solve the technical problem, the technical scheme of the invention is as follows:
a guaranteed resource similarity recognition and calculation method is characterized in that a mode of fusing a Jaccard similarity coefficient and a vector space cosine similarity is adopted for calculation, and the guaranteed resource similarity of two devices is represented by a common characteristic quantity represented by the Jaccard similarity coefficient and a common characteristic numerical value represented by the cosine similarity.
The method for identifying and calculating the similarity of the guaranteed resources further comprises the operation of uniformly quantizing the number of the common characteristics and the common characteristic value.
The method for identifying and calculating the similarity of the guarantee resources adopts the following fusion similarity formula:
in the formula, the similarity between the X equipment and the Y equipment is determined by two parts, namely the measurement of the quantity of the common characteristics represented by Jaccard, and the similarity of the common characteristic value represented by cosine cos. The similarity calculation is carried out by fusing the Jaccard and the cosine, the number and the numerical value of the common characteristics can be considered at the same time, the unified quantification is carried out, meanwhile, the Jaccard formula is regarded as the weight, the similarity attenuation can be considered to be carried out according to the number of the common characteristics, and under the condition of sparse data, the more accurate similarity of the special-shaped equipment characteristics can be obtained.
The method for identifying and calculating the similarity of the guarantee resources comprises the following steps:
step one, sorting all the guarantee resource feature vectors;
step two, calculating the value of each element of the similarity matrix by using the fusion similarity formula;
thirdly, based on threshold judgment, carrying out similarity analysis on the guaranteed resources;
preferably, the method for identifying and calculating the similarity of the safeguard resources further comprises the step of giving a resource merging suggestion according to the similarity.
Preferably, the guaranteed resource characteristics include weight, output power, number of users, and frequency of use.
Preferably, the similarity matrix is corrected in combination with the characteristics of the two devices in the value judgment.
The invention has the beneficial effects that: the similarity calculation method fusing the Jaccard and the cosine firstly provides a helicopter guarantee resource similarity recognition method for recognizing the similarity of each attribute of an abnormal guarantee resource and provides input for subsequent guarantee resource optimization. Due to the limitations of the guaranteed space and other conditions, the carrying of guaranteed resources for helicopter equipment is very limited. Therefore, the universal research on the guarantee resources is developed, the types and the number of the carried guarantee resources are reduced, and the method has very important significance for improving the guarantee efficiency of the helicopter. The computer rule for similarity of the abnormal security resources established by the invention is used for analyzing the similarity of the resources, intelligently identifying the security resources with different security tasks having the types and the similar key characteristics, and providing the merging suggestion of the similar resources.
Drawings
In order to more clearly illustrate the technical solution of the present invention, the drawings used in the embodiment of the present invention will be briefly explained. It is obvious that the drawings described below are only some embodiments of the invention, and that for a person skilled in the art, other drawings can be obtained from these drawings without inventive effort.
FIG. 1 is a flow chart of the method of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention. It is to be understood that the embodiments described are only a few embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Features of various aspects of embodiments of the invention will be described in detail below. In the following detailed description, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without these specific details. The following description of the embodiments is merely intended to better understand the present invention by illustrating examples thereof. The present invention is not limited to any particular arrangement or method provided below, but rather covers all product structures, any modifications, alterations, etc. of the method covered without departing from the spirit of the invention.
In the drawings and the following description, well-known structures and techniques are not shown to avoid unnecessarily obscuring the present invention.
Aiming at the similarity identification and calculation method of the guarantee resource, which is disclosed by the invention, a similarity calculation method of Jaccard and cosine is fused, a similarity identification method of the guarantee resource of the helicopter is provided to identify the similarity of each attribute of the special-shaped guarantee resource, and the similarity is analyzed by taking a related guarantee tool of an electrical system of the helicopter as an example. According to the step flow chart shown in fig. 1, the specific steps are as follows:
firstly, determining equipment guarantee resources, and sorting guarantee resource feature vectors;
selecting a typical tool, 5 tool resources of a power supply vehicle, a universal meter, an electric brush extractor, a motor electric brush measuring tool and a storage battery charging and discharging station, and paying attention to several frequently used attributes of the tool in the selection of characteristics: weight, volume, function, frequency of use. And selecting equipment guarantee resource characteristics, expressing the characteristics in a vectorization manner, and calculating the value of each element of the similarity matrix by using the fusion similarity formula.
Step two, calculating the value of each element of the similarity matrix by using the fusion similarity formula;
5 tool resources of a power supply vehicle, a universal meter, a brush extractor, a motor brush measuring tool and a storage battery charging and discharging station are represented by T1, T2, T3, T4 and T5 respectively, each vector is composed of 4 dimensions and represents weight, output power, number of users and frequency (average daily use times). The method comprises the following specific steps:
T2={1,0.01,1,1};
T5={1000,220,4,8};
calculating by adopting the fused Jaccard cosine similarity to obtain a similarity matrix among 5 resources, wherein the similarity matrix is as follows:
thirdly, based on threshold judgment, carrying out similarity analysis on the guaranteed resources;
from the similarity matrix analysis, the highest similarity is that T3 and T4 reach 0.998. It is worth noting that the similarity between T1 and T5 is high according to the calculation of a common cosine formula, but through introducing Jaccard to carry out similarity correction, the quantity of similar guarantee characteristics is comprehensively considered, the similarity is punished, the similarity is more practical, and similar situations also exist between T2 and T3 and T4. In the similarity matrix, features for a certain dimension may be close, but actually may have two different features, such as electronic and mechanical, and the correction needs to be performed manually through a threshold.
Step four, providing a resource generalization suggestion according to the similarity
The highest similarity is that T3 and T4 reach 0.998, and the generalized merging is recommended.
The foregoing is merely a detailed description of the embodiments of the present invention, and some of the conventional techniques are not detailed. The scope of the present invention is not limited thereto, and any changes or substitutions that can be easily made by those skilled in the art within the technical scope of the present invention will be covered by the scope of the present invention. The protection scope of the present invention shall be subject to the protection scope of the claims.
Claims (8)
1. A method for identifying and calculating the similarity of guarantee resources is characterized by comprising the following steps: the method for identifying and calculating the guaranteed resource similarity adopts a mode of fusing the Jaccard similarity coefficient and the vector space cosine similarity to calculate, and the guaranteed resource similarity of the two devices is represented by the common characteristic quantity represented by the Jaccard similarity coefficient and the common characteristic value represented by the cosine similarity.
2. The guaranteed resource similarity recognition computing method of claim 1, wherein: the method for identifying and calculating the similarity of the guaranteed resources further comprises the operation of uniformly quantizing the number of the common characteristics and the common characteristic value.
3. The guaranteed resource similarity recognition computing method of claim 2, wherein: the fusion similarity of the equipment X and the equipment Y in the method for identifying and calculating the resource similarity adopts the following formula:
wherein X represents XXXX, Y represents XXXX, X vector represents XXX, YX vector represents XXX, | X | | represents XXX, and | Y | | represents XXX.
4. The guaranteed resource similarity recognition computing method of claim 3, wherein: the Jaccard formula in the fusion formula is a weight.
5. The guaranteed resource similarity recognition computing method of claim 1, wherein: the method for identifying and calculating the similarity of the guarantee resources comprises the following steps:
step one, sorting all the guarantee resource feature vectors;
step two, calculating the value of each element of the similarity matrix by using the fusion similarity formula;
and thirdly, performing similarity analysis on the guaranteed resources based on threshold judgment.
6. The guaranteed resource similarity recognition computing method of claim 5, wherein: the method for identifying and calculating the similarity of the guarantee resources further comprises the step of giving a resource merging suggestion according to the similarity.
7. The guaranteed resource similarity recognition computing method of claim 5, wherein: the guaranteed resource characteristics comprise weight, output power, number of users and use frequency.
8. The guaranteed resource similarity recognition computing method of claim 5, wherein: and correcting the similarity matrix by combining the characteristics of the two devices during value judgment.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010370021.8A CN111626567A (en) | 2020-04-30 | 2020-04-30 | Identification and calculation method for guaranteeing resource similarity |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010370021.8A CN111626567A (en) | 2020-04-30 | 2020-04-30 | Identification and calculation method for guaranteeing resource similarity |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111626567A true CN111626567A (en) | 2020-09-04 |
Family
ID=72273003
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010370021.8A Pending CN111626567A (en) | 2020-04-30 | 2020-04-30 | Identification and calculation method for guaranteeing resource similarity |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111626567A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112464991A (en) * | 2020-11-04 | 2021-03-09 | 西北工业大学 | Multi-sensor evidence evolution game fusion recognition method based on multi-population dynamics |
CN113552496A (en) * | 2021-06-29 | 2021-10-26 | 哈尔滨理工大学 | Voltage cosine similarity-based diagnosis method for short circuit fault in battery series module |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080114750A1 (en) * | 2006-11-14 | 2008-05-15 | Microsoft Corporation | Retrieval and ranking of items utilizing similarity |
CN103823880A (en) * | 2014-03-03 | 2014-05-28 | 国家认证认可监督管理委员会信息中心 | Attribute weight-based method for calculating similarity between detection mechanisms |
CN103984685A (en) * | 2013-02-07 | 2014-08-13 | 百度国际科技(深圳)有限公司 | Method, device and equipment for classifying items to be classified |
CN108549883A (en) * | 2018-08-06 | 2018-09-18 | 国网浙江省电力有限公司 | A kind of face recognition methods again |
CN110991793A (en) * | 2019-10-25 | 2020-04-10 | 中国飞行试验研究院 | Aviation security equipment identification resource allocation method |
-
2020
- 2020-04-30 CN CN202010370021.8A patent/CN111626567A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080114750A1 (en) * | 2006-11-14 | 2008-05-15 | Microsoft Corporation | Retrieval and ranking of items utilizing similarity |
CN103984685A (en) * | 2013-02-07 | 2014-08-13 | 百度国际科技(深圳)有限公司 | Method, device and equipment for classifying items to be classified |
CN103823880A (en) * | 2014-03-03 | 2014-05-28 | 国家认证认可监督管理委员会信息中心 | Attribute weight-based method for calculating similarity between detection mechanisms |
CN108549883A (en) * | 2018-08-06 | 2018-09-18 | 国网浙江省电力有限公司 | A kind of face recognition methods again |
CN110991793A (en) * | 2019-10-25 | 2020-04-10 | 中国飞行试验研究院 | Aviation security equipment identification resource allocation method |
Non-Patent Citations (4)
Title |
---|
李征等: "一种基于改进相似度计算的文本聚类方法", 《河南大学学报(自然科学版)》 * |
王焱等: "飞机结构件分类编码系统", 《航空制造技术》 * |
蒋宗礼等: "融合用户相似度与信任度的协同过滤推荐算法", 《软件导刊》 * |
闫纪红等: "一种基于改进成组算法的特征提取方法", 《计算机集成制造系统》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112464991A (en) * | 2020-11-04 | 2021-03-09 | 西北工业大学 | Multi-sensor evidence evolution game fusion recognition method based on multi-population dynamics |
CN113552496A (en) * | 2021-06-29 | 2021-10-26 | 哈尔滨理工大学 | Voltage cosine similarity-based diagnosis method for short circuit fault in battery series module |
CN113552496B (en) * | 2021-06-29 | 2024-04-02 | 哈尔滨理工大学 | Battery series module internal short circuit fault diagnosis method based on voltage cosine similarity |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Sun et al. | Classification of electric vehicle charging time series with selective clustering | |
US10215814B2 (en) | System and method for cognitive alarm management for the power grid | |
CN111626567A (en) | Identification and calculation method for guaranteeing resource similarity | |
CN106407280A (en) | Query target matching method and device | |
Chandra et al. | A multivariate time series clustering approach for crime trends prediction | |
Xie et al. | Item similarity learning methods for collaborative filtering recommender systems | |
CN109614074A (en) | Approximate adder reliability degree calculation method based on probability transfer matrix model | |
CN111008673A (en) | Method for collecting and extracting malignant data chain in power distribution network information physical system | |
Zhao et al. | Anomaly detection of aircraft lead‐acid battery | |
Saxena et al. | Rank me thou shalln't compare me | |
Wu et al. | Classification of complex power quality disturbances based on modified empirical wavelet transform and light gradient boosting machine | |
Chao et al. | Hesitant Mahalanobis distance with applications to estimating the optimal number of clusters | |
Orang et al. | Improving performance of similarity measures for uncertain time series using preprocessing techniques | |
Dagnely et al. | Annotating the performance of industrial assets via relevancy estimation of event logs | |
Wang | Application of E-Commerce Recommendation Algorithm in Consumer Preference Prediction | |
Abdelwahab et al. | Alleviating the sparsity problem of collaborative filtering using an efficient iterative clustered prediction technique | |
Zhang et al. | Online social network profile linkage based on cost-sensitive feature acquisition | |
He et al. | Semi-supervised dialogue abstractive summarization via high-quality pseudolabel selection | |
Chen et al. | Pseudo-Relevance Feedback Method Based on the Topic Relevance Model | |
Shahbaba et al. | Efficient unimodality test in clustering by signature testing | |
Dhanalakshmi et al. | Hybrid Cohort Rating Prediction Technique to leverage Recommender System | |
Bellandi et al. | A Comparative Study of Clustering Techniques Applied on Covid-19 Scientific Literature | |
US20020138466A1 (en) | Method, computer program and data processing system for data clustering | |
Zhang et al. | Improved locally linear embedding based method for nonlinear system fault detection | |
Jain et al. | A Proposed similarity measure for text-classification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200904 |
|
RJ01 | Rejection of invention patent application after publication |