CN115422991A - Fault record classification method - Google Patents

Fault record classification method Download PDF

Info

Publication number
CN115422991A
CN115422991A CN202210904343.5A CN202210904343A CN115422991A CN 115422991 A CN115422991 A CN 115422991A CN 202210904343 A CN202210904343 A CN 202210904343A CN 115422991 A CN115422991 A CN 115422991A
Authority
CN
China
Prior art keywords
classification
fault
record
name
distance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210904343.5A
Other languages
Chinese (zh)
Inventor
张介
戴妮理
赵利明
杨家荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hongta Tobacco Group Co Ltd
Original Assignee
Hongta Tobacco Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hongta Tobacco Group Co Ltd filed Critical Hongta Tobacco Group Co Ltd
Priority to CN202210904343.5A priority Critical patent/CN115422991A/en
Publication of CN115422991A publication Critical patent/CN115422991A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/20Administration of product repair or maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/04Manufacturing

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Primary Health Care (AREA)
  • Manufacturing & Machinery (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention relates to a fault record classification method, and belongs to the technical field of tobacco MES (manufacturing execution system) automatic equipment management. The fault record classification method comprises a text enhancement method for performing keyword replacement and enhancing equipment part classification on fault records by using common languages and a weighted text editing distance algorithm to achieve automatic classification of fuzzy record fault information similar to spoken language. The method can classify the MES fault maintenance records by fewer samples on the basis of experience, has stable algorithm, automatically classifies the faults, classifies information under different management systems, takes the calculated total edit distance as a classification basis, and takes the middle position name of the target classification system corresponding to the sequenced maximum value as a classification result, thereby improving the accuracy of fault record identification.

Description

Fault record classification method
Technical Field
The invention belongs to the technical field of tobacco MES (manufacturing execution system) automatic equipment management, and particularly relates to a fault record classification method.
Background
The tobacco MES system management relates to related records in the equipment maintenance process, including equipment fault parts, fault description and the like, in order to use the information in the life cycle management of accessories, the point position control of key functions of equipment and the like, fuzzy MES maintenance records recorded manually need to correspond to the corresponding key function management positions of the equipment, but the MES records have ambiguity, and the number of samples is not large, so that the algorithm of more complex automatic classification is not enough, otherwise, the problems of data overfitting and the like are easily caused.
Therefore, there is a need to provide a fault record classification method, which has stable algorithm, can automatically classify faults and classify information under different management systems.
Disclosure of Invention
In order to overcome the problems that the MES records in the background technology have ambiguity, a relatively complex automatic classification algorithm is not enough due to the small number of samples, and data overfitting is easily caused otherwise, the invention provides a fault record classification method which is stable in algorithm, automatically classifies faults and classifies information under different management systems.
In order to realize the purpose, the invention is realized by the following technical scheme:
the invention provides a fault record classification method, which comprises a text enhancement method for performing keyword replacement and device part classification enhancement on fault records by using common languages and a weighted text editing distance algorithm to achieve automatic classification of fuzzy record fault information similar to spoken language.
The fault record classification method determines the corresponding positions of the MES maintenance record and the key function point position control by substituting the weighted Jaro-Winkler distance with empirical keywords, and the algorithm flow is as follows: fault recording general abnormal character processing; common names of semi-empirical equipment and materials are unified into unified key words in key function point positions; replacing the device name with some keywords with Jaro-Winkler distance of 0; respectively recording the Jaro-Winkler distance between the maintenance record content and the original classification name; adding the distance terms according to the weight; and sequencing according to the weight merging value to determine the final classification.
Preferably, the method for replacing the fault records by the keywords in the common language comprises the following steps: many common expressions commonly used in spoken language related to materials used by the device, names of bits on the device are all replaced by a uniform name in the device critical function location management system or the accessory lifecycle management system. Or uniformly removing the information such as the device name and the like in each record.
Preferably, the text enhancement method for enhancing the device location classification includes: similar part names used on different devices are replaced with name phrases that differ very much for the following classification algorithm.
Preferably, the weighted text editing distance algorithm achieves automatic classification of fuzzy record fault information similar to spoken language as follows: based on the original general Jaro-Winkler edit distance algorithm, the original system part name in the fault record, the fault description and the part names in the target classification system are used for respectively calculating the Jaro-Winkler edit distance, and the edit distance sum is calculated according to the weight coefficients of 2.5/4 and 1.5/4.
Preferably, the weighted Jaro-Winkler distance algorithm comprises the steps of quantifying and sorting the target classification name and a plurality of information items in the fault record system according to a certain weight, determining the most matched classification item, taking the calculated total edit distance as a classification basis, and taking the middle position name of the target classification system corresponding to the sorted maximum value as a classification result, so that the accuracy of fault record identification is improved.
The invention has the beneficial effects that:
the invention can classify MES fault maintenance records by less samples on the basis of experience, has stable algorithm, automatically classifies faults and classifies information under different management systems.
Drawings
FIG. 1 is a flow chart of the algorithm of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings to facilitate understanding of the skilled person.
As shown in fig. 1, the fault record classification method includes a text enhancement method for performing keyword replacement and enhancing device part classification on fault records by using a common language, and a weighted text editing distance algorithm to achieve automatic classification of fuzzy record fault information similar to spoken language.
The fault record classification method determines the corresponding positions of the MES maintenance record and the key function point position control by substituting the weighted Jaro-Winkler distance with empirical keywords, and the algorithm flow is as follows: fault recording general abnormal character processing; common names of semi-empirical equipment and materials are unified keywords in key function points; replacing the device name with some keywords with Jaro-Winkler distance of 0; respectively recording the Jaro-Winkler distance between the maintenance record content and the original classification name; adding the distance terms according to the weight; and sorting according to the weight merging value to determine the final classification.
The method for replacing the fault records by the keywords in the common language comprises the following steps: many common expressions commonly used in spoken language related to materials used by the device, names of bits on the device are all replaced by a uniform name in the device critical function location management system or the accessory lifecycle management system. Or uniformly removing the information such as the device name and the like in each record.
The text enhancement method for enhancing the device part classification comprises the following steps: similar part names used on different devices are replaced with name phrases that differ very much for the following classification algorithm.
The weighted text editing distance algorithm achieves the automatic classification of fuzzy recording fault information similar to spoken language as follows: based on the original general Jaro-Winkler edit distance algorithm, the original system part name in the fault record, the fault description and the part names in the target classification system are used for respectively calculating the Jaro-Winkler edit distance, and the edit distance sum is calculated according to the weight coefficients of 2.5/4 and 1.5/4.
The weighted Jaro-Winkler distance algorithm comprises the steps of quantifying and sorting the target classification name and a plurality of information items in the fault recording system according to a certain weight, determining the most matched classification item, taking the calculated total edit distance as a classification basis, and taking the middle position name of the target classification system corresponding to the sorted maximum value as a classification result, so that the accuracy of fault recording identification is improved.
Table 1 shows the results of partial fault classification:
table 1 partial fault classification result display
Figure BDA0003771912930000041
Figure BDA0003771912930000051
Figure BDA0003771912930000061
The method can classify the MES fault maintenance records by fewer samples on the basis of experience, has stable algorithm, automatically classifies the faults, classifies information under different management systems, takes the calculated total edit distance as a classification basis, and takes the middle position name of the target classification system corresponding to the sequenced maximum value as a classification result, thereby improving the accuracy of fault record identification.
Finally, it is noted that the above-mentioned preferred embodiments illustrate rather than limit the invention, and that, although the invention has been described in detail with reference to the above-mentioned preferred embodiments, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the scope of the invention as defined by the appended claims.

Claims (6)

1. A fault record classification method is characterized in that: the fault record classification method comprises a text strengthening method and a weighted text editing distance algorithm, wherein the common language is used for carrying out keyword substitution on fault records and strengthening equipment part classification, so that the automatic classification of fuzzy record fault information similar to spoken language is achieved;
the fault record classification method determines the corresponding positions of the MES maintenance record and the key function point position control by substituting the weighted Jaro-Winkler distance with empirical keywords, and the algorithm flow is as follows:
fault recording general abnormal character processing;
common names of semi-empirical equipment and materials are unified into unified key words in key function point positions;
replacing the device name with some keywords with Jaro-Winkler distance of 0;
respectively recording the Jaro-Winkler distance between the maintenance record content and the original classification name;
adding the distance terms according to the weight;
and sequencing according to the weight merging value to determine the final classification.
2. The fault record classification method according to claim 1, characterized in that: the method for replacing the fault records by the keywords in the common language comprises the following steps: the common general expressions in the spoken language related to the materials used by the equipment and the names of the upper parts of the equipment are all replaced by the unified names in the key function position management system or the accessory life cycle management system of the equipment, or the information such as the equipment names in all records is removed in a unified way.
3. A fault record classification method according to claim 1 or 2, characterized in that: the text enhancement method for enhancing the device part classification comprises the following steps: similar part names used on different devices are replaced with name phrases that are very different for the following classification algorithm.
4. A fault record classification method according to claim 1 or 2, characterized in that: the weighted text editing distance algorithm achieves automatic classification of fuzzy recording fault information similar to spoken language as follows: based on the original general Jaro-Winkler edit distance algorithm, the original system part name in the fault record, the fault description and the part names in the target classification system are used for respectively calculating the Jaro-Winkler edit distance, and the edit distance sum is calculated according to the weight coefficients of 2.5/4 and 1.5/4.
5. A fault record classification method according to claim 1 or 2, characterized in that: the weighted Jaro-Winkler distance algorithm comprises the steps of quantifying and sorting the target classification name and a plurality of information items in the fault recording system according to a certain weight, determining the most matched classification item, taking the calculated total edit distance as a classification basis, and taking the middle position name of the target classification system corresponding to the sorted maximum value as a classification result, so that the accuracy of fault recording identification is improved.
6. The fault record classification method according to claim 4, characterized in that: the weighted Jaro-Winkler distance algorithm comprises the steps of quantifying and sorting the target classification name and a plurality of information items in the fault recording system according to a certain weight, determining the most matched classification item, taking the calculated total edit distance as a classification basis, and taking the middle position name of the target classification system corresponding to the sorted maximum value as a classification result, so that the accuracy of fault recording identification is improved.
CN202210904343.5A 2022-07-29 2022-07-29 Fault record classification method Pending CN115422991A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210904343.5A CN115422991A (en) 2022-07-29 2022-07-29 Fault record classification method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210904343.5A CN115422991A (en) 2022-07-29 2022-07-29 Fault record classification method

Publications (1)

Publication Number Publication Date
CN115422991A true CN115422991A (en) 2022-12-02

Family

ID=84196329

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210904343.5A Pending CN115422991A (en) 2022-07-29 2022-07-29 Fault record classification method

Country Status (1)

Country Link
CN (1) CN115422991A (en)

Similar Documents

Publication Publication Date Title
CN109165294B (en) Short text classification method based on Bayesian classification
CN111414479A (en) Label extraction method based on short text clustering technology
CN112632980A (en) Enterprise classification method and system based on big data deep learning and electronic equipment
CN111860981A (en) Enterprise national industry category prediction method and system based on LSTM deep learning
CN113468317B (en) Resume screening method, system, equipment and storage medium
CN111026870A (en) ICT system fault analysis method integrating text classification and image recognition
CN111462752A (en) Client intention identification method based on attention mechanism, feature embedding and BI-L STM
CN108363691A (en) A kind of field term identifying system and method for 95598 work order of electric power
CN111078979A (en) Method and system for identifying network credit website based on OCR and text processing technology
CN115794798A (en) Market supervision informationized standard management and dynamic maintenance system and method
CN114491034B (en) Text classification method and intelligent device
CN109543038B (en) Emotion analysis method applied to text data
CN116841779A (en) Abnormality log detection method, abnormality log detection device, electronic device and readable storage medium
CN107480126B (en) Intelligent identification method for engineering material category
CN111104422B (en) Training method, device, equipment and storage medium of data recommendation model
CN117454220A (en) Data hierarchical classification method, device, equipment and storage medium
CN117436440A (en) Log identification method, system, terminal equipment and storage medium
CN102103604B (en) Method and device for determining core weight of term
CN108615124B (en) Enterprise evaluation method and system based on word frequency analysis
CN110688445B (en) Digital archive construction method
CN115422991A (en) Fault record classification method
CN115936748A (en) Business big data analysis method and system
CN115952282A (en) Intelligent bank customer complaint diversion handling method and system based on NLP technology
CN114638219A (en) Intelligent wrong word recognition method based on machine learning algorithm
CN114490937A (en) Comment analysis method and device based on semantic perception

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination