CN106126364A - A kind of fault event memory collection method based on Linux system and system - Google Patents

A kind of fault event memory collection method based on Linux system and system Download PDF

Info

Publication number
CN106126364A
CN106126364A CN201610494538.1A CN201610494538A CN106126364A CN 106126364 A CN106126364 A CN 106126364A CN 201610494538 A CN201610494538 A CN 201610494538A CN 106126364 A CN106126364 A CN 106126364A
Authority
CN
China
Prior art keywords
event
data
error
mistake
mcelog
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610494538.1A
Other languages
Chinese (zh)
Inventor
郭美思
宗栋瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Beijing Electronic Information Industry Co Ltd
Original Assignee
Inspur Beijing Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Beijing Electronic Information Industry Co Ltd filed Critical Inspur Beijing Electronic Information Industry Co Ltd
Priority to CN201610494538.1A priority Critical patent/CN106126364A/en
Publication of CN106126364A publication Critical patent/CN106126364A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0766Error or fault reporting or storing
    • G06F11/0781Error filtering or prioritizing based on a policy defined by the user or on a policy defined by a hardware/software module, e.g. according to a severity level

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention discloses a kind of fault event memory collection method based on Linux system and system, including the wrong primitive event data transferred in EMS memory error record depositor, and send it to mcelog equipment and carry out record;Analyze the wrong primitive event data in mcelog equipment, extract the critical data in mistake primitive event data;Critical data is integrated into error event file, and is defined as the form of error event file presetting after error event form as internal memory event source.The wrong primitive event data that the present invention can actively will be located in EMS memory error record depositor are transferred, and by analyzing, the critical data including error message therein is extracted, it is integrated into the error event file of uniform format, it is easy to during consequent malfunction diagnostic process be identified, availability is strong, and improves convenience when fault diagnosis processes.

Description

A kind of fault event memory collection method based on Linux system and system
Technical field
The present invention relates to Linux system troubleshooting technical field, particularly relate to a kind of internal memory based on Linux system Event of failure collection method and system.
Background technology
Along with the fast development of the Internet, computer serves the effect of key to the development of the mankind.Depositing in computer Reservoir is divided into internal memory and external memory.Internal memory is used to deposit that be being currently used or to be used program and data.In once Deposit appearance mistake or fault, program cisco unity malfunction or machine of delaying can be caused.Therefore the error message to internal memory is collected right and wrong The most important.
But in current Linux system, the wrong primitive event data in EMS memory error record depositor cannot be actively Obtain, and in these data, include many data in addition to error message, and the most unified form, therefore according to Mistake primitive event data judge that the fault occurred is the most difficult, the availability of the error message of the internal memory collected i.e. at present Difference.
Therefore, how the internal memory based on Linux system of the availability of a kind of error message that can improve collection is provided Event of failure collection method and system are the problems that those skilled in the art are presently required solution.
Summary of the invention
It is an object of the invention to provide a kind of fault event memory collection method based on Linux system and system, it is possible to Extract uniform format, error event file that availability is strong, it is simple to follow-up carry out fault diagnosis and process.
For solving above-mentioned technical problem, the invention provides a kind of fault event memory collection side based on Linux system Method, including:
Transfer the wrong primitive event data in EMS memory error record depositor, and send it to mcelog equipment and carry out Record;
Analyze the described mistake primitive event data in described mcelog equipment, extract in described mistake primitive event data Critical data;
Described critical data is integrated into error event file, and is defined as presetting by the form of described error event file As internal memory event source after error event form.
Preferably, described mistake primitive event data are 64BIT integer data.
Preferably, described mistake primitive event data include memory pages class mistake primitive event data.
Preferably, described default error event form is ereport.cpu.intel.mem_dev.
For solving above-mentioned technical problem, present invention also offers a kind of fault event memory based on Linux system and collect System, including:
Transfer module, for transferring the wrong primitive event data in EMS memory error record depositor, and send it to Mcelog equipment;
Described mcelog equipment, is used for recording described mistake primitive event data;
Critical data acquisition module, for analyzing the described mistake primitive event data in described mcelog equipment, extracts Critical data in described mistake primitive event data;
Integrate module, for described critical data being integrated into error event file, and by described error event file Form is defined as presetting after error event form as internal memory event source.
Preferably, also include:
Transfer module with described respectively and communication module that described mcelog equipment is connected, be used for receiving described in transfer mould The described mistake primitive event data that block sends, and send it to described mcelog equipment.
The invention provides a kind of fault event memory collection method based on Linux system and system, it is possible to actively will The wrong primitive event data being positioned at EMS memory error record depositor are transferred, and by analyze by therein comprise wrong The critical data of false information extracts, and is integrated into the error event file of uniform format, it is simple to during consequent malfunction diagnostic process Being identified, availability is strong, and improves convenience when fault diagnosis processes.
Accompanying drawing explanation
For the technical scheme being illustrated more clearly that in the embodiment of the present invention, below will be to institute in prior art and embodiment The accompanying drawing used is needed to be briefly described, it should be apparent that, the accompanying drawing in describing below is only some enforcements of the present invention Example, for those of ordinary skill in the art, on the premise of not paying creative work, it is also possible to obtains according to these accompanying drawings Obtain other accompanying drawing.
The flow process of the process of a kind of based on Linux system the fault event memory collection method that Fig. 1 provides for the present invention Figure;
The structural representation of a kind of based on Linux system the fault event memory collection system that Fig. 2 provides for the present invention.
Detailed description of the invention
The core of the present invention is to provide a kind of fault event memory collection method based on Linux system and system, it is possible to Extract uniform format, error event file that availability is strong, it is simple to follow-up carry out fault diagnosis and process.
For making the purpose of the embodiment of the present invention, technical scheme and advantage clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is The a part of embodiment of the present invention rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment obtained under not making creative work premise, broadly falls into the scope of protection of the invention.
The invention provides a kind of fault event memory collection method based on Linux system, shown in Figure 1, Fig. 1 is The flow chart of the process of a kind of based on Linux system the fault event memory collection method that the present invention provides;The method includes:
Step s101: transfer the wrong primitive event data in EMS memory error record depositor, and send it to Mcelog equipment carries out record;
Wherein, wrong primitive event data here are 64BIT integer data.
It is further known that, wrong primitive event data here include memory pages class mistake primitive event data.Certainly, Here wrong primitive event data also include other kinds of data, and the present invention does not limit institute in mistake primitive event data Comprise data type.
It is understood that under normal circumstances, the wrong primitive event data in EMS memory error record depositor are cannot Actively transfer, can only be passively to provide when the system failure, and mistake primitive event data can be carried out actively by the present invention Transfer, and the wrong primitive event data after transferring are positioned in mcelog equipment, want afterwards mistake primitive event number According to when being analyzed processing, then can realize by obtaining the wrong primitive event data placed in mcelog equipment, significantly carry High convenience.
Step s102: analyze the wrong primitive event data in mcelog equipment, extracts in mistake primitive event data Critical data;
Step s103: critical data is integrated into error event file, and the form of error event file is defined as pre- If as internal memory event source after error event form.
It is understood that mistake primitive event data include many data unrelated with error message, these numbers According to affecting the diagnosis of fault, thus need by mistake primitive event data critical data (data that comprise error message or The data relevant to error message) extract, it is integrated into error event file, it is simple to follow-up carry out fault diagnosis, also allows for Staff checks, mistake primitive event data compared by error event file, and availability is strong.
Wherein, preset error event form and could be arranged to ereport.cpu.intel.mem_dev.Certainly, here Form can be not construed as limiting according to practical situation sets itself, the present invention.
The invention provides a kind of fault event memory collection method based on Linux system, it is possible in actively will be located in The wrong primitive event data deposited in error logging depositor are transferred, and include error message by analyzing by therein Critical data extract, be integrated into the error event file of uniform format, it is simple to know during consequent malfunction diagnostic process Not, availability is strong, and improves convenience when fault diagnosis processes.
Present invention also offers a kind of fault event memory collection system based on Linux system, shown in Figure 2, Fig. 2 Structural representation for a kind of based on Linux system the fault event memory collection system that the present invention provides.This system includes:
Transfer module 11, for transferring the wrong primitive event data in EMS memory error record depositor, and be sent to To mcelog equipment 12;
Mcelog equipment 12, for misregistration primitive event data;
Critical data acquisition module 13, for analyzing the wrong primitive event data in mcelog equipment 12, extracts mistake Critical data in primitive event data;
Integrate module 14, for critical data being integrated into error event file and the form of error event file is fixed Justice is for presetting after error event form as internal memory event source.
As preferably, this system also includes:
Respectively with transfer the communication module 15 that module 11 and mcelog equipment 12 is connected, transfer module 11 for reception The wrong primitive event data sent, and send it to mcelog equipment 12.
Wherein, transfer module 11, EMS memory error record depositor, communication module 15 and mcelog equipment 12 are respectively positioned on In MCE kernel module in Linux system.
The invention provides a kind of fault event memory collection system based on Linux system, it is possible in actively will be located in The wrong primitive event data deposited in error logging depositor are transferred, and include error message by analyzing by therein Critical data extract, be integrated into the error event file of uniform format, it is simple to know during consequent malfunction diagnostic process Not, availability is strong, and improves convenience when fault diagnosis processes.
It should be noted that in this manual, term " includes ", " comprising " or its any other variant are intended to Comprising of nonexcludability, so that include that the process of a series of key element, method, article or equipment not only include that those are wanted Element, but also include other key elements being not expressly set out, or also include for this process, method, article or equipment Intrinsic key element.In the case of there is no more restriction, statement " including ... " key element limited, it is not excluded that Including process, method, article or the equipment of described key element there is also other identical element.
Described above to the disclosed embodiments, makes professional and technical personnel in the field be capable of or uses the present invention. Multiple amendment to these embodiments will be apparent from for those skilled in the art, as defined herein General Principle can realize without departing from the spirit or scope of the present invention in other embodiments.Therefore, the present invention It is not intended to be limited to the embodiments shown herein, and is to fit to and principles disclosed herein and features of novelty phase one The widest scope caused.

Claims (6)

1. a fault event memory collection method based on Linux system, it is characterised in that including:
Transfer the wrong primitive event data in EMS memory error record depositor, and send it to mcelog equipment and remember Record;
Analyze the described mistake primitive event data in described mcelog equipment, extract the pass in described mistake primitive event data Key data;
Described critical data is integrated into error event file, and is defined as the form of described error event file presetting mistake As internal memory event source after event format.
Method the most according to claim 1, it is characterised in that described mistake primitive event data are 64BIT integer data.
Method the most according to claim 2, it is characterised in that described mistake primitive event data include that memory pages class is wrong Primitive event data by mistake.
Method the most according to claim 1, it is characterised in that described default error event form is ereport.cpu.intel.mem_dev。
5. a fault event memory collection system based on Linux system, it is characterised in that including:
Transfer module, for transferring the wrong primitive event data in EMS memory error record depositor, and send it to Mcelog equipment;
Described mcelog equipment, is used for recording described mistake primitive event data;
Critical data acquisition module, for analyzing the described mistake primitive event data in described mcelog equipment, extracts described Critical data in mistake primitive event data;
Integrate module, for described critical data being integrated into error event file, and by the form of described error event file It is defined as presetting after error event form as internal memory event source.
System the most according to claim 5, it is characterised in that also include:
Transfer module with described respectively and communication module that described mcelog equipment is connected, be used for receiving described in transfer module and send out The described mistake primitive event data sent, and send it to described mcelog equipment.
CN201610494538.1A 2016-06-28 2016-06-28 A kind of fault event memory collection method based on Linux system and system Pending CN106126364A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610494538.1A CN106126364A (en) 2016-06-28 2016-06-28 A kind of fault event memory collection method based on Linux system and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610494538.1A CN106126364A (en) 2016-06-28 2016-06-28 A kind of fault event memory collection method based on Linux system and system

Publications (1)

Publication Number Publication Date
CN106126364A true CN106126364A (en) 2016-11-16

Family

ID=57284351

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610494538.1A Pending CN106126364A (en) 2016-06-28 2016-06-28 A kind of fault event memory collection method based on Linux system and system

Country Status (1)

Country Link
CN (1) CN106126364A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109343993A (en) * 2018-09-28 2019-02-15 郑州云海信息技术有限公司 A kind of error message processing method and processing device of cloud platform
CN113076264A (en) * 2020-01-03 2021-07-06 阿里巴巴集团控股有限公司 Memory management method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103198000A (en) * 2013-04-02 2013-07-10 浪潮电子信息产业股份有限公司 Method for positioning faulted memory in linux system
CN103227734A (en) * 2013-04-27 2013-07-31 华南理工大学 Method for detecting abnormity of OpenStack cloud platform
US20150128111A1 (en) * 2013-08-26 2015-05-07 Tencent Technology (Shenzhen) Company Limited Devices and Methods for Acquiring Abnormal Information
CN105204968A (en) * 2015-11-10 2015-12-30 浪潮(北京)电子信息产业有限公司 Method and device for detecting fault memory
CN105589776A (en) * 2015-12-23 2016-05-18 华为技术有限公司 Fault location method and server

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103198000A (en) * 2013-04-02 2013-07-10 浪潮电子信息产业股份有限公司 Method for positioning faulted memory in linux system
CN103227734A (en) * 2013-04-27 2013-07-31 华南理工大学 Method for detecting abnormity of OpenStack cloud platform
US20150128111A1 (en) * 2013-08-26 2015-05-07 Tencent Technology (Shenzhen) Company Limited Devices and Methods for Acquiring Abnormal Information
CN105204968A (en) * 2015-11-10 2015-12-30 浪潮(北京)电子信息产业有限公司 Method and device for detecting fault memory
CN105589776A (en) * 2015-12-23 2016-05-18 华为技术有限公司 Fault location method and server

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109343993A (en) * 2018-09-28 2019-02-15 郑州云海信息技术有限公司 A kind of error message processing method and processing device of cloud platform
CN113076264A (en) * 2020-01-03 2021-07-06 阿里巴巴集团控股有限公司 Memory management method and device

Similar Documents

Publication Publication Date Title
CN108667725A (en) A kind of industrial AnyRouter and implementation method based on a variety of accesses and edge calculations
CN104092755B (en) A kind of method and device for capturing of cloud service origination data
CN106055608B (en) The method and apparatus of automatic collection and analysis interchanger log
CN105468735A (en) Stream preprocessing system and method based on mass information of mobile internet
CN103488558A (en) Device and method of automatically acquiring application anomalies based on LOG4J logging framework
CN107844325A (en) The acquisition methods and system of a kind of distributed data
CN108632111A (en) Service link monitoring method based on log
CN102355482A (en) Data transmission method and equipment thereof
CN104038821A (en) Method for uniformly gathering fault information of each functional module of Android television
CN104184745A (en) Intelligent front-end equipment communication system
CN105117316B (en) A kind of automatic Inspection and maintenance method and system of server
CN105786683A (en) Customized log collecting system and method
CN110912731A (en) NFV-based system and method for realizing service identification and topology analysis by adopting DPI technology
CN105718299A (en) Virtual machine configuration method, device and system
CN105389314A (en) Log file query system and query method
CN106126364A (en) A kind of fault event memory collection method based on Linux system and system
CN104202328B (en) A kind of method, configuration module and the subscription end of subscription GOOSE/SMV messages
CN103634135B (en) A kind of collecting method based on metadata
CN104038388B (en) Based on distributed Internet of Things Auto-Test System and method of testing
CN111130828B (en) Intelligent network distribution method and device and terminal equipment
CN107870850A (en) A kind of efficient the Internet, applications log system
CN106469115A (en) A kind of telecommunication network management automatic software test method and device
CN106446008A (en) Management method and analysis system for database security event
CN112788145A (en) Cross-domain functional security anomaly detection tracing method based on non-embedded probe
CN104517082B (en) Electric power data acquisition apparatus and method

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20161116

RJ01 Rejection of invention patent application after publication