CN115794524A - Verification method, verification device, electronic equipment and readable storage medium - Google Patents

Verification method, verification device, electronic equipment and readable storage medium Download PDF

Info

Publication number
CN115794524A
CN115794524A CN202211557599.XA CN202211557599A CN115794524A CN 115794524 A CN115794524 A CN 115794524A CN 202211557599 A CN202211557599 A CN 202211557599A CN 115794524 A CN115794524 A CN 115794524A
Authority
CN
China
Prior art keywords
error
memory device
target
information
slot position
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211557599.XA
Other languages
Chinese (zh)
Inventor
刘浩君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Suzhou Inspur Intelligent Technology Co Ltd
Original Assignee
Suzhou Inspur Intelligent Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Suzhou Inspur Intelligent Technology Co Ltd filed Critical Suzhou Inspur Intelligent Technology Co Ltd
Priority to CN202211557599.XA priority Critical patent/CN115794524A/en
Publication of CN115794524A publication Critical patent/CN115794524A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Debugging And Monitoring (AREA)

Abstract

The embodiment of the invention provides a checking method, a checking device, electronic equipment and a readable storage medium, wherein an initial error report aiming at persistent memory equipment is obtained; selecting target marking error memory equipment from the persistent memory equipment; determining slot position information corresponding to the memory device with the wrong destination label; generating an error command through the slot position information; injecting the error command into the memory device with the target label error according to the slot position information, and generating error information aiming at the memory device with the target label error; the initial error report is verified through the error information, so that whether the initial error report is accurate or not can be judged by checking whether IERR disappears or not after replacement is avoided, and the efficiency of processing PMEM faults is improved.

Description

Verification method, verification device, electronic equipment and readable storage medium
Technical Field
The present invention relates to the field of storage technologies, and in particular, to a verification method, a verification apparatus, an electronic device, and a computer-readable storage medium.
Background
The Memory device PMem (Persistent Memory) is a novel Memory, and has the functions of both a normal Memory and a hard disk.
When the PMEM fails, the PMEM needs to be replaced according to the prompt information to avoid that the failed PMEM cannot execute caching or storage operation, so that server-free maintenance is realized.
Disclosure of Invention
Embodiments of the present invention provide a verification method, an apparatus, an electronic device, and a computer-readable storage medium, so as to solve a problem of how to improve efficiency of processing a PMEM fault.
The embodiment of the invention discloses a checking method, which is applied to a server, wherein the server is provided with a PMEM (persistent memory device), and the method comprises the following steps:
acquiring an initial error report aiming at the PMEM;
selecting a target fault-marked memory device from the PMEM;
determining slot position information corresponding to the memory device with the wrong destination label;
generating an error command through the slot position information;
injecting the error command into the memory device with the target label error according to the slot position information, and generating error information aiming at the memory device with the target label error;
and checking the initial error report through the error information.
Optionally, the persistent memory device PMEM has a corresponding basic input output system BIOS, where the basic input output system BIOS includes an error information injection state for the persistent memory device, and before the step of selecting the target error memory device from the persistent memory device PMEM, the method may further include:
configuring the error information injection state to an adjustable state.
Optionally, the server includes an operating system OS for the persistent memory device PMEM, where the persistent memory device PMEM has a corresponding memory module manager ipmctl, and the step of selecting the target memory device with the fault address from the persistent memory device PMEM may include:
and creating an application direct mode aiming at the persistent memory device PMEM in the operating system OS, and issuing a target mislabeled memory device selection instruction to the memory module management program ipmctl in the application direct mode so as to select the target mislabeled memory device from the persistent memory device PMEM.
Optionally, the step of determining slot location information corresponding to the target-labeled error memory device may include:
and connecting the memory device with the wrong target label, and issuing a slot position determining command aiming at the memory device with the wrong target label to the memory module management program ipmctl when the memory device with the wrong target label is judged to be successfully connected so as to determine slot position information corresponding to the memory device with the wrong target label.
Optionally, the step of injecting the error command into the target memory device with the error through the slot location information may include:
and injecting the error command into the target memory device with the error label in the adjustable state according to the slot position information.
Optionally, the target labeled error memory device has corresponding error log information, and the method may further include:
and checking the error logarithm information, and judging that the error command is successfully injected into the target memory device with the fault when the error logarithm information contains the error command.
Optionally, the server has a central processing unit, the error information injection state includes a disable state, and the step of generating the error information for the target memory device with the fault may include:
when the error information injection state is changed from the adjustable state to the disabled state, a progress state display command is issued to the memory module management program ipmctl so as to trigger the central processing unit to generate an error log which is specific to the target memory device with the error and contains the slot position information.
Optionally, the step of checking the initial error report by the error information may include:
the initial error report is verified by the error log.
Optionally, the persistent memory device PMEM has a corresponding motherboard device, where the motherboard device includes an error warning apparatus for the memory device with the target label error, and the method may further include:
and triggering the mainboard equipment to generate warning device control information aiming at the error warning device.
Optionally, the step of checking the initial error report by the error information may include:
and checking the initial error report through the control information of the warning device.
The embodiment of the invention also discloses a checking device, which is applied to a server, wherein the server is provided with the PMEM, and the checking device comprises:
an initial error report acquisition module, configured to acquire an initial error report for the PMEM;
a target labeling error memory device selecting module, configured to select a target labeling error memory device from the persistent memory device PMEM;
the slot position information determining module is used for determining slot position information corresponding to the memory device with the wrong destination label;
the error command generating module is used for generating an error command according to the slot position information;
the error command injection module is used for injecting the error command into the target memory device with the error label through the slot position information and generating error information aiming at the target memory device with the error label;
and the checking module is used for checking the initial error report through the error information.
Optionally, the persistent memory device PMEM has a corresponding basic input output system BIOS, where the basic input output system BIOS includes an error information injection state for the persistent memory device, and may further include:
and the adjustable state configuration module is used for configuring the error information injection state into an adjustable state.
Optionally, the server includes an operating system OS for the persistent memory device PMEM, where the persistent memory device PMEM has a corresponding memory module management program ipmctl, and the module for selecting the target memory device with an error may include:
and the target labeling error memory device selecting submodule is used for creating an application direct mode aiming at the permanent memory device PMEM in the operating system OS, and issuing a target labeling error memory device selecting instruction to the memory module management program ipmctl in the application direct mode so as to select the target labeling error memory device from the permanent memory device PMEM.
Optionally, the slot position information determining module may include:
and the slot position information determining submodule is used for connecting the memory device with the wrong eye mark, and issuing a slot position determining command aiming at the memory device with the wrong eye mark to the memory module management program ipmctl when the memory device with the wrong eye mark is judged to be successfully connected so as to determine the slot position information corresponding to the memory device with the wrong eye mark.
Optionally, the error command injection module may include:
and the error command injection submodule is used for injecting the error command into the target-labeled error memory device in the adjustable state through the slot position information.
Optionally, the target error memory device has corresponding error log information, and the apparatus may further include:
and the error command injection success judging module is used for checking the error logarithm information, and judging that the error command is successfully injected into the target error memory device when the error logarithm information contains the error command.
Optionally, the server has a central processing unit, the error information injection status includes a disable status, and the error command injection module may include:
and the error log generation submodule is used for issuing a progress state display command to the memory module management program ipmctl when the error information injection state is changed from the adjustable state to the disabled state so as to trigger the central processing unit to generate an error log which is used for marking the wrong memory device and contains the slot position information.
Optionally, the verification module may include:
and the first checking submodule is used for checking the initial error report through the error log.
Optionally, the persistent memory device PMEM has a corresponding motherboard device, the motherboard device includes an error warning apparatus for the memory device with a fault label, and the verification module may include:
the warning lamp control information generation submodule is used for triggering the mainboard equipment to generate warning device control information aiming at the error warning device;
and the second checking submodule is used for checking the initial error report through the control information of the warning device.
The embodiment of the invention also discloses electronic equipment which comprises a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory finish mutual communication through the communication bus;
the memory is used for storing a computer program;
the processor is configured to implement the method according to the embodiment of the present invention when executing the program stored in the memory.
Also disclosed is a computer-readable storage medium having instructions stored thereon, which, when executed by one or more processors, cause the processors to perform a method according to an embodiment of the invention.
The embodiment of the invention has the following advantages:
according to the embodiment of the invention, an initial error report aiming at the PMEM is obtained; selecting a target fault-marked memory device from the PMEM; determining slot position information corresponding to the memory device with the wrong destination label; generating an error command through the slot position information; injecting the error command into the memory device with the target label error according to the slot position information, and generating error information aiming at the memory device with the target label error; the initial error report is verified through the error information, so that whether the initial error report is accurate or not can be judged by checking whether IERR disappears or not after replacement is avoided, and the efficiency of processing PMEM faults is improved.
Drawings
FIG. 1 is a flow chart illustrating the steps of a verification method provided in an embodiment of the present invention;
fig. 2 is a block diagram of a checking apparatus provided in an embodiment of the present invention;
fig. 3 is a block diagram of a hardware configuration of an electronic device provided in embodiments of the present invention;
fig. 4 is a schematic diagram of a computer-readable medium provided in an embodiment of the present invention.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.
Referring to fig. 1, a flowchart illustrating steps of a verification method provided in an embodiment of the present invention is shown, which may specifically include the following steps:
step 101, obtaining an initial error report for the persistent memory device;
102, selecting target label error memory equipment from the persistent memory equipment;
step 103, determining slot position information corresponding to the target-labeled error memory device;
104, generating an error command through the slot position information;
step 105, injecting the error command into the target memory device with the error label through the slot position information, and generating error information for the target memory device with the error label;
and 106, checking the initial error report through the error information.
In a specific implementation, the embodiments of the present invention may be applied to a server having a persistent memory device PMEM.
The server provides calculation or application services for other clients (such as terminals like PC, smart phone, ATM and the like and even large equipment like train systems and the like) in the network. The server has high-speed CPU computing capability, long-time reliable operation, strong I/O external data throughput capability and better expansibility.
Generally, a server has the capability of responding to a service request, supporting a service, and guaranteeing the service according to the service provided by the server.
The Persistent Memory device PMEM (Intel Optane Persistent Memory) perfectly combines affordable large capacity with support for data persistence. Users can build larger persistent memory tiers within an affordable cost range, providing flexible (including volatile or non-volatile) high-performance storage tiers for various workloads, including but not limited to in-cloud virtual machines, memory databases, and the like.
The embodiment of the invention can acquire the initial error report aiming at the PMEM, and specifically, the initial error report can be automatically generated by a system or generated by related technicians according to statistics of related parameters.
The embodiment of the invention can also select the target error-marked memory device from the permanent memory device PMEM and then determine the slot position information corresponding to the target error-marked memory device.
In a specific implementation, the target error memory device of the embodiment of the present invention may be a persistent memory device PMEM to which an error command is to be injected.
In a specific implementation, the server may include a motherboard device, where the motherboard device may include a slot for the persistent memory device PMEM, and the slot location information determined in the embodiment of the present invention may be used to express a slot location for a target-labeled faulty memory device.
After the slot position information corresponding to the target error-injection memory device is determined, the embodiment of the invention can generate the error command through the slot position information, and inject the error command into the target error-injection memory device through the slot position information, thereby generating the error information aiming at the target error-injection memory device.
After generating the error information for the target-labeled error memory device, the embodiment of the invention can check the initial error report through the error information.
For example, by triggering the link fail for the target PMEM, active triggering of the IERR for the target PMEM is achieved, which is a catastrophic error reported by the processor, but is typically caused by a device outside the processor core (e.g., memory, PCIe). Whether the previously acquired prompt information (initial error report) is consistent is then determined by actively triggering generation of a relevant resolution log for the IERR, for example, whether the error reporting device in the initial error report is consistent with the device that actually failed, or whether the location of the error reporting device in the initial error report is consistent with the location of the device that actually failed, and so on.
According to the embodiment of the invention, an initial error report aiming at the PMEM is obtained; selecting a target fault-marked memory device from the PMEM; determining slot position information corresponding to the memory device with the wrong destination label; generating an error command through the slot position information; injecting the error command into the memory device with the target label error according to the slot position information, and generating error information aiming at the memory device with the target label error; the initial error report is verified through the error information, so that whether the initial error report is accurate or not can be judged by checking whether IERR disappears or not after replacement is avoided, and the efficiency of processing PMEM faults is improved.
On the basis of the above-described embodiment, a modified embodiment of the above-described embodiment is proposed, and it is to be noted herein that, in order to make the description brief, only the differences from the above-described embodiment are described in the modified embodiment.
In an optional embodiment of the present invention, the persistent memory device PMEM has a corresponding Basic Input Output System BIOS (Basic Input Output System), where the Basic Input Output System BIOS includes an error information injection state for the persistent memory device, and before the step of selecting the memory device with the fault entry from the persistent memory device PMEM, the method may further include:
configuring the error information injection state to an adjustable state.
In order to avoid that an error command cannot be injected into the persistent memory device PMEM, the embodiment of the present invention may configure a corresponding BIOS for the persistent memory device PMEM, where the BIOS may include an error information injection state for the persistent memory device.
For example, after the server is booted, the relevant BIOS option may be set, and the Error information Injection state PMem Error Injection may be set to the adjustable state Enabled.
According to the embodiment of the invention, the error information injection state is configured to be an adjustable state, so that the success rate of injecting the error command can be improved when the PMEM is injected with the error command in the subsequent process.
In an optional embodiment of the present invention, the server includes an operating system OS for the persistent memory device PMEM, the persistent memory device PMEM has a corresponding memory module manager ipmctl, and the step of selecting the memory device with the tag error from the persistent memory device PMEM includes:
and creating an application direct mode aiming at the PMEM in the operating system OS, and issuing an object labeling error memory device selection instruction to the memory module management program ipmctl in the application direct mode so as to select an object labeling error memory device from the PMEM.
In a specific implementation, the server according to the embodiment of the present invention may include an operating system OS for the persistent memory device PMEM, and the persistent memory device PMEM may further have a corresponding memory module hypervisor ipmctl.
An Operating System (OS) is a set of interrelated System software programs that host and control computer operations, use and run hardware, software resources and provide common services to organize user interactions. Operating systems can be classified into desktop operating systems, mobile phone operating systems, server operating systems, embedded operating systems, and the like according to the operating environment.
The IPMCTL may be a utility for managing persistent memory devices PMEM, and in practical applications, the functions supported by the IPMCTL include, but are not limited to, discovering persistent memory modules in the platform, setting platform memory configuration, viewing and updating firmware on the PMEM, configuring static security data on the PMEM, monitoring PMEM operating conditions, tracking the performance of the PMEM, debugging and troubleshooting PMEM failures, and the like.
According to the embodiment of the invention, an application direct mode AppDirect aiming at the persistent memory device PMEM can be established in the operating system OS, and a target-wrongly-labeled memory device selecting instruction is issued to the memory module management program ipmctl under the application direct mode AppDirect, so that the target-wrongly-labeled memory device is selected from the persistent memory device PMEM.
For example, an operating system OS is entered, an AppDirect mode is created for the PMEM, and an ipmctl create-f-high permanent memory location = appdirected failed memory device is issued to the ipmctl through the operating system OS in the AppDirect mode, so as to select a target-marked error memory device from the persistent memory device PMEM.
According to the embodiment of the invention, the direct application mode aiming at the PMEM is created in the operating system OS, and the selection instruction of the memory device with the wrong destination label is sent to the memory module management program ipmctl in the direct application mode, so that the memory device with the wrong destination label is selected from the PMEM, thereby efficiently selecting the memory device with the wrong destination label from the PMEM, and further improving the efficiency of processing PMEM faults.
In an optional embodiment of the present invention, the step of determining slot location information corresponding to the target misprinted memory device includes:
and connecting the memory device with the wrong target label, and issuing a slot position determining command aiming at the memory device with the wrong target label to the memory module management program ipmctl when the memory device with the wrong target label is judged to be successfully connected so as to determine slot position information corresponding to the memory device with the wrong target label.
Illustratively, in the embodiment of the present invention, by connecting the memory device with the wrong destination label, when it is determined that the memory device with the wrong destination label is successfully connected, a slot position determination command PMEM.
Of course, the above examples are merely examples, and those skilled in the art may use any reference numbers to express the position where the PMEM is inserted, and the embodiments of the present invention are not limited thereto.
Optionally, in the embodiment of the present invention, before determining the slot position information, it may be determined whether the target memory device with the wrong label is successfully connected, specifically, it may be determined whether the target memory device with the wrong label is successfully connected by checking the relevant information of the server through the target memory device with the wrong label, for example, when the relevant information of the server is checked through the target memory device with the wrong label, it may be determined that the target memory device with the wrong label is successfully connected.
In the embodiment of the invention, by connecting the memory device with the wrong eye mark, when the memory device with the wrong eye mark is successfully connected, a slot position determining command aiming at the memory device with the wrong eye mark is issued to the memory module management program ipmctl so as to determine the slot position information corresponding to the memory device with the wrong eye mark, so that the slot position information corresponding to the memory device with the wrong eye mark is efficiently determined, and the efficiency of processing PMEM faults is further improved.
In an optional embodiment of the present invention, the step of injecting the error command into the target memory device with the slot location information includes:
and injecting the error command into the target memory device with the error label in the adjustable state according to the slot position information.
Illustratively, in the embodiments of the present invention, when it is determined that the destination-labeled error memory device is successfully connected, a command pmem.show.list () is issued to the memory module manager ipmctl to determine a slot position of the destination-labeled error memory device, so as to obtain slot position information pmemXXX corresponding to the domain-labeled error memory device, and an error command pmemxxx.inject.error (108,0) is generated by using the slot position information pmemXXX, and then a command pmem.show.status () is issued to the memory module manager ipmctl to check that the current state of the PMEM is Enabled, an error command pmxxx.inject.error (108,0) is injected to the destination-labeled error memory device by using the slot position information pmemXXX, where the location of the PMEM expressed by XXX is inserted, for example, 010, 011, and the Link error information 108 may be Link failure error information 3242.
According to the embodiment of the invention, the error command is injected into the target error injection memory device in the adjustable state through the slot position information, so that the error command is efficiently injected into the target error injection memory device, and the efficiency of processing PMEM faults is further improved.
In an optional embodiment of the present invention, the target error memory device has corresponding error log information, and the method further includes:
and checking the error logarithm information, and judging that the error command is successfully injected into the target memory device with the fault when the error logarithm information contains the error command.
In a specific implementation, in order to further improve efficiency of processing a PMEM fault, the embodiment of the present invention may determine whether the error command injected into the target error-injecting memory device is successful after the error command is injected into the target error-injecting memory device.
For example, a pmemxxx. Error _ Log () command may be issued to the memory module manager ipmctl to view an Error Log of the memory device PMEM with a target Error, and when the type of the memory Error with the Error is link fault Error, it is determined that the Error command is successfully injected into the target Error-injected memory device.
In the embodiment of the invention, the error logarithm information is checked, and when the error logarithm information contains the error command, the error command is successfully injected into the target error-labeled memory device, so that the success rate of injecting the error command into the target error-labeled memory device is ensured, and the efficiency of processing PMEM faults is further improved.
In an optional embodiment of the present invention, the server has a central processing unit, the error information injection state includes a disabled state, and the step of generating the error information for the target faulty memory device includes:
when the error information injection state is changed from the adjustable state to the disabled state, a progress state display command is issued to the memory module management program ipmctl so as to trigger the central processing unit to generate an error log which is specific to the target memory device with the error and contains the slot position information.
In a specific implementation, the server according to the embodiment of the present invention may have a Central Processing Unit (CPU), which is a final execution unit for information processing and program operation and serves as an operation and control core of a computer system. Since the production of CPUs, great developments have been made in the extension of logic structures, operation efficiencies, and functions.
Illustratively, in the embodiment of the present invention, when it is determined that the destination-labeled error memory device is successfully connected, the memory module manager ipmctl may issue a slot position determination command pmem.show.list () for the destination-labeled error memory device, so as to obtain slot position information pmemXXX corresponding to the domain-labeled error memory device, and generate an error command pmemxxx.object.error (108,0) by using the slot position information pmemXXX, and then issue a command pmem.show.status () for the memory module manager ipmctl, when the current state of the PMEM is checked as Enabled, the memory module manager ipmctl injects an error command emxxx.object.error (108,0) into the destination-labeled error memory device by using the slot position information pmemXXX, where the position of the inserted XXX represented by the memory module mme is changed to an error state, for example, 001, 011, 108 may be a Link error information indicating that the state of the inserted XXX is in the slot position determination command PMEM, and the status of the memory module manager ipm may be changed to an adjustable state indicating that the slot position information pmxxx is included in the status of the slot position determination command pmemc.
Optionally, the step of checking the initial error report by the error information includes:
the initial error report is verified by the error log.
In the embodiment of the present invention, when the error information injection state is changed from the adjustable state to the disabled state, a progress state display command is issued to the memory module management program ipmctl to trigger the central processing unit to generate an error log including the slot position information for the target-labeled erroneous memory device, and the initial error report is verified by the error log, so that an error log including the slot position information for the target-labeled erroneous memory device is efficiently generated, the error log can be used to verify the initial error report, and efficiency of processing PMEM faults is further improved.
In an optional embodiment of the present invention, the persistent memory device PMEM has a corresponding motherboard device, the motherboard device includes an error warning apparatus for the target memory device with an error, and the step of checking the initial error report by using the error information includes:
triggering the mainboard equipment to generate warning device control information aiming at the error warning device;
and checking the initial error report through the control information of the warning device.
In a specific implementation, the persistent memory device PMEM of the present invention may have a corresponding motherboard device, which is also called a main board (main board), a system board (system board), or a motherboard (thermal board), and is one of the most important components while being the most basic of a computer. The main board is generally a rectangular circuit board, on which the main circuit system forming the computer is mounted, and generally includes elements such as BIOS chip, I/O control chip, keyboard and panel control switch interface, indicator light plug-in unit, expansion slot, main board and dc power supply plug-in unit of plug-in card.
The mainboard device of the embodiment of the present invention may include an error warning device for a memory device with a wrong destination label, and for example, the embodiment of the present invention may be configured to, by connecting the memory device with a wrong destination label, issue a slot position determination command pmem.show.list () for the memory device with a wrong destination label to the memory module management program ipmctl when it is determined that the memory device with a wrong destination label is successfully connected, obtain slot position information pmemXXX corresponding to the memory device with a wrong destination label, generate an error command pmemxxx.injecterror (108,0) by the slot position information pmemXXX, then issue a command pmem.show.status () for the memory module management program ipmctl, and, when the current state of the PMEM is an Enabled, inject an error command pmxxx.inj.afror (108,0) into the memory device with a wrong destination label through the slot position information XXX, where the PMEM indicated by XXX indicates that the error status of the memory device with a wrong destination label is, and the error command pmxx is generated by a control module management program capable of triggering fault injection fault (for example, and when the state of the ram is an adjustable error control command PMEM is failed.
In practical application, the warning lamp control information can be used for controlling the wrong warning lamp to be turned on so as to express the fault of the memory device marked with the wrong object, and the fault of the memory device marked with the wrong object can be judged by judging whether the wrong warning lamp is turned on or not, so that the initial error report is verified.
In the embodiment of the invention, when the error information injection state is changed from the adjustable state to the disabled state, a progress state display command is issued to the memory module management program ipmctl so as to trigger the main board equipment to generate the warning device control information for the error warning device, and the initial error report is verified through the warning device control information, so that the initial error report is verified in multiple ways, and the efficiency of processing PMEM faults is further improved.
In order that those skilled in the art will better understand the embodiments of the present invention, a full and complete example will now be described.
S1, starting a server, setting relevant BIOS options, and setting PMem Error Injection to Enabled, so that errors can be injected into PMEM, otherwise, errors cannot be injected successfully;
s2, entering an OS, creating an AD mode (AppDirect) for the PMEM, and executing the following commands:
ipmctl create-f-goal persistentmemorytype=appdirectnotinterleaved;
s3, connecting the error injection equipment, confirming that the connection is successful, and checking the related information of the server through the equipment;
s4, executing a command on the fault injection equipment to view the position inserted by the PMEM, wherein the executing command is as follows:
pmem.show.list();
s5, checking the state of the current PMEM as Enabled,
pmem.show.status();
s6, selecting one of the inserted PMEM memories to inject a Link fail error, and executing the commands as follows:
pmemXXX.inject.error(108,0)
PMEM inserted positions 001, 010, 011, etc. denoted XXX, and Link fail error denoted 108;
s7, checking Error Log of the current PMEM, and executing the following command:
pmemXXX.error_log()
the type of the memory error of error injection is found to be link fail error;
s8, checking that the state of the current PMEM is changed from Enabled to Disabled, and executing the following commands:
pmemXXX.show.status();
s9, a cutter Error lamp for checking the main board can be lightened to trigger IERR; if the Catter Error is not lightened, the research and development function is not complete, and the function needs to be added;
s10, checking sel and idl logs can trigger IERR error logs of a CPU, and logs PMEM _ IERR _ Status Fault Status asserted and Detail (Location: pmemXXX) of a specific position of PMEM are provided;
s11, checking whether the IERR is triggered by pmem, if the IERR is triggered only, and if the IERR is not triggered by any part, indicating that the research and development do not do a specific positioning function, and needing to research and develop and increase the function; if the part is reported, but the position or the part is not reported, the analysis of the development is problematic, and the development is required to be modified.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Referring to fig. 2, a block diagram of a structure of a verification apparatus provided in the embodiment of the present invention is shown, and specifically, the structure may include the following modules:
an initial error report acquisition module 201, configured to acquire an initial error report for the PMEM;
a target labeling error memory device selecting module 202, configured to select a target labeling error memory device from the persistent memory device PMEM;
a slot position information determining module 203, configured to determine slot position information corresponding to the memory device with the wrong destination label;
an error command generating module 204, configured to generate an error command according to the slot position information;
an error command injection module 205, configured to inject the error command into the target-labeled error memory device according to the slot position information, and generate error information for the target-labeled error memory device;
a checking module 206, configured to check the initial error report according to the error information.
Optionally, the persistent memory device PMEM has a corresponding basic input output system BIOS, where the basic input output system BIOS includes an error information injection state for the persistent memory device, and the method may further include:
and the adjustable state configuration module is used for configuring the error information injection state into an adjustable state.
Optionally, the server includes an operating system OS for the persistent memory device PMEM, where the persistent memory device PMEM has a corresponding memory module manager ipmctl, and the module for selecting the target memory device with the fault may include:
and the target labeling error memory device selecting submodule is used for creating an application direct mode aiming at the permanent memory device PMEM in the operating system OS, and issuing a target labeling error memory device selecting instruction to the memory module management program ipmctl in the application direct mode so as to select the target labeling error memory device from the permanent memory device PMEM.
Optionally, the slot position information determining module may include:
and the slot position information determining submodule is used for connecting the memory device with the wrong eye mark, and issuing a slot position determining command aiming at the memory device with the wrong eye mark to the memory module management program ipmctl when the memory device with the wrong eye mark is judged to be successfully connected so as to determine the slot position information corresponding to the memory device with the wrong eye mark.
Optionally, the error command injection module may include:
and the error command injection submodule is used for injecting the error command into the target-labeled error memory device in the adjustable state through the slot position information.
Optionally, the target error memory device has corresponding error log information, and the apparatus may further include:
and the error command injection success judging module is used for checking the error logarithm information, and judging that the error command is successfully injected into the target error memory device when the error logarithm information contains the error command.
Optionally, the server has a central processing unit, the error information injection status includes a disable status, and the error command injection module may include:
and the error log generation submodule is used for issuing a progress state display command to the memory module management program ipmctl when the error information injection state is changed from the adjustable state to the disabled state so as to trigger the central processing unit to generate an error log which is used for marking the wrong memory device and contains the slot position information.
Optionally, the verification module may include:
and the first checking submodule is used for checking the initial error report through the error log.
Optionally, the persistent memory device PMEM has a corresponding motherboard device, the motherboard device includes an error warning apparatus for the memory device with a fault label, and the verification module may include:
the warning lamp control information generation submodule is used for triggering the mainboard equipment to generate warning device control information aiming at the error warning device;
and the second checking submodule is used for checking the initial error report through the control information of the warning device.
For the apparatus embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and reference may be made to the partial description of the method embodiment for relevant points.
In addition, an embodiment of the present invention further provides an electronic device, including: the processor, the memory, and the computer program stored in the memory and capable of running on the processor, where the computer program is executed by the processor to implement the processes of the foregoing verification method embodiment, and can achieve the same technical effects, and are not described herein again to avoid repetition.
The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when being executed by a processor, the computer program implements each process of the foregoing verification method embodiment, and can achieve the same technical effect, and is not described herein again to avoid repetition. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.
Fig. 3 is a schematic diagram of a hardware structure of an electronic device implementing various embodiments of the present invention.
The electronic device 300 includes, but is not limited to: radio frequency unit 301, network module 302, audio output unit 303, input unit 304, sensor 305, display unit 306, user input unit 307, interface unit 308, memory 309, processor 310, and power supply 311. Those skilled in the art will appreciate that the electronic device configuration shown in fig. 3 does not constitute a limitation of the electronic device, and that the electronic device may include more or fewer components than shown, or combine certain components, or a different arrangement of components. In the embodiment of the present invention, the electronic device includes, but is not limited to, a mobile phone, a tablet computer, a notebook computer, a palm computer, a vehicle-mounted terminal, a wearable device, a pedometer, and the like.
It should be understood that, in the embodiment of the present invention, the radio frequency unit 301 may be used for receiving and sending signals during a message sending and receiving process or a call process, and specifically, receives downlink data from a base station and then processes the received downlink data to the processor 310; in addition, the uplink data is transmitted to the base station. In general, the radio frequency unit 301 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. In addition, the radio frequency unit 301 can also communicate with a network and other devices through a wireless communication system.
The electronic device provides wireless broadband internet access to the user via the network module 302, such as assisting the user in sending and receiving e-mails, browsing web pages, and accessing streaming media.
The audio output unit 303 may convert audio data received by the radio frequency unit 301 or the network module 302 or stored in the memory 309 into an audio signal and output as sound. Also, the audio output unit 303 may also provide audio output related to a specific function performed by the electronic apparatus 300 (e.g., a call signal reception sound, a message reception sound, etc.). The audio output unit 303 includes a speaker, a buzzer, a receiver, and the like.
The input unit 304 is used to receive audio or video signals. The input Unit 304 may include a Graphics Processing Unit (GPU) 3041 and a microphone 3042, and the Graphics processor 3041 processes image data of a still picture or video obtained by an image capturing apparatus (such as a camera) in a video capturing mode or an image capturing mode. The processed image frames may be displayed on the display unit 306. The image frames processed by the graphic processor 3041 may be stored in the memory 309 (or other storage medium) or transmitted via the radio frequency unit 301 or the network module 302. The microphone 3042 may receive sounds and may be capable of processing such sounds into audio data. The processed audio data may be converted into a format output transmittable to a mobile communication base station via the radio frequency unit 301 in case of the phone call mode.
The electronic device 300 also includes at least one sensor 305, such as a light sensor, motion sensor, and other sensors. Specifically, the light sensor includes an ambient light sensor that adjusts the brightness of the display panel 3061 according to the brightness of ambient light, and a proximity sensor that turns off the display panel 3061 and/or the backlight when the electronic device 300 is moved to the ear. As one type of motion sensor, an accelerometer sensor can detect the magnitude of acceleration in each direction (generally three axes), detect the magnitude and direction of gravity when stationary, and can be used to identify the posture of an electronic device (such as horizontal and vertical screen switching, related games, magnetometer posture calibration), and vibration identification related functions (such as pedometer, tapping); the sensors 305 may also include fingerprint sensors, pressure sensors, iris sensors, molecular sensors, gyroscopes, barometers, hygrometers, thermometers, infrared sensors, etc., which are not described in detail herein.
The display unit 306 is used to display information input by the user or information provided to the user. The Display unit 306 may include a Display panel 3061, and the Display panel 3061 may be configured in the form of a Liquid Crystal Display (LCD), an Organic Light-Emitting Diode (OLED), or the like.
The user input unit 307 may be used to receive input numeric or character information and generate key signal inputs related to user settings and function control of the electronic device. Specifically, the user input unit 307 includes a touch panel 3071 and other input devices 3072. Touch panel 3071, also referred to as a touch screen, may collect touch operations by a user on or near it (e.g., operations by a user on or near touch panel 3071 using any suitable object or accessory such as a finger, a stylus, etc.). The touch panel 3071 may include two parts of a touch detection device and a touch controller. The touch detection device detects the touch direction of a user, detects a signal brought by touch operation and transmits the signal to the touch controller; the touch controller receives touch information from the touch sensing device, converts the touch information into touch point coordinates, sends the touch point coordinates to the processor 310, and receives and executes commands sent by the processor 310. In addition, the touch panel 3071 may be implemented using various types, such as resistive, capacitive, infrared, and surface acoustic wave. The user input unit 307 may include other input devices 3072 in addition to the touch panel 3071. Specifically, the other input devices 3072 may include, but are not limited to, a physical keyboard, function keys (such as volume control keys, switch keys, etc.), a trackball, a mouse, and a joystick, which are not described herein.
Further, the touch panel 3071 may be overlaid on the display panel 3061, and when the touch panel 3071 detects a touch operation on or near the touch panel, the touch operation is transmitted to the processor 310 to determine the type of the touch event, and then the processor 310 provides a corresponding visual output on the display panel 3061 according to the type of the touch event. Although the touch panel 3071 and the display panel 3061 are shown in fig. 3 as two separate components to implement the input and output functions of the electronic device, in some embodiments, the touch panel 3071 and the display panel 3061 may be integrated to implement the input and output functions of the electronic device, which is not limited herein.
The interface unit 308 is an interface for connecting an external device to the electronic apparatus 300. For example, the external device may include a wired or wireless headset port, an external power supply (or battery charger) port, a wired or wireless data port, a memory card port, a port for connecting a device having an identification module, an audio input/output (I/O) port, a video I/O port, an earphone port, and the like. The interface unit 308 may be used to receive input (e.g., data information, power, etc.) from an external device and transmit the received input to one or more elements within the electronic apparatus 300 or may be used to transmit data between the electronic apparatus 300 and the external device.
The memory 309 may be used to store software programs as well as various data. The memory 309 may mainly include a program storage area and a data storage area, wherein the program storage area may store an operating system, an application program required by at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area may store data (such as audio data, a phonebook, etc.) created according to the use of the cellular phone, and the like. Further, the memory 309 may include high speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.
The processor 310 is a control center of the electronic device, connects various parts of the whole electronic device by using various interfaces and lines, performs various functions of the electronic device and processes data by operating or executing software programs and/or modules stored in the memory 309 and calling data stored in the memory 309, thereby performing overall monitoring of the electronic device. Processor 310 may include one or more processing units; preferably, the processor 310 may integrate an application processor, which mainly handles operating systems, user interfaces, application programs, etc., and a modem processor, which mainly handles wireless communications. It will be appreciated that the modem processor described above may not be integrated into the processor 310.
The electronic device 300 may further include a power supply 311 (such as a battery) for supplying power to various components, and preferably, the power supply 311 may be logically connected to the processor 310 through a power management system, so as to implement functions of managing charging, discharging, and power consumption through the power management system.
In addition, the electronic device 300 includes some functional modules that are not shown, and are not described in detail herein.
In yet another embodiment provided by the present invention, as shown in fig. 4, a computer-readable storage medium 401 is further provided, which stores instructions that, when run on a computer, cause the computer to execute the verification method described in the above embodiment.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a … …" does not exclude the presence of another identical element in a process, method, article, or apparatus that comprises the element.
Through the description of the foregoing embodiments, it is clear to those skilled in the art that the method of the foregoing embodiments may be implemented by software plus a necessary general hardware platform, and certainly may also be implemented by hardware, but in many cases, the former is a better implementation. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling a terminal (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.
While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.
Those of ordinary skill in the art will appreciate that the various illustrative elements and algorithm steps described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present invention.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the above-described systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. For example, the above-described apparatus embodiments are merely illustrative, and for example, the division of the units is only one type of logical functional division, and other divisions may be realized in practice, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, devices or units, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U disk, a removable hard disk, a ROM, a RAM, a magnetic disk or an optical disk, and various media capable of storing program codes.
The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (12)

1. A verification method, applied to a server having a persistent memory device, the method comprising:
acquiring an initial error report aiming at the persistent memory device;
selecting target marking error memory equipment from the persistent memory equipment;
determining slot position information corresponding to the memory device with the wrong destination label;
generating an error command through the slot position information;
injecting the error command into the memory device with the target label error according to the slot position information, and generating error information aiming at the memory device with the target label error;
and checking the initial error report through the error information.
2. The method of claim 1, wherein the persistent memory device has a corresponding bios, the bios including an error information injection status for the persistent memory device, and further comprising, prior to the step of selecting the targeted faulty memory device from the persistent memory device:
configuring the error information injection state to an adjustable state.
3. The method of claim 2, wherein the server includes an operating system for the persistent memory device having a corresponding memory module manager, and wherein selecting the memory device with the tag error from the persistent memory device comprises:
and establishing an application direct mode aiming at the persistent memory device in the operating system, and issuing a target-labeled error memory device selection instruction to the memory module management program under the application direct mode so as to select the target-labeled error memory device from the persistent memory device.
4. The method of claim 3, wherein the step of determining slot location information corresponding to the target misprinted memory device comprises:
and connecting the memory device with the wrong target label, and issuing a slot position determining command aiming at the memory device with the wrong target label to the memory module management program when the memory device with the wrong target label is judged to be successfully connected so as to determine the slot position information corresponding to the memory device with the wrong target label.
5. The method according to claim 4, wherein the step of injecting the error command into the target memory device with the slot location information comprises:
marking the target with wrong memory equipment in the adjustable state through the slot position information
Injecting the error command.
6. The method of claim 5, wherein the target error memory device has corresponding error log information, the method further comprising:
and checking the error logarithm information, and judging that the error command is successfully injected into the target memory device with the fault when the error logarithm information contains the error command.
7. The method of claim 5, wherein the server has a central processing unit, wherein the error information injection status comprises a disabled status, and wherein generating the error information for the target faulty memory device comprises:
when the error information injection state is changed from the adjustable state to the disabled state, a progress state display command is issued to the memory module management program to trigger the central processing unit to generate an error log which is specific to the target-labeled error memory device and contains the slot position information.
8. The method of claim 7, wherein the step of checking the initial error report with the error information comprises:
the initial error report is verified by the error log.
9. The method of claim 7, wherein the persistent memory device has a corresponding motherboard device, the motherboard device includes an error alert for the target faulty memory device, and the step of checking the initial error report with the error information comprises:
triggering the mainboard equipment to generate warning device control information aiming at the error warning device;
and checking the initial error report through the control information of the warning device.
10. A verification apparatus, wherein the apparatus is applied to a server, and the server has a persistent memory device, the apparatus comprising:
an initial error report acquisition module, configured to acquire an initial error report for the persistent memory device;
the target labeling error memory device selecting module is used for selecting the target labeling error memory device from the persistent memory device;
the slot position information determining module is used for determining slot position information corresponding to the memory device with the wrong destination label;
the error command generating module is used for generating an error command according to the slot position information;
the error command injection module is used for injecting the error command into the target memory device with the error label through the slot position information and generating error information aiming at the target memory device with the error label;
and the checking module is used for checking the initial error report through the error information.
11. An electronic device, comprising a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory communicate with each other via the communication bus;
the memory is used for storing a computer program;
the processor, when executing a program stored on the memory, implementing the method of any of claims 1-9.
12. A computer-readable storage medium having stored thereon instructions, which when executed by one or more processors, cause the processors to perform the method of any one of claims 1-9.
CN202211557599.XA 2022-12-06 2022-12-06 Verification method, verification device, electronic equipment and readable storage medium Pending CN115794524A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211557599.XA CN115794524A (en) 2022-12-06 2022-12-06 Verification method, verification device, electronic equipment and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211557599.XA CN115794524A (en) 2022-12-06 2022-12-06 Verification method, verification device, electronic equipment and readable storage medium

Publications (1)

Publication Number Publication Date
CN115794524A true CN115794524A (en) 2023-03-14

Family

ID=85417454

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211557599.XA Pending CN115794524A (en) 2022-12-06 2022-12-06 Verification method, verification device, electronic equipment and readable storage medium

Country Status (1)

Country Link
CN (1) CN115794524A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118132358A (en) * 2024-04-29 2024-06-04 苏州元脑智能科技有限公司 Error injection method, system, upper computer, controller, equipment and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118132358A (en) * 2024-04-29 2024-06-04 苏州元脑智能科技有限公司 Error injection method, system, upper computer, controller, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN111143005B (en) Application sharing method, electronic equipment and computer readable storage medium
CN109800179B (en) Method for acquiring data, method for sending data, host and embedded memory
CN111309205B (en) Application sharing method, first electronic device and computer-readable storage medium
WO2021143669A1 (en) Method for acquiring configuration information and electronic device
CN110213313B (en) File uploading method, related device and system
CN115589361A (en) Management equipment firmware updating method and device, electronic equipment and storage medium
CN109788470B (en) Method, device and terminal for identifying eSIM card
CN115794524A (en) Verification method, verification device, electronic equipment and readable storage medium
CN116679967B (en) Firmware upgrading method and device for basic input/output system
CN107832191B (en) Black screen detection method, mobile terminal and computer readable storage medium
CN108089935B (en) Application program management method and mobile terminal
CN109745699A (en) A kind of method and terminal device responding touch control operation
CN106709330B (en) Method and device for recording file execution behaviors
CN111159738B (en) Authority configuration method, application login method and device
CN112596980A (en) ios performance collection method and device, mobile terminal and computer readable storage medium
CN109451011B (en) Information storage method based on block chain and mobile terminal
CN109828702B (en) Interface display method and terminal equipment
US11567822B2 (en) Method of monitoring closed system, apparatus thereof and monitoring device
CN108965577B (en) Interface display method and terminal equipment
CN117806857B (en) Fault location information generation method and device, electronic equipment and storage medium
CN112667868A (en) Data detection method and device
CN116841594A (en) Refreshing method and device for basic input/output system
CN108055377B (en) T card detection method and mobile terminal
CN116483451A (en) Method and device for determining on-board equipment information of on-board equipment of server
CN117056971B (en) Data storage method, device, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination