WO2016115661A1

WO2016115661A1 - Memory fault isolation method and device

Info

Publication number: WO2016115661A1
Application number: PCT/CN2015/071008
Authority: WO
Inventors: 刘勇
Original assignee: 华为技术有限公司
Priority date: 2015-01-19
Filing date: 2015-01-19
Publication date: 2016-07-28
Also published as: CN106133704A

Abstract

A memory fault isolation method and device. The method comprises: during the operation of a process, a server monitoring the state of a memory identified by an original physical address mapped by a virtual address of the process; if the memory identified by the original physical address has a fault, the server marking the state of a physical address block to which the original physical address belongs as a fault, so as to realize the on-line isolation of a faulty memory space; and the server allocating the physical address block for a page to which the virtual address belongs, and synchronizing data in a memory interval identified by the physical address block to which the original physical address belongs to the memory interval identified by the re-allocated physical address block. In the above-mentioned method, the virtual address of the process in the whole process is unchanged, so as to ensure that a service is uninterrupted and realize the on-line isolation of a faulty memory space.

Description

Memory fault isolation method and device

Technical field

Embodiments of the present invention relate to computer technologies, and in particular, to a memory fault isolation method and apparatus.

Background technique

A memory failure on the server will cause the server or board to be reset. The server reset will cause the running application to be interrupted, and the server needs to be returned to the manufacturer to replace the memory. It is inconvenient to replace the memory.

The prior art provides an offline fault memory isolation method, which does not need to return to the manufacturer to replace the memory. The method includes: first, detecting the memory through a Basic Input Output System (BIOS) before the server is running, and Obtain the address information of the faulty memory space, save the address information of the faulty memory space to a non-volatile memory (Non-Volatile Memory, NVM for short), and read the address information of the faulty memory space saved in the NVM, and the address information is obtained. The corresponding fault memory space is marked as unavailable, permanently isolating the fault memory space.

The method provided by the above prior art can only perform fault isolation before the server is running, and a memory hardware failure occurs during the running of the server, which still causes service interruption.

Summary of the invention

The embodiment of the invention provides a memory fault isolation method and device, which can perform fault isolation during server operation and avoid service interruption.

A first aspect of the present invention provides a memory fault isolation apparatus, including:

An exception processing module, configured to monitor a state of a memory that is identified by a physical address mapped by a virtual address of the process, where a page to which the virtual address belongs has a mapping relationship with a physical address block to which the original physical address belongs, The physical address block is used to identify a contiguous memory interval allocated to the process;

The exception handling module is further configured to mark a status of the physical address block to which the original physical address belongs as a fault, if the memory identified by the original physical address is faulty;

a memory management module for using a non-faulty memory for the page block to which the virtual address belongs Newly allocated physical address block;

The exception processing module is further configured to synchronize data in a memory interval identified by a physical address block to which the original physical address belongs to a memory interval identified by the reallocated physical address block.

In conjunction with the first aspect of the present invention, in a first possible implementation of the first aspect of the present invention, the exception processing module is further configured to: save the information of the physical address block marked as a fault to a nonvolatile memory in;

The memory management module is specifically configured to:

Determining the non-faulty memory according to information of a failed physical address block stored in the nonvolatile memory;

Reassigning a physical address block from the non-faulty memory to a page to which the virtual address belongs according to the virtual address and the process ID of the process.

With reference to the first possible implementation manner of the first aspect of the present invention, in a second possible implementation manner of the first aspect of the present disclosure, the memory management module is specifically configured to:

Obtaining, according to the virtual address and a process ID of the process, a page to which the virtual address belongs;

And selecting a physical address block from the non-faulty memory, and establishing a mapping relationship between the page to which the virtual address belongs to the selected physical address block.

In conjunction with the first aspect of the present invention, in a third possible implementation of the first aspect of the present invention, the memory management module is further configured to:

Reading information of a failed physical address block held in the nonvolatile memory when initial memory is allocated to the process;

Determining non-faulty memory according to the information of the fault physical address block, where the non-fault memory is a memory other than the fault physical address block in the memory;

The initial memory is allocated to the process from the non-faulty memory.

In conjunction with the first aspect of the present invention and any one of the first to third possible implementations of the first aspect of the present invention, in a fourth possible implementation of the first aspect of the present invention, the exception handling The module is also used to:

When the server is restarted, the memory is fault detected;

If it is detected that the physical address block to which the original physical address belongs has returned to normal, the status of the physical address block to which the original physical address belongs is marked as non-fault.

A second aspect of the present invention provides a memory fault isolation method, including:

The state of the memory identified by the original physical address to which the virtual address of the process is mapped, wherein the page to which the virtual address belongs has a mapping relationship with the physical address block to which the original physical address belongs, and the physical address block is used to identify a contiguous memory interval allocated to the process;

If the memory identified by the original physical address fails, the status of the physical address block to which the original physical address belongs is marked as a fault;

Relocating a physical address block from the non-faulty memory for the page block to which the virtual address belongs, and synchronizing data in the memory interval identified by the physical address block to which the original physical address belongs to the reallocated physical address block In the identified memory interval.

With reference to the second aspect of the present invention, in a first possible implementation manner of the second aspect of the present invention, the method further includes:

Saving the information of the physical address block marked as failed to the non-volatile memory;

Reassigning a physical address block to the page to which the virtual address belongs from the non-faulty memory includes:

With reference to the first possible implementation manner of the second aspect of the present invention, in a second possible implementation manner of the second aspect of the present invention, the The physical memory block is re-allocated in the fault memory for the page to which the virtual address belongs, including:

With reference to the second aspect of the present invention, in a third possible implementation manner of the second aspect of the present invention, the method further includes:

Determining non-faulty memory according to the information of the fault physical address block, the non-faulty memory a memory other than the fault physical address block in the memory;

The initial memory is allocated to the process from the non-faulty memory.

In conjunction with the second aspect of the present invention and any one of the first to third possible implementations of the second aspect of the present invention, in a fourth possible implementation of the second aspect of the present invention, the method further include:

When the server is restarted, the memory is fault detected;

The memory fault isolation method and device of the embodiment of the present invention, during the process running, the server monitors the state of the memory identified by the original physical address mapped by the virtual address of the process, and if the memory identified by the original physical address fails, the server Mark the status of the physical address block to which the original physical address belongs as a fault to isolate the faulty memory space online. And the server allocates a physical address block to the page to which the virtual address belongs, and synchronizes the data in the memory interval identified by the physical address block to which the original physical address belongs to the memory interval identified by the reallocated physical address block. In the method of this embodiment, the virtual address of the process is unchanged during the whole process, so that the service is not interrupted, and the faulty memory space is isolated online.

DRAWINGS

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. Obviously, the drawings in the following description It is a certain embodiment of the present invention, and other drawings can be obtained from those skilled in the art without any inventive labor.

1 is a schematic structural diagram of a server according to an embodiment of the present invention;

2 is a schematic structural diagram of a memory fault isolation device according to Embodiment 1 of the present invention;

FIG. 3 is a flowchart of a memory fault isolation method according to Embodiment 2 of the present invention.

detailed description

In order to make the objectives, technical solutions, and advantages of the embodiments of the present invention more clearly, the technical solutions in the embodiments of the present invention are clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. It is apparent that the described embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.

The method of the embodiment of the invention is mainly applied to an operating system kernel and a VMM kernel having a page table mapping mechanism and an exception handling mechanism. FIG. 1 is a schematic structural diagram of a server according to an embodiment of the present invention. As shown in FIG. 1 , the server includes: a memory, a processor, and an NVM. The processor mainly refers to a central processing unit (CPU), and a processor. It includes a memory management module and an exception handling module. Memory Management Module: Implement virtual machine or process virtual address space and physical address space mapping. The exception handling module is configured to perform related processing on the abnormality triggered by the CPU, and is used to process the memory fault in the embodiment of the present invention. The memory and NVM are two separate physical hardware. The NVM is used to store the physical address of the faulty memory space. Even if the server is powered off, the data stored in the NVM will not be lost. The memory is generally a random access memory (RAM) or a dynamic random access memory (DRAM). The memory generally exists in the form of a memory stick. The size of a single memory stick in the memory of the server is 8GB, 16GB and above, the replacement cost is high, and the virtual machine or process is a business running bearer.

2 is a schematic structural diagram of a memory fault isolation device according to Embodiment 1 of the present invention. The memory fault isolation device provided in this embodiment may be integrated in a server. As shown in FIG. 2, the memory fault isolation device provided in this embodiment is provided. The method includes an exception processing module 11 and a memory management module 12.

The exception processing module 11 is configured to monitor the state of the memory identified by the original physical address mapped by the virtual address of the process, where the page to which the virtual address belongs has a mapping relationship with the physical address block to which the original physical address belongs, and the physical address block Used to identify a contiguous memory interval allocated to a process.

If the memory identified by the original physical address fails, the exception handling module 11 is further configured to mark the status of the physical address block to which the original physical address belongs as a fault.

a memory management module 12, configured to re-allocate a physical address block from a non-faulty memory for a page block to which the virtual address belongs;

The exception handling module 11 is further configured to synchronize data in the memory interval identified by the physical address block to which the original physical address belongs to the memory interval identified by the reallocated physical address block.

When the user starts the program, the server's operating system creates a process for the application, memory The management module 12 is further configured to allocate an initial memory space for the process, that is, allocate a page for the process. In the page table mapping mechanism, each page includes a virtual address interval, and each page corresponds to one physical address block, and each physical address The block is used to identify a contiguous memory interval. Each virtual address in the page corresponds to a physical address in the physical address block, and the mapping relationship between the virtual address and the physical address is maintained by the memory management module 12 of the server.

In this embodiment, the memory management module 12 allocates initial memory for the process, which is: first, the information of the fault physical address block saved in the NVM is read, and then, the non-fault memory is determined according to the information of the fault physical address block, and the non-fault memory is Memory in memory other than the failed physical address block; finally, the initial memory is allocated to the process from non-faulty memory. In this embodiment, the memory management module 12 implements the mapping of the virtual address and the physical address in the form of a page table, and the page table organization form is managed in a level 1 or multi-level manner, for example, the Linux kernel adopts a level 3 page table management method, and each The size of the page may be 4K, 2M or 1gbps, etc. In this embodiment, the management form of the page table is not limited.

The memory management module 12 can avoid the memory identified by the failed physical address block when the initial memory is allocated for the process by reading the information of the failed physical address block in the NVM. During the subsequent running of the process, the exception handling module 11 monitors the state of the memory identified by the original physical address mapped by the virtual address of the process, and the state of the memory identified by the original physical address includes: a fault state and a normal state. Specifically, the memory management module 12 receives the memory access request sent by the process, where the memory access request includes the virtual address of the process, and the memory management module 12 maps the virtual address to the original physical address, and the virtual address and the original physical address. The correspondence is stored in the Translation Look-aside Buffer (TLB). Then, the memory management module 12 sends the original physical address to the memory controller through the memory bus, and the memory controller reads the data according to the original physical address, and if the abnormality cannot read the data in the memory identified by the original physical address, the memory control The device issues an exception access instruction through the memory bus, and the exception handling module 11 determines, according to the abnormal access instruction, that the memory identified by the original physical address is faulty.

In the page table mapping mechanism, a page table is usually used as the minimum operation unit, and a page table corresponds to a physical address block. Therefore, when the memory identified by the original physical address fails, the exception handling module 11 belongs to the original physical address. The status of the physical address block is marked as a failure, isolating the physical address block to which the original physical address belongs. Usually the information of the failed physical address block is recorded in the NVM, so that even if the server is powered down, the information of the failed physical address block stored in the NVM is also It will not be lost. After the server is powered on, the memory management module 12 can still read the information of the fault physical address block from the NVM. When the initial memory is allocated for the process, the memory interval identified by the fault physical address block can be avoided.

In this embodiment, in order to ensure that the running service is not interrupted, the memory management module 12 reallocates the physical address block for the page to which the virtual address belongs, and the physical address mapped by the virtual address changes before and after the allocation, but for the upper application In this case, the virtual address of the process corresponding to the application does not change. As long as the virtual address does not change, the process is not interrupted, thereby ensuring that the user's service is not interrupted.

The memory management module 12 specifically allocates a physical address block for the page to which the virtual address belongs by first: first, determining non-faulty memory according to the information of the failed physical address block stored in the NVM, and the non-faulty memory is the memory-depleted physical address block in the memory. Memory outside of the identified memory. Then, based on the virtual address and the process number of the process, the physical address block is reassigned from the non-faulty memory for the page to which the virtual address belongs. The memory management module 12 re-allocates the physical address block from the non-fault memory to the page to which the virtual address belongs according to the virtual address and the process ID of the process. Specifically, the virtual address belongs to the virtual address and the process ID of the process. The page is then selected from the non-faulty memory to establish a mapping relationship between the page to which the virtual address belongs and the selected physical address block.

After the memory management module 12 reallocates the physical address block for the page block to which the virtual address belongs, the exception processing module 11 is further configured to synchronize the data in the memory interval identified by the physical address block to which the original physical address belongs to the reallocated physical unit. In the memory interval identified by the address block. Specifically, when the memory identified by the original physical address fails, the memory controller generates an abnormal instruction, and the exception processing module 11 performs data recovery according to the abnormal access instruction, where the abnormal access instruction includes an operation code and an operand, and the operation code indicates the The operation type of the exception access instruction is a read operation or a write operation, and the operand includes information of a register to be accessed by a read operation or a write operation, and a physical address of data to be accessed by a read operation or a write operation. If the operation type is a write operation, the exception handling module 11 writes the data to be written in the memory identified by the original physical address into the corresponding location in the reallocated physical address block. If the operation type is a read operation, the exception processing module 11 can perform data recovery on the data to be read according to the backup data of the data to be read in the memory identified by the original physical address, and if the data to be read can be recovered according to the backup data, Then, the recovered data to be read is copied to the corresponding position in the reallocated physical address block. If you want to read If the data cannot be recovered, the exception handling module 11 resets the process. Different from the prior art, in this embodiment, only the currently monitored process needs to be reset without interrupting other processes running on the server. In the prior art, once a process has a memory failure during operation, the server needs to be reset, and all processes running on the server are interrupted, causing all services to be interrupted.

It should be noted that, in this embodiment, although the status of the original physical address block is marked as a fault, in the memory interval identified by the original physical address block, only the memory identified by the original physical address is faulty, and the original physical address block is faulty. The other memory in the identified memory interval is normal. Therefore, when the exception processing module 11 synchronizes the data in the memory interval identified by the physical address block to which the original physical address belongs to the memory interval identified by the reallocated physical address block, The data in the normal memory can be directly copied from the memory interval identified by the original physical address block to the memory interval identified by the reallocated physical address block.

Optionally, if the server is restarted, the exception processing module 11 is further configured to perform fault detection on the memory, and specifically detect whether the faulty memory block in the NVM has returned to normal. If it is detected that the physical address block to which the original physical address belongs has returned to normal, Then, the exception handling module 11 marks the state of the physical address block to which the original physical address belongs as non-fault, and the memory interval identified by the original physical address block can be used for memory allocation. If the physical address block to which the original physical address belongs cannot be recovered, the exception handling module 11 permanently isolates the original physical address block, and the memory interval identified by the original physical address block cannot be used for memory allocation.

In the device of this embodiment, during the running of the process, the exception handling module monitors the state of the memory identified by the original physical address mapped by the virtual address of the process. If the memory identified by the original physical address fails, the exception handling module will The status of the physical address block to which the physical address belongs is marked as a fault to isolate the faulty memory space online. And the exception handling module calls the memory management module to allocate a physical address block for the page to which the virtual address belongs, and synchronizes the data in the memory interval identified by the physical address block to which the original physical address belongs to the memory interval identified by the reallocated physical address block. in. In the device of this embodiment, the virtual address of the process is unchanged during the whole process, so that the service is not interrupted, and the faulty memory space is isolated online.

FIG. 3 is a flowchart of a memory fault isolation method according to Embodiment 2 of the present invention. The method in this embodiment is performed by a server. As shown in FIG. 2, the method in this embodiment may include the following steps:

Step 101: Monitor the state of the memory identified by the original physical address mapped by the virtual address of the process, where the page to which the virtual address belongs has a mapping relationship with the physical address block to which the original physical address belongs, and the physical address block is used to identify the process assigned to the process. A continuous memory interval.

When a user launches a program, the server's operating system creates a process for the application, and the server allocates an initial memory space for the process. The server allocates initial memory for the process, which is: first, the information of the fault physical address block saved in the NVM is read, and then the non-fault memory is determined according to the information of the fault physical address block, and the non-fault memory is the fault physical address block in the memory. External memory; finally, the initial memory is allocated to the process from non-faulty memory. In this embodiment, the memory management module implements mapping of the virtual address and the physical address in the form of a page table, and the page table organization form is managed in a level 1 or multi-level manner, for example, the Linux kernel adopts a level 3 page table management method, and each The size of the page may be 4K, 2M or 1gbps, etc. In this embodiment, the management form of the page table is not limited.

In this embodiment, by reading the information of the fault physical address block in the NVM, the server can avoid the memory identified by the fault physical address block when the initial memory is allocated for the process. During the subsequent running of the process, the server monitors the state of the memory identified by the original physical address mapped by the virtual address of the process, and the state of the memory identified by the original physical address includes: a fault state and a normal state. Specifically, the memory management module of the server receives the memory access request sent by the process, where the memory access request includes a virtual address of the process, and the memory management module maps the virtual address to the original physical address, and the virtual address and the original physical address are The corresponding relationship is stored in the TLB. Then, the memory management module sends the original physical address to the memory controller through the memory bus, and the memory controller reads the data according to the original physical address, and if the abnormality cannot read the data in the memory identified by the original physical address, the memory controller An abnormal access instruction is issued through the memory bus, and the server determines, according to the abnormal access instruction, that the memory identified by the original physical address is faulty.

Step 102: If the memory identified by the original physical address fails, the status of the physical address block to which the original physical address belongs is marked as a fault.

In this embodiment, when the memory identified by the original physical address fails, the server marks the status of the physical address block to which the original physical address belongs as a fault, and isolates the physical address block to which the original physical address belongs. Usually, the information of the fault physical address block is recorded in the NVM, so that even if the server is powered off, the information of the fault physical address block saved in the NVM is not lost, and the server can still read the fault physics from the NVM after power-on. Address block information, at the beginning of the process allocation When starting memory, you can avoid the memory interval identified by the failed physical address block.

Step 103: Reassign the physical address block from the non-fault memory to the page to which the virtual address belongs, and synchronize the data in the memory interval identified by the physical address block to which the original physical address belongs to the memory identified by the reallocated physical address block. In the interval.

In this embodiment, in order to ensure that the running service is not interrupted, the server re-allocates the physical address block to the page to which the virtual address belongs, and the physical address mapped by the virtual address changes before and after the allocation, but for the upper layer application, The virtual address of the process corresponding to the application does not change. As long as the virtual address does not change, the process will not be interrupted, thereby ensuring that the user's service is not interrupted.

Specifically, the server re-allocates the physical address block to the page to which the virtual address belongs by first: first, determining the non-faulty memory according to the information of the failed physical address block saved in the NVM, and the non-faulty memory is the block of the physical address in the memory. Memory outside of the identified memory. Then, based on the virtual address and the process number of the process, the physical address block is reassigned from the non-faulty memory for the page to which the virtual address belongs. The server re-allocates the physical address block from the non-faulty memory to the page to which the virtual address belongs according to the virtual address and the process ID of the process, where: first, the page to which the virtual address belongs is obtained according to the virtual address and the process ID of the process; The physical address block is selected from the non-faulty memory to establish a mapping relationship between the page to which the virtual address belongs and the selected physical address block.

After reallocating the physical address block for the page block to which the virtual address belongs, the server synchronizes the data in the memory interval identified by the physical address block to which the original physical address belongs to the memory interval identified by the reallocated physical address block. For the specific synchronization mode, refer to the related description of the first embodiment, and details are not described herein again.

In the method of the embodiment, during the running of the process, the server monitors the state of the memory identified by the original physical address mapped by the virtual address of the process. If the memory identified by the original physical address fails, the server belongs to the original physical address. The status of the physical address block is marked as faulty to isolate the faulty memory space online. And the server allocates a physical address block to the page to which the virtual address belongs, and synchronizes the data in the memory interval identified by the physical address block to which the original physical address belongs to the memory interval identified by the reallocated physical address block. In the method of this embodiment, the virtual address of the process is unchanged during the whole process, so that the service is not interrupted, and the faulty memory space is isolated online.

On the basis of the second embodiment, if the server is restarted, the server detects the fault of the memory, and the server specifically detects whether the faulty memory block in the NVM has returned to normal. If the server detects that the physical address block to which the original physical address belongs has returned to normal, Then, the server marks the status of the physical address block to which the original physical address belongs as non-fault, and the memory interval identified by the original physical address block can be used for memory allocation. If the physical address block to which the original physical address belongs cannot be recovered, the server permanently isolates the original physical address block, and the memory interval identified by the original physical address block cannot be used for memory allocation.

A person skilled in the art can understand that all or part of the steps of implementing the above method embodiments may be completed by using hardware related to the program instructions. The foregoing program may be stored in a computer readable storage medium, and the program is executed when executed. The foregoing steps include the steps of the foregoing method embodiments; and the foregoing storage medium includes: a medium that can store program codes, such as a ROM, a RAM, a magnetic disk, or an optical disk.

Finally, it should be noted that the above embodiments are merely illustrative of the technical solutions of the present invention, and are not intended to be limiting; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that The technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the technical solutions of the embodiments of the present invention. range.

Claims

A memory fault isolation device, comprising:

An exception processing module, configured to monitor a state of a memory that is identified by a physical address mapped by a virtual address of the process, where a page to which the virtual address belongs has a mapping relationship with a physical address block to which the original physical address belongs, The physical address block is used to identify a contiguous memory interval allocated to the process;

The exception handling module is further configured to mark a status of the physical address block to which the original physical address belongs as a fault, if the memory identified by the original physical address is faulty;

a memory management module, configured to re-allocate a physical address block from a non-faulty memory for a page block to which the virtual address belongs;

The exception processing module is further configured to synchronize data in a memory interval identified by a physical address block to which the original physical address belongs to a memory interval identified by the reallocated physical address block.
The apparatus according to claim 1, wherein the exception processing module is further configured to: save the information of the physical address block marked as a fault into a non-volatile memory;

The memory management module is specifically configured to:

Determining the non-faulty memory according to information of a failed physical address block stored in the nonvolatile memory;

Reassigning a physical address block from the non-faulty memory to a page to which the virtual address belongs according to the virtual address and the process ID of the process.
The device according to claim 2, wherein the memory management module is specifically configured to:

Obtaining, according to the virtual address and a process ID of the process, a page to which the virtual address belongs;

And selecting a physical address block from the non-faulty memory, and establishing a mapping relationship between the page to which the virtual address belongs to the selected physical address block.
The device according to claim 1, wherein the memory management module is further configured to:

Reading information of a failed physical address block held in the nonvolatile memory when initial memory is allocated to the process;

Determining non-faulty memory according to the information of the fault physical address block, the non-faulty memory a memory other than the fault physical address block in the memory;

The initial memory is allocated to the process from the non-faulty memory.
The apparatus according to any one of claims 1 to 4, wherein the exception processing module is further configured to:

When the server is restarted, the memory is fault detected;

If it is detected that the physical address block to which the original physical address belongs has returned to normal, the status of the physical address block to which the original physical address belongs is marked as non-fault.
A memory fault isolation method, comprising:

The state of the memory identified by the original physical address to which the virtual address of the process is mapped, wherein the page to which the virtual address belongs has a mapping relationship with the physical address block to which the original physical address belongs, and the physical address block is used to identify a contiguous memory interval allocated to the process;

If the memory identified by the original physical address fails, the status of the physical address block to which the original physical address belongs is marked as a fault;

Relocating a physical address block from the non-faulty memory for the page block to which the virtual address belongs, and synchronizing data in the memory interval identified by the physical address block to which the original physical address belongs to the reallocated physical address block In the identified memory interval.
The method of claim 6 wherein the method further comprises:

Saving the information of the physical address block marked as failed to the non-volatile memory;

Reassigning a physical address block to the page to which the virtual address belongs from the non-faulty memory includes:

Determining the non-faulty memory according to information of a failed physical address block stored in the nonvolatile memory;

Reassigning a physical address block from the non-faulty memory to a page to which the virtual address belongs according to the virtual address and the process ID of the process.
The method according to claim 7, wherein the reallocating the physical address block for the page to which the virtual address belongs from the non-faulty memory according to the virtual address and the process number of the process, including :

Obtaining, according to the virtual address and a process ID of the process, a page to which the virtual address belongs;

And selecting a physical address block from the non-faulty memory, and establishing a mapping relationship between the page to which the virtual address belongs to the selected physical address block.
The method of claim 6 wherein the method further comprises:

Reading information of a failed physical address block held in the nonvolatile memory when initial memory is allocated to the process;

Determining non-faulty memory according to the information of the fault physical address block, where the non-fault memory is a memory other than the fault physical address block in the memory;

The initial memory is allocated to the process from the non-faulty memory.
The method of any of claims 6-9, wherein the method further comprises:

When the server is restarted, the memory is fault detected;

If it is detected that the physical address block to which the original physical address belongs has returned to normal, the status of the physical address block to which the original physical address belongs is marked as non-fault.