CN111737176A

CN111737176A - PCIE data-based synchronization device and driving method

Info

Publication number: CN111737176A
Application number: CN202010392108.5A
Authority: CN
Inventors: 林涛
Original assignee: Fuzhou Rockchip Electronics Co Ltd
Current assignee: Fuzhou Rockchip Electronics Co Ltd
Priority date: 2020-05-11
Filing date: 2020-05-11
Publication date: 2020-10-02
Anticipated expiration: 2040-05-11
Also published as: CN111737176B

Abstract

The invention provides a PCIE data-based synchronization device and a driving method, wherein the driving method comprises the following steps: the method comprises the following steps: an MMU unit acquires an operation request to an IOVA; the MMU unit converts the operation request of the IOVA into a virtual channel in the VF unit; the PF unit judges whether a responding virtual channel with higher priority than the virtual channel exists, if so, the PF unit suspends responding to the virtual channel, and enables the virtual channel to enter a waiting queue, and after the response of the virtual channel with the higher priority is finished, the PF unit responds to the operation request of the virtual channel, and if not, the PF unit responds to the operation request of the virtual channel. The technical scheme can ensure the data orderliness, avoid consuming excessive bandwidth of the flow and realize the flow control of each thread.

Description

PCIE data-based synchronization device and driving method

Technical Field

The invention relates to the field of computer communication, in particular to a PCIE data-based synchronization device and a driving method.

Background

PCIe (peripheral component interconnect express) may map a PCIe domain address to a chip storage domain address through an ATS (address translation service), and a Memory Management Unit (MMU) of the chip may map a segment of the storage domain address in a kernel space and a user state space of an operating system to form a corresponding IOVA segment. Because of the data management for the address mapping mode, if different software and threads need to access the region at the same time, a signal synchronization mechanism, such as a software lock, is inevitably required to be added. Such a mechanism, while guaranteeing data ordering, may consume CPU bandwidth unnecessarily, and lock contention may actually reduce the bandwidth of the entire flow. And flow control cannot be performed on each thread, and hard real-time indexes of partial access and the like cannot be guaranteed.

Disclosure of Invention

Therefore, a PCIE data-based synchronization apparatus and a driving method are needed to solve the problem of the utilization rate of the CPU bandwidth.

In order to achieve the above object, the inventor provides a method for driving a synchronization apparatus based on PCIE data, including the following steps:

an MMU unit acquires an operation request to an IOVA;

the MMU unit converts the operation request of the IOVA into a virtual channel in the VF unit;

the PF unit judges whether a responding virtual channel with higher priority than the virtual channel exists, if so, the PF unit suspends responding to the virtual channel, and enables the virtual channel to enter a waiting queue, and after the response of the virtual channel with the higher priority is finished, the PF unit responds to the operation request of the virtual channel, and if not, the PF unit responds to the operation request of the virtual channel.

Further, when the virtual channel is entered into the waiting queue, the method further comprises the following steps:

and the PF unit judges whether the virtual channels in the waiting queue exceed preset time, and if so, the PF unit promotes the priority level of the virtual channels to the highest priority level.

Further, if yes, the PF unit raises the priority of the virtual channel to the highest priority, further including the steps of:

and the VF unit controls the virtual channel which is lifted to the highest priority level to carry out single-stroke transmission.

Further, the method also comprises the following steps:

after the virtual channel with the highest priority is promoted to perform single transmission, the VF unit restores the original priority level of the virtual channel.

Further, the method also comprises the following steps:

the PF unit judges whether a responding virtual channel with lower priority than the virtual channel exists, if so, the PF unit responds to the operation request of the virtual channel, the virtual channel with lower priority enters a waiting queue, after the operation request response of the virtual channel is finished, the PF unit responds to the suspended operation request with lower priority, and if not, the PF unit responds to the operation request of the virtual channel.

the PF unit controls the virtual channel to enter the buffer of the VF unit.

and the real-time control module of the MMU unit closes the translation action of the access request of the corresponding IOVA of the virtual channel.

Further, the method also comprises the following steps:

a page fault message is generated, and the real-time control module shelves the page fault message.

Further, the method also comprises the following steps:

and the real-time control module responds to the translation action of the access request of the corresponding IOVA of the virtual channel and eliminates page fault messages.

The inventor provides a synchronization device based on PCIE data, including an MMU unit, a VF unit, and a PF unit, where the MMU unit, the VF unit, and the PF unit are configured to execute a synchronization management method based on PCIE data according to any one of the embodiments.

Different from the prior art, the technical scheme enables the high-priority virtual channel to operate firstly, and the low-priority virtual channel enters the waiting queue when the high-priority virtual channel responds. The virtual channels with low priority in the waiting queue also determine the order of response according to the order of the priority from high to low. The method can ensure the orderliness of data, avoid consuming excessive bandwidth of the flow and realize the flow control of each thread.

Drawings

Fig. 1 is an architecture diagram of a synchronization management apparatus based on PCIE data according to this embodiment.

Detailed Description

To explain technical contents, structural features, and objects and effects of the technical solutions in detail, the following detailed description is given with reference to the accompanying drawings in conjunction with the embodiments.

Referring to fig. 1, the present embodiment is a method for synchronous management based on PCIE data, including the following steps: a Memory Management Unit (MMU) Unit obtains an operation request for an IOVA (input/output virtual address), and the MMU Unit converts the operation request for the IOVA into a virtual channel in a virtual-Function (VF) Unit. A Physical channel (PF) unit determines whether there is a responding virtual channel with a higher priority than the virtual channel. If yes, the PF unit suspends responding to the virtual channel, enables the virtual channel to enter a waiting queue, and responds to the operation request of the virtual channel after the response of the high-priority virtual channel is finished. And if not, the PF unit responds to the operation request of the virtual channel. It should be noted that the configuration priority may be performed by a Quality of service (QOS) module in the PF unit.

According to the technical scheme, the PF unit configures the priority for each virtual channel, and runs the operation requests (such as writing or reading) of the application or the thread according to the sequence of the priority from high to low. And the high-priority virtual channel is enabled to operate firstly, and the low-priority virtual channel enters a waiting queue when the high-priority virtual channel responds. The virtual channels with low priority in the waiting queue also determine the order of response according to the order of the priority from high to low. The method can ensure the orderliness of data, avoid consuming excessive bandwidth of the flow and realize the flow control of each thread.

An MMU unit of a chip maps a Physical Address (PA) of a storage domain to a plurality of virtual addresses (IOVA) of the storage domain, and then allocates a virtual-Function (VF) to each application or thread that needs to access the Physical Address. Referring to fig. 1, two physical addresses, PA1 and PA2, two IOVA1 and IOVA2, and two virtual channels, VF1 and VF2 are shown. And one virtual channel corresponds to one IOVA through the ATS unit.

The virtual channel has the same address configuration for the PCIe domain, so that the virtual channel corresponds to the physical address PA after the ATS, but the IOVA is different after the virtual channel passes through the MMU, so that the addresses seen from different applications or threads are different, the virtual channel is considered to be different devices, and software does not need to additionally increase a locking mechanism. The software efficiency can be improved, and the bandwidth utilization rate can be fully improved.

In this embodiment, a high priority virtual channel may interrupt the operation of a low priority virtual channel. If a virtual channel is configured with the highest priority (also referred to as real-time transmission mode), other ongoing transmissions with lower priority are stopped. Specifically, the PF unit determines whether there is a responding virtual channel with a lower priority than the virtual channel, and if so, the PF unit responds to the operation request of the virtual channel, and allows the virtual channel with the lower priority to enter a waiting queue, and after the operation request response of the virtual channel is finished, responds to the suspended operation request with the lower priority, and if not, the PF unit responds to the operation request of the virtual channel.

In this embodiment, in order to prohibit the low-priority virtual channel from further receiving the transmission data sent by the software, when the low-priority virtual channel is buffered in the buffer of the VF unit to form the wait queue, the real-time control unit (thread unit) of the memory management unit closes the translation operation of the access request of the virtual channel corresponding to the IOVA. It should be noted that the translation action refers to a process from the correspondence between the physical address and the IOVA to the virtual channel, and by limiting the process, the virtual channel with low priority can be inhibited from receiving the transmission data sent by the software.

In a further embodiment, if a page fault message (page fault) is generated after shutdown, the page fault message is not sent to upper layer software. And the real-time control module responds to the translation action of the access request of the corresponding IOVA of the virtual channel and eliminates page fault messages.

In this embodiment, after the low-priority virtual channel is preempted for a long time by other high-priority virtual channels or virtual channels requiring real-time performance, in order to prevent the user status from being abnormal, a maximum timeout transmission time (preset time) is configured for the virtual channels in the wait queue. The method can be carried out by depending on a quality management unit of a memory management unit, and a timer can be used for exhausting the maximum overtime transmission time and sending out a corresponding request after the maximum overtime transmission time is exhausted. And the PF unit judges whether the virtual channels in the waiting queue exceed preset time, and if so, the PF unit promotes the priority level of the virtual channels to the highest priority level. Then automatically reducing the priority again, waiting for queuing and then allowing the transmission of the queue or waiting for the next maximum timeout time to be exhausted again and then being temporarily allowed to insert a transmission. If not, the queue is continuously arranged in the waiting queue. The method is an emergency measure for the virtual channel with the low priority level, so that the abnormal user state caused by long-time non-running of the process or thread with the low priority level is avoided.

In this embodiment, in order to maintain the ordering of the same priority level, the quality management unit of the memory management unit uses a rotation method to allocate the time slices to a plurality of virtual channels at the same priority level. High priority applications may interrupt low priority in the quality management unit and also occupy a larger transmission time slice. The same-level priority is subjected to a round robin strategy by allocating time slices by the quality management unit. Let a virtual channel run on the CPU for a time slice, such as 100ms (milliseconds) of time, this 100ms interval is called a time slice. When a process runs out of time slices allocated to it, the scheduler stops the process and puts it at the end of the ready queue, letting the next virtual channel also execute a time slice. Of course, the time slice for each virtual channel may be different and set according to the actual requirements of the program or process. Therefore, a plurality of processes or programs can be responded by the system in time, and the running efficiency is improved.

It should be noted that, only two connections between the physical addresses, the IOVA, and the virtual channels are illustrated herein, for example, three, four, or even more connections between the physical addresses, the IOVA, and the virtual channels are still a connection relationship where one physical address corresponds to one IOVA, and one IOVA corresponds to one virtual channel.

The present embodiment provides a synchronization management apparatus based on PCIE data, including an MMU unit, a VF unit, and a PF unit, where the MMU unit, the VF unit, and the PF unit are configured to execute any one of the synchronization management methods based on PCIE data in this embodiment.

It should be noted that, although the above embodiments have been described herein, the invention is not limited thereto. Therefore, based on the innovative concepts of the present invention, the technical solutions of the present invention can be directly or indirectly applied to other related technical fields by making changes and modifications to the embodiments described herein, or by using equivalent structures or equivalent processes performed in the content of the present specification and the attached drawings, which are included in the scope of the present invention.

Claims

1. A driving method of a synchronization device based on PCIE data is characterized by comprising the following steps:

an MMU unit acquires an operation request to an IOVA;

2. The method according to claim 1, wherein when the virtual channel is allowed to enter the wait queue, the method further comprises the following steps:

3. The method according to claim 2, wherein if the PF unit has raised the priority level of the virtual channel to the highest priority level, the method further comprises the following steps:

4. The method according to claim 3, further comprising the following steps:

5. The method according to claim 1, further comprising the following steps:

6. The method according to claim 1, wherein when the virtual channel is allowed to enter the wait queue, the method further comprises the following steps:

the PF unit controls the virtual channel to enter the buffer of the VF unit.

7. The method according to claim 1, wherein when the virtual channel is allowed to enter the wait queue, the method further comprises the following steps:

8. The method according to claim 6, further comprising the following steps:

9. The method according to claim 8, further comprising the following steps:

10. A synchronization apparatus based on PCIE data, comprising an MMU unit, a VF unit, and a PF unit, where the MMU unit, the VF unit, and the PF unit are configured to execute the synchronization management method based on PCIE data according to any one of claims 1 to 9.