WO2019188175A1

WO2019188175A1 - Deadlock avoidance method and deadlock avoidance device

Info

Publication number: WO2019188175A1
Application number: PCT/JP2019/009627
Authority: WO
Inventors: 祐次郎谷; 一嘉石渡
Original assignee: 株式会社デンソー; 株式会社エヌエスアイテクス
Priority date: 2018-03-30
Filing date: 2019-03-11
Publication date: 2019-10-03
Also published as: JP7064367B2; JP2019179412A

Abstract

A processor for executing a program in a graph structure is provided with: a graph structure analysis unit (501) that extracts a provisional deadlock point where input/output buffers loop in buffer units and is seemingly in deadlock; and a deadlock elimination unit (502) that eliminates the deadlock in the provisional deadlock point. The deadlock elimination unit (502) identifies a processing start node at which processing is started in the provisional deadlock point, and carries out a start identification process for executing identification so as to start a series of processing from the processing start node.

Description

Deadlock avoidance method and deadlock avoidance device

Cross-reference of related applications

This application is based on Japanese Patent Application No. 2018-068429 filed on March 30, 2018, and claims the benefit of its priority. Which is incorporated herein by reference.

The present disclosure relates to a deadlock avoidance method and a deadlock avoidance device that avoid a deadlock in a processor that executes a program described in a graph structure.

In a program, a deadlock avoidance method according to the situation where a deadlock occurs is proposed in order to avoid a deadlock in which two or more processing units wait for the end of each other's processing and as a result no processing can proceed further Has been. Patent Document 1 below discloses a method for avoiding a deadlock that occurs with interrupt processing.

International Publication No. 2012/120573

Patent Document 1 discloses that a deadlock is avoided in a processor system that performs processing on a coprocessor by interrupt processing during execution of a coprocessor instruction.

However, in the invention described in Patent Document 1, it is not possible to avoid a deadlock unique to a processor that operates by dividing a program into data and processing into a graph structure and reading it.

This disclosure is intended to provide a deadlock avoidance method and a deadlock avoidance device that avoid a deadlock in a processor that executes a program described in a graph structure.

The present disclosure relates to a deadlock avoidance method for avoiding a deadlock in a processor that executes a program described in a graph structure. In a program having a graph structure, an input / output buffer loops in units of buffers and apparently becomes a deadlock. A graph structure analysis step for extracting a provisional deadlock location; and a deadlock elimination step for eliminating a deadlock at the provisional deadlock location. In the deadlock elimination step, a process start node for starting the process at the provisional deadlock point is specified, and a start specifying process for specifying a series of processes to be started from the process start node is executed.

The present disclosure also relates to a deadlock avoidance device that avoids deadlock in a processor that executes a program described in a graph structure. In a program having a graph structure, an input / output buffer is looped in units of buffers and apparently deadlocks. A graph structure analysis unit for extracting a provisional deadlock location, and a deadlock elimination unit for eliminating a deadlock at the provisional deadlock location. The deadlock elimination unit identifies a process start node that starts processing at the provisional deadlock location, and executes a start identification process that identifies a series of processes to be started from the process start node.

By specifying a process start node and processing so that a series of processes start from that process start node, even if the input / output buffer is looped in units of buffers, define an appropriate process level Can be executed in parallel while avoiding apparent deadlocks.

FIG. 1 is a diagram for explaining parallel processing which is a premise of the present embodiment. FIG. 2 is a diagram showing a system configuration example for executing the parallel processing shown in FIG. FIG. 3 is a diagram illustrating a configuration example of the DFP used in FIG. FIG. 4 is a diagram for explaining a functional configuration example of the compiler. FIG. 5 is a diagram for explaining an example of deadlock avoidance. FIG. 6 is a diagram for explaining an example of deadlock avoidance. FIG. 7 is a diagram for explaining an example of deadlock avoidance. FIG. 8 is a diagram for explaining an example of deadlock avoidance.

Hereinafter, the present embodiment will be described with reference to the accompanying drawings. In order to facilitate the understanding of the description, the same constituent elements in the drawings will be denoted by the same reference numerals as much as possible, and redundant description will be omitted.

FIG. 1A shows program code described in a graph structure, FIG. 1B shows the state of a thread, and FIG. 1C shows the state of parallel processing. .

As shown in FIG. 1A, the program to be processed in this embodiment has a graph structure in which data and processing are divided. This graph structure maintains the task parallelism and graph parallelism of the program.

1) When automatic vectorization and graph structure extraction are performed on the program code shown in FIG. 1A by a compiler, a large number of threads as shown in FIG. 1B can be generated.

1) Parallel execution as shown in FIG. 1C can be performed on a large number of threads shown in FIG. 1B by dynamic register placement and thread scheduling by hardware. By dynamically allocating register resources during execution, a plurality of threads can be executed in parallel for different instruction streams.

Next, a data processing system 2, which is a system configuration example including a DFP (Data Flow Processor) 10 as an accelerator for performing dynamic register placement and thread scheduling, will be described with reference to FIG.

The data processing system 2 includes a DFP 10, an event handler 20, a host CPU 21, a ROM 22, a RAM 23, an external interface 24, and a system bus 25. The host CPU 21 is an arithmetic unit that mainly performs data processing. The host CPU 21 supports the OS. The event handler 20 is a part that generates an interrupt process.

ROM 22 is a read-only memory. The RAM 23 is a read / write memory. The external interface 24 is an interface for exchanging information with the outside of the data processing system 2. The system bus 25 is for transmitting and receiving information between the DFP 10, the host CPU 21, the ROM 22, the RAM 23, and the external interface 24.

The DFP 10 is positioned as an individual master provided to cope with the heavy computation load of the host CPU 21. The DFP 10 is configured to support the interrupt generated by the event handler 20.

Next, the DFP 10 will be described with reference to FIG. As shown in FIG. 3, the DFP 10 includes a command unit 12, a thread scheduler 14, an execution core 16, and a memory subsystem 18.

The command unit 12 is configured to be able to communicate information with the config interface. The command unit 12 also functions as a command buffer.

The thread scheduler 14 is a part that schedules processing of a large number of threads as exemplified in FIG. The thread scheduler 14 can perform scheduling across threads.

The execution core 16 has four processing elements, PE # 0, PE # 1, PE # 2, and PE # 3. The execution core 16 has a number of pipelines that can be scheduled independently.

The memory subsystem 18 includes an arbiter 181, an L1 cache 18a, and an L2 cache 18b. The memory subsystem 18 is configured to allow information communication between the system bus interface and the ROM interface.

Subsequently, a compiler 50 as an example of the deadlock avoidance device of the present disclosure will be described with reference to FIG. The embodiment of the deadlock avoidance device according to the present disclosure is not limited to the compiler 50, and if the program described in the graph structure illustrated in FIG. The data processing system 2 as described above or the DFP 10 as shown in FIG.

The compiler 50 includes a graph structure analysis unit 501 and a deadlock elimination unit 502 as functional components.

The graph structure analysis unit 501 is a part for extracting a provisional deadlock portion where an input / output buffer loops in buffer units and apparently becomes a deadlock in a program described in a graph structure.

This will be described with reference to a process as shown in FIG. In the process shown in FIG. 5, the process of func1 [0] is executed using the data of buf0, and the execution result is held in buf1 [0]. Subsequently, the process of func2 [0] is executed using the data of buf1 [0], and the execution result is held in buf2 [0]. Subsequently, the process of func1 [1] is executed using the data of buf2 [0], and the execution result is held in buf1 [1]. Subsequently, the process of func2 [1] is executed using the data of buf1 [1], and the execution result is held in buf2 [1]. This process is performed N times, and the final calculation result func2 [N] is used as the final output.

If such processing is to be realized for a processor that can be executed in parallel, attention will be paid to the buffer portion of FIG. 5, and it seems that a deadlock has occurred between buf2 and buf1, so parallel processing Cannot be executed.

However, as described above, by appropriately describing the process and changing the buffer index, a process avoiding deadlock as shown in FIG. 6 can be performed. Therefore, the graph structure analysis unit 501 extracts a location as shown in FIG. 5 as a temporary deadlock location where the input / output buffer loops in buffer units and apparently becomes a deadlock (graph structure analysis step).

The deadlock elimination unit 502 is a part for eliminating deadlock at the provisional deadlock location. The deadlock elimination unit 502 identifies a process start node that starts processing at the provisional deadlock location, and executes a start identification process that identifies a series of processes to be started from the process start node (deadlock resolution step). . In this way, the deadlock canceling unit 502 executes the start specifying process, so that the deadlock state is canceled and the process illustrated in FIG. 6 can be performed.

As an example of the start specifying process, as shown in FIG. 7, in the start specifying process, start instruction information for the function start node func1 is added so that the process starts from the process start node func1. A kick command is used as the start instruction information.

As another example of the start specifying process, as shown in FIG. 8, in the start specifying process, process order information for instructing the order of the process is given so as to start the process from func1, which is the process start node.

As described above, the present embodiment is a deadlock avoidance method for avoiding a deadlock in a processor that executes a program described in a graph structure. In the program described in the graph structure, the input / output buffer is a buffer unit. A graph structure analysis step for extracting a provisional deadlock portion that loops and becomes an apparent deadlock, and a deadlock elimination step for eliminating a deadlock at the provisional deadlock portion are provided. In the deadlock elimination step, a process start node for starting the process at the provisional deadlock point is specified, and a start specifying process for specifying a series of processes to be started from the process start node is executed.

If considered as a device, it is a deadlock avoidance device that avoids deadlock in a processor that executes a program described in a graph structure. A graph structure analysis unit 501 that extracts a provisional deadlock location that becomes a deadlock, and a deadlock elimination unit 502 that eliminates a deadlock at the provisional deadlock location. The deadlock elimination unit 502 identifies a process start node that starts processing at the provisional deadlock location, and executes a start identification process that identifies a series of processes to be started from the process start node.

As described with reference to FIG. 7, in the deadlock avoidance method, in the start specifying process, start instruction information for the process start node can be given so as to start the process from the process start node. Similarly, in the deadlock avoidance device, the deadlock elimination unit 502 can give start instruction information to the process start node so that the start specifying process starts the process from the process start node. Thus, by giving start instruction information, a process start node can be specified, and parallel processing becomes possible.

As described with reference to FIG. 8, in the deadlock avoidance method, in the start specifying process, it is possible to add process order information for instructing the process order so that the process is started from the process start node. Similarly, in the deadlock avoidance apparatus, the deadlock elimination unit 502 can add processing order information that instructs the processing order so that the processing is started from the processing start node in the start specifying processing. In this way, by adding the processing order information, the processing order can be specified, and parallel processing becomes possible.

The embodiment has been described above with reference to specific examples. However, the present disclosure is not limited to these specific examples. Those in which those skilled in the art appropriately modify the design of these specific examples are also included in the scope of the present disclosure as long as they have the features of the present disclosure. Each element included in each of the specific examples described above and their arrangement, conditions, shape, and the like are not limited to those illustrated, and can be changed as appropriate. Each element included in each of the specific examples described above can be appropriately combined as long as no technical contradiction occurs.

Claims

A deadlock avoidance method for avoiding a deadlock in a processor that executes a program described in a graph structure,
In a program described in a graph structure, a graph structure analysis step for extracting a provisional deadlock portion where an input / output buffer loops in buffer units and apparently becomes a deadlock;
A deadlock elimination step for eliminating a deadlock at the provisional deadlock location, and
In the deadlock elimination step,
Identify the process start node to start processing at the provisional deadlock location,
A deadlock avoidance method for executing a start specifying process for specifying a series of processes to be started from the process start node.
The deadlock avoidance method according to claim 1,
A deadlock avoidance method of providing start instruction information for the process start node so that the process is started from the process start node in the start specifying process.
The deadlock avoidance method according to claim 1,
In the start specifying process, a deadlock avoidance method of giving processing order information for instructing a processing order so as to start processing from the processing start node.
A deadlock avoidance device for avoiding deadlock in a processor that executes a program described in a graph structure,
In a program described in a graph structure, a graph structure analysis unit (501) for extracting a provisional deadlock portion where an input / output buffer loops in buffer units and apparently becomes a deadlock;
A deadlock elimination unit (502) for eliminating a deadlock at the provisional deadlock location,
The deadlock elimination unit is
Identify the process start node to start processing at the provisional deadlock location,
A deadlock avoidance device that executes a start specifying process for specifying a series of processes to be started from the process start node.
The deadlock avoidance device according to claim 4,
The deadlock avoiding apparatus, wherein the deadlock elimination unit provides start instruction information for the process start node so that the process is started from the process start node in the start specifying process.
The deadlock avoidance device according to claim 4,
The deadlock elimination unit is a deadlock avoidance device that assigns processing order information indicating a processing order so that processing is started from the processing start node in the start specifying processing.