CN111506349A

CN111506349A - Calculation board card with OODA (on-off-the-digital-analog) multiprocessor

Info

Publication number: CN111506349A
Application number: CN202010363731.8A
Authority: CN
Inventors: 谭光明; 邵恩
Original assignee: Western Institute Of Advanced Technology Institute Of Computing Chinese Academy Of Sciences
Current assignee: Western Institute Of Advanced Technology Institute Of Computing Chinese Academy Of Sciences
Priority date: 2020-04-30
Filing date: 2020-04-30
Publication date: 2020-08-07
Also published as: CN111813453A; CN111813453B

Abstract

The invention discloses a calculation board card with an OODA (on-off-the-shelf) multiprocessor, which comprises four OODA processors with different calculation functions, wherein the four processors distribute different workflow jobs through a scheduling controller; when a stream type operation is processed, the four processors are circularly called in turn according to a preset calculation sequence, and the stream type operation is executed. When the four processors process the OODA workflow loads, four types of calculation algorithms including observation, adjustment, scenario and action are respectively executed, and according to the characteristic that the OODA workload sequentially occupies different processors, a plurality of OODA workloads sequentially occupy different processors, so that the execution efficiency of a load production line is improved.

Description

Calculation board card with OODA (on-off-the-digital-analog) multiprocessor

Technical Field

The invention relates to a computing board card with an OODA (on-off optical data acquisition) multiprocessor.

Background

With the calculation task of the OODA Workflow (Workflow) with the front-back dependency relationship, the calculation task gradually becomes a main calculation load, and the structural design of the calculation board gradually extends to the design idea of processing the calculation load in a 'streaming mode'. However, it is difficult for existing board designs to guarantee that the compute board can efficiently perform OODA-like workload in a pipelined manner.

OODA Ring theory (OODA L oop), originally proposed by the United states air force school John Boyd in 1966, is the main model framework for describing the military command decision process.

Disclosure of Invention

The invention aims to provide a computing board with an OODA (on-off optical disk) multiprocessor, which is used for solving the problem that the design of the existing board is difficult to ensure that the OODA workflow load can be executed in an efficient pipeline mode by the computing board.

In order to solve the technical problem, the invention provides a computing board card with an OODA (on-off-the-digital-architecture) multiprocessor, which comprises four OODA processors with different computing functions, wherein the four processors distribute different workflow jobs through a scheduling controller; when a stream type operation is processed, the four processors are circularly called in turn according to a preset calculation sequence, and the stream type operation is executed.

Furthermore, each processor is directly connected with the same shared memory, and data transmission between partitions is performed through the shared memory.

Further, the four processors include a first processor, a second processor, a third processor, and a fourth processor; the first processor is matched with an observation algorithm and used for processing data matching type operation; the second processor is matched with an adjusting algorithm and used for processing training type operation of machine learning; the third processor is matched with a imagination class algorithm and is used for processing inference class operation of machine learning; and the fourth processor is matched with an action algorithm and used for processing control algorithm operation.

Further, the OODA class workload may be divided into a plurality of OODA processing flows, one OODA class workload includes a plurality of sub-jobs, and a single sub-job needs to occupy the first processor, the second processor, the third processor, and the fourth processor in sequence.

Further, when a plurality of OODA workloads occupy the same OODA multiprocessor computing card at the same time, different workflow jobs are caused to occupy the processors in sequence.

Furthermore, the shared memory of each computing board card is directly connected with three resource pool interconnection interfaces for accessing the communication interfaces of the processors of other computing board cards.

The invention has the beneficial effects that: the computer board card can process the workflow with OODA independent steps in a way of processing the workflow in a pipeline way by four processors and sharing board card memory, so that 'board card level' multi-workflow parallel execution is realized, the processing efficiency of processing the workflow is improved, and the parallel pipeline acceleration of the workflow task is realized.

Drawings

The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the application and together with the description serve to explain the application and not to limit the application. In the drawings:

fig. 1 is a schematic structural diagram of a computing board of an OODA multiprocessor according to an embodiment of the present invention;

FIG. 2 is an exemplary diagram of OODA workflow loads according to one embodiment of the invention;

FIG. 3 is an exemplary diagram of a single OODA workflow load comprising a plurality of sub-jobs, in accordance with an embodiment of the present invention.

Detailed Description

A computing board with an OODA multiprocessor as shown in fig. 1, the computing board includes four OODA processors with different computing functions, and the four processors distribute different workflow jobs through a scheduling controller; when a streaming operation is processed, the four processors are circularly called in turn according to a preset calculation sequence, and the streaming operation is executed (namely, the streaming operation is circularly executed according to the sequence of O1- > O2- > D3- > A4- > O1).

The computer board card can process the workflow with OODA independent steps in a way of processing the workflow in a pipeline way by four processors and sharing board card memory, so that 'board card level' multi-workflow parallel execution is realized, the processing efficiency of processing the workflow is improved, and the parallel pipeline acceleration of the workflow task is realized.

The shared memory of each computing board card is directly connected with three resource pool interconnection interfaces and is used for accessing the communication interfaces of other computing board card processors. Each processor is directly connected with the same shared memory, and data transmission between partitions is carried out through the shared memory.

The four processors comprise a first processor O1, a second processor O2, a third processor D3 and a fourth processor A4 which have mutually independent functions, and the four processors are respectively allocated to the following different workflow jobs:

as shown in fig. 2, the first processor O1 is matched with an observation-class algorithm for processing data matching-type jobs and setting labels for pictures by classification; the matching corresponding algorithms may include data compression algorithms, image tagging algorithms, and data cleansing algorithms.

The second processor O2 is matched with an adjustment algorithm, and is used for processing training jobs of machine learning and implementing training according to training data obtained by data classification; the matching corresponding algorithm can comprise a face (feature) training algorithm, a sensitive data (feature) algorithm and a linear regression algorithm.

The third processor D3 is matched with a thought algorithm and used for processing inference operation of machine learning and implementing inference according to the trained model; the matching corresponding algorithms may include data prediction algorithms, signal recognition algorithms, and image recognition algorithms.

The fourth processor A4 is matched with an action algorithm and used for processing control algorithm operation and implementing control according to the inference result; the corresponding algorithm matched with the unmanned aerial vehicle can comprise an unmanned aerial vehicle flight control algorithm, a mechanical arm control algorithm and a robot control algorithm.

In addition, the OODA class workflow load may be divided into multiple OODA processing flows.

As shown in fig. 3, an OODA-like workload includes a plurality of sub-jobs (Job), and a single sub-Job needs to occupy a first processor O1, a second processor O2, a third processor D3, and a fourth processor a4 in this order. When a plurality of OODA workloads occupy the same OODA multiprocessor computing card at the same time, different workflow jobs occupy each processor in sequence, and the pipeline processing efficiency is improved.

When the workflow operation with more than four steps is processed, the design that the shared memory is directly connected with the three resource pool interconnection interfaces is utilized, the operation steps except for processing the independent computing board card by using other computing board cards can be utilized, and a production line is formed in parallel, so that the task scale of the computing board card can be expanded.

Finally, the above embodiments are only for illustrating the technical solutions of the present invention and not for limiting, although the present invention has been described in detail with reference to the preferred embodiments, it should be understood by those skilled in the art that modifications or equivalent substitutions may be made to the technical solutions of the present invention without departing from the spirit and scope of the technical solutions of the present invention, and all of them should be covered in the claims of the present invention.

Claims

1. A computing board card with an OODA (on-off-optical disk) multiprocessor is characterized by comprising four OODA processors with different computing functions, wherein the four processors distribute different workflow jobs through a scheduling controller; when a stream type operation is processed, the four processors are circularly called in turn according to a preset calculation sequence, and the stream type operation is executed.

2. The computing board with OODA multiprocessors of claim 1, wherein each of the processors is directly connected to a same shared memory and performs inter-partition data transfer with each other through the shared memory.

3. The computing board with the OODA multiprocessor of claim 2, wherein the four processors include a first processor, a second processor, a third processor, and a fourth processor; the first processor is matched with an observation algorithm and used for processing data matching type operation; the second processor is matched with an adjusting algorithm and used for processing training type operation of machine learning; the third processor is matched with a imagination class algorithm and is used for processing inference class operation of machine learning; and the fourth processor is matched with an action algorithm and used for processing control algorithm operation.

4. The computing board with the OODA multiprocessor of claim 3, wherein the OODA class workload can be divided into multiple OODA processing flows, one OODA class workload comprises multiple sub-jobs, and a single sub-job needs to occupy the first processor, the second processor, the third processor, and the fourth processor in sequence.

5. The compute board with OODA multiprocessors of claim 4, wherein when multiple OODA class workloads concurrently occupy the same OODA multiprocessor compute board, different workflow jobs are caused to occupy each processor in turn.

6. The computing board with the OODA multiprocessor of claim 1, wherein the shared memory of each computing board is directly connected to three resource pool interconnect interfaces for accessing the communication interfaces of other computing board processors.