CN105892359A - Multi-DSP parallel processing system and method - Google Patents

Multi-DSP parallel processing system and method Download PDF

Info

Publication number
CN105892359A
CN105892359A CN201610273044.0A CN201610273044A CN105892359A CN 105892359 A CN105892359 A CN 105892359A CN 201610273044 A CN201610273044 A CN 201610273044A CN 105892359 A CN105892359 A CN 105892359A
Authority
CN
China
Prior art keywords
dsp
data
fpga
parallel
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610273044.0A
Other languages
Chinese (zh)
Inventor
秦于华
王晓亮
王岩飞
刘畅
刘飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Electronics of CAS
Original Assignee
Institute of Electronics of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Electronics of CAS filed Critical Institute of Electronics of CAS
Priority to CN201610273044.0A priority Critical patent/CN105892359A/en
Publication of CN105892359A publication Critical patent/CN105892359A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B19/00Programme-control systems
    • G05B19/02Programme-control systems electric
    • G05B19/04Programme control other than numerical control, i.e. in sequence controllers or logic controllers
    • G05B19/042Programme control other than numerical control, i.e. in sequence controllers or logic controllers using digital processors
    • GPHYSICS
    • G05CONTROLLING; REGULATING
    • G05BCONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
    • G05B2219/00Program-control systems
    • G05B2219/20Pc systems
    • G05B2219/25Pc structure of the system
    • G05B2219/25341Single chip programmable controller

Landscapes

  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Automation & Control Theory (AREA)
  • Information Transfer Systems (AREA)

Abstract

The invention provides a multi-DSP parallel processing system and method, and the system is provided with an N-stage signal processing board which improves the signal processing capability. The type of an FPGA is XC6VLX130T, and the type of the DSPs is ADSP-TS201. The data communication between the DSPs is completed through a high-speed link. The system achieves high parallelism, and is strong in data transmission capability. An SDRAM connected with the DSPs are greatly high in data throughout, and the data transmission is achieved through a parallel interface. The rate of data transmission is high.

Description

A kind of multi-DSP parallel procession system and processing method thereof
Technical field
The present invention relates to signal processing technology field, particularly relate to a kind of multi-DSP parallel procession system and Its processing method.
Background technology
Along with the development of Radar Technology, increasing Radar Signal Processing will complete at numeric field, greatly Bandwidth high resolution, the employing of multi-signal processing mode so that the real time signal processing process to data Speed is greatly improved.(designer determines algorithm flow for application background to tradition task based access control, determines phase The system structure answered, then structure is divided into module carries out circuit design) radar signal processor more come More expose its function singleness, poor universality, the shortcoming of software algorithm scaling difficulty.
Superelevation disposal ability, prominent data exchange capability, good Universal and scalability becomes Feature for Modern Radar Signal datatron.Digital Signal Processing (DSP) chip have disposal ability strong, Apply feature flexibly, frequently as the core processor of radar signal processor, but along with to processing energy Improving constantly of the requirement of power, monolithic DSP can not meet the most far away requirement.Therefore, this area is needed badly The data handling system that a kind of fast operation, disposal ability are strong.
Summary of the invention
(1) to solve the technical problem that
In order to solve the problems referred to above that prior art exists, the invention provides a kind of many DSP and locate parallel Reason system and processing method thereof.
(2) technical scheme
The invention provides a kind of multi-DSP parallel procession system, including: N level signal-processing board, phase Adjacent two-stage signal is processed and is connected by data-interface between plate;Wherein, every grade of signal-processing board includes one FPGA and M sheet DSP, described FPGA receive the data of upper level signal-processing board transmission, institute State M sheet DSP and data carried out parallel processing, described FPGA will process after data transmission under One stage signal processes plate, wherein, 1≤N≤100,2≤M≤10.
Preferably, described M takes 4, and the link port of described FPGA and four DPS connects, each Sheet DSP is all connected by the link port of link port with other three DSP;Described FPGA will receive The data arrived are by link port transmission to DSP1, and data are given by described DSP1 by link port transmission DSP2, DSP3 and DSP4, data are carried out also by described DSP1, DSP2, DSP3 and DSP4 Row processes, and the data after described DSP2, DSP3 and DSP4 will process are transferred to described DSP1, Data after DSP1, DSP2, DSP3 and DSP4 are processed by described DSP1 are transferred to described FPGA, Data after described FPGA will process are transmitted to next stage signal-processing board.
Preferably, described FPGA selects XC6VLX130T;And/or described DSP selects ADSP-TS201。
Preferably, described FPGA connects a piece of FLASH chip, is used for storing FPGA program; And/or described DSP4 connects a piece of FLASH chip, it is used for storing DSP program and systematic parameter; Described FLASH chip selects M29W640FT60ZA6;And/or described DSP3 and DSP4 is respectively External one group of SDRAM, respectively expands 4G off-chip memory space, is used for storing ephemeral data.
Preferably, described FPGA connects has a transmission parallel port and to receive parallel port, at upper level signal The parallel port that sends of reason plate connects the reception parallel port of next stage signal-processing board, and FPGA is by receiving parallel port The data of reception upper level signal-processing board transmission, the data after processing are by transmission also port transmission extremely Next stage signal-processing board.
Preferably, described FPGA connects an Asynchronous Serial Interface, and described Asynchronous Serial Interface is with outside Equipment connects, and carries out the storage of video data;And/or described FPGA connects one 422 interfaces, it is achieved And the serial communication between PC;And/or described FPGA connects a PCIE interface and/or a network interface, Data after processing are sent to host computer.
Preferably, described FPGA connects power supply chip, it is achieved the following electric sequence of DSP: DSP Core voltage VDD-A power at first, power on after IO voltage, VDD-DRAM finally powers on.
Preferably, also including CY2308-1H clock module, input clock signal is divided into 8 by it The identical clock signal of phase frequency, wherein four tunnel clock signals are supplied to DSP, wherein two-way Being supplied to SDRAM, wherein a road is supplied to FPGA.
Present invention also offers a kind of many DSP based on above-mentioned multi-DSP parallel procession system to locate parallel Reason method, including: step A: initialize multi-DSP parallel procession system;Step B: to described many DSP parallel processing system (PPS) configures;Step C: the FPGA of every grade of signal-processing board receives upper one The data of level signal-processing board transmission, and transfer data to a DSP of this grade of signal-processing board; And step D: a DSP transfers data to other DSP of this grade of signal-processing board, all DSP carries out parallel processing to data, and the data back after processing is to FPGA;Step E:FPGA Data after processing are transmitted to next stage signal-processing board.
10, multi-DSP parallel procession method as claimed in claim 9, it is characterised in that described step Rapid C specifically includes:
The FPGA of every grade of signal-processing board receives the data of upper level signal-processing board transmission by parallel port, And data are given by link port transmission the DSP1 of this grade of signal-processing board;
Preferably, described step D specifically includes: data are given this level by link port transmission by DSP1 DSP2, DSP3, DSP4 of signal-processing board, DSP1, DSP2, DSP3, DSP4 are to data Carry out parallel processing, and the data back after processing is to FPGA.
(3) beneficial effect
From technique scheme it can be seen that a kind of multi-DSP parallel procession system of the present invention and from Reason method has the advantages that
(1) there is N level signal-processing board, greatly improve signal handling capacity, height can be realized The data of fast and high complexity process;
(2) FPGA selects XC6VLX130T, XC6VLX130T cost performance height, internal resource Abundant, macroelement quantity is many;
(3) DSP selects ADSP-TS201, ADSP-TS201 fast operation, provides high-performance Static superscale DSP operation, special optimization is applicable to communication and needs the application of many dsp processors; DSP algorithm and I/O excellent performance, instruction set is very flexible, supports the DSP architecture of high-level language, It is easy to dsp program;It is extended to multicomputer system, during connection, only needs the lowest communication overhead, pole The earth improves data processing speed and the ability of multi-DSP parallel procession system;
(4) data communication between DSP uses high-speed chain crossing to complete, and bidirectional data transfers rate can Reach 1GB/s, it is achieved seamless link, it is to avoid bus arbitration problem, have process simultaneously a lot of different The ability of task, highly-parallel, data transmission capabilities is strong, and motility is good;
(5) FLASH chip that DSP connects preserves the parameter that data process, and improves system number According to processing speed;
(6) FLASH chip selects M29W640FT 60ZA6, and it is low in energy consumption, capacity is big, fast Degree is fast, can retain information after power down, can write with online programming, can write by page successive byte, Reading speed is high, and compared with EEPROM, it has more superior performance and lower price;
(7) SDRAM of the employing that DSP connects has high data throughout, and DSP has SDRAM memory, simplifies the interface of SDRAM, without external circuits and consideration sequential;
(8) being transmitted by the data between parallel port realizes, message transmission rate is high, it is possible to achieve The message transmission rate of 3.2Gbps;
(9) the PCIE interface of FPGA provides the >=high speed data transmission interface of 3Gbit/s speed, Achieve high speed pipelined data between plate to process;
(10) by clock matches device distribute clock signal, it is to avoid clock signal because of reflection to it He impacts by clock, it is ensured that each device clock synchronizes.
Accompanying drawing explanation
Fig. 1 is the multi-DSP parallel procession system structure schematic diagram of the embodiment of the present invention;
Fig. 2 is the timing relationship figure of the ADSP-TS201 of the embodiment of the present invention;
Fig. 3 is the control software interface figure of the embodiment of the present invention;
Fig. 4 is the PCIe function structure chart of the embodiment of the present invention;
Fig. 5 is the annexation figure of FPGA Yu DPS of the embodiment of the present invention;
Fig. 6 is the link port structure chart of the embodiment of the present invention.
Fig. 7 is the annexation figure of DSP Yu SDRAM of the embodiment of the present invention;
Fig. 8 is the multi-DSP parallel procession method flow diagram of the embodiment of the present invention.
Detailed description of the invention
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with concrete real Execute example, and referring to the drawings, the present invention is described in more detail.
Seeing Fig. 1, first embodiment of the invention provides a kind of multi-DSP parallel procession system, its bag Including N level signal-processing board, adjacent two-stage signal is processed and is connected by data-interface between plate, every grade of letter Number process plate include a FPGA and M sheet DSP, FPGA receive upper level signal-processing board transmission Data, M sheet DSP carries out parallel processing to data, FPGA will process after data transmission under One stage signal processes plate, wherein, 1≤N≤100,2≤M≤10.
Wherein, the FPGA of first order signal-processing board receives initial data, and M sheet DSP is to data After carrying out parallel processing, the data after FPGA will process are transmitted to second level signal-processing board;N Level signal-processing board receives the data of N-1 level signal-processing board transmission, and data are entered by M sheet DSP Row parallel processing, obtains target data.
Preferably, described M takes 4, and FPGA is connected by data/address bus with the most a piece of DSP, should Three DSP of sheet DSP and other are connected by data/address bus, the data transmission that FPGA will receive Transfer data to other three DSP, four DSP to this sheet DSP, this sheet DSP data are entered Row parallel processing, the data after other three DSP will process are transferred to this sheet DSP, this sheet DSP Data after four DSP are processed be transferred to the data transmission after FPGA, FPGA will process under One stage signal processes plate.
Further, described FPGA select XC6VLX130T, XC6VLX130T cost performance high, Internal resource is abundant, macroelement quantity is many.DSP selects ADSP-TS201, ADSP-TS201 to transport Calculation speed is fast, provide high-performance static state superscale DSP operation, special optimization to be applicable to communication and needs The application of many dsp processors;DSP algorithm and I/O excellent performance, instruction set is very flexible, supports The DSP architecture of high-level language, it is simple to dsp program;It is extended to multicomputer system, during connection Only need the lowest communication overhead, be a high performance dsp chip, drastically increase many DSP The data processing speed of parallel processing system (PPS) and ability.
The link port of FPGA and four DPS connects, every a piece of DSP all by link port and other The link port of three DSP connects;FPGA by the data that receive by link port transmission to DSP1, Data are passed through link port transmission to DSP2, DSP3 and DSP4, DSP1, DSP2, DSP3 by DSP1 With DSP4, data being carried out parallel processing, the data after DSP2, DSP3 and DSP4 will process pass It is defeated by the data after DSP1, DSP2, DSP3 and DSP4 are processed by DSP1, DSP1 to be transferred to Data after FPGA, FPGA will process are transmitted to next stage signal-processing board.FPGA in this enforcement With the annexation of DPS as shown in Figure 5.
In the present embodiment, fpga chip completes electric control and DSP on DSP and carries out link port Communication, PCI Express bus communication, Ethernet interface communication, parallel bus communication, 422 buses are led to Letter, system reset, the function such as ASI bus communication.
ADSP-TS201 has 4 full-duplex link port communications mouths, uses LVDS (Low Voltage Differential Signaling) circuit, it is provided that 4 extra complete two-way I/O abilities, each link Mouth has one to receive passage and a sendaisle, all can be received and transmitted operation independently, I.e. with Double Data Rate (being up to the rising and falling edges of CCLK clock) latch data, i.e. can lead to Cross processor core to access, it is also possible to conduct interviews by the way of DMA.ADSP-TS201 uses LVDS carries out data transmission, and relative to other DSP such as ADSP-TS101, can make link port work Make in higher clock frequency, it is thus achieved that higher message transmission rate, to improve the performance of processor. Link port structure chart is as shown in Figure 6.
Preferably, the link port data width of ADSP-TS201 is set to 1, or The pull-up resistor of 500 Ω is added, by chain between TMROE and VDD_IO of ADSP-TS201 Intersection data width is set to 4.
Preferably, link port recipient is near the terminal build-out resistor of pin configuration 100 Ω so that chain Crossing normally works and carries out link port data transmission at a high speed.
Preferably, on pcb board, link port clock cable should be placed between four groups of LVDS lines, To adapt to the operation of high speed 4-bit;Distance between LVDS line is the biggest, and length is the shortest, and via is most Amount is few;LVDS line is individually placed in one layer, and is put in bottom or the top layer of PCB;Bus plane or Stratum is positioned at below LVDS line, it is also possible to LVDS line is placed in the interlayer of bus plane and ground plane.
First embodiment of the invention provides a kind of multi-DSP parallel procession system, has N level signal Process plate, greatly improve signal handling capacity, can realize at a high speed and the data of high complexity process. Have employed the structure exclusively enjoying bus, external data bus and the address bus of four DSP are separate, Being independent of each other, data communication each other uses high-speed chain crossing to complete so that multicomputer system Being suitable for processing the most different tasks, highly-parallel, motility is strong simultaneously.Present treatment system schema is more Make use of well the feature of chip, make circuit function more abundant comprehensively, be conducive to digital signal The function of processing module.
First embodiment of the invention provides a kind of multi-DSP parallel procession system, between four DSP Data communication use high-speed chain crossing complete, when core clock is 500MHz, one-way data pass The defeated 55MB/s that reaches as high as, bidirectional data transfers rate is up to 1GB/s.And if use high speed outer Bus coupled modes, even if using the SCLK of 100MHz, 64 BITBUS network mouths, with the fastest flowing water Operating protocol, is not added with any latent period, and bidirectional data transfers speed is only 800MB/s, thus may be used Seeing, the point-to-point communication of link port has the highest transmission reliability.Data between four DSP are led to Letter uses high-speed chain crossing to complete, it is only necessary to link port realizes communicating between DSP sheet, it is possible to realize nothing Seam connects, and avoids bus arbitration problem.
Wherein, FPGA connects a piece of FLASH chip, FLASH chip storage FPGA program; DSP4 connects a piece of FLASH chip, FLASH chip storage DSP program and systematic parameter, is somebody's turn to do FLASH chip is the 64Mb FLASH memory of 8Mbit × 8, the chip selection signal of FLASH With DSP4'sWithConnect after carrying out logical AND operation in FPGA.Many DSP locate parallel The DSP of reason system needs to perform a series of Digital Signal Processing computings such as FFT, FIR, these algorithms Needing some butterfly coefficient and window parameters, if again calculated with formula when using, can be substantially reduced The arithmetic speed of system, the FLASH chip that DSP4 connects also is preserved the parameter that data process, is come Raising system overall operation speed.
DSP4 is set to main boot pattern, and DSP4 is by guiding FLASH loading procedure, its pinPin connects low, and its pinWithThe sheet choosing of FLASH is connected respectively with defeated as output Going out to enable, 8 position datawires of FLASH connect DATA0~DATA7 of DSP4, it is achieved DSP4 FLASH load.
DSP1, DSP2 and DSP3 are set to link port bootmode, by link port loading procedure, Any one link port of DSP1, DSP2 and DSP3 could be used for guiding, and link port guides to use and is somebody's turn to do The DMA channel of link port, DSP1, DSP2 and DSP3'sPin connects height,Pin It is high level state when resetting, after reset, enters idle condition, wait for link port and start, from The link port loading procedure of DSP4.
Preferably, FLASH chip selects the M29W640FT of STMicroelectronics company 60ZA6, this chip have low in energy consumption, capacity is big, speed is fast, can retain information after power down, can With advantages such as online programming writes, and can write by page successive byte, reading speed is high, with EEPROM compares, and it has more superior performance and lower price.
Wherein, DSP3 and DSP4 undertakes main data processing task, and its data processing amount is more than DSP1 and DSP2, DSP3 and DSP4 external 8,64 SDRAM of every 512Mbit respectively, Expand 4G off-chip memory space, be used for storing ephemeral data, expand disposal ability.SDRAM Select IS42S16320B;ADSP-TS201'sManagement IS42S16320B, its SDA10 Pin connects the A10 of IS42S16320B, and when normal read-write operation, this pin is address signal, When IS42S16320B refresh count is full, ADSP-TS201 by this pin to IS42S16320B Send refresh command signal, A14, A13 and the IS42S16320B in ADSP-TS201 address wire BA1, BA0 be connected, for the group selection of IS42S16320B.DSP's Yu SDRAM Annexation is as shown in Figure 7.
Sync cap is used, according to system clock frequency relative to traditional DRAM, SDRAM Each periodic transfer once, has high data throughout.ADSP-TS201 has SDRAM Memorizer, is very easy to the Interface design of SDRAM, without external circuits and consideration sequential, Wherein control line and data wire can be directly connected to, and address wire then to be come according to concrete handling characteristics Judge to adjust, it is only necessary to carry out simple Initialize installation to controlling depositor SDRCON.
SDRAM also can be shared by multiple DSP, and the SDRAM of multiple DSP output controls letter The interconnection of number line, the input signal of SDRAM is driven by main DSP, detects input signal from DSP, Make ADSP-TS201 ensure to synchronize to the refresh count of SDRAM, prevent when bus power handing-over Carry out unnecessary refresh operation.
Wherein, FPGA connects has a transmission parallel port and to receive parallel port, upper level signal-processing board Sending parallel port and connect the reception parallel port of next stage signal-processing board, FPGA receives by receiving parallel port One stage signal processes the data of plate transmission, and the data after processing pass through transmission port transmission to next stage Signal-processing board.
Opening up two FIFO inside FPGA and carry out ping-pong operation, for data cached, data are led to After crossing the incoming FPGA in parallel port, cache at the FIFO within FPGA, then send data to DSP1 Carry out data process, it is achieved that the uninterrupted water operation of data.Preferably, employing clock is The parallel port of the 32bit of 100MHz, it is achieved the message transmission rate of 3.2Gbps.
Wherein, FPGA connects an Asynchronous Serial Interface (ASI, Asynchronous Serial Interface) Interface, ASI interface is connected with external equipment (such as recorder), carries out the storage of video data. ASI interface has at a high speed, reliable, accurate, hot swappable advantage.
Wherein, FPGA connects one 422 interfaces, and preferably MAX3491ESD chip is with FPGA even Connect, it is achieved 422 serial communication data transmitting-receivings;PC is communicated with system by 422 interfaces, real Existing master control order loading, observation system state and master control carry out mutual, the state inspection of instruction and data Survey, DSP program loads, parameter is arranged.
Wherein, FPGA connects a PCIE interface, and the data after processing are sent to host computer, Data after processing are further processed by host computer.There is inside M29W640FT PCIe module, This PCIe function structure chart as shown in Figure 4, is the internal module of FPGA in high order end dotted line frame, Be the PCIe module of FPGA in middle solid box, in the dotted line frame of the right for FPGA interconnect upper Machine.PCIe module include PCI Express interface module, GTX transceiver, Block RAM, time Clock module and reseting module, wherein PCI Express interface module is realized by the built-in stone of V6 FPGA, It is responsible for protocol logic;GTX transceiver includes dma controller, sends logic and receive logic module; Block RAM includes sending caching and receiving cache module;Clock module is responsible for providing clock accurately, Reseting module is used for providing reset signal, PCIE interface to provide the >=high-speed data of 3Gbit/s speed Coffret, it is achieved that between plate, high speed pipelined data processes.
Wherein, FPGA connects a network interface, and the data after processing are sent to host computer, upper Data after processing are further processed by machine.Preferably Ethernet interface control chip Ethernet MACs module within 88E1111-XX-RCJ1C000, FPGA and Ethernet interface control Coremaking sheet connects, it is achieved the HVN port transmission of data.
Wherein, every one-level signal-processing board of multi-DSP parallel procession system also includes power supply chip, uses In powering for signal-processing board.
ADSP-TS201 kernel VDD uses low-voltage, the working method of big electric current, core current Up to more than 2A, electric power outputting current ability should be at more than 3A, and now ADSP-TS201 is total Internal power is about 2W, and chip power-consumption is bigger.Through considering, power supply chip selects PTH08T250WAD, the electric current of its monolithic the highest exportable 50A, output voltage range is 0.7-3.6V, meets four DSP and simultaneously works in maximum power dissipation state.
Analog PLL power vd D-A is the phaselocked loop within DSP and frequency multiplier circuit is powered, each Dsp chip all with the addition of decoupling circuit, for being filtered VDD-A.VDD, VSS with VDD-A is connected by a 10uH inductance the most between the two, VDD, VSS and VDD-A Respectively by 1 μ F capacity earth, one or two 1nF high frequency is set near VSS and VDD-A Surface Mount electric capacity, VDD-A lead-in wire on PCB is away from noise source, by above-mentioned decoupling circuit to electricity Source filters, and improves stability and the quality of power supply.
All DSP share VDD-IO, and selecting PTH08T230WAD chip is outside port, chain Circuit, the I/O port of FPGA that crossing is relevant to external terminal output driver and logic drive and supply Electricity, the current value of exterior I/O is about 4A.
The inside DRAM processor of ADSP-TS201 is powered by external voltage source, The voltage request of VDD-DRAM power supply depends on the operation frequency of DSP, for 500MHz speed The equipment of grade, VDD-DRAM needs the power supply of 1.50V, selects PTH08T230WAD to open Close the VDD-DRAM that power supply is four DSP to power.
The power supply of ADSP-TS201 is provided with shunt capacitance decoupling, and shunt capacitance is close proximity to electricity Source pin, the VDD_DRAM power supply of DSP at least places the high frequency of 6 1nF near its pin Shunt capacitance, 2 10nF electric capacity and 4 0.1uF electric capacity.
FPGA connects power supply chip, for the upper electric control of DSP.Use metal-oxide-semiconductor break-make and The charging interval of electric capacity controls the sequencing powered on, and specifically includes: utilize the conducting of metal-oxide-semiconductor Characteristic first powers on as control signal, 1.0V, and the enable that then 1.0V powers on as 2.5V controls, 2.5V is as the upper electric control of 1.5V, it is achieved power on step by step, it is ensured that core voltage VDD-A of DSP 1.0V power at first, power on after the 2.5V of IO voltage VDD-IO, the 1.5V of VDD-DRAM Finally power on.
Wherein, every one-level signal-processing board of multi-DSP parallel procession system also includes reset chip, uses In providing reset signal for signal-processing board.Reset request signal is by ADSP-TS201 chip: Core voltage, PLL power, I/O voltage, system and core clock and to initialize pin the most steady After Ding, keep the low level of 2ms.Selecting MAX708 reset chip, this chip structure is simple, Feature richness, has hand-reset function, can carry out manual reset at any time, be that a cost performance is the highest Reset chip.
Signal-processing board is also provided with external reset button and LED, for identifying system program Running status.
Wherein, every one-level signal-processing board of multi-DSP parallel procession system also includes clock module, choosing With CY2308-1H clock matches device, it is complete that input clock signal is divided into 8 phase frequencies by it Identical clock signal, wherein four tunnel clock signals are supplied to DSP, and wherein two-way is supplied to SDRAM, Wherein a road is supplied to FPGA, and arbitrary clock signal so can be avoided other clocks because of reflection Impact, it is ensured that each device clock synchronizes.
As in figure 2 it is shown, the system clock of ADSP-TS201 has four kinds, they respectively: during system Clock (SCLK), core clock (CCLK), peripheral bus clock (SOCCLK) and link port are defeated Go out clock (LxCLKOUT).Wherein system clock provides clock, and conduct for external bus port The AC specification reference of external bus signal.Core clock be kernel, internal bus, memorizer and Link port provides clock, CCLK=SCLK × SCLKRATx, wherein the value model of SCLKRATx Enclose is 4~12.Peripheral bus clock, i.e. SOC bus operating frequencies, equal to 1/2 × CCLK.Chain Crossing output clock equal to CCLK/CR, wherein CR be LCTLx medium velocity position is set, take it Value scope is 1~4.
Consider according to external bus port speed and the factor such as CCLK, instruction execution speed, In the present embodiment, SCLK takes 100MHz, for arranging the N value of the clock multiplier of PLL by three Individual external pin SCLKRAT2-0 determine, these three pin connect pull-up resistor (inside have weak under Pull-up resistor) whether weld adjustable Clock Multiplier Factor size, this coefficient takes 5 by default, at this moment Core clock can be produced by SCLK frequency multiplication, i.e. core clock CCLK=N × SCLK=500MHz.
As shown in Figure 8, second embodiment of the invention provides a kind of many DSP utilizing first embodiment The multi-DSP parallel procession method of parallel processing system (PPS), comprising:
Step A: initialize multi-DSP parallel procession system.
Step A specifically includes: the signal-processing board of multi-DSP parallel procession system powers on, FPGA Control the electric sequence of DSP, arrangement reset signal and clock signal.
Step B: multi-DSP parallel procession system is configured.
Step B specifically includes: PC is ordered to multi-DSP parallel procession system loads by 422 interfaces Order, arranges multi-DSP parallel procession systematic parameter.System is carried out by PC end by corresponding software And control, software interface figure is as shown in Figure 3.
Step C: the FPGA of every grade of signal-processing board receives the data of upper level signal-processing board transmission, And transfer data to a DSP of this grade of signal-processing board.
Step C specifically includes: the FPGA of every grade of signal-processing board receives upper level signal by parallel port Process the data of plate transmission, and data are given by link port transmission the DSP1 of this grade of signal-processing board.
Wherein, the FPGA of first order signal-processing board receives initial data.
Step D a: DSP transfers data to other DSP of this grade of signal-processing board, all DSP carries out parallel processing to data, and the data back after processing is to FPGA.
Step D specifically includes: data are given this grade of signal-processing board by link port transmission by DSP1 DSP2, DSP3, DSP4, DSP1, DSP2, DSP3, DSP4 carry out parallel processing to data, And the data back after processing is to FPGA.
Data after step E:FPGA will process are transmitted to next stage signal-processing board.
Step E specifically includes: FPGA will process after data by and port transmission to next stage signal Process plate.
Wherein, what N level signal-processing board obtained is target data, and target data is exported by FPGA To host computer.
Wherein, during multi-DSP parallel procession method performs, PC is by passing through 422 interfaces Communicate with system, carry out system mode observation and master control carries out that instruction and data is mutual, state Detection etc..
So far, already in connection with accompanying drawing, the present embodiment has been described in detail.According to above description, this The multi-DSP parallel procession system of the present invention and processing method thereof should have been had clearly by skilled person The understanding of Chu.
It should be noted that in accompanying drawing or description text, the implementation not illustrating or describing, It is form known to a person of ordinary skill in the art in art, is not described in detail.Additionally, The above-mentioned definition to each element is not limited in various concrete structures, shape or the side mentioned in embodiment Formula, it can be changed or replace, such as by those of ordinary skill in the art simply:
(1) FPGA and DSP can also select the chip of other models;
(2) the direction term mentioned in embodiment, such as " on ", D score, "front", "rear", " left ", " right " etc., are only the directions with reference to accompanying drawing, are not used for limiting the scope of the invention;
(3) above-described embodiment can based on design and the consideration of reliability, being mixed with each other collocation use or Using with other embodiment mix and match, the technical characteristic in i.e. different embodiments can be with independent assortment shape Become more embodiment.
In sum, a kind of multi-DSP parallel procession system of the present invention and processing method thereof, there is N Level signal-processing board, greatly improves signal handling capacity, can realize at a high speed and the number of high complexity According to process;FPGA selects XC6VLX130T, XC6VLX130T cost performance height, internal resource Abundant, macroelement quantity is many;DSP select ADSP-TS201, ADSP-TS201 fast operation, High-performance static state superscale DSP operation, special optimization is provided to be applicable to communication and need at many DSP The application of reason device;DSP algorithm and I/O excellent performance, instruction set is very flexible, supports high-level language DSP architecture, it is simple to dsp program;It is extended to multicomputer system, only needs the lowest during connection Communication overhead, drastically increase data processing speed and the ability of multi-DSP parallel procession system; Data communication between DSP uses high-speed chain crossing to complete, bidirectional data transfers rate up to 1GB/s, Realize seamless link, it is to avoid bus arbitration problem, there is the energy simultaneously processing the most different tasks Power, highly-parallel, data transmission capabilities is strong, and motility is good;The FLASH chip that DSP connects is protected There is the parameter that data process, improves system data processing speed;FLASH chip is selected M29W640FT 60ZA6, it is low in energy consumption, capacity is big, speed is fast, can retain information after power down, Can write with online programming, can write by page successive byte, reading speed is high, with EEPROM Comparing, it has more superior performance and lower price;The SDRAM tool of the employing that DSP connects Having high data throughout, DSP has SDRAM memory, simplifies the interface of SDRAM, Without external circuits and consideration sequential;Data transmission between being realized by parallel port, message transmission rate High, it is possible to achieve the message transmission rate of 3.2Gbps;The PCIE interface of FPGA provides >=3Gbit/s The high speed data transmission interface of speed, it is achieved that between plate, high speed pipelined data processes;Pass through clock matches Device distribution clock signal, it is to avoid other clocks are impacted by clock signal because of reflection, it is ensured that Each device clock synchronizes.
Particular embodiments described above, is carried out the purpose of the present invention, technical scheme and beneficial effect Further describe, be it should be understood that the foregoing is only the present invention specific embodiment and , be not limited to the present invention, all within the spirit and principles in the present invention, that is done any repaiies Change, equivalent, improvement etc., should be included within the scope of the present invention.

Claims (10)

1. a multi-DSP parallel procession system, it is characterised in that including: N level signal-processing board, Adjacent two-stage signal is processed and is connected by data-interface between plate;Wherein,
Every grade of signal-processing board includes that a FPGA and M sheet DSP, described FPGA receive upper one The data of level signal-processing board transmission, described M sheet DSP carries out parallel processing to data, described FPGA will process after data transmission to next stage signal-processing board, wherein, 1≤N≤100,2≤ M≤10。
2. multi-DSP parallel procession system as claimed in claim 1, it is characterised in that described M Taking 4, the link port of described FPGA and four DPS connects, and every a piece of DSP all passes through link port It is connected with the link port of other three DSP;
Described FPGA by the data that receive by link port transmission to DSP1, described DSP1 will Data by link port transmission to DSP2, DSP3 and DSP4, described DSP1, DSP2, DSP3 With DSP4, data are carried out parallel processing, described DSP2, DSP3 and DSP4 will process after number According to being transferred to described DSP1, after DSP1, DSP2, DSP3 and DSP4 are processed by described DSP1 Data be transferred to described FPGA, described FPGA will process after data transmit to next stage signal Process plate.
3. multi-DSP parallel procession system as claimed in claim 2, it is characterised in that described FPGA Select XC6VLX130T;And/or described DSP selects ADSP-TS201.
4. multi-DSP parallel procession system as claimed in claim 2, it is characterised in that described FPGA Connect a piece of FLASH chip, be used for storing FPGA program;And/or described DSP4 connects a piece of FLASH chip, is used for storing DSP program and systematic parameter;Described FLASH chip is selected M29W640FT60ZA6;And/or described DSP3 and DSP4 external one group of SDRAM respectively, respectively Expand 4G off-chip memory space, be used for storing ephemeral data.
5. multi-DSP parallel procession system as claimed in claim 1, it is characterised in that described FPGA Connecting and have a transmission parallel port and to receive parallel port, the transmission parallel port of upper level signal-processing board connects next The reception parallel port of level signal-processing board, FPGA receives upper level signal-processing board biography by receiving parallel port Defeated data, the data after processing pass through transmission port transmission to next stage signal-processing board.
6. multi-DSP parallel procession system as claimed in claim 1, it is characterised in that described FPGA Connecting an Asynchronous Serial Interface, described Asynchronous Serial Interface is connected with external equipment, carries out video data Storage;And/or described FPGA connects one 422 interfaces, it is achieved and the serial communication between PC; And/or described FPGA connects a PCIE interface and/or a network interface, the data after processing send To host computer.
7. multi-DSP parallel procession system as claimed in claim 1, it is characterised in that described FPGA Connect power supply chip, it is achieved the following electric sequence of DSP: core voltage VDD-A of DSP is at first Powering on, power on after IO voltage, VDD-DRAM finally powers on.
8. multi-DSP parallel procession system as claimed in claim 4, it is characterised in that also include CY2308-1H clock module, its input clock signal is divided into 8 phase frequencies identical time Clock signal, wherein four tunnel clock signals are supplied to DSP, and wherein two-way is supplied to SDRAM, wherein One tunnel is supplied to FPGA.
9. locate parallel based on the many DSP described in claim any one of claim 1 to 8 for one kind The multi-DSP parallel procession method of reason system, it is characterised in that including:
Step A: initialize multi-DSP parallel procession system;
Step B: described multi-DSP parallel procession system is configured;
Step C: the FPGA of every grade of signal-processing board receives the data of upper level signal-processing board transmission, And transfer data to a DSP of this grade of signal-processing board;And
Step D a: DSP transfers data to other DSP of this grade of signal-processing board, all DSP carries out parallel processing to data, and the data back after processing is to FPGA;
Data after step E:FPGA will process are transmitted to next stage signal-processing board.
10. multi-DSP parallel procession method as claimed in claim 9, it is characterised in that described step Rapid C specifically includes:
The FPGA of every grade of signal-processing board receives the data of upper level signal-processing board transmission by parallel port, And data are given by link port transmission the DSP1 of this grade of signal-processing board;
Described step D specifically includes: data are given this grade of signal processing by link port transmission by DSP1 DSP2, DSP3, DSP4 of plate, data are carried out parallel by DSP1, DSP2, DSP3, DSP4 Process, and will process after data back to FPGA.
CN201610273044.0A 2016-04-28 2016-04-28 Multi-DSP parallel processing system and method Pending CN105892359A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610273044.0A CN105892359A (en) 2016-04-28 2016-04-28 Multi-DSP parallel processing system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610273044.0A CN105892359A (en) 2016-04-28 2016-04-28 Multi-DSP parallel processing system and method

Publications (1)

Publication Number Publication Date
CN105892359A true CN105892359A (en) 2016-08-24

Family

ID=56701803

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610273044.0A Pending CN105892359A (en) 2016-04-28 2016-04-28 Multi-DSP parallel processing system and method

Country Status (1)

Country Link
CN (1) CN105892359A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108519806A (en) * 2018-04-26 2018-09-11 北京比特大陆科技有限公司 A kind of chip controls method, apparatus, computer equipment and computer storage media
CN110196391A (en) * 2019-06-27 2019-09-03 中国兵器工业集团第二一四研究所苏州研发中心 Digital circuit test device and method based on FPGA and DSP architecture
CN110737219A (en) * 2019-10-12 2020-01-31 四川赛狄信息技术股份公司 DSP-based digital signal processing device
CN111506333A (en) * 2020-04-27 2020-08-07 湖北三江航天红峰控制有限公司 double-DSP program online upgrading method and system
CN111553122A (en) * 2020-05-15 2020-08-18 深圳探科技术有限公司 Method, module, storage medium and system for integrated circuit simulation synchronous processing
CN113467506A (en) * 2021-06-28 2021-10-01 北京航天飞腾装备技术有限责任公司 Core board design-based domestic flight controller and design method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030097511A1 (en) * 2001-11-16 2003-05-22 Samsung Electronics Co., Ltd. Fieldbus interface board
CN101140559A (en) * 2007-10-30 2008-03-12 中国传媒大学 Paralleling calculating machinery based on DSP and system
CN101667169A (en) * 2008-09-03 2010-03-10 中国科学院上海技术物理研究所 Multi-processor parallel processing system for digital signals
CN103885919A (en) * 2014-03-20 2014-06-25 北京航空航天大学 Multi-DSP and multi-FPGA parallel processing system and implement method
CN203773274U (en) * 2014-03-12 2014-08-13 成都智恒博纳科技有限公司 Multi-DSP parallel processing board based on CPCI-E bus

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030097511A1 (en) * 2001-11-16 2003-05-22 Samsung Electronics Co., Ltd. Fieldbus interface board
CN101140559A (en) * 2007-10-30 2008-03-12 中国传媒大学 Paralleling calculating machinery based on DSP and system
CN101667169A (en) * 2008-09-03 2010-03-10 中国科学院上海技术物理研究所 Multi-processor parallel processing system for digital signals
CN203773274U (en) * 2014-03-12 2014-08-13 成都智恒博纳科技有限公司 Multi-DSP parallel processing board based on CPCI-E bus
CN103885919A (en) * 2014-03-20 2014-06-25 北京航空航天大学 Multi-DSP and multi-FPGA parallel processing system and implement method

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108519806A (en) * 2018-04-26 2018-09-11 北京比特大陆科技有限公司 A kind of chip controls method, apparatus, computer equipment and computer storage media
CN110196391A (en) * 2019-06-27 2019-09-03 中国兵器工业集团第二一四研究所苏州研发中心 Digital circuit test device and method based on FPGA and DSP architecture
CN110196391B (en) * 2019-06-27 2022-04-22 中国兵器工业集团第二一四研究所苏州研发中心 Digital circuit testing device and method based on FPGA and DSP framework
CN110737219A (en) * 2019-10-12 2020-01-31 四川赛狄信息技术股份公司 DSP-based digital signal processing device
CN111506333A (en) * 2020-04-27 2020-08-07 湖北三江航天红峰控制有限公司 double-DSP program online upgrading method and system
CN111553122A (en) * 2020-05-15 2020-08-18 深圳探科技术有限公司 Method, module, storage medium and system for integrated circuit simulation synchronous processing
CN113467506A (en) * 2021-06-28 2021-10-01 北京航天飞腾装备技术有限责任公司 Core board design-based domestic flight controller and design method
CN113467506B (en) * 2021-06-28 2023-08-25 北京航天飞腾装备技术有限责任公司 Domestic flight controller based on core board design and design method

Similar Documents

Publication Publication Date Title
CN105892359A (en) Multi-DSP parallel processing system and method
CN101329663B (en) Apparatus and method for implementing pin time-sharing multiplexing
CN104915303B (en) High speed digital I based on PXIe buses/O systems
CN105051706B (en) The equipment of operation for the low-power PHY with PCIE protocol stacks, method and system
CN105183680B (en) Realize that PCIe interface turns the fpga chip and method of CF card interfaces
CN107889529A (en) Share the dynamic data link selection on physical interface
CN105743820A (en) ARM+FPGA-architecture-based Ethernet switch for train
CN110347635A (en) A kind of heterogeneous polynuclear microprocessor based on multilayer bus
CN107562672B (en) System and method for improving data transmission rate of vector network analyzer
CN109189716A (en) A kind of data transmission system and transmission method based on FPGA
EP1535169B1 (en) Improved inter-processor communication system for communication between processors
CN104714918B (en) The reception of high speed FC bus datas and way to play for time under hosted environment
CN106385388A (en) Gigabit Ethernet switch system based on 2FI+CTC5160 configuration
CN209149287U (en) Big data operation acceleration system
CN205692166U (en) Core board based on PowerPC framework central processing unit
CN102752180A (en) Method for achieving controller area network (CAN) bus network nodes
CN109411007A (en) A kind of Common Flash Memory test macro based on FPGA
CN107704413A (en) A kind of reinforcement type parallel information processing platform based on VPX frameworks
CN205545305U (en) Based on dark embedded data interchange device of xilinx FPGA
CN105515673B (en) A kind of optical-fibre channel node card
CN109656856A (en) Multiplex bus and multiplex bus interconnect device and method are realized using FPGA
CN107643993A (en) Bus conversion interface, the method for work of bus conversion interface and communication equipment
CN111581152A (en) Reconfigurable hardware acceleration SOC chip system
CN103914427A (en) On-chip communication method and on-chip communication device on basis of three physical interconnection lines for integrated circuits
CN101778038B (en) Gigabit Ethernet-based high-speed data transmission system of embedded equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20160824