CN104881666A

CN104881666A - Real-time binary image connected domain mark realizing method based on FPGA

Info

Publication number: CN104881666A
Application number: CN201410068052.2A
Authority: CN
Inventors: 王磊
Original assignee: Individual
Current assignee: Chengdu Jingyao Communication Technology Co Ltd
Priority date: 2014-02-27
Filing date: 2014-02-27
Publication date: 2015-09-02
Anticipated expiration: 2034-02-27
Also published as: CN104881666B

Abstract

The invention discloses a real-time binary image connected domain mark realizing method based on FPGA, which belongs to the large scale integrated circuit design, object identification and visual measurement technical field. The invention aims to achieve real-time connected domain marking of large scale moving objects on the basis of field programmable gate array devices in a hardware way. Four symmetrical processing nodes are used. A newest serial exchange architecture (VPX architecture) is adopted. Each processing node comprises XC5VX95TI-2FPGA of the XILINX corporation and a DDR memory cell of a 16G-bit storage capacity. The nodes are interconnected through 16 pairs of 2.5GHz RocketIO. Each processing node is composed of a first-time scanning unit, a main controller module, a correlation memory group, a common scanning unit, a DDR access arbitration module, a DDR controller, and a framing and data output module. Real-time marking of large scale moving objects is achieved through superscalar streamline and dynamic delay streamline technologies. Adaption to various resolutions is further realized. The performance is not affected when the moving object shape and number are changed. Strong robustness is achieved. The calculation result is completely consistent with the MATLAB bwlabel function.

Description

A kind of real-time bianry image connected component labeling implementation method based on FPGA

Technical field

The present invention relates to a kind of bianry image connected component labeling implementation method, especially one is based on FPGA by hardware mode, uses superscalar pipelining line technology to realize carrying out in real time extensive moving target the method for connected component labeling.Belong to large scale integrated circuit design and target identification and technical field of visual measurement.

Background technology

Bianry image connected component labeling in radar system for the Doppler frequency position of moving-target, distance to position, isoparametric the resolving of phase place, amplitude, angle and speed provide important evidence, this technology is also widely used in target identification, target detection, guidance, navigation and field of medical applications in addition.

Bianry image connected component labeling is from the dot matrix image be only made up of " 0 " (expression background dot) and " 1 " (expression impact point), by adjacent Objective extraction out.Its objective is and find destination objects all in image, and all pixels belonging to same destination object are marked by unique mark value.In the algorithm simulating stage, in MATLAB, often use bwlable function to realize this function.At implementation phase, software and hardware is usually had to realize two kinds of modes.

The mode of software simulating uses high performance PC or workstation, realizes with based on software approach.But because CPU performs the serialization feature of instruction, when image resolution ratio, frame per second raising or moving-target quantity become many, the time processed spent by a two field picture can linear increase.When being greater than the frame period time when treated, frame losing can be caused.In order to reduce process time delay, general mode improves algorithm, proposes region growth method, the degree of depth or the various optimized algorithm such as BFS (Breadth First Search) method, wire tag method.Document " a kind of new method of bianry image connected component labeling " (computer engineering and application, 2006; Use region growth method 42(25): 50-51), only need to carry out single pass to image, improve arithmetic speed." a kind of new method of bianry image connected component labeling " (computer utility, 2007; Use the method that wire tag method and region growth method combine 27(11): 2776-2777), its connected component, as the elementary cell detected, greatly reduces the number needing to detect.But this algorithm needs the field repeatedly searching for each connected component, and when can significantly degenerate containing this algorithm execution efficiency when a lot of point-like and vertical line dress connected region in image.By the test result that analysis software implementation obtains, complete resolution be the image connectivity field mark general processing time of 1024x768 hundred milliseconds of magnitudes, not reach in radar system the requirement of process in real time far away.

Hardware implementation mode, based on ASIC or FPGA device, utilizes its parallelization feature to reduce processing delay.Document " FPGA based connected component labeling " (International Conference on Control, Automation and Systems. Seoul, Korea, describe 2007:2313-2317) and a kind ofly realize framework based on hardware mode, can process 200 two field picture per second, but maximumly only support 255 moving targets, in the polarization sensitive synthetic aperture radar system needing simultaneously to process thousands of moving target easily, there is no actual use.

Summary of the invention

The present invention is intended to solve image resolution ratio and is up to 18432x4096, and needs to process the technical barrier of the real-time bianry image connected component labeling that prior art cannot realize in the synthetic-aperture radar of thousands of up to ten thousand moving targets simultaneously.The present invention is based on FPGA and pass through hardware mode, superscalar pipelining line technology is used to achieve the extensive moving target of real-time mark, and the various resolution of this programme energy self-adaptation, its performance can not be affected when moving-target shape and quantity change, strong robustness, the bwlabel function of its operation result and MATLAB is completely the same.

The object of the invention is to be achieved through the following technical solutions.

A kind of real-time bianry image connected component labeling implementation method based on FPGA of the present invention, its implementation algorithm uses four neighbourhood signatures algorithms, is specifically exactly only relevant to the state of the point of its left side and top to certain any process.Processing procedure uses four modes scanned, and first time uses sequential system, from left to right, from top to bottom processes image; Otherwise second time then uses backward mode, processes from right to left, from top to bottom; Third time uses sequential system process again; Last use backward mode scan process image.This kind of Processing Algorithm is more common algorithm, and its calculated amount is relatively large, but can ensure to obtain the result consistent with bwlabel function in any shape with when any quantity moving-target.The starting point of the present invention be use Promethean realize framework and realize thought improve handling property, reach the beneficial effect that high resolving power and high frame rate image are processed in real time.

A kind of real-time its hardware platform of bianry image connected component labeling implementation method based on FPGA of the present invention uses four symmetrical treatment node compositions, adopts VPX framework (high speed serialization architecture for exchanging), interconnected employing HSSI High-Speed Serial Interface.The wherein high-end FPGA of XC5VX95TI-2 of four processing node 100 ~ 103 use XILINX companies; Storage unit 200 ~ 203 employing Micron company model is the 2G bit DDR2 particle of MT47H256M8-3E, and each storage unit uses 16 DDR2 particles, makes the memory capacity of each processing unit be 16G bit.The high-speed serial communication standard of the Rocket IO(XILINX definition of the interconnected employing 16 couples of 2.5GHz between processing unit), its total bandwidth is greater than 1000MB/s, the requirement of the high data throughput that synthetic-aperture radar middle high-resolution brings can be met, also have 4 pairs of difference control lines between processing unit in addition, can be used for communicating of control between processing unit and status signal.

A kind of real-time bianry image connected component labeling implementation method based on FPGA of the present invention, wherein said processing node by first time scanning element 1, main controller module 2, correlativity table memory set 3, plain scan unit 4, DDR access arbitration module 5, DDR controller 6 and framing and data outputting module 7 form.Main controller module 2 respectively with first time scanning element 1, correlativity table memory set 3, plain scan unit 4 be connected with data outputting module 7 with framing, interactive controlling and status information; First time, scanning element 1 was connected with correlativity table memory set 3 with the RAM access interface of plain scan unit 4 by standard, and write port is used for the content of update dependence table, and read port is for reading the value of correlativity table; Scanning element 1, plain scan unit 4 and framing and data outputting module 7 all need to carry out data interaction with DDR, so these three modules are all connected with DDR access arbitration module 5 by self-defining high speed access interface for the first time; The arbitration mode that DDR access arbitration module 5 combines according to poll and self-defined priority is multiple parallel being connected with DDR controller 6 to the request of access that the request of access mapping and multiplexing of DDR is the DDR of serial; The accessing time sequence that DDR controller 6 produces DDR according to request of access directly accesses outside DDR chip; Framing and data outputting module 7 export in mark value result embedding frame according to frame structure.

A kind of real-time bianry image connected component labeling implementation method based on FPGA of the present invention, wherein said first time scanning element 1 by serioparallel exchange module 11, raw data ping-pong buffers 12, first time scanning element processing and control module 13, mark value ping-pong buffers 14 and DDR Write post 15 form; Main controller module 2 is made up of main control state machine 21 and some auxiliary logics; Correlativity table memory set 3 is made up of access handover module 31, impact damper A 32 and impact damper B 33; Plain scan unit 4 is made up of read buffer group 41, plain scan cell processing control module 42, DDR Write post 43 and mark value ping-pong buffers 44; DDR access arbitration module 5 is made up of port selection mode machine 51, port and status register group 52, read data buffering 53 and Read-write Catrol state machine 54; DDR controller 6 is made up of initialization module 61, state of a control machine 62, data channel 63 and phase alignment module 64; Framing and data outputting module 7 are made up of read states machine 71, read data buffering 72, framing steering logic 73 and output timing control module 74.

A kind of real-time bianry image connected component labeling implementation method based on FPGA of the present invention, its realization comprises the steps:

1. in first time scanning element, complete the initial markers of bianry image according to sequential system from left to right, from top to bottom, and the correlation information between mark value be written in correlativity table memory set:

A. serioparallel exchange is done to the binary image data of input, become the data of 8 bit widths, stored in raw data ping-pong buffers.The data of so each reading 8 picture point process, and can reduce reading times;

B. determine whether impact point, then do corresponding process.Having three kinds of situations, if be not impact point, is so 0 the mark value assignment of current location point; If be new impact point, namely the point of its left side and top is not impact point, is so that the maximum mark value used adds 1 the mark value assignment of current location point, and will revises correlativity table; If be not new impact point, need the mark value reading neighbor, then with these mark value for address reads relevant mark value from correlativity table memory set, be the minimum value of mark value read the mark value assignment of this point, finally revise correlativity table;

C. the mark value that calculates needs stored in mark value ping-pong buffers because need the mark value using lastrow when calculating next line, the mark value of lastrow stored in internal storage mainly in order to improve processing speed.Simultaneously also will stored in DDR Write post FIFO, the Read-write Catrol logic for the first time in scanning element starts DDR write operation mark value stored in DDR according to the data volume in FIFO;

2. first time scanning element process completes initial markers to image after completing and incorporates the correlation data of mark value inside correlativity table memory set.Now main controller module notice plain scan unit starts scan operation next time, and notifies that the access handover module in correlativity table memory set does access switching, and read-write operation is afterwards that plain scan unit is initiated;

3. the fusion of mark when the first time scanning of plain scan unit, is completed according to backward mode from right to left, from top to bottom, and the information of update dependence table:

A. from DDR mark-sense value stored in read buffer group;

B. judge mark value, if not being this point of 0 expression is impact point.Its processing mode is completely different with first time scanning element process, first be the mark value reading current process points and the left side and the right point from read buffer group, then from correlativity table memory set, relevant mark value is read as address by these mark value, then the minimum value of the mark value of 3 points of reading is compared, by this minimum value as the mark value of current point and the value in update dependence table memory set;

C., in the mark value write DDR Write post FIFO calculated, be then deposited in DDR;

4. plain scan unit notifies main controller module after completing scanning, and main controller module upgrades the parameter value unlatching scan operation next time of plain scan unit;

5. repeat 3,4 step operations and complete scanning for four times image for twice;

6. after all scan operations complete, main controller module notice framing and data outputting module take out last mark value from the correspondence position of DDR, then generate according to the form of output frame the final data exported, last output timing control module produces suitable sequential and result is sent.

Compared with prior art, beneficial effect of the present invention comprises:

1. processing speed is fast, use the FPGA that XILINX company model is XC5VX95TI, under the condition of processing clock 100MHz, to reach 400M pixel per second for the maximum speed that can process, and can process the mark of bianry image connected domain in synthetic-aperture radar completely in real time;

2. this kind realizes the shape of structure and moving-target and quantity has nothing to do, and different shapes and quantity can not reduce processing speed;

3. parameterizedly realize structure, when image resolution ratio and frame per second larger time only need to revise corresponding buffer space size, and need not revise and realize logic;

4. constant process time delay, easy to use in various different system.

Accompanying drawing explanation

Fig. 1 is hardware structure schematic diagram of the present invention

Fig. 2 is that in present system, processing node realizes configuration diagram

Fig. 3 is pending some Position Design figure

Fig. 4 a is that scanning element streamline realizes sequential chart for the first time

Fig. 4 b is the method for expressing schematic diagram of the value in pending point, corresponding mark value and correlativity table

Fig. 5 is the schematic diagram of pending point dissimilar in first time scanning element

Fig. 6 is that plain scan unit stream waterline realizes sequential chart

Number in the figure illustrates:

100 ~ 103 processing nodes, 200 ~ 203 processing node external memory units, 1 first time scanning element, 2 main controller modules, 3 correlativity table memory set, 4 plain scan unit, 5 DDR access arbitration modules, 6 DDR controllers, 7 framings and data outputting module, 11 serioparallel exchange modules, 12 raw data ping-pong buffers, 13 first time scanning element processing and control module, 14 mark value ping-pong buffers, 15 DDR Write post, 21 main control state machines, 31 access handover modules, 32 impact damper A, 33 impact damper B, 41 read buffer groups, 42 plain scan cell processing control modules, 43 DDR Write post, 44 mark value ping-pong buffers, 51 port selection mode machines, 52 ports and status register group, 53 read data bufferings, 54 Read-write Catrol state machines, 61 initialization modules, 62 state of a control machines, 63 data channel, 64 phase alignment modules, 71 read states machines, 72 read data bufferings, 73 framing steering logics, 74 output timing control modules.

Embodiment

Detailed construction of the present invention, application principle, effect and effect, with reference to accompanying drawing 1-6, be explained by following embodiment.

Hardware platform architecture of the present invention as shown in Figure 1, uses four symmetrical treatment node compositions, adopts VPX framework, interconnected employing HSSI High-Speed Serial Interface.In the present invention, processing node realizes framework as shown in Figure 2, mainly comprises seven main functional modules.These two parts are described in detail in summary of the invention.

The respectively function of each module and specific design method in descriptive system and processing node below, make an explanation to the present invention further, its emphasis is to illustrate how superscalar pipelining line technology embodies in the design, and how to improve system handling property.In addition the details that realizes of some keys is also set forth:

One, the whole system FPGA that use four is identical on hardware is formed, each FPGA are as a processing node, complete the connected component labeling operation of a two field picture.The mode of processing node 100 ~ 103 employing order flowing water works.The view data that front end enters is a treatment cycle according to 4 frames, and i-th ~ i+3 two field picture is assigned in FPGA_A ~ FPGA_D respectively, after these four processing nodes complete scanning, then is aggregated into FPGA_D.From the water operation totally achieving the first order;

Two, in the realization of processing node, adopt the mode of two-stage pipeline processes, have two processing modules, comprise scanning element 1 and plain scan unit 4 for the first time, first one two field picture processes after entering processing node in first time scanning element, give plain scan unit after having processed and do subsequent treatment, now scanning element starts the image processing next frame for the first time.This is the water operation of the second level;

Three, first time scanning element 1, complete the initial markers of bianry image according to sequential system from left to right, from top to bottom, and the correlation information between mark value be written in correlativity table memory set.

Several operation has been needed when processing the point on (m, n) position as shown in Figure 3 when doing preliminary sweep mark:

1) read the two-value data of (m, n), (m, n-1), (m-1, n) three points, determine whether background dot, new impact point or have adjacent impact point;

2) read the mark value of (m, n-1) and (m-1, n), be expressed as g (m, n-1) and g (m-1, n);

3) inside correlativity table, read corresponding mark value using mark value g (m, n-1) and g (m-1, n) as address, be expressed as T (g (m, n-1)) and T (g (m-1, n));

4) compare the size of T (g (m, n-1)) and T (g (m-1, n)), obtain its minimum value, be expressed as T0;

5) mark value of (m, n) is composed as T0;

6) upgrade T (g (m, n))=T0, T (g (m, n-1))=T0, T (g (m, n-1))=T0 in correlativity table, correlativity table is deposited in correlativity table memory set.

According to the structure of FPGA, above-described six operations are each at least needs a clock period to process, if the mode operating all uses order so will reduce processing speed greatly.Use pipeline system process as shown in fig. 4 a in the design, using 8 points as process particle, read 8 points at every turn simultaneously and process.A ~ f in Fig. 4 a represents six operations recited above, brings target to represent respective operations that is last or that next time process; Fig. 4 b describes the method for expressing of the value needed in point to be processed, corresponding mark value and correlativity table, and T represents impact point, and B represents background dot.It is as follows that it performs flow process:

1) at d ', e ', three clock period of f ' read and need 8 raw data d (m to be processed from raw data ping-pong buffers 12, n) ~ d (m, and from mark value ping-pong buffers 14, read the mark value g (m-1 of corresponding lastrow 8 points n+7), n) ~ g (m-1, n+7).Often process 8 pixels only to need to read 1 time.Because there are three clock period, the read latency of the block storage of FPGA inside can be set to 2 clock period, does like this and can reduce critical path optimization process sequential in design.When the clock period 6 ', these data have just been read out;

2) when a with g (m-1, n) for address reads g (m-1 in correlativity table memory set 3, n) corresponding mark value T (g (m-1, n)), the same delay of reading correlativity table memory set 3 can be set to 2 clock period;

3) T (g (m-1 of the point of process points top when c, n) value) has been read out, and the T of left side point (g (m, n-1) be) that the last result processed can directly obtain, so just can compare this two values, little value is labeled as T0;

4) at the point (m that d, e, f tri-clock period obtain c, n) mark value is written in mark value ping-pong buffers 14 and DDR Write post 15, and the value of address g (m, n-1) and g (m-1, n) is T0 in update dependence table memory set 3;

5) when d, e, f tri-clock period update dependence table memory set 3, a of lower some process is reading correlativity table again, adds comparison mechanism and avoids address conflict.

This is the water operation of the third level in design.

The emphasis of first time its design of scanning element is in processing and control module, and according to above-mentioned algorithmic descriptions, the pixel of its process can be divided into 3 kinds of situations, 1) background dot; 2) fresh target point; 3) non-new impact point.As shown in Figure 5,1 be wherein background dot; 2 is new impact point, and 3,4,5 is non-new impact point.Then according to the need of the value of correlativity table reading consecutive point above, several situation shown in following table is divided into again:

According to previously described three kinds of situations and the value whether will reading reading process point top point of proximity from correlativity table memory set 3, the situation of process is always divided into 4 kinds of situations.Comprise 1,2,3 and 5a, 4 and 5b.So processing state machine uses 4 kinds of different treatment states to process all situations in processing and control module 13.The pipeline design mode according to Fig. 4 a, each treatment state continues 3 clock period, can ensure constant processing delay like this.

Four, plain scan unit 4 according to backward, sequentially, the order of backward carries out three scanning to image, can complete the fusion of mark value, the correlation information between mark value is written in correlativity table memory set simultaneously.

Be example with the point on (m, the n) position shown in Fig. 3, it realizes needs and completes following operation:

1) read (m, n), (m, n-1), the mark value g (m, n) of (m-1, n) three points, g (m, n-1), g (m-1, n), determine whether background dot or impact point;

2) with mark value g (m, n), g (m, and g (m-1 n-1), n) inside correlativity table, corresponding mark value is read as address, be expressed as T (g (m, n)), T (g (m, n-1)) and T (g (m-1, n));

3) compare the size of T (g (m, n)), T (g (m, n-1)) and T (g (m-1, n)), obtain its minimum value, be expressed as T0;

4) mark value of (m, n) is composed as T0;

5) T (g (m, n))=T0, T (g (m, n-1))=T0, T (g (m, n-1))=T0 is upgraded in correlativity table.

The operation of plain scan unit only has 5 steps, but the upper difficulty of its design is larger.Because need in the 2nd above-mentioned step to read value corresponding to 3 mark value from correlativity table memory set, need in the 4th step to write 3 values in correlativity table storer.According to the function of FPGA internal storage, a clock period can only carry out a read-write operation simultaneously, even if if do not carry out processing 2 and 4 parallel work-flows at least need 3 clock period yet, data compare 1 clock period of needs, read data and write result parallel work-flow and also need 1 clock period, this just at least needs 5 clock period.And plain scan needs execution 3 times, if according to the pipeline design of the fixed delay in first time scanning element, so processing speed can be slowly.

Use dynamic deferred pipelining to improve processing speed in the present invention.

The pipeline design in plain scan unit as shown in Figure 6.Wherein A represents read pending mark value from read buffer group 41, reads the value of correlativity table to be compared from correlativity table memory set 3; B represents that the value comparing correlativity table obtains the mark value of current process points; C represents the mark value of current process points is written in DDR Write post and mark value ping-pong buffers, the content of update dependence table.

The delay of streamline equals large one of retardation in the delay of A+B or the delay of C as can be known from Fig. 6.Be divided into four kinds of situations in following table:

As can be seen from the above table, when needing when only having when the adjacent impact point of process two and process to read the value of correlativity table from correlativity table memory set, its pipelining delay is 3 clock period, and other situations are all less than 3 clock period.Because when just having thousands of or up to ten thousand moving-targets at last in a two field picture, the quantity of impact point is also little relative to the point of whole image, just substantially increases treatment effeciency by above-mentioned dynamic deferred pipelining.

Processing node 100 ~ 103 in present embodiment can adopt FPGA(Field Programmable Gate Array, field programmable gate array), also ASIC(Application-Specific Integrated Circuit can be used, special IC) realize.

In present embodiment trial run, use the FPGA that XILINX company model is XC5VX95TI, processing clock under being set to the condition of 100MHz the maximum speed that can process to reach 400M pixel per second.After the process of bianry image by native system, the mark value that the bwlabel function of the mark value obtained and MATLAB obtains is completely the same, illustrates and reaches Expected Results.

Above-described specific descriptions; the object of inventing, technical scheme and beneficial effect are further described; be understood that; the foregoing is only specific embodiments of the invention; the protection domain be not intended to limit the present invention; within the spirit and principles in the present invention all, any amendment made, equivalent replacement, improvement etc., all should be included within protection scope of the present invention.

Claims

1. based on a real-time bianry image connected component labeling implementation method of FPGA, it is characterized in that: hardware platform uses four symmetrical treatment node compositions, adopts VPX framework (high speed serialization architecture for exchanging); The wherein XC5VX95TI-2 FPGA of four processing node 100 ~ 103 use XILINX companies; Storage unit 200 ~ 203 adopts 16 capacity to be 2G bit DDR2 particle, makes the memory capacity 32G bit of each processing unit; The Rocket IO of the interconnected employing 16 couples of 2.5GHz between processing unit, also has 4 pairs of difference control lines between processing unit in addition, can be used for communicating of control between processing unit and status signal; Wherein processing node is made up of first time scanning element (1), main controller module (2), correlativity table memory set (3), plain scan unit (4), DDR access arbitration module (5), DDR controller (6) and framing and data outputting module (7); Main controller module (2) is connected with data outputting module (7) with framing with first time scanning element (1), correlativity table memory set (3), plain scan unit (4) respectively, interactive controlling and status information; First time scanning element (1) is connected with correlativity table memory set (3) with the RAM access interface of plain scan unit (4) by standard, and write port is used for the content of update dependence table, and read port is for reading the value of correlativity table; First time scanning element (1), plain scan unit (4) and framing and data outputting module (7) all need to carry out data interaction with DDR, so these three modules are all connected with DDR access arbitration module (5) by self-defining high speed access interface; The arbitration mode that DDR access arbitration module (5) combines according to poll and self-defined priority parallel to be connected with DDR controller (6) to the request of access that the request of access of DDR is mapped as the DDR of serial multiple; The accessing time sequence that DDR controller (6) produces DDR according to request of access directly accesses outside DDR chip; Framing and data outputting module (7) export in mark value result embedding frame according to frame structure.

2. a kind of real-time bianry image connected component labeling implementation method based on FPGA as claimed in claim 1, is characterized in that: described first time scanning element (1) is made up of serioparallel exchange module (11), raw data ping-pong buffers (12), first time scanning element processing and control module (13), mark value ping-pong buffers (14) and DDR Write post (15); Main controller module (2) is made up of main control state machine (21) and some auxiliary logics; Correlativity table memory set (3) is by access handover module (31), impact damper A(32) and impact damper B(33) form; Plain scan unit (4) is made up of read buffer group (41), plain scan cell processing control module (42), DDR Write post (43) and mark value ping-pong buffers (44); DDR access arbitration module (5) cushions (53) by port selection mode machine (51), port and status register group (52), read data and Read-write Catrol state machine (54) is formed; DDR controller (6) is made up of initialization module (61), state of a control machine (62), data channel (63) and phase alignment module (64); Framing and data outputting module (7) cushion (72), framing steering logic (73) and output timing control module (74) by read states machine (71), read data and form.

3. a kind of real-time bianry image connected component labeling implementation method based on FPGA as claimed in claim 1, is characterized in that realization comprises the steps:

A. in first time scanning element (1), complete the initial markers of bianry image according to sequential system from left to right, from top to bottom, and the correlation information between mark value is written in correlativity table memory set (3);

a)serioparallel exchange is done to the binary image data of input, becomes the data of 8 bit widths, inner stored in raw data ping-pong buffers (12), the data at every turn reading 8 picture point are processed, can reading times be reduced;

B) determine whether impact point, then do corresponding process; Having three kinds of situations, if be not impact point, is so 0 the mark value assignment of current location point; If be new impact point, namely the point of its left side and top is not impact point, is so that the maximum mark value used adds 1 the mark value assignment of current location point, and will revises correlativity table; If be not new impact point, need the mark value reading neighbor, then with the mark value that these mark value are relevant for address reads from correlativity table memory set (3), be the minimum value of the mark value read the mark value assignment of this point, finally revise correlativity table;

C) mark value calculated is stored in mark value ping-pong buffers (14) and DDR Write post FIFO(15) in, the Read-write Catrol logic in first time scanning element (1) starts DDR write operation mark value stored in DDR according to the data volume in FIFO;

After first time scanning element (1) process initial markers completed to image and inside correlativity table memory set (3), incorporate the correlation data of mark value; Now main controller module (2) notice plain scan unit (4) starts scan operation next time, and notify that the inner access handover module (31) of correlativity table memory set (3) does access and switches, respond the read-write operation that plain scan unit (4) is initiated afterwards;

C. the fusion of mark is completed during plain scan unit (4) first time scanning according to backward mode from right to left, from top to bottom, and the information of update dependence table;

a)from DDR, mark-sense value is stored in read buffer group (41);

B) judge mark value, if not being this point of 0 expression is impact point; The process of its processing mode and first time scanning element (1) is completely different, first be the mark value reading current process points and the left side and the right point from read buffer group, then from correlativity table memory set (3), relevant mark value is read by these mark value as address, then the minimum value of mark value obtaining 3 points read is compared, by this minimum value as the mark value of current point and the value in update dependence table memory set (3);

C) the mark value write DDR Write post FIFO(43 calculated) in, be then deposited in DDR;

d.plain scan unit (4) notifies main controller module (2) after completing scanning, and main controller module (2) upgrades the parameter value unlatching scan operation next time of plain scan unit (4);

E. repeat C, D walks operation and completes scanning for four times image for twice;

F. after all scan operations complete, main controller module (2) notice framing and data outputting module (7) take out last mark value from the correspondence position of DDR, then generate according to the form of output frame the final data exported, last output timing control module (74) produces suitable sequential and result is sent.

4. a kind of real-time bianry image connected component labeling implementation method based on FPGA as claimed in claim 1, is characterized in that: realization employs superscalar pipelining line technology, be embodied in:

A. the mode of processing node 100 ~ 103 employing order flowing water works; The view data that front end enters is a treatment cycle according to 4 frames, and i-th ~ i+3 two field picture is assigned in FPGA_A ~ FPGA_D respectively, after these four processing nodes complete scanning, then is aggregated into FPGA_D; From the water operation totally achieving the first order;

B. in the realization of processing node, adopt the mode of two-stage pipeline processes, have two processing modules, comprise scanning element 1 and plain scan unit 4 for the first time, first one two field picture processes after entering processing node in first time scanning element, give plain scan unit after having processed and do subsequent treatment, now scanning element starts the image processing next frame for the first time; This is the water operation of the second level;

C. in processing procedure, read data needed to each pixel, has read correlativity table data, compared, update dependence table and write data five steps, this five steps, by pipeline mode process, makes the process of each pixel at most only need three clock period; This is the water operation of the third level.

5. a kind of real-time bianry image connected component labeling implementation method based on FPGA as claimed in claim 1, is characterized in that: in the realization of plain scan unit, use dynamic deferred pipelining; It is embodied in: according to the type of last process points and current process points type, process is divided into four kinds of situations, when needing when only having when the adjacent impact point of process two and process to read the value of correlativity table from correlativity table memory set (3), its pipelining delay is 3 clock period, and other situations are all less than 3 clock period.