CN100510836C

CN100510836C - Pulsation array processing circuit for adaptive optical system wavefront control operation

Info

Publication number: CN100510836C
Application number: CNB2007100991061A
Authority: CN
Inventors: 郑文佳; 王春鸿; 姜文汉; 李梅; 唐端午
Original assignee: Institute of Optics and Electronics of CAS
Current assignee: Institute of Optics and Electronics of CAS
Priority date: 2007-05-11
Filing date: 2007-05-11
Publication date: 2009-07-08
Anticipated expiration: 2027-05-11
Also published as: CN101067681A

Abstract

The invention relates to a pulsating array processing circuit for implementing wavefront control operation of a self-adapting optical system, arranging multiple processing elements into two linear structures to respectively complete convolution and recursion operations and using an addition processing element to link the two arrays together to implement the wavefront control operation; and n voltages are serially calculated in time-division multiplexing mode in an array, and the processing elements implement data transfer between adjacent elements by shift registers, avoiding memory read-write operation. And the arrays of the invention have features of data communication localization, and simple data and control streams, saving resources and convenient to implement with hardware.

Description

The pulsation array processing circuit that is used for adaptive optical system wavefront control operation

Technical field

The present invention relates to a kind of pulsation array processing circuit, particularly a kind of wavefront control operation that is applicable to the ADAPTIVE OPTICS SYSTEMS wave front processor.

Background technology

In ADAPTIVE OPTICS SYSTEMS,, very high to the computing power and the real-time requirement of wave front processor in order effectively to proofread and correct the dynamic wavefront error that atmospheric turbulence causes.General microcomputer can not meet the demands, must develop special-purpose fast wave pretreating machine according to the characteristics that the ADAPTIVE OPTICS SYSTEMS wavefront is handled.The workflow of wave front processor comprises Flame Image Process, slope calculating, wave front restoration, control computing and five modules of D/A conversion, wherein wavefront control operation is with the corrugated error vector E from the output of restoration calculation module, obtain tilting mirror, the required control voltage of distorting lens driver through the control interative computation, computing formula is:

V = \frac{b_{0} + b_{1} z^{- 1} + b_{2} z^{- 2} + b_{3} z^{- 3}}{1 - a_{1} z^{- 1} - a_{2} z^{- 2} - a_{3} z^{- 3}} E - - - (1)

Wherein, b ₀, b ₁, b ₂, b ₃, a ₁, a ₂, a ₃It is the controlled variable of system.If m effective sub-aperture n unit self-adapting optical system, then V and E are the vectors of n * 1.If e represents the one-component of wavefront error vector E, v is the one-component of gained control voltage V, and then the time domain computing formula of single channel control voltage is:

v_{i}^{k} = b_{0} e_{i}^{k} + b_{1} e_{i}^{k - 1} + b_{2} e_{i}^{k - 2} + b_{3} e_{i}^{k - 3} + a_{1} v_{i}^{k - 1} + a_{2} v_{i}^{k - 2} + a_{3} v_{i}^{k - 3} - - - (2)

Wherein Be the error amount on k frame i road, Be k frame i road magnitude of voltage (1≤i≤n).As seen control computing and be an interative computation, not only relevant, also relevant with the data and the result of calculation of its front cross frame with the data of present frame.

The interative computation on this n road can come executed in parallel by the multicomputer system of a plurality of processors (DSP), for example is published in one piece of paper of " photoelectric project " in September, 1998, is entitled as " the fast wave pretreating machine of frame frequency 2900Hz ", the Wang Chunhong work.4 control interative computations that TMS320C31 is used to walk abreast have wherein been used.Because every DSP needs a large amount of control circuits, causes the device integrated level not high, is not easy to large scale integration; And this method is essentially the computing of being undertaken by software, so restricted on the high speed; Calculating begin after can only finishing at the restoration calculation of frame data in addition, i.e. restoration calculation and control computing serial carried out, and therefore the calculating time-delay is bigger.

Summary of the invention

Technology of the present invention is dealt with problems: overcome the deficiencies in the prior art, provide a kind of calculate the time-delay little, integrated level is high, the pulsation array processing circuit of the realization ADAPTIVE OPTICS SYSTEMS wave front processor wavefront control operation of fast operation.

Technical solution of the present invention: be used for the pulsation array processing circuit of adaptive optical system wavefront control operation, its characteristics are: it is by 7 multiply accumulating processing unit PE ₁-PE ₇, 1 addition process unit PE ₈, 12 shift register M ₁-M ₁₂Form 4 multiply accumulating processing unit PE ₁-PE ₄Linear array is formed convolution algorithm portion, has two data stream in opposite directions in the array, and each component order of the error vector E of each frame is from the first multiply accumulating processing unit PE ₁Flow into array, through the second multiply accumulating processing unit PE ₂, the 3rd multiply accumulating processing unit PE ₃With the 4th multiply accumulating processing unit PE ₄Flow out array after the computing, convolution results with initial value 0 from the 4th multiply accumulating processing unit PE ₄Flow into array, through PE ₃, PE ₂, PE ₁Flow out array; 3 multiply accumulating processing unit PE ₅-PE ₈Linear array is formed recursive operation portion, has two data stream in opposite directions in the array, and the recursive operation part is 0 from the 7th multiply accumulating processing unit PE with initial value ₇Flow into array, through the 6th multiply accumulating processing unit PE ₆With the 5th multiply accumulating processing unit PE ₅After the computing at addition process unit PE ₈With output control voltage result of calculation after the array of the convolution algorithm portion output results added and feedback backward, through PE ₅, PE ₆, PE ₇Calculate the back and flow out array, realize data transfer between adjacent cells by shift register interconnection between the each processing unit, promptly the data-out port of a processing unit connects the data input pin of a shift register, and the data output end of this shift register connects the data input pin of next processing unit.

Principle of the present invention: the present invention is divided into convolution algorithm portion and recursive operation portion two parts, and each finishes the convolution algorithm and the recursive operation of adaptive optics wavefront control operation respectively by 4 multiply accumulating processing unit linear array.Each processing unit is synchronous working under the promotion of clock, and by an adder unit two arrays is linked, and realizes two systolic arrays output results added and outside output voltage values.

Array is by as shown in fig. 18 processing unit PE ₁-PE ₈, 12 degree of depth are the shift register M of (n/2) ₁-M ₁₂Form, if n is not an even number, then the degree of depth is (n+1)/2.Array is divided into two parts: convolution algorithm portion 101 and recursive operation portion 102, finish the convolution and the recursive operation of following (3), (4) formula respectively, and at last two parts are linked to realize the control computing of (5) formula by an adder unit.

y _i＝b ₀e _i+b ₁e _i-1+b ₂e _i-2+b ₃e _i-3 (3)

Then have

v _i＝a ₁v _i-1+a ₂v _i-2+a ₃v _i-3 (4)

v _i＝(a ₁v _i-1+a ₂v _i-2+a ₃v _i-3)+(b ₀e _i+b ₁e _i-1+b ₂e _i-2+b ₃e _i-3) (5)

PE wherein ₁-PE ₇Be the multiply accumulating computing unit that has local storage, each multiply accumulating processing unit all is made up of a register REG, a multiplier MUL totalizer ADD.

Fig. 2 is the structural representation of multiply accumulating processing unit.The port explanation of multiply accumulating processing unit:

MUL_in, MUL_out: the input/output port of error amount or magnitude of voltage;

MAD_in, MAD_out: the input/output port of accumulated value;

The logic function explanation of multiply accumulating processing unit:

Multiplier is got the multiplication that prestores among the input data of MUL_in port and the register REG, and totalizer is with after multiplier output and the addition of MAD_in port data, and the result is as the output of MAD_out port, and what its was carried out is exactly a multiply accumulating computing.

PE ₈Be the adder unit of a band feedback, its structural representation as shown in Figure 3, it is by a totalizer ADD.The port explanation:

Y_in: convolution algorithm array 101 result of calculation y _iInput port;

MAD_in: the input port of accumulated value;

V_feedback: magnitude of voltage feedback port;

V_out: the outside output port of magnitude of voltage.

The logic function explanation of adder unit

Totalizer is with the data addition of port Y_in and MAD_in input, and the result exports as V_feedback and V_out, and its effect is exactly that two parts with systolic arrays interlink, and with the result outwards output and to the right feedback participate in computing.

The present invention's advantage compared with prior art is:

(1) processing unit is realized data transfer between adjacent cells by input queue register and output queue register, the multiplier of each unit directly obtains data foremost from the input queue of unit and operates, and totalizer is directly sent into result of calculation the output queue rearmost end of unit.Avoid read-write to storer with the method for this formation, this input data and output data of not using in calculating does not need to deposit in the memory block of unit, makes an array can reach very high efficient when calculating the plurality of voltages value.

(2) simultaneously, owing to when systolic arrays carries out the computing of one road recovery voltage, there is the processing unit of half to be in idle condition, so the mode that can share with processing unit in a systolic arrays is carried out the computing of two independent recovery voltage simultaneously, make the processing unit utilization factor reach 100%, saved hardware resource, thus the input and output queue degree of depth between the processing unit be n/2 (if n be odd number then the degree of depth be (n+1)/2).

(3) each PE unit is simple in structure, has only local data's communication, is convenient to hardware and realizes.

(4) with the parallel processing of restoration calculation module, calculating time-delay is a flow beat, and real-time is good.

(5) the present invention makes the usefulness of each processing element reach maximum, thereby promotes the miniaturization and the low consumption electrification of device.Workflow and work characteristics according to wave front processor can get: by the pixel order output data, frame data need pass through Flame Image Process, slope calculating, wave front restoration, control computing and the last output voltage of five modules of D/A conversion and drive distorting lens work CCD for line by line.This shows k-1 frame error vector E ^K-1With k frame error vector E ^kInput time very big at interval (being that CCD camera one frame pixel output time-delay is calculated time-delay and wave front restoration time-delay sum with slope).And the incoming wave surface error vector E of control computing is the output of wave front restoration module, each error component

... order output and output time are less at interval.Can assurance before the error vector of next frame enters array finish the calculating of n road magnitude of voltage in proper order.The present invention organizes rational data stream and the corresponding circuit of design according to above characteristics, use a systolic arrays to improve the utilization ratio of ARRAY PROCESSING unit by the n road independently being controlled voltage operational with the mode timesharing of serial, rationally also reduce the number of processing unit effectively, reduce resource occupation.

Description of drawings

Fig. 1 is a theory diagram of the present invention;

Fig. 2 is the structural representation of the multiply accumulating processing unit PE among the present invention;

Fig. 3 is the structural representation of the adder unit among the present invention.

Embodiment

Specify present embodiment below in conjunction with Fig. 1 to Fig. 3.

As shown in Figure 1, the present invention is by 7 multiply accumulating processing unit PE ₁-PE ₇, an addition process unit PE ₈, 12 degree of depth be n/2 (if n be odd number then the degree of depth be (n+1)/2) shift register (M ₁-M ₁₂) form.Processing unit PE ₁-PE ₄With shift register M ₁-M ₆, processing unit PE ₁-PE ₈With shift register M ₇-M ₁₂Be arranged in two linear array structures respectively, port interconnects by shift register between the processing unit, the pin MUL_out that is positioned at the processing unit of left connects the data-in port of shift register, and the data-out port of shift register meets the pin MUL_in of right-hand processing unit; The pin MAD_in that is positioned at the processing unit of left connects the data-out port of shift register, and the data-in port of shift register connects the pin MAD_out that is positioned at right-hand processing unit; Processing unit PE ₁Pin MUL_in meet the error information input port E of array.Processing unit PE ₈Pin Y_in meet processing unit PE ₁Pin MAD_out; Pin V_out meets the voltage output end mouth V of array; Pin V_feedback connects the input port of shift register M7, and the output port of M7 meets processing unit PE ₅Pin MUL_in; Processing unit PE ₈Pin MAD_in connect the output port of shift register M10, the input port of M10 meets processing unit PE ₅Pin MAD_out.

As shown in Figure 1, 2, each multiply accumulating processing unit PE ₁-PE ₇All form by a register REG, a multiplier MUL totalizer ADD.Register REG connects the input end of multiplier MUL, the pin MUL_in of another input termination multiply accumulating processing unit of multiplier MUL, the input end of the output termination totalizer ADD of multiplier MUL, the pin MAD_in of another input termination multiply accumulating processing unit of totalizer ADD, the pin MAD_out of the output termination multiply accumulating processing unit of totalizer ADD, the pin MUL_in of multiply accumulating processing unit links to each other with pin MUL_out.

Shown in Fig. 1,3, addition process unit PE ₈Constitute by a totalizer ADD.The input termination addition process unit PE of totalizer ADD ₈Pin Y_in, another the input termination meet addition process unit PE ₈Pin MAD_in, the output termination addition process unit PE of totalizer ADD ₈Pin V_out and pin V_feedback.

The principle of work of this circuit is described below in conjunction with Fig. 1:

(1) before circuit is started working, the controlled variable b of system ₀, b ₁, b ₂, b ₃, a ₁, a ₂, a ₃PE prestores respectively ₁-PE ₇Register REG in, and each storage unit of each shift register all is initialized as zero.

The n of (2) one tunnel error vectors component order input array is as an error amount e _iFrom PE ₁Port MUL_in input, start-up circuit is the zero point of time beat.Processing unit PE ₁-PE ₇Synchronous working under the promotion of clock is clapped multiplier the 1st and is got shift register M from port MUL_in ₁-M ₆The multiplication that prestores among the data of low order end cell data and the register REG; The 2nd claps totalizer gets shift register M from the MAD_in port ₇-M ₁₂The data of high order end unit and multiplication result addition, the result exports as the MAD_out port; The 3rd totalizer of clapping PE8 is got PE from port Y_in ₁MAD_out port output data with get M from port MAD_in ₁₀The data addition of high order end unit, the result is as V_out and the output of MUL_out port; The 4th claps shift register M ₁-M ₆Middle data move a storage unit, shift register M from left to right ₇-M ₁₂Middle data move a storage unit from right to left, and each processing unit has just been finished once-through operation like this, and obtains v from the input port V_out of array _i, be designated as a flow beat.

(3) whenever an error amount input array, just start flow beat of array operation, the mobile to the left or to the right storage unit of data obtains this road magnitude of voltage.Each processing unit of each flow beat and shift register repeat identical operations.Error amount flows in convolution algorithm portion 101 from left to right in the array like this, and convolution results flows from right to left.Magnitude of voltage is zero to flow from right to left with the initial value in the recursive operation portion 102, finishes behind the recursive operation at PE ₈In with the addition of convolution partial results, the magnitude of voltage that obtains is outwards exported.

Claims

1, a kind of pulsation array processing circuit that is used to realize adaptive optical system wavefront control operation is characterized in that: it is by 7 multiply accumulating processing unit PE ₁-PE ₇, 1 addition process unit PE ₈, 12 shift register M ₁-M ₁₂Form 4 multiply accumulating processing unit PE ₁-PE ₄Linear array is formed convolution algorithm portion, has two data stream in opposite directions in the array, and each component order of the error vector E of each frame is from the first multiply accumulating processing unit PE ₁Flow into array, through the second multiply accumulating processing unit PE ₂, the 3rd multiply accumulating processing unit PE ₃With the 4th multiply accumulating processing unit PE ₄Flow out array after the computing, convolution results with initial value 0 from the 4th multiply accumulating processing unit PE ₄Flow into array, through PE ₃, PE ₂, PE ₁Flow out array; 3 multiply accumulating processing unit PE ₅-PE ₇Linear array is formed recursive operation portion, has two data stream in opposite directions in the array, and the recursive operation part is 0 from the 7th multiply accumulating processing unit PE with initial value ₇Flow into array, through the 6th multiply accumulating processing unit PE ₆With the 5th multiply accumulating processing unit PE ₅After the computing at addition process unit PE ₈With output control voltage result of calculation after the array of the convolution algorithm portion output results added and feedback backward, through PE ₅, PE ₆, PE ₇Calculate the back and flow out array, realize data transfer between adjacent cells by shift register interconnection between the each processing unit, promptly the data-out port of a processing unit connects the data input pin of a shift register, and the data output end of this shift register connects the data input pin of next processing unit.

2, according to a kind of pulsation array processing circuit that is used to realize adaptive optical system wavefront control operation of claim 1, it is characterized in that: described each multiply accumulating processing unit PE ₁-PE ₇All form by register REG, multiplier MUL and totalizer ADD, register REG connects the input end of multiplier MUL, the pin MUL_in of another input termination multiply accumulating processing unit of multiplier MUL, the input end of the output termination totalizer ADD of multiplier MUL, the pin MAD_in of another input termination multiply accumulating processing unit of totalizer ADD, the pin MAD_out of the output termination multiply accumulating processing unit of totalizer ADD, the pin MUL_in of multiply accumulating processing unit links to each other with pin MUL_out.

3, according to a kind of pulsation array processing circuit that is used to realize adaptive optical system wavefront control operation of claim 1, it is characterized in that: described addition process unit PE ₈Constitute the input termination addition process unit PE of totalizer ADD by a totalizer ADD ₈Pin Y_in, another input termination addition process unit PE ₈Pin MAD_in, the output termination addition process unit PE of totalizer ADD ₈Pin V_out and pin V_feedback.

4, according to a kind of pulsation array processing circuit that is used to realize adaptive optical system wavefront control operation of claim 1, it is characterized in that: when n is even number, described 12 shift register M ₁-M ₁₂The degree of depth be n/2; When n is odd number, described 12 shift register M ₁-M ₁₂The degree of depth be (n+1)/2.