CN102651797A - Novel array video signal processing unit structure - Google Patents

Novel array video signal processing unit structure Download PDF

Info

Publication number
CN102651797A
CN102651797A CN2011100465378A CN201110046537A CN102651797A CN 102651797 A CN102651797 A CN 102651797A CN 2011100465378 A CN2011100465378 A CN 2011100465378A CN 201110046537 A CN201110046537 A CN 201110046537A CN 102651797 A CN102651797 A CN 102651797A
Authority
CN
China
Prior art keywords
operator
arithmetic
bunch
compute cluster
level
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2011100465378A
Other languages
Chinese (zh)
Inventor
李涛
蒋林
刘镇弢
韩俊刚
杜慧敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xi'an Post & Telecommunication College
Xian University of Posts and Telecommunications
Original Assignee
Xi'an Post & Telecommunication College
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xi'an Post & Telecommunication College filed Critical Xi'an Post & Telecommunication College
Priority to CN2011100465378A priority Critical patent/CN102651797A/en
Publication of CN102651797A publication Critical patent/CN102651797A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to a novel structure of a processing unit in an array video signal processor, which comprises an arithmetic operator, two input selectors, two input registers, a fan-out controller and four configurable buffers. The a base arithmetic unit can be used for implementing an arithmetic operation, a comparison operation, a shifting operation, a conversion operation, a logic operation and other special operations. The operations can be combined into operators and the arithmetic and logic operators can be used for implementing large scale parallel computing by programming. Each operator can be used for carrying out simple operations; the operators can be further combined into a computing cluster; and the operators in the same computing cluster can implement more complex functions by cascading. The processor not only has the advantages of high performance, low power consumption and low cost of an ASIC (Application Specific Integrated Circuit), but also has the advantages of flexible design and development and short development period of a programmable DSP (Digital Signal Processor).

Description

A kind of novel array video signal processing unit structure
1. technical field
The present invention relates to the structure of processing unit in a kind of novel array video signal processor.Invention is used the active demand to high-performance, programmable digital signal processor to vision signal processing and 3-D view processing etc., based on large-scale parallel place column processor, has proposed a kind of novel processing unit architecture.Belong to the IC design field.
2. background technology
The number of arrays word signal processor that the present invention relates to can be widely used in the high-performance video signal processing field.Owing to have high-performance and restructural, programmability simultaneously; So can adapt to the coding and decoding video algorithm of various main flows such as MPEG-2/MPEG-4/H.263/Divx/H.264/AVS/VC-1/RV/MJPEG; Can also be widely used in various video pre-treatments and reprocessing occasion, for electronic multimedia products such as high-performance video converter and server, high-end DVD, DTV provide powerful video/digital signal processing capability.The more important thing is that the number of arrays word signal processor not only has low cost, low-power consumption and high performance advantage also have great adaptability, can realize following potential new video/digital signal processing algorithm.So application prospect is very wide.
The present invention handles towards multimedia and uses; The architecture of the ARRAY PROCESSING unit of a kind of dynamic reconfigurable, easy programming is proposed; Both had ASIC high-performance, low-power consumption, advantage cheaply, had the advantage that Programmable DSPs is designed and developed flexibly, the construction cycle is short again
The present invention can be widely used in consumer electronics market, comprises that high performance digital signal is handled and field programmable logic is used.Be particularly useful for low side and high-end audio/video consumer device.The characteristics of low-power consumption, low cost and high reliability make the present invention have the very strong market competitiveness.
3. summary of the invention
To the effect that of the present invention: a kind of novel AP basic processing unit structure that has proposed is as shown in Figure 1, comprises an arithmetic operator, two input selectors, two input registers, a fan-out controller and four configurable buffers.In the structure shown in Figure 1, can realize reconstruct and programming to arithmetic element through changing configuration register.
Basic processing unit can be realized arithmetical operation operation, compare operation, shifting function, conversion operations, logical operation and other dedicated operations.These operations can be combined into operator, and the arithmetic sum logical operator can realize that large-scale parallel calculates through programming.Each operator can be carried out some shirtsleeve operations, the operation that we have selected suitable video reprocessing and other Digital Signal Processing to use meticulously.
Operator can further be formed compute cluster, and is as shown in Figure 2.Operator in the same compute cluster can realize more complicated function through cascade.What operator was formed bunch interconnects through switching network.A plurality of bunches can constitute the AP of dynamic reconfigurable with switching network.
The basic structure of each processing unit is as shown in Figure 3 in the AP, wherein:
● ALU (ALU, Arithmetic Logic Unit): can carry out the arithmetic sum logical operation as: add, subtract, multiplication and division, comparison, displacement, with or, non-etc..
● local program memory: be used for storing a spot of instruction;
● register: preliminary consideration 8 local registers of design and 4 pairs of registers that are used for port interconnection.
Each basic compute cluster comprises 9 arithmetic elements, and 8 such base varieties and some storage bank connect to form the compute cluster of low level through localized network, and are as shown in Figure 4.Typical storage bank is the 8-12 bit wide, and the 256-2048 position is dark.The compute cluster of low level can constitute high-level compute cluster through interconnection.Compute cluster is a hierarchical structure, and multilayer can be arranged.Equally, also can realize the reconstruct and the programming of compute cluster level in the structure shown in Figure 4 through the interference networks in the configuration bunch.
4. description of drawings
Fig. 1 is the basic processing unit architecture.
Fig. 2 is the architecture that basic processing unit constitutes compute cluster.
Fig. 3 is the architecture of processing unit in the AP.
Fig. 4 is the architecture that processing unit is formed compute cluster.
Fig. 5 is the AP example architecture.
5. embodiment
As everyone knows, the application-specific integrated circuit (ASIC) (ASIC) that designs for the particular video frequency processing capacity has very high performance, generally is dozens or even hundreds of times of Programmable DSPs performance.For this reason, we have anatomized the characteristics that ASIC realizes, recognize: used operation types is limited in the computing, and a few operation is used times without number; The fan-out of operation is fewer often; Through the cascade and the combination of operation, can effectively improve the utilance of resource; Local interlinkage between the unit is effectively, and adopting the large-scale switching network is to waste very much resource.
Based on above-mentioned cognition, we have designed array processor structure as shown in Figure 5.In this structure, a plurality of low levels shown in Figure 4 bunch have constituted high-level bunch through network interconnection.High-level bunch can constitute higher level compute cluster through local interlinkage, thereby forms stratification Structure Calculation bunch.
The AP that we propose is very similar with static data flow processor structure, is the stream handle of a static state on microcosmic, is a dynamic stream handle on macroscopic view.But,, overcome the not high deficiency of conventional flow processor efficient through adopting reconfigurable hardwired and reconfigurable arithmetic unit.

Claims (3)

1. a kind of novel AP basic processing unit structure that has proposed is as shown in Figure 1, comprises an arithmetic operator, two input selectors, two input registers, a fan-out controller and four configurable buffers.In the structure shown in Figure 1, realize reconstruct and programming to arithmetic element through changing configuration register.Basic processing unit can be realized arithmetical operation operation, compare operation, shifting function, conversion operations, logical operation and other dedicated operations.The operation of having selected suitable video reprocessing and other Digital Signal Processing to use meticulously.These operational group are synthesized operator, and the arithmetic sum logical operator realizes that through programming large-scale parallel calculates.
2. operator is combined into compute cluster as shown in Figure 2.Can realize more complicated function with the operator in the cluster through cascade.What operator was formed bunch interconnects through switching network.In this structure, a plurality of low levels bunch have constituted high-level bunch through network interconnection.High-level bunch constitutes higher level compute cluster through local interlinkage, thereby forms stratification Structure Calculation bunch.Multi-level compute cluster and switching network have constituted the AP of dynamic reconfigurable.
3. each basic compute cluster comprises 9 arithmetic elements, and 8 such base varieties and some storage bank connect to form the compute cluster of low level through localized network, and are as shown in Figure 4.Typical storage bank is the 8-12 bit wide, and the 256-2048 position is dark.Also can realize the reconstruct and the programming of compute cluster level in the structure shown in Figure 4 through the interference networks in the configuration bunch.
CN2011100465378A 2011-02-25 2011-02-25 Novel array video signal processing unit structure Pending CN102651797A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2011100465378A CN102651797A (en) 2011-02-25 2011-02-25 Novel array video signal processing unit structure

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2011100465378A CN102651797A (en) 2011-02-25 2011-02-25 Novel array video signal processing unit structure

Publications (1)

Publication Number Publication Date
CN102651797A true CN102651797A (en) 2012-08-29

Family

ID=46693685

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2011100465378A Pending CN102651797A (en) 2011-02-25 2011-02-25 Novel array video signal processing unit structure

Country Status (1)

Country Link
CN (1) CN102651797A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106502923A (en) * 2016-09-30 2017-03-15 西安邮电大学 In array processor, cluster memory storage accesses ranks two-stage switched circuit

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5633808A (en) * 1992-08-13 1997-05-27 Yamaha Corporation Digital signal processor
CN1198630A (en) * 1997-05-06 1998-11-11 松下电器产业株式会社 Image signal processing device
US6052705A (en) * 1995-08-30 2000-04-18 Sony Corporation Video signal processor with triple port memory
CN1836224A (en) * 2003-08-15 2006-09-20 皇家飞利浦电子股份有限公司 Parallel processing array

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5633808A (en) * 1992-08-13 1997-05-27 Yamaha Corporation Digital signal processor
US6052705A (en) * 1995-08-30 2000-04-18 Sony Corporation Video signal processor with triple port memory
CN1198630A (en) * 1997-05-06 1998-11-11 松下电器产业株式会社 Image signal processing device
CN1836224A (en) * 2003-08-15 2006-09-20 皇家飞利浦电子股份有限公司 Parallel processing array

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106502923A (en) * 2016-09-30 2017-03-15 西安邮电大学 In array processor, cluster memory storage accesses ranks two-stage switched circuit
CN106502923B (en) * 2016-09-30 2018-08-24 西安邮电大学 Storage accesses ranks two-stage switched circuit in cluster in array processor

Similar Documents

Publication Publication Date Title
Abbo et al. Xetal-II: a 107 GOPS, 600 mW massively parallel processor for video scene analysis
CA3034597C (en) A homomorphic processing unit (hpu) for accelerating secure computations under homomorphic encryption
Kaul et al. A 320 mv 56 μw 411 gops/watt ultra-low voltage motion estimation accelerator in 65 nm cmos
US8468335B2 (en) Reconfigurable system having plurality of basic function units with each unit having a plurality of multiplexers and other logics for performing at least one of a logic operation or arithmetic operation
Augustine et al. Low-power functionality enhanced computation architecture using spin-based devices
Albaharna et al. On the viability of FPGA-based integrated coprocessors
CN107977191B (en) Low-power-consumption parallel multiplier
US20040019765A1 (en) Pipelined reconfigurable dynamic instruction set processor
US9507753B2 (en) Coarse-grained reconfigurable array based on a static router
US8572151B2 (en) Methods and apparatuses for cordic processing
Venkatesan et al. Spintastic: Spin-based stochastic logic for energy-efficient computing
Kaul et al. A 300 mV 494GOPS/W reconfigurable dual-supply 4-way SIMD vector processing accelerator in 45 nm CMOS
CN101620587B (en) Flexible reconfigurable task processing unit structure
CN111047034B (en) On-site programmable neural network array based on multiplier-adder unit
Lanuzza et al. A new reconfigurable coarse-grain architecture for multimedia applications
CN101031904A (en) Programmable processor system with two kinds of subprocessor to execute multimedia application
CN102651797A (en) Novel array video signal processing unit structure
CN111752529B (en) Programmable logic unit structure supporting efficient multiply-accumulate operation
US20190235863A1 (en) Sort instructions for reconfigurable computing cores
US20080225939A1 (en) Multifunctional video encoding circuit system
CN102298568A (en) Method and device for switching configuration information of dynamic reconfigurable array
CN103390070A (en) Reconfigurable operator array structure
Lin et al. FDR 2.0: A low-power dynamically reconfigurable architecture and its FinFET implementation
Nyländen et al. FPGA based application specific processing for sensor nodes
Penny et al. Power-efficient and memory-aware approximate hardware design for hevc fme interpolator

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C53 Correction of patent of invention or patent application
CB02 Change of applicant information

Address after: 710121 Shaanxi city of Xi'an province Changan District Wei Guolu Xi'an University of Posts and Telecommunications

Applicant after: Xi'an University of Posts & Telecommunications

Address before: 710121 Shaanxi city of Xi'an province Changan District Wei Guolu Xi'an University of Posts and Telecommunications

Applicant before: Xi'an Post & Telecommunication College

COR Change of bibliographic data

Free format text: CORRECT: APPLICANT; FROM: XI AN POST + TELECOMMUNICATION COLLEGE TO: XI AN UNIVERSITY OF POSTS + TELECOMMUNICATIONS

C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20120829