JPS62137668A

JPS62137668A - Picture signal processor using convolution

Info

Publication number: JPS62137668A
Application number: JP27703885A
Authority: JP
Inventors: Mitsuo Kurakake; 鞍掛　三津雄; Shoichi Otsuka; 大塚　昭一
Original assignee: Fanuc Corp
Current assignee: Fanuc Corp
Priority date: 1985-12-11
Filing date: 1985-12-11
Publication date: 1987-06-20

Abstract

PURPOSE:To find convolution in each picture element data at a relative high speed with a few multipliers by performing the multiplication of a corresponding picture element data shifting it in order against a load coefficient of one line, and estimating a result. CONSTITUTION:One line share out of the load coefficients of N-number of lines and N-number of rows in a coefficient memory 41 is set at N-number of registers 421, 422,..., and also, the picture element data of a frame memory 2 corresponding to a set load coefficient is inputted in order to the first shift register 3, and input controls for all of the lines in the coefficient memory 41 are performed. And the multiplication between the value of the first register 3 and the values of the N-number of registers are found at N-number of multipliers 51, 52,..., and outputs are added at an adder 71. The convolution can be obtained by repeating an operation that the output of the adder 71 is inputted to the second shift register 73, and the output of the shift register 73 and the output of the adder 71 are added with an adder register 72, and returned again to the shift register 73.

Description

【発明の詳細な説明】〔産業上の利用分野〕本発明はコンボリューションを用いる画像信号処理装置
に関する。本発明による装置は例えば産業用ロボットに
おける対象物を識別するビジュアルセンサ等に関連して
用いられる。DETAILED DESCRIPTION OF THE INVENTION [Field of Industrial Application] The present invention relates to an image signal processing device using convolution. The device according to the invention is used, for example, in connection with visual sensors for identifying objects in industrial robots.

[Prior art and problems to be solved by the invention]

ＣＣＤカメラのような撮像装置（イメージセンサ）によ
って撮像された画像出力は画像信号としてディジタル情
報に変換される。この画像の鮮鋭化のため、線２輪郭を
抽出する必要があり、画像信号処理の対象となる１フレ
一ム分の画素データの各々に対してコンボリューション
が施される。An image output captured by an imaging device (image sensor) such as a CCD camera is converted into digital information as an image signal. In order to sharpen this image, it is necessary to extract the line 2 contour, and convolution is performed on each pixel data for one frame that is the target of image signal processing.

従来はコンボリューション演算を行う際に乗算器を複数
用いて演算し、積算又は加算するから、メモリの記憶容
量が大となり画像処理装置の容量・寸法が増大し、価格
の上昇が避けられない。Conventionally, when performing a convolution operation, a plurality of multipliers are used to perform the operation and the integration or addition is performed, which increases the storage capacity of the memory, increases the capacity and size of the image processing device, and inevitably increases the price.

従来技術を第３図、第４図を用いて説明する。The prior art will be explained using FIGS. 3 and 4.

先づ３行×３列のコンボリューションは第３図上部に示
される。処理対象画素データＦ（ｉ、ｊ）と荷重係数Ｗ
（ｉ、ｊ）が図の如く配列されている場合、中央部の画
素データＦ（２，２）のコンボリューション演算結果Ｇ
（２，２）は次式にて示される。The first three rows by three columns of convolutions are shown at the top of FIG. Processing target pixel data F (i, j) and weight coefficient W
When (i, j) are arranged as shown in the figure, the convolution calculation result G of the central pixel data F (2, 2)
(2,2) is expressed by the following formula.

葭で説明の便宜上匈（１，１）　Ｘ　Ｆ（１，１）　＋Ｗ（２，１）　Ｘ
　Ｆ（２，１）十讐（３，１）ｘＦ（３，１）　　を第
１コラム演算、Ｗ（１，２）　ＸＦ（１，２）＋Ｋ（２
，２）　ｘ　Ｆ（２，２）＋Ｋ（３，２）ｘＦ（３，２
）　　を第２コラム演算、Ｗ（１，３）　ｘＦ（１，３
）　＋Ｗ（２，３）　ｘＦ（２，３）　＋Ｗ（３，３）
ＸＦ（３，３）　　を第３コラム演算と呼称する。For convenience of explanation, we use 匈 (1, 1) X F (1, 1) + W (2, 1) X
F (2, 1) ten (3, 1) x F (3, 1) is calculated in the first column, W (1, 2) XF (1, 2) + K (2
,2) x F(2,2)+K(3,2)xF(3,2
) is the second column operation, W(1,3) xF(1,3
) +W(2,3) xF(2,3) +W(3,3)
XF(3,3) is called the third column operation.

従来、上記のコンボリューションによる演算は第３図、
第４図に示す装置で実行されている。即ち第３図は一般
的な演算に使用するもので１つの乗算器と一つの加算器
とを使用する。乗算器の第１入力にＦ（ｉ、ｊ）を入力
し、第２入力にＷ（ｉ、ｊ）を順次入力し、Ｆ　（ＩＩ
Ｊ）　ｘＷ（ｉ、ｊ）を求め、３行３列のコンボリュー
ションの場合、合計９個の結果を加算器で積算するもの
である。積算した結果、特徴抽出を３行３列の素子の中
央部Ｆ　（２，２）で把握しようとすればそのコンボリ
ューション結果Ｇ（２，２）が容易に得られる。Conventionally, the calculation using the above convolution is shown in Figure 3.
The system is executed by the device shown in FIG. That is, FIG. 3 is used for general calculations, and uses one multiplier and one adder. F(i, j) is input to the first input of the multiplier, W(i, j) is input to the second input in sequence, and F (II
J) xW(i, j) is calculated, and in the case of a 3-row, 3-column convolution, a total of 9 results are integrated using an adder. As a result of integration, if the feature extraction is to be grasped at the central part F (2, 2) of the element arranged in 3 rows and 3 columns, the convolution result G (2, 2) can be easily obtained.

この第３図の装置では構成部品が少いことは利点として
考えられるが、必要とする一つの画素データのコンポリ
ニージョン演算の結果を求めるまでに長い時間を必要と
し、例えば２５６　Ｘ　２５６画素という多数の画素の
コンボリューション演算を求めるには適さず、またメモ
リの記憶容量も膨大になるという問題点がある。The device shown in Fig. 3 has a small number of components, which can be considered an advantage, but it takes a long time to obtain the result of the composite operation for one pixel data, for example, 256 x 256 pixels. This method has the problem that it is not suitable for calculating convolution operations for a large number of pixels, and that the storage capacity of the memory becomes enormous.

第４図の例では合計９個の乗算器Ｍｌｌ〜Ｍ１９と１個
の加算器Ａｌｌとを設けて、夫々の乗算器の第１入力に
Ｆ（ｉ、ｊ）を入力し、第２入力に夫々異なったＷ（ｉ
、ｊ）を入力して並列処理演算を行い、その各結果を加
算器で加算するものである。In the example of FIG. 4, a total of nine multipliers Mll to M19 and one adder All are provided, F(i, j) is input to the first input of each multiplier, and F(i, j) is input to the second input of each multiplier. Different W(i
, j) are input, parallel processing operations are performed, and the results are added together using an adder.

第４図の装置によれば大約１／９の速度で一つの画素デ
ータのコンボリューション演算を求めることができるが
、この乗算器は、大型でかつ高価なものであって、この
ような乗算器を９個も使用する第４図の装置は一般的に
コスト高となり、しかも大型化するという問題点がある
。According to the apparatus shown in FIG. 4, the convolution operation of one pixel data can be obtained at approximately 1/9 the speed, but this multiplier is large and expensive, and such a multiplier is The apparatus shown in FIG. 4, which uses as many as nine of these, generally has problems in that it is expensive and large in size.

本発明の目的は、フレームメモリに記憶された複数個の
処理対象画素データの各々のコンボリューションを少な
い乗算器を用いて比較的高速にて求め、更にコンボリュ
ーションに必要な荷重係数の種類を少なくするという構
想にもとづき、乗算器用ＲＡＭの記憶容量が節減された
画像信号処理装置を提供することにある。An object of the present invention is to obtain convolution of each of a plurality of pixel data to be processed stored in a frame memory at a relatively high speed using a small number of multipliers, and to further reduce the types of weighting coefficients required for convolution. Based on this idea, it is an object of the present invention to provide an image signal processing device in which the storage capacity of a multiplier RAM is reduced.

〔問題点を解決するための手段、および作用〕本発明に
おいては、イメージセンサの画像信号をディジタル変換
して格納するフレームメモリと、Ｎ行Ｎ列の荷重係数を
予め記憶する係数メモリとを有し、該フレームメモリに
記憶された複数の処理対象画素データの各々のコンボリ
ューションを該係数メモリの荷重係数を用いて行う画像
信号処理装置において、Ｎ個の段数を有する第１のシフ
トレジスタ、Ｎ個の荷重係数がセットされたＮ個のレジ
スタ、前記第１のシフトレジスタの各段の出力に対応す
る前記レジスタの出力を乗算するＮ個のＲＡＭ乗算器、
該Ｎ個の乗算器出力を加算する加算器、前記フレームメ
モリの行方向の処理対象画素データ数に等しい段数を有
する第２のシフトレジスタ、前記加算器の出力と前記第
２のシフトレジスタの出力とを加算記憶し、その加算結
果を該第２のシフトレジスタに加算する加算レジスタ、
を具備し、前記Ｎ個のレジスタへ係数メモリから１行分
の荷重係数がセットされ、該荷重係数に対応するフレー
ムメモリの画素データが荷重係数のすべての行に対して
前記第１のシフトレジスタへ順次入力され、荷重係数の
最後の行について入力が行われるまでに加算レジスタ出
力データを用いて１行分の処理対象画素データの各々の
コンボリューションが行われ、対象マトリクス形式によ
り等しい荷重係数の乗算結果を重複してもたないように
マトリクス・コンボリューションを行い得ることを特徴
とする画像信号処理装置、が提供される。[Means and effects for solving the problem] The present invention includes a frame memory that digitally converts and stores an image signal of an image sensor, and a coefficient memory that stores N rows and N columns of weight coefficients in advance. and an image signal processing device that performs convolution of each of a plurality of pixel data to be processed stored in the frame memory using weighting coefficients of the coefficient memory, a first shift register having N stages; N registers in which N weight coefficients are set; N RAM multipliers for multiplying the output of the register corresponding to the output of each stage of the first shift register;
an adder that adds the outputs of the N multipliers; a second shift register having a number of stages equal to the number of pixel data to be processed in the row direction of the frame memory; an output of the adder and an output of the second shift register; an addition register that adds and stores the addition result and adds the addition result to the second shift register;
A load coefficient for one row is set from the coefficient memory to the N registers, and pixel data of the frame memory corresponding to the load coefficient is transferred to the first shift register for all rows of the load coefficient. By the time the last row of weighting coefficients is input, each row of pixel data to be processed is convolved using the addition register output data, and equal weighting coefficients are An image signal processing device is provided that is capable of performing matrix convolution so as not to have duplicate multiplication results.

本発明による装置において、例えば、３行×３列のコン
ボリューションは下記のように実行される。すなわち、
第１のシフトレジスタ３は３段のレジスタ３１，３２．
３３で構成され、更に３個のレジスタ４２１〜４２３が
使用される。フレームメモリ２に所定の順序で合計２５
６Ｘ　２５６個の画素データＦ（ｘ、ｙ）が配列される
。第２行〜第２５４行目までの画素データを処理対象と
し、所定の荷重係数Ｗ（ｉ、ｊ）が設定される。In the device according to the invention, for example, a 3 row by 3 column convolution is performed as follows. That is,
The first shift register 3 has three stages of registers 31, 32 .
33, and three registers 421 to 423 are used. A total of 25 files are stored in frame memory 2 in a predetermined order.
6×256 pixel data F(x,y) are arranged. The pixel data from the 2nd row to the 254th row is targeted for processing, and a predetermined weighting coefficient W(i, j) is set.

最初、３個のレジスタ４２１〜４２３にＷ（１，１）　
。Initially, W (1, 1) is written in the three registers 421 to 423.
.

Ｗ（２，１）　、　Ｗ（３，１）がセットされ、第１の
シフトレジスタ３の第２段目３２にＦ（０，０）が、第
１段目３３にＦ（１，０）がセットされる。その結果筒
１の加算器７１の出力は、Ｆ（０，１）の第１演算結果
となり、最初の１行の処理中加算レジスタ（７２）は第
２のシフトレジスタ７３の出力を加算しないよう構成さ
ているので、Ｆ（０，１）の第１演算結果が第２のシフ
トレジスタ７３に入力される。W(2,1) and W(3,1) are set, F(0,0) is placed in the second stage 32 of the first shift register 3, and F(1,0) is placed in the first stage 33. is set. As a result, the output of the adder 71 of cylinder 1 becomes the first operation result of F (0, 1), and the first row of processing addition register (72) is configured not to add the output of the second shift register 73. Therefore, the first operation result of F(0,1) is input to the second shift register 73.

次いで第１のシフトレジスタ３の第３段目（３３）にＦ
（２，０）が入力され、第２段目には第１段目の内容が
、第３段目には第２段目の内容が夫々シフト入力される
。この結果、加算器７２の出力はＦ（１，１）の第１演
算結果となり、これが第２のシフトレジスタ７３に入力
される。Next, F is placed in the third stage (33) of the first shift register 3.
(2,0) is input, the contents of the first row are shifted into the second row, and the contents of the second row are shifted into the third row. As a result, the output of the adder 72 becomes the first operation result of F(1,1), which is input to the second shift register 73.

以後Ｆ（３，０）〜Ｆ　（２５５，０）と最後に例えば
Ｏが順々に第１のシフトレジスタ３の第１段目に入力さ
れることにより、第２のシフトレジスタ７３にはＦ　（
０，１）〜Ｆ　（２５５，１）の第１演算結果がセット
される。Thereafter, F(3,0) to F(255,0) and finally, for example, O are sequentially input to the first stage of the first shift register 3, so that F(3,0) to F(255,0) is input to the second shift register 73. (
0,1) to F (255,1) are set.

次に加算レジスタ７２の加算動作を開始させるとともに
３個のレジスタ４２１〜４２３にＷ（１，２）　。Next, the addition operation of the addition register 72 is started, and W(1, 2) is written to the three registers 421 to 423.

Ｗ（２，２）　、　Ｗ（３，２）をセットし第１のシフ
トレジスタ３の第２段目にＦ（０，１）、第１段目にＦ
（１，１）を夫々セットすると、第１の加算器７１の出
力はＦ（０，１）の第２演算結果となり、加算レジスタ
７２において第２のシフトレジスタ７３にセットされて
いたＦ（０，１）の第１演算結果と加算され、この加算
値が再び第２のシフトレジスタ（７３）に戻される。こ
のような操作が第１行目の画素データすべてについて行
われると、第２のシフトレジスタ７３の内容はＦ（０，
１）〜Ｆ　（２５５，１）の第１演算結果と第２演算結
果の和となる。Set W(2,2) and W(3,2), and set F(0,1) to the second stage of the first shift register 3 and F(0,1) to the first stage.
(1, 1) respectively, the output of the first adder 71 becomes the second operation result of F(0, 1), and the addition register 72 outputs F(0, 1), which was set in the second shift register 73. , 1), and this added value is returned to the second shift register (73) again. When such operations are performed on all the pixel data in the first row, the contents of the second shift register 73 become F(0,
1) to F (255, 1) is the sum of the first calculation result and the second calculation result.

次に３個のレジスタ４２１〜４２３にＷ（１，３）　、
　Ｗ（２，３）　、　Ｗ（３，３）をセットし、第１の
シフトレジスタ３の第１段目にＦ（１，２）　、第２段
目にＦ（０，２）をセットすると、第１の加算器（７１
）の出力はＦ（０，１）の第３演算結果となり、加算レ
ジスタ７２において第２のシフトレジスタ７３にセット
されていたＦ（０，１）の第１．第２演算結果の和と加
算され、Ｆ（０，１）のコンボリューション出力Ｇ　（
０，１）が制御回路に入力される。Next, write W(1,3) to the three registers 421 to 423,
If you set W(2,3) and W(3,3) and set F(1,2) in the first stage and F(0,2) in the second stage of the first shift register 3, , the first adder (71
) becomes the third operation result of F(0,1), and the first . It is added to the sum of the second calculation results, and the convolution output G (
0, 1) are input to the control circuit.

同様に第１のシフトレジスタ３の第２段目に順次Ｆ（２
，２）〜Ｆ　（２５５，２）がシフト入力されていくと
、加算レジスタ７２からＦ（０，１）〜Ｆ　（２５５，
１）のコンボリューション出力Ｇ（０，１）〜Ｇ　（２
５５，１）が得られる。Similarly, F(2
, 2) to F (255, 2) are shifted in, F(0, 1) to F (255,
1) convolution output G(0,1)~G(2
55,1) is obtained.

以上のプロセスで第１行目の画素データの各々のコンボ
リューションが完了し、第２行目以後の画素データにつ
いても同様のプロセスで操作演算が行われる。Through the above process, the convolution of each of the pixel data in the first row is completed, and operation calculations are performed on the pixel data in the second and subsequent rows through the same process.

〔Example〕

第１図は本発明の要部構成図である。第２図は産業用ロ
ボットのシュアルセンサの画像信号処理装置として採用
された例であって、ＣＣＤカメラ等のイメージセンサ１
１から送出される画像信号をＡ／Ｄ変換器１２によりデ
ィジタル情報に変換し、その変換器出力をフレームメモ
リ２に格納する。実用上フレームメモリ２は複数のフレ
ームメモリにて構成されることが多く、必要に応じてグ
イナミソクＲＡＭ、　　シリアル・アクセスメモリ（Ｓ
ＡＭ）より構成される。FIG. 1 is a block diagram of the main parts of the present invention. Figure 2 shows an example of an image signal processing device adopted as a real sensor of an industrial robot.
An A/D converter 12 converts the image signal sent from the frame memory 1 into digital information, and the output of the converter is stored in the frame memory 2. In practice, the frame memory 2 is often composed of multiple frame memories, and if necessary, it can be configured with RAM, serial access memory (S
AM).

上記フレームメモリ２に記憶された複数個の処理対象画
素データは、ディジタル化されており、コンボリューシ
ョンにより検出されるべき画像の線や輪郭の抽出が行わ
れ、信号処理により画像の鮮鋭化が得られる。The plurality of pixel data to be processed stored in the frame memory 2 are digitized, lines and contours of the image to be detected are extracted by convolution, and the image is sharpened by signal processing. It will be done.

第１図においてはフレームメモリ２に記憶さた複数個の
処理対象画素データの各々のコンボリューションを予め
係数メモリ４１に記憶されたＮ行Ｎ列の荷重係数を用い
て行う。In FIG. 1, each convolution of a plurality of pieces of pixel data to be processed stored in the frame memory 2 is performed using N rows and N columns of weight coefficients stored in advance in the coefficient memory 41.

レジスタ３１．３２．３３・・・より構成される第１の
シフトレジスタ３はＮ個の段数を有するものである（図
ではＮ＝３としである）。また係数メモリ４１は、Ｎ個
の荷重係数がセットされるＮ個のレジスタ４２１．４２
２．４２３．・・・を介して乗算器５１゜５２．５３．
・・・に接続される。これらの乗算器５１．５２゜５３
、・・・は、第１のシフトレジスタ３の各段３１゜３２
、３３．・・・の出力と対応するレジスタ４２１，４２
２゜４２３、・・・の出力を乗算するものである。また
Ｎ個の乗算器５１．５２．５３．・・・の出力は加算器
７１により加算される。第２のシフトレジスタ７３はフ
レームメモリ２の行方向の処理対象画素データ数に等し
い段数を有し、この第２のシフトレジスタ７３の出力と
加算器７１の出力とは加算され、その方Ｕ算結果を第２
のシフトレジスタ７３に加える加算レジスタ７２が存す
る。第２シフトレジスタ７３の出力は加算器７１に加え
られる前にゲート７４を経由し、禁止信号の付与により
信号の通過を制御する。The first shift register 3 composed of registers 31, 32, 33, . . . has N stages (N=3 in the figure). The coefficient memory 41 also includes N registers 421 and 42 in which N weight coefficients are set.
2.423. . . via multipliers 51, 52, 53, .
...is connected to... These multipliers 51.52゜53
, . . . are each stage 31°32 of the first shift register 3.
, 33. Registers 421 and 42 corresponding to the output of ...
The outputs of 2°423, . . . are multiplied. Also, N multipliers 51.52.53. ... are added by an adder 71. The second shift register 73 has a number of stages equal to the number of pixel data to be processed in the row direction of the frame memory 2, and the output of the second shift register 73 and the output of the adder 71 are added, and Second result
There is an addition register 72 which is added to the shift register 73 of . The output of the second shift register 73 passes through a gate 74 before being applied to the adder 71, and the passage of the signal is controlled by applying an inhibit signal.

Ｎ個のレジスタ４２１．４２２．４２３．・・・へ係数
メモリ４１の１行分の荷重係数をセットするとともに、
このセットした荷重係数に対応するフレームメモＵ　２
の画素データを第１のシフトレジスタ３へ順次入力して
、係数メモリ４１のすべての行について入力制御を行い
、この制御が係数メモリ４１のｉｆ＆の行について行わ
れている間に加算レジスタ７２から出力されるデータを
１行分の処理対象画素データの各々のマトリクス・コン
ボリューション演算結果として出力される。N registers 421.422.423. . . . to set the load coefficient for one row of the coefficient memory 41, and
Frame memo U2 corresponding to this set load coefficient
pixel data are sequentially input to the first shift register 3 to perform input control for all rows of the coefficient memory 41, and while this control is being performed for the if& row of the coefficient memory 41, pixel data from the addition register 72 are inputted sequentially to the first shift register 3. The output data is output as a matrix convolution calculation result for each row of pixel data to be processed.

組合せ部の掛数と変数の乗算した結果をＲＡＭの中に格
納しておき、これを変数によってテーブル・ルックアッ
プすることにより結果を得て乗算動作が行われる。した
がってコンボリューションを１回もしくは連続して行う
場合、その演算に必要な掛数の種類を少なくすれば当然
乗算器用ＲＡＭの記憶容量を節減することができる。The result of multiplying the multiplier of the combination part by the variable is stored in the RAM, and the result is obtained by looking up the result in a table using the variable, and the multiplication operation is performed. Therefore, when convolution is performed once or continuously, the storage capacity of the multiplier RAM can naturally be reduced by reducing the number of multipliers required for the calculation.

−ｍ的な画像処理に使用される線・輪郭抽出用の荷重係
数マトリクスは対称的な配列をするものが多く、この事
実に着目して第２図に示されるようなコラム対コラム方
式（ｃｏｌｕｍｎ　−ｔｏ　−ｃｏｌｕｍｎｐｒｏｃｅ
ｓｓ　）が採用されている。これにより、従来の如＜　
Ｏ’　、　４５°、９０°、１３５°・・・２７５°、
３１５゜の如く各角度に対しベクトルコンボリューショ
ンを施すことができる。Many of the weighting coefficient matrices for line/contour extraction used in -m-type image processing have a symmetrical arrangement. -to -columnproce
ss) has been adopted. As a result,
O', 45°, 90°, 135°...275°,
Vector convolution can be performed on each angle, such as 315°.

第１図装置においては、コラム対コラム方式によりベク
トルコンボリューションを行って、更に時分割でマトリ
クスコンボリューションを行う際に、マトリクスの対称
形により等しい荷重係数の乗算結果を重複してはもたな
いことになる。それにより、乗算器機能を有するＲＡＭ
の記憶容量を節約することができる。In the device shown in Figure 1, when vector convolution is performed using a column-to-column method and then matrix convolution is performed in a time-division manner, the symmetrical shape of the matrix prevents the multiplication results of equal weight coefficients from being duplicated. It turns out. Thereby, RAM with multiplier function
storage capacity can be saved.

第１図装置に関連して、線・輪郭の抽出のために使用さ
れる荷重マトリクスの例を以下に説明する。In connection with the apparatus of FIG. 1, an example of a load matrix used for line/contour extraction will be described below.

マトリクス配列において対称形式の係数が多く、対称形
式の場合には係数不変とみなすことにする。In the matrix array, there are many coefficients in a symmetric form, and in the case of a symmetric form, the coefficients are considered to be unchanged.

又係数が同じ場合には、ロードしなくてすみ、前のま＼
使用することが可能である。乗算器にロードする場合は
インターフェースからバッファを介して乗算器出力に加
えられ、加算器に送られ、またバッファ入力は各入力同
時に印加される。Also, if the coefficients are the same, there is no need to load the previous one.
It is possible to use. When loading the multiplier, it is added to the multiplier output from the interface via the buffer and sent to the adder, and the buffer input is applied to each input simultaneously.

線・輪郭の抽出のための荷重マトリックスの例は下記の
とおりである。An example of a load matrix for line/contour extraction is shown below.

グレディエント（Ｇｒａｄｉｅｎｔ　）の例：Ｇｉｊマ
トリックス　３Ｘ３ｆｘマトリックス　３Ｘ３ｆｙマトリックス　４Ｘ４ｆｘラプラシアン（Ｌａｐｌａｃｉａｎ　）の例：Ｌｉｊマ
トリックス　３×３マトリックス　４×４なお、ベクトルコンボリューションが終了した段階で精
度を更に上げる必要のある場合には、マトリンクスコン
ポリューシランを行い、対称形により等しい掛数の乗算
結果を重複してもたないようにして演算を実施すること
ができる。Gradient example: Gij matrix 3X3fx matrix 3X3fy matrix 4X4fx Laplacian example: Lij matrix 3x3 matrix 4x4 Note that if you need to further improve the accuracy after vector convolution , matrix conpolusion is performed, and the calculation can be carried out by not having duplicate multiplication results with equal multipliers due to symmetry.

第５図にフレームメモリをシリアルアクセスメモリ　（
ＳＡＭ）付のダイナミックＲＡＭ（ｄ−ＲＡ　Ｍ　）で
構成した他の実施例を示す。第５図においてベクトルコ
ンボリューションプロセッサとフレームメモリの間をシ
リアルアクセスメモリ（ＳＡＭ）を経由してＯ°力方向
シーケンシャルアクセスと４５°、９０°、１３５°方
向のｄ−ＲＡＭ経由のＲＡＭの２アクセス方式により結
合するイメージデータバスを設けたもので、これは特に
Ｏ。Figure 5 shows frame memory as serial access memory (
Another embodiment configured with a dynamic RAM (d-RAM) with SAM will be shown. In Fig. 5, there are two accesses between the vector convolution processor and the frame memory: sequential access in the 0° force direction via serial access memory (SAM) and RAM access via d-RAM in the 45°, 90°, and 135° directions. It is equipped with an image data bus that is connected by the O.

方向のコンボリューションを特に高速化したことを特徴
としている。It is characterized by particularly high-speed directional convolution.

従来はフレームメモリがスタティックＲＡＭで構成され
、全方向のベクトルコンボリューションが高速に実施で
きたが非常に高価となるという問題点がある。Conventionally, the frame memory has been configured with a static RAM, and vector convolution in all directions can be performed at high speed, but there is a problem in that it is very expensive.

第５図において、フレームメモリは通常４ないし５画面
分の容量をもつためその記憶容量が太きく、装置のコス
トを下げるために安価なｄ　−ＲＡＭが使用されている
。特にシリアルアクセスメモリ（ＳＡＭ）付きのｄ−Ｒ
ＡＭを使用してシリアルアクセスメモリからシーケンシ
ャルに１行毎にフレームメモリをアクセスすると、高速
Ｓ　−ＲＡＭとぼり同程度のスピードで動作が可能とな
る。In FIG. 5, the frame memory usually has a capacity for four to five screens, so its storage capacity is large, and an inexpensive d-RAM is used to reduce the cost of the device. Especially d-R with serial access memory (SAM)
If the frame memory is sequentially accessed row by row from the serial access memory using AM, it becomes possible to operate at a speed comparable to that of high-speed S-RAM.

したがってシーケンシャルにアクセスできる０゜方向の
アクセスと同じ速度で動作するベクトルコンボリューシ
ョンプロセッサを設置すれば、Ｏ。Therefore, if a vector convolution processor that operates at the same speed as access in the 0° direction that can be accessed sequentially is installed, the processing time will be O.

方向ベクトルコンボリューションを高速に実行できるよ
うになる。０°方向ベクトルコンボリユーシヨンを繰返
して実行し、その和をとる如く加算して行くとマトリッ
クスコンボリューションが実施される。以上の動作は、
コンボリューションプロセッサにとって重要な機能であ
る。Direction vector convolution can be performed quickly. Matrix convolution is performed by repeatedly performing 0° direction vector convolution and adding up the sums. The above operation is
This is an important function for convolution processors.

〔Effect of the invention〕

本発明によれば、フレームメモリに記憶された複数個の
処理対象画素データの各々のコンボリューションが少な
い乗算器を用いて比較的高速で求められ、コンボリュー
ションに必要な荷重係数の種類が少なくなり、乗算器用
ＲＡＭの記憶容量が節減された画像信号処理装置を実現
することができる。According to the present invention, convolution of each of a plurality of pieces of pixel data to be processed stored in a frame memory is obtained at a relatively high speed using a small number of multipliers, and the number of types of weighting coefficients required for convolution is reduced. , it is possible to realize an image signal processing device in which the storage capacity of the multiplier RAM is reduced.

[Brief explanation of drawings]

第１図は本発明の一実施例としての画像信号処理装置の
概略構成を示す図、第２図は第１図装置において用いられるＮＸＮコンボリ
ューション用係数マトリクスの構成を示す図面、第３図および第４図はいずれも従来形のコンボリューシ
ョン用装置の機能を説明する図、第５図は、第１図装置
において用いられるベクトルコンボリューションプロセ
ッサに用いられるフレームメモリの他の例を示す。２・・・フレームメモリ、３・・・第１のシフトレジスタ、３１．３２．３３・・・レジスタ、１１・・・イメージセンサ、１２・・・Ａ／Ｄ変換器、４１・・・係数メモリ、４２・・・マツピングメモリ、４２１．４２２，４２３・・・レジスタ、５１．５２．
５３・・・乗算器、６１・・・インターフェース、７１・・・加算器、FIG. 1 is a diagram showing a schematic configuration of an image signal processing device as an embodiment of the present invention, FIG. 2 is a diagram showing a configuration of a coefficient matrix for NXN convolution used in the device shown in FIG. 1, and FIG. FIG. 4 is a diagram explaining the functions of a conventional convolution device, and FIG. 5 shows another example of a frame memory used in the vector convolution processor used in the device of FIG. 1. 2... Frame memory, 3... First shift register, 31.32.33... Register, 11... Image sensor, 12... A/D converter, 41... Coefficient memory , 42... Mapping memory, 421.422, 423... Register, 51.52.
53... Multiplier, 61... Interface, 71... Adder,

Claims

[Claims]

1. It has a frame memory that digitally converts and stores the image signal of the image sensor, and a coefficient memory that stores N rows and N columns of weighting coefficients in advance, and each of the plurality of processing target pixel data stored in the frame memory. An image signal processing device that performs convolution using weighting coefficients of the coefficient memory, comprising: a first shift register having N stages; N registers in which N weighting coefficients are set; a multiplier using N RAMs that multiplies the output of each stage of the first shift register by the output of the corresponding register; an adder that adds the outputs of the N multipliers; a processing target in the row direction of the frame memory a second shift register having a number of stages equal to the number of pixel data; an addition register that adds and stores the output of the adder and the output of the second shift register, and adds the addition result to the second shift register; , a load coefficient for one row is set from the coefficient memory to the N registers, and pixel data of the frame memory corresponding to the load coefficient is shifted to the first shift for all rows of the load coefficient. Each row of pixel data to be processed is sequentially input to the register, and until the last row of the weighting coefficients is inputted, each row of pixel data to be processed is convolved using the addition register output data, and the weighting factor selection signal is used to convolve each row of pixel data to be processed. 1. An image signal processing device using convolution, characterized in that the coefficients are repeatedly used, and the coefficients are further associated with a RAM-based multiplier by a mapping memory, thereby enabling sequential specification in matrix convolution.