JP5265984B2

JP5265984B2 - Image encoding apparatus and decoding apparatus

Info

Publication number: JP5265984B2
Application number: JP2008199996A
Authority: JP
Inventors: 秀敏松村
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2008-08-01
Filing date: 2008-08-01
Publication date: 2013-08-14
Anticipated expiration: 2028-08-01
Also published as: JP2010041235A

Abstract

<P>PROBLEM TO BE SOLVED: To provide an image encoder and an image decoder which rarely delay and have the fewer number of wiring. <P>SOLUTION: When four image encoding processing parts are provided, respective regions where one image is divided into four are assigned to the respective image encoding processing parts in the direction different from that of raster scanning when a macroblock of one image is disposed two-dimensionally. When a macroblock at the left end or right end of respective regions is processed, the image encoding processing part in charge of the next area is waited for to complete necessary processing of the macroblock, data of adjacent macroblock data are used to process a target macroblock. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、MPEGやH.264などの画像圧縮規格に従った符号化装置および復号装置に関する。 The present invention relates to an encoding device and a decoding device according to an image compression standard such as MPEG or H.264.

ハイビジョン放送、大容量光ディスクなどの普及により、HD(High Definition)TV画像(1920x1080)を扱うことが一般的になってきている。今後、更に大画面のUHD(Ultra High Definition)TV画像(7680x4320、3840x2160)を使用することが検討されており、MPEGのデコード・エンコード装置の更なる処理性能向上が望まれている。処理性能の向上には、「動作周波数の向上」、「処理の並列化」などが考えられるが、動作周波数の向上には、半導体プロセス技術の向上、高速動作可能な回路などが必要である。一方、処理の並列化に関しては、MPEGのデコード処理では、フレーム(ピクチャ)間の予測処理および可変長復号化処理が必要なため、符号圧縮されたビットストリームを前から順番に処理していく必要があり、単純な処理の並列化は難しく、並列化には様々な工夫が行なわれてきた。 With the widespread use of high-definition broadcasting and large-capacity optical discs, it has become common to handle HD (High Definition) TV images (1920x1080). In the future, the use of UHD (Ultra High Definition) TV images (7680x4320, 3840x2160) of larger screens is being studied, and further improvement in processing performance of MPEG decoding / encoding devices is desired. To improve the processing performance, “improvement of operating frequency”, “parallelization of processing”, and the like can be considered. However, improvement of the operating frequency requires improvement of semiconductor process technology and a circuit capable of high-speed operation. On the other hand, with regard to parallel processing, MPEG decoding processing requires prediction processing between frames (pictures) and variable length decoding processing, so it is necessary to process code-compressed bitstreams in order from the front. Therefore, it is difficult to parallelize simple processes, and various ideas have been made for parallelization.

従来技術の並列化手法としては、(1)動きベクトル範囲を考慮して、分割単位の処理開始タイミングをフレーム間予測で必要となる参照画領域の処理が完了した後になるように制御して、フレーム間での並列化を行う手法（特許文献１）、(2) MPEG2におけるスライスなど、フレーム内でのデータの独立性を利用したスライス単位でのフレーム内での並列化を行う手法（特許文献２）、(3)各マクロブロックが必要とする隣接マクロブロックの処理完了後にマクロブロック処理を開始させるように制御することで、処理をマクロブロックラインの単位でフレーム内での並列化を行う手法（特許文献３）、などが提案されている。 As a prior art parallelization method, (1) considering the motion vector range, the processing start timing of the division unit is controlled to be after the processing of the reference image area required for inter-frame prediction is completed, A method of performing parallelization between frames (Patent Document 1), (2) A method of performing parallelization within a frame in units of slices using independence of data within a frame, such as a slice in MPEG2 (Patent Document 1) 2), (3) A method of performing parallel processing within a frame in units of macroblock lines by controlling the macroblock processing to start after completion of processing of adjacent macroblocks required by each macroblock (Patent Document 3) and the like have been proposed.

(1)のフレーム間予測の依存関係を守るように制御し、フレーム間での並列化を行う手法は、UHDTV画像などの大画面に適用しようとした場合、次のような課題がある。この手法では並列処理を行うデコーダ/エンコーダの数が増える毎に入力から出力までの処理遅延が増大する。UHDTVのような大画面を扱うようなH.264デコーダ/エンコーダでは、必要な処理能力の増加のため、必要なデコード/エンコード処理ブロックの数が増えるので、処理遅延も増大する問題があった。 The method of controlling the inter-frame prediction dependency in (1) and performing parallel processing between frames has the following problems when trying to apply to a large screen such as a UHDTV image. In this method, the processing delay from input to output increases as the number of decoders / encoders performing parallel processing increases. An H.264 decoder / encoder that handles a large screen such as UHDTV has a problem that processing delay increases because the number of necessary decoding / encoding processing blocks increases due to an increase in necessary processing capacity.

図２１〜図２３は、従来のフレーム間予測の依存関係を守るように制御し、フレーム間での並列化を行う手法の説明図である。
図２１は、コアとメモリの画面分割と担当領域、図２２は動作タイミング、図２３はブロック構成を示す。 FIG. 21 to FIG. 23 are explanatory diagrams of a method for performing parallel control between frames by controlling so as to keep the dependency relationship between the conventional inter-frame predictions.
FIG. 21 shows the screen division and the assigned area of the core and the memory, FIG. 22 shows the operation timing, and FIG. 23 shows the block configuration.

図２１〜図２３は、4つのデコード処理ブロックと4つのメモリモジュールを使用した場合の構成例である。
図２１に示されるように、２１６０画素ラインある画面の縦方向を４分割する。１マクロブロックラインは、１６画素ラインからなっており、Memory0は、マクロブロックライン０〜３３を、Memory1は、マクロブロックライン３４〜６７、Memory2は、マクロブロックライン６８〜１０１、Memory3は、マクロブロックライン１０２〜１３４を分担して格納する。 21 to 23 are configuration examples in the case of using four decoding processing blocks and four memory modules.
As shown in FIG. 21, the vertical direction of a screen having 2160 pixel lines is divided into four. One macro block line is composed of 16 pixel lines, Memory 0 is macro block lines 0 to 33, Memory 1 is macro block lines 34 to 67, Memory 2 is macro block lines 68 to 101, and Memory 3 is macro block. The lines 102 to 134 are shared and stored.

図２２のように、各Decoder Coreは、１ピクチャ全体の処理を担当する。Decoder Core0は、Ｉ０ピクチャを担当し、最初の６０分の１秒でＩ０ピクチャのマクロライン０〜３３を処理する。この処理が終わると、Ｐ３ピクチャを担当するDecoder Core1がＰ３ピクチャのマクロブロックライン０〜３３の処理を開始し、６０分の１秒でマクロブロックライン３３までの処理を行う。その後、Ｂ１ピクチャを担当するDecoder Core2がＢ１ピクチャのマクロブロックライン０〜３３の処理を開始し、６０分の１秒後にマクロブロックライン３３の処理が終わると、Ｂ２ピクチャを担当するDecoder Core3がＢ２ピクチャのマクロブロックライン０〜３３の処理を開始する。このように、各Decoder Coreは、６０分の１秒ずつ遅れながら、各ピクチャの処理を開始し、以後、並列に処理を行うようにする。 As shown in FIG. 22, each Decoder Core is in charge of processing of the entire picture. Decoder Core0 takes charge of the I0 picture and processes the macrolines 0 to 33 of the I0 picture in the first 1/60 second. When this processing is completed, Decoder Core1 in charge of the P3 picture starts processing the macroblock lines 0 to 33 of the P3 picture, and performs processing up to the macroblock line 33 in 1/60 second. After that, Decoder Core2 in charge of B1 picture starts processing of macroblock lines 0 to 33 of B1 picture, and when processing of macroblock line 33 is finished after 1/60 second, Decoder Core3 in charge of B2 picture is set to B2 Processing of macroblock lines 0 to 33 of the picture is started. In this way, each Decoder Core starts processing each picture while being delayed by 1/60 second, and thereafter performs processing in parallel.

図２３において、Decoder Coreは、４つ設けられており、Memoryも４つ設けられている。そして、この間の信号ラインを切り替えるためのスイッチ１０が設けられる。
このような構成において、各Decoder Core（デコード処理ブロック）は、割り当てられたフレーム処理を担当しており、その処理時間は1フレーム時間の４倍である。要求画像サイズ、フレームレートによってはさらに多くのエンコーダ・デコーダが必要となり、処理遅延は並列度に比例する。エンコーダ・デコーダが利用される分野(放送の分野の放送局と中継現場との間の通信など)によってはこのような大きな遅延は到底許されるものではない。 In FIG. 23, four Decoder Cores are provided, and four Memory are also provided. And the switch 10 for switching the signal line in the meantime is provided.
In such a configuration, each Decoder Core (decoding processing block) is in charge of assigned frame processing, and its processing time is four times as long as one frame time. More encoders / decoders are required depending on the required image size and frame rate, and the processing delay is proportional to the degree of parallelism. Depending on the field in which the encoder / decoder is used (such as communication between a broadcasting station in the broadcasting field and a relay site), such a large delay cannot be allowed.

図２４〜図２６は、従来のスライス単位でフレーム内の並列化を行う手法の説明図である。
図２４は、Decoder Core担当領域、図２５は、動作タイミング、図２６は、ブロック構成を示す。 24 to 26 are explanatory diagrams of a conventional method for performing parallelization in a frame in units of slices.
24 shows the Decoder Core area, FIG. 25 shows the operation timing, and FIG. 26 shows the block configuration.

図２４において、Decoder Coreは、２つ設けられ、それぞれ、マクロブロックライン０〜３３と、マクロブロックライン３４〜６７を担当する。Memoryは、３つ設けられ、それぞれ、マクロブロックライン０〜２３と、マクロブロックライン２４〜４３と、マクロブロックライン４４〜６７を担当する。 In FIG. 24, two Decoder Cores are provided and are in charge of macroblock lines 0 to 33 and macroblock lines 34 to 67, respectively. Three memories are provided, and are in charge of macroblock lines 0 to 23, macroblock lines 24 to 43, and macroblock lines 44 to 67, respectively.

図２５のように、Decoder Core0は、各ピクチャのマクロブロックライン０〜３３を常に担当し、Decoder Core1は、各ピクチャのマクロブロックライン３４〜６７を常に担当する。Decoder Core0と1の動作開始タイミングは同時であり、３０分の１秒の間に、それぞれの担当マクロブロックラインを処理する。 As shown in FIG. 25, Decoder Core0 is always in charge of macroblock lines 0 to 33 of each picture, and Decoder Core1 is always in charge of macroblock lines 34 to 67 of each picture. The operation start timings of Decoder Core 0 and 1 are the same, and each assigned macro block line is processed in 1/30 second.

図２６において、Decoder Coreは、２つ設けられており、Memoryは、３つ設けられている。Decoder CoreとMemory間の信号ラインを切り替えるため、スイッチ１１−１〜１１−３が設けられている。 In FIG. 26, two Decoder Cores are provided, and three Memory are provided. Switches 11-1 to 11-3 are provided to switch the signal line between the Decoder Core and Memory.

本手法は、MPEG2規格がマクロブロックライン毎にスライスヘッダと呼ばれるユニークコードが存在することを利用している。この手法では、Decoder Coreの処理開始タイミングが同時なので、１フレームのエンコード・デコード処理時間は１フレーム時間と同じになり、(1)の手法のような問題は発生しない。しかし別の問題が存在する。H.264などの近年の画像圧縮規格では、フレーム内でそのようなユニークコードは必ずしも存在せず、またデコード処理において、隣接上方のマクロブロックとの依存関係がある。そのため、スライスレベル相当の並列化処理を行うことはできず、H.264に代表される近年の画像圧縮規格には適用できない。 This method uses the fact that the MPEG2 standard has a unique code called a slice header for each macroblock line. In this method, since the processing start timing of the Decoder Core is the same, the encoding / decoding processing time for one frame is the same as the one frame time, and the problem like the method (1) does not occur. But there is another problem. In recent image compression standards such as H.264, such a unique code does not necessarily exist in a frame, and there is a dependency relationship with an adjacent macroblock in the decoding process. For this reason, parallel processing equivalent to the slice level cannot be performed, and it cannot be applied to recent image compression standards represented by H.264.

上記双方の問題を解決したのが、(3)のマクロブロックの隣接情報参照の依存関係を守るように制御し、マクロブロックライン間での並列化を行う手法である。本手法は、H.264を含めた近年の画像圧縮規格にも使用可能であるが、UHDTV画像などの大画面に適用しようとした場合、次のような課題がある。UHDTVのような大画面を扱うようなH.264デコーダ/エンコーダでは、必要な処理能力の増加のため、必要なデコード/エンコード処理ブロックの数が増え、また、画像を格納するメモリモジュールに関しても、必要なメモリ容量およびメモリモジュールとデコード/エンコード処理ブロック間の帯域確保のため、メモリモジュールも複数必要となる。 The solution to both of the above problems is the technique (3) of controlling the macroblock adjacent information reference dependency so that the macroblock lines are parallelized. Although this method can be used for recent image compression standards including H.264, there are the following problems when trying to apply it to large screens such as UHDTV images. In the H.264 decoder / encoder that handles a large screen like UHDTV, the number of necessary decoding / encoding processing blocks increases due to the increase in required processing capacity, and also regarding the memory module that stores images, In order to secure the necessary memory capacity and the bandwidth between the memory module and the decoding / encoding processing block, a plurality of memory modules are required.

図２７〜図２９は、従来のマクロブロックの隣接情報参照の依存関係を守るように制御し、マクロブロックライン間での並列化を行う手法の説明図である。
図２７は、Decoder CoreとMemoryの画面分割と担当領域、図２８は動作タイミング、図２９はブロック構成を示す。 FIG. 27 to FIG. 29 are explanatory diagrams of conventional techniques for performing control so as to protect the dependency relationship of adjacent information reference of macroblocks and performing parallelization between macroblock lines.
FIG. 27 shows the screen division and assigned area of Decoder Core and Memory, FIG. 28 shows the operation timing, and FIG. 29 shows the block configuration.

図２７〜２９は、4つのデコード処理ブロックと4つのメモリモジュールを使用した場合の構成例である。
図２７に示されるように、４つのMemoryは、マクロブロックライン０〜３３、マクロブロックライン３４〜６７、マクロブロックライン６８〜１０１、マクロブロックライン１０２〜１３４をそれぞれ担当する。４つのDecoder Coreは、それぞれ４マクロブロックライン飛ばしで各マクロブロックラインの処理を担当する。図２７においては、Decoder Core3の担当マクロブロックラインが網掛けされて示されている。 27 to 29 are configuration examples in the case of using four decoding processing blocks and four memory modules.
As shown in FIG. 27, the four memories are responsible for macroblock lines 0 to 33, macroblock lines 34 to 67, macroblock lines 68 to 101, and macroblock lines 102 to 134, respectively. Each of the four Decoder Cores is responsible for processing each macroblock line by skipping four macroblock lines. In FIG. 27, the macro block line in charge of Decoder Core 3 is shown shaded.

図２８に示されるように、各Decoder Coreは、自分の担当マクロブロックラインの処理を、MBが参照するMBの処理が上方MBを担当するDecode Coreによって処理が完了してから開始する。1ピクチャの処理時間は、６０分の１秒である。各Decoder Coreは、すべてのピクチャの処理において、決められたマクロブロックラインの処理を固定的に担当する。 As shown in FIG. 28, each Decoder Core starts processing its own macro block line after the processing of the MB referenced by the MB is completed by the Decode Core in charge of the upper MB. The processing time for one picture is 1/60 second. Each Decoder Core is in charge of fixed macroblock line processing in all picture processing.

図２９に示されるように、４つのDecoder Coreと４つのMemoryが設けられ、これらの間の信号ラインの切り替えのために、スイッチ１２が設けられる。
このような構成において、各デコード処理ブロックは、割り当てられたマクロブロックライン処理を担当しており、各メモリにはフレームを4分割したそれぞれの領域を格納しているために、4つのメモリモジュール全てにアクセスする必要がある。そのため、デコード処理ブロックとメモリモジュールの間に4対4の接続が必要となり、メモリバス構成が複雑化・肥大化するといった課題があった。ここでは簡単にするために水平線によって４分割した領域のそれぞれをフレームのメモリへ割り当てたが、この問題はどのような割り当てを行っても発生する。また、この課題は(1)の手法にも存在していた。
特開２００６−１４１１３号公報特許第２８６３０９６号明細書特開２００８−６７０２６号公報 As shown in FIG. 29, four Decoder Cores and four memories are provided, and a switch 12 is provided for switching signal lines between them.
In such a configuration, each decoding processing block is in charge of the assigned macro block line processing, and each memory stores the respective areas divided into four frames. Need to access. For this reason, a 4-to-4 connection is required between the decoding processing block and the memory module, which causes a problem that the memory bus configuration becomes complicated and enlarged. Here, for the sake of simplicity, each of the four areas divided by the horizontal line is allocated to the frame memory, but this problem occurs regardless of the allocation. This problem also existed in the method (1).
JP 2006-14113 A Japanese Patent No. 2863096 JP 2008-67026 A

したがって、複数のデコード/エンコード処理ブロックおよび複数のメモリモジュールを使用するMPEG、H.264などのフレーム間予測を使用した画像圧縮規格のデコーダ/エンコーダ装置において
・処理遅延の増大を避け、低遅延を必要とする用途においても適用可能とすること
・デコード/エンコード処理ブロックとメモリモジュール間のメモリバス構成の複雑化・肥大化と、回路規模/レイアウト難度を低減すること
を実現する必要がある。 Therefore, in an image compression standard decoder / encoder device using inter-frame prediction such as MPEG and H.264 that uses multiple decoding / encoding processing blocks and multiple memory modules. It is necessary to be able to be applied to necessary applications. It is necessary to realize a complicated and enlarged memory bus configuration between a decoding / encoding processing block and a memory module, and to reduce a circuit scale / layout difficulty.

本発明の課題は、低遅延で、配線の数が少ない画像符号化装置、及び、画像復号装置を提供することである。 An object of the present invention is to provide an image encoding device and an image decoding device with low delay and a small number of wires.

本発明の画像符号化装置は、画像データをブロック単位で符号化する画像符号化装置に
おいて、画像データを分割して格納する複数の格納手段と、画像符号化処理を行う複数の画像符号化手段と、該複数の格納手段と該複数の画像符号化手段を、該画像符号化手段がアクセスする必要のある該格納手段と接続する接続手段と、画像データを２次元配列と見た場合に、ブロック単位の画像符号化の進行する方向とは異なった方向に、画像データを前記画像符号化手段の数だけ分割し、それぞれ分割された画像データを前記複数の画像符号化手段に割り当てて、各画像符号化手段に並列に画像符号化処理を行わせる制御手段とを備える。 An image encoding apparatus according to the present invention includes a plurality of storage means for dividing and storing image data and a plurality of image encoding means for performing image encoding processing in an image encoding apparatus for encoding image data in units of blocks. And when the image data is viewed as a two-dimensional array, the connection means for connecting the plurality of storage means and the plurality of image encoding means to the storage means that the image encoding means needs to access, Dividing the image data by the number of the image encoding means in a direction different from the direction in which the image encoding in units of blocks proceeds, assigning each divided image data to the plurality of image encoding means, Control means for causing the image coding means to perform image coding processing in parallel.

本発明の画像復号装置は、画像データを復号する画像復号装置において、可変長符号化された画像データを可変長復号する可変長復号手段と、可変長復号された画像データにブロック単位で画像復号処理を施す複数の画像復号手段と、画像復号処理された画像データを分割して格納する複数の格納手段と、該複数の格納手段と該複数の画像復号手段を、該画像復号手段がアクセスする必要のある該格納手段と接続する接続手段と、画像データを２次元配列と見た場合に、ブロック単位で行う画像復号処理の進行する方向とは異なった方向に、画像データを前記画像復号手段の数だけ分割し、それぞれ分割された画像データを前記複数の画像復号手段に割り当てて、各画像復号手段に並列に画像復号処理を行わせる制御手段とを備える。 An image decoding apparatus according to the present invention includes an image decoding apparatus that decodes image data, variable-length decoding means that performs variable-length decoding on variable-length encoded image data, and image decoding in units of blocks for variable-length decoded image data. The image decoding means accesses the plurality of image decoding means for performing the processing, the plurality of storage means for dividing and storing the image data subjected to the image decoding process, the plurality of storage means and the plurality of image decoding means. The connecting means for connecting to the necessary storage means, and the image decoding means in a direction different from the direction in which the image decoding processing performed in units of blocks is performed when the image data is viewed as a two-dimensional array. And control means for allocating the divided image data to the plurality of image decoding means and causing each image decoding means to perform image decoding processing in parallel.

本発明によれば、低遅延で、配線の数が少ない画像符号化装置、及び、画像復号装置を提供される。 According to the present invention, it is possible to provide an image encoding device and an image decoding device with a low delay and a small number of wires.

本実施形態では、UHDTV画像などの大画面のデコード・エンコードなど、高い処理能力が要求される装置において、複数のデコード・エンコード処理ブロックおよび複数のメモリモジュールを使用する場合の、並列処理構成を示す。 This embodiment shows a parallel processing configuration in the case where a plurality of decoding / encoding processing blocks and a plurality of memory modules are used in an apparatus that requires high processing capability such as decoding / encoding of a large screen such as a UHDTV image. .

本実施形態では、動画像符号化装置を、以下のようなものとする。
・画像をブロックに分割して処理を行う動画像符号化方式に対応している。
・以下の物を有する
（１）1つ以上の可変長符号化部
（２）複数の画像符号化処理部
（３）（２）の出力するローカル復号画像を格納する複数のメモリモジュール
（４）画像符号化処理部とメモリモジュール間に存在するデータセレクト機能。
・各画像符号化処理部はピクチャを垂直線（ピクチャデータを２次元に配列した場合のラスタスキャンの方向あるいはマクロブロックの処理順の方向とは異なる方向）によって分割された領域の符号化処理を担当し、１ピクチャを同時に並列処理する。 In the present embodiment, the moving image encoding apparatus is as follows.
-It corresponds to the moving picture coding system which divides | segments an image into blocks and performs processing.
(1) One or more variable-length encoding units (2) Multiple image encoding processing units (3) Multiple memory modules for storing local decoded images output from (2) (4) A data selection function that exists between the image encoding processing unit and the memory module.
Each image encoding processing unit performs encoding processing of an area divided by a vertical line (a direction different from the direction of raster scanning or the order of processing of macroblocks when picture data is arranged two-dimensionally) Responsible for processing 1 picture in parallel at the same time.

画像符号化処理部とメモリモジュールの物理的な構成として、全ての画像符号化処理部が1つのLSIに配置されるという構成も、複数のLSIに分割して配置されるという構成もある。また、メモリモジュールが外付けのメモリである構成も、On-Chipのメモリ(Embedded DRAMなど)である構成もある。符号化中間データを格納するバッファ-はLSIの内部バッファとして配置しても良いし、LSIに外付けするメモリの位置領域として配置しても良い。 As a physical configuration of the image encoding processing unit and the memory module, there are a configuration in which all the image encoding processing units are arranged in one LSI and a configuration in which the image encoding processing unit is divided into a plurality of LSIs. In addition, there is a configuration in which the memory module is an external memory and a configuration in which the memory module is an on-chip memory (such as embedded DRAM). The buffer for storing the encoded intermediate data may be arranged as an internal buffer of the LSI, or may be arranged as a location area of a memory externally attached to the LSI.

画像符号化処理部間で隣接マクロブロック情報を渡すための手段を有する。
隣接マクロブロック情報を格納する手段としては、LSI内の内部メモリとして配置してもよいし、LSIに外付けするメモリの一領域として配置してもよい。また、隣接する画像符号化処理部間に一つずつ、(画像符号化処理部数-1)個配置しても良いし、符号化装置全体の隣接マクロブロック情報を管理するものを一つ配置しても良い。 Means for passing adjacent macroblock information between image encoding processing units.
The means for storing the adjacent macroblock information may be arranged as an internal memory in the LSI, or may be arranged as an area of a memory externally attached to the LSI. In addition, one (one image encoding processing unit-1) may be arranged between adjacent image encoding processing units, or one unit for managing adjacent macroblock information of the entire encoding apparatus is arranged. May be.

画像符号化処理部の出力する符号化中間データを格納するためのバッファとそのデータ管理機能を有し、このデータ管理機能はバッファ内の符号化中間データのピクチャ内の領域ごとの格納位置を示すポインタを記録する機能を有し、可変長符号化処理部のそれぞれに符号化中間データを振り分けて入力する。 A buffer for storing the encoded intermediate data output from the image encoding processing unit and its data management function are provided, and this data management function indicates the storage position of each area in the picture of the encoded intermediate data in the buffer. It has a function of recording a pointer and distributes and inputs encoded intermediate data to each of the variable length encoding processing units.

中間データバッファは、LSI内の内部バッファとして配置してもよいし、LSIに外付けするメモリの一領域として配置してもよい。
各画像符号化処理部は、符号化処理中のマクロブロックのピクチャ内位置情報を通知する手段を有するとともに、制御部からの指示によりマクロブロック符号化処理の待機、再開の制御を行う。 The intermediate data buffer may be arranged as an internal buffer in the LSI, or may be arranged as an area of a memory externally attached to the LSI.
Each image coding processing unit has means for notifying position information in a picture of a macroblock being coded, and controls waiting and restarting of the macroblock coding processing according to an instruction from the control unit.

制御部は、各画像符号化処理部から通知されるピクチャ内位置情報から、あるマクロブロックの処理開始が、そのマクロブロックが必要な隣接情報を出力するマクロブロックの処理完了後になるように制御する。このとき、各画像符号化処理部の待機、再開の単位は必ずしもマクロブロック単位である必要はなく、より大きな単位、例えば、５マクロブロック単位での待機、再開でも良い。 The control unit controls the start of processing of a certain macroblock from the position information in the picture notified from each image encoding processing unit after completion of the processing of the macroblock that outputs adjacent information necessary for the macroblock. . At this time, the unit of standby and restart of each image encoding processing unit does not necessarily have to be a macroblock unit, and may be a standby or restart of a larger unit, for example, a unit of 5 macroblocks.

画像符号化処理部とメモリモジュールの間にメモリアクセス振り分け部を有し、前記メモリアクセス振り分け部は、画像符号化処理部からのアクセスをメモリマップにしたがって、複数のメモリモジュールのいずれか、もしくは両方に振り分ける機能を有する。 A memory access distribution unit is provided between the image encoding processing unit and the memory module, and the memory access distribution unit performs access from the image encoding processing unit according to a memory map, either one of a plurality of memory modules, or both It has a function to distribute to.

複数の画像符号化処理部が物理的に複数のモジュールに分かれて配置されている場合、モジュール間で画像データおよび、隣接マクロブロック情報、符号化中間データ、マクロブロック符号化位置情報のうちいずれかのデータの受け渡しを行う機能を有する。 When a plurality of image coding processing units are physically divided into a plurality of modules, any one of image data, adjacent macroblock information, coding intermediate data, and macroblock coding position information between modules It has a function to exchange data.

動画像復号装置を、以下のようなものとする。
・画像をブロックに分割して処理を行う動画像符号化方式に対応している。
・以下の物を有する。 The moving picture decoding apparatus is as follows.
-It corresponds to the moving picture coding system which divides | segments an image into blocks and performs processing.
・ Has the following items.

（１）1つ以上の可変長復号部
（２）複数の画像復号処理部
（３）（２）の出力する復号画像を格納する複数のメモリモジュール
（４）画像復号処理部とメモリモジュール間に存在するデータセレクト機能
・各画像復号処理部は、ピクチャを垂直線（ピクチャデータを２次元に配列した場合のラスタスキャンの方向あるいはマクロブロックの処理順の方向とは異なる方向）によって分割された領域の復号処理を担当し、１ピクチャを同時に並列処理する。 (1) One or more variable length decoding units (2) A plurality of image decoding processing units (3) A plurality of memory modules for storing the decoded images output from (2) (4) Between the image decoding processing unit and the memory modules Data selection function that exists ・ Each image decoding processing unit is an area where a picture is divided by a vertical line (a direction different from the direction of raster scanning or the order of processing of macroblocks when picture data is arranged two-dimensionally) In parallel, one picture is processed in parallel.

また、以上のような符号化装置に対応した復号装置も提供する。復号装置の構成は、データの処理の順序が異なるのみで、装置構成の基本は符号化装置と同じである。
H.264/VC-1等のマクロブロック処理では、左、左上、上、右上に隣接するマクロブロックの情報が必要なので、これらの隣接マクロブロックの処理が完了してから、マクロブロック処理を開始することが必要である。 Also provided is a decoding apparatus corresponding to the above-described encoding apparatus. The configuration of the decoding apparatus is different only in the order of data processing, and the basic configuration of the decoding apparatus is the same as that of the encoding apparatus.
Macroblock processing such as H.264 / VC-1 requires information on adjacent macroblocks on the left, top left, top, and top right, so macroblock processing starts after processing of these adjacent macroblocks is complete It is necessary to.

特開2008-67026号公報では、画面をマクロブロックライン単位に分割し、それぞれに別の画像符号化処理部を割り当てることで、隣接情報の依存関係を守ったまま画面内での並列処理を行っていた。本実施形態も同様に隣接情報の依存関係を保ったまま画面内の並列化処理を提供するものであるが、各画像符号化処理部が担当する処理領域が異なる。 In Japanese Patent Laid-Open No. 2008-67026, the screen is divided into macroblock line units, and a separate image encoding processing unit is assigned to each to perform parallel processing within the screen while maintaining the dependency of adjacent information. It was. Similarly, the present embodiment provides parallel processing within a screen while maintaining the dependency relationship of adjacent information, but the processing areas handled by each image encoding processing unit are different.

図１及び図２は、本発明の実施形態の動作を説明する図である。
図１（ａ）において、例えば、画像符号化処理部１のマクロブロックライン１の処理について考える。左端マクロブロックの左隣接、左上隣接マクロブロックは画像符号化処理部０の処理範囲であるが、画像符号化処理部０のマクロブロックライン１の処理を完了してからであれば、これらの隣接マクロブロックの処理は完了している。また、右端マクロブロックの右上マクロブロックは画像符号化処理部２の処理範囲であるが、画像符号化処理部２のマクロブロックライン０の左端マクロブロックの処理完了後であれば、この隣接マクロブロック処理は完了している。その他の隣接マクロブロックは自分（画像符号化処理部１）の処理範囲内となり、画像符号化処理部１は処理範囲内を規格に定められたマクロブロック順序で処理しているので、必要な隣接マクロブロックの処理は完了している。したがって、隣接マクロブロックの処理が完了していれば、処理対象のマクロブロックの処理を開始できる。図１（ｂ）に示すように、画像符号化処理部０〜３は、それぞれ、処理が可能になった順に、自分の担当範囲のマクロブロックを処理する。 1 and 2 are diagrams for explaining the operation of the embodiment of the present invention.
In FIG. 1A, for example, the processing of the macroblock line 1 of the image encoding processing unit 1 is considered. The left adjacent macroblock and the upper left adjacent macroblock in the left end macroblock are within the processing range of the image encoding processing unit 0. However, if the processing of the macroblock line 1 of the image encoding processing unit 0 is completed, Macroblock processing is complete. The upper right macroblock of the rightmost macroblock is the processing range of the image coding processing unit 2, but if the processing of the leftmost macroblock of the macroblock line 0 of the image coding processing unit 2 is completed, this adjacent macroblock Processing is complete. Other adjacent macroblocks are within the processing range of their own (image encoding processing unit 1), and the image encoding processing unit 1 processes the processing range in the order of macroblocks defined in the standard. Macroblock processing is complete. Therefore, if the processing of the adjacent macroblock is completed, the processing of the processing target macroblock can be started. As shown in FIG. 1B, each of the image encoding processing units 0 to 3 processes the macroblocks in its own assigned range in the order in which processing is possible.

一般的な動画像符号化方式では動きベクトルの最大値が定められている。画像符号化処理部０は画像の左から1/4の領域を処理範囲としているので、この領域内のマクロブロックの符号化処理で参照されるローカルデコード画像の領域はこの画像左1/4範囲に動きベクトルの水平方向の最大値幅の領域を合わせた領域となる（図２）。水平方向動きベクトル最大値は規格で定められた値以下であるが、現実のエンコーダではより小さな範囲での動き予測処理が行われることが多く、参照されるローカルデコード画像の領域は、隣の画像符号化処理部の処理担当領域にとどまることが多い。 In a general moving picture coding system, the maximum value of a motion vector is determined. Since the image encoding processing unit 0 has a region of 1/4 from the left of the image as the processing range, the region of the local decoded image referred to in the encoding process of the macroblock in this region is the range of the 1/4 left of this image And the area of the maximum width in the horizontal direction of the motion vector are combined (FIG. 2). The maximum horizontal motion vector value is less than or equal to the value specified in the standard, but in actual encoders, motion prediction processing in a smaller range is often performed, and the area of the locally decoded image referred to is the adjacent image. In many cases, it remains in the processing area of the encoding processing unit.

図３は、デブロッキングフィルタ処理について説明する図である。
また、Writeアクセスに関してはH.264におけるデブロッキングフィルタ処理のために自処理の左側の領域への書き込みが必要となる。図３は、デブロッキングフィルタ処理での左側書き込み領域を示す。デブロッキングフィルタ処理では、ブロック境界にフィルタ処理を行うために、処理マクロブロックの隣接画素のRead/Write処理が必要となる。左隣接について注目すると、H.264規格では、デブロッキングフィルタ処理によって左隣接3画素のデータが変化する可能性があるので、左隣接3画素のライト処理が必要となる。 FIG. 3 is a diagram illustrating the deblocking filter process.
As for the write access, it is necessary to write to the left area of the own process for the deblocking filter process in H.264. FIG. 3 shows the left-side writing area in the deblocking filter process. In the deblocking filter process, in order to perform the filter process on the block boundary, the read / write process of the adjacent pixels of the processing macroblock is required. When attention is paid to the left adjacency, in the H.264 standard, the data of the left adjoining three pixels may change due to the deblocking filter process, and thus the write process of the left adjoining three pixels is required.

図４及び図５に、従来手法と本発明手法での画像符号化処理部と各メモリモジュール(Memory)の間の接続パス数の比較を示す。
ここで、各画像符号化処理部と各Memoryが担当する処理領域(符号化担当領域と格納領域)は1対1に対応しており、各画像符号化処理部の担当符号化領域が必要とする参照画領域は、対応するMemoryとその前後のMemoryに格納されているものとする(すなわち、最大ベクトル範囲が1Memory格納領域の範囲を超えない)。画像符号化処理部4/Memory4、画像符号化処理部8/Memory8の構成で接続パス数は、従来方法と本実施形態について、それぞれ表のようになっている。 4 and 5 show a comparison of the number of connection paths between the image encoding processing unit and each memory module (Memory) in the conventional technique and the technique of the present invention.
Here, the processing area (encoding area and storage area) in charge of each image encoding processing unit and each Memory has a one-to-one correspondence, and the encoding area in charge of each image encoding processing unit is required. It is assumed that the reference image area to be stored is stored in the corresponding Memory and the preceding and following Memory (that is, the maximum vector range does not exceed the range of 1 Memory storage area). The number of connection paths in the configuration of the image encoding processing unit 4 / Memory4 and the image encoding processing unit 8 / Memory8 is as shown in the table for the conventional method and this embodiment.

図６に、各マクロブロックが符号化処理に必要とする隣接マクロブロック情報の位置関係を示す。 FIG. 6 shows the positional relationship of adjacent macroblock information required for each macroblock for encoding processing.

例えば、H.264ではあるマクロブロックを処理する際に、その隣接A(左),B(上),C(右上),D(左上)の情報が必要となる。ここで隣接マクロブロック情報には、各マクロブロックでのマクロブロックタイプ情報、動きベクトル情報、画素値などがある。 For example, when processing a macroblock in H.264, information on adjacent A (left), B (upper), C (upper right), and D (upper left) is required. Here, the adjacent macroblock information includes macroblock type information, motion vector information, pixel values, and the like in each macroblock.

ある画像符号化処理部の処理領域左端のマクロブロックでは左、左上マクロブロックの隣接情報を使用するので、左領域を処理している画像符号化処理部からこれを受け取る必要がある。また、処理領域右端のマクロブロックでは右上隣接マクロブロック情報を使用するので、右領域を処理している画像符号化処理部からこれを受け取る必要がある。 Since the macroblock at the left end of the processing region of a certain image encoding processing unit uses the adjacent information of the left and upper left macroblocks, it needs to be received from the image encoding processing unit processing the left region. Further, since the right upper macro block information is used in the macro block at the right end of the processing region, it is necessary to receive this from the image encoding processing unit processing the right region.

図７及び図８に、隣接マクロブロック情報メモリの例を示す。
この例では、図７に示すように隣接マクロブロックメモリは２つの画像符号化処理部の間に一つずつ配置されている。画像符号化処理部のMB-line0の右端マクロブロック（図８（ｂ）でaの位置にあるマクロブック）の処理を完了すると、このマクロブロックのうち隣接マクロブロック情報として保存する必要のあるものを隣接マクロブロック情報メモリ０に格納する。この情報は、画像符号化処理部1によるMB-line0の左端マクロブロック（図８（ｂ）でdの位置にあるマクロブック）の処理開始時に参照されるが、MB-line1の左端マクロブロック（図８（ｂ）でeの位置にあるマクロブロック）でも使用されるので、消去されてはならない。マクロブロックｄの処理が完了するとこのマクロブロックの情報はマクロブロック情報メモリ０に格納される。このｄの情報はマクロブロックbの処理で参照されると、それ以降画像符号化処理部０の処理範囲では使用されないので隣接マクロブロック情報メモリ０から消去してもよい。よって、ｂのマクロブロック情報によって上書きされる。以下同様であって、図８（ｂ）のｃの位置にあるマクロブロックを画像符号化処理部０が処理する場合には、ｅの位置にあるマクロブロックが必要であって、このマクロブロックは、隣接マクロブロック情報メモリ０に格納されている。また、ｆの位置にあるマクロブロックを画像符号化処理部１が処理する場合には、ｂ及びｃの位置のマクロブロックが必要であって、これは、隣接マクロブロック情報メモリ０に格納されている。図８（ａ）は、以上の動作をタイミングチャートによって示した図である。 7 and 8 show examples of adjacent macroblock information memories.
In this example, as shown in FIG. 7, one adjacent macroblock memory is arranged between two image coding processing units. When the processing of the rightmost macroblock of MB-line0 (macrobook at position a in FIG. 8B) of the image encoding processing unit is completed, the macroblock that needs to be saved as adjacent macroblock information Is stored in the adjacent macroblock information memory 0. This information is referred to when the image encoding processing unit 1 starts processing the leftmost macroblock of MB-line0 (the macrobook at the position d in FIG. 8B), but the leftmost macroblock of MB-line1 ( Since it is also used in the macro block at the position e in FIG. 8B, it must not be erased. When the processing of the macro block d is completed, information on the macro block is stored in the macro block information memory 0. When the information of d is referred to in the processing of the macroblock b, it is not used in the processing range of the image encoding processing unit 0 thereafter, and may be deleted from the adjacent macroblock information memory 0. Therefore, it is overwritten by the macroblock information of b. The same applies to the following. When the image encoding processing unit 0 processes the macro block at the position c in FIG. 8B, the macro block at the position e is necessary. Are stored in the adjacent macroblock information memory 0. Further, when the image encoding processing unit 1 processes the macroblock at the position of f, the macroblocks at the positions of b and c are necessary, which are stored in the adjacent macroblock information memory 0. Yes. FIG. 8A is a timing chart showing the above operation.

この様な制御によって、隣接する領域を処理する画像符号化処理部同士の間で隣接マクロブロック情報が伝達され、並列処理が可能となる。各隣接マクロブロック情報メモリは２マクロブロックの情報を保持する容量を持てばよい。 By such control, adjacent macroblock information is transmitted between image encoding processing units that process adjacent regions, and parallel processing becomes possible. Each adjacent macroblock information memory may have a capacity for holding information of two macroblocks.

可変長符号化処理は通常、ストリームの先頭から順に処理を行う必要があり、並列処理を行うとしても、ユニークコードが存在する箇所から分割しての並列処理となる。スライス分割が必須ではないH.264などの規格において、ユニークコードが必ず存在するのはピクチャ単位であるため、一般ストリームに対応する場合、可変長符号化処理の並列化は、ピクチャ単位で行うこととなる。一方、本実施形態の画像符号化処理部は、ピクチャを領域毎に分けてそれぞれの領域の符号化処理を担当する。本実施形態の構成では、可変長符号化処理部と画像符号化処理部の間に符号化中間データバッファを設け、符号化中間データの格納位置ポインタを記録することによって、画像符号化処理部での並列処理を実現する。 In general, the variable-length encoding processing needs to be performed in order from the beginning of the stream, and even if parallel processing is performed, it is parallel processing by dividing from a portion where a unique code exists. In standards such as H.264 where slice division is not essential, unique codes always exist in units of pictures. Therefore, when supporting general streams, variable-length encoding processing must be performed in units of pictures. It becomes. On the other hand, the image encoding processing unit of the present embodiment divides a picture into regions and takes charge of the encoding processing of each region. In the configuration of the present embodiment, an encoding intermediate data buffer is provided between the variable length encoding processing unit and the image encoding processing unit, and the storage position pointer of the encoding intermediate data is recorded, so that the image encoding processing unit Realization of parallel processing.

図９〜図１１に、中間データバッファの説明図を示す。
図９はブロック構成、図１０は動作タイミング、、図１１は中間データバッファとポインタの関係である。図９〜図１１は、可変長符号化処理部4個、画像符号化処理部4個の場合の例である。 9 to 11 are explanatory diagrams of the intermediate data buffer.
9 is a block configuration, FIG. 10 is an operation timing, and FIG. 11 is a relationship between an intermediate data buffer and a pointer. FIG. 9 to FIG. 11 are examples in the case of four variable-length encoding processing units and four image encoding processing units.

図９に示されるように、４つの画像符号化処理部（Decoder Core）０〜３の出力は、中間データバッファ１５に格納され、４つの可変長符号化処理部０〜３によって読みだされ、可変長符号化される。 As shown in FIG. 9, the outputs of the four image encoding processing units (Decoder Core) 0 to 3 are stored in the intermediate data buffer 15 and read by the four variable length encoding processing units 0 to 3. Variable length coding.

画像符号化処理部０〜３は、担当する領域内のマクロブロックラインを上から符号化していき、中間データを中間データバッファ１５に格納する。このとき各画像符号化処理部０〜３は担当領域内の中間データを処理順に書き込んでゆく。図１０に示されるように、画像符号化処理部０〜３の動作は、完全に並列であり、並列処理によって得られた中間データを中間データバッファ１５に格納する。図１１に示されるように、マクロブロックラインの中間データ毎の開始ポインタは、ピクチャ番号をl、マクロブロックラインをm、画像符号化処理部番号をnとすると、PTR_l_m_nとして記録される。可変長符号化処理部０〜３では、ピクチャ単位で処理を行うものとし、それぞれピクチャ０〜３の処理を行うものとする。可変長符号化処理部０〜３の動作は、処理対象のデータが中間データバッファ１５に格納された後、初めて処理を開始できる。したがって、図１０に示されるように、可変長符号化処理部０〜３の処理は、それぞれ所定タイミングずつずれた並列処理となる。可変長符号化処理部０では中間データをPTR_0_0_0、PTR_0_0_1、PTR_0_0_2、PTR_0_0_3、PTR_0_1_0、PTR_0_1_2、PTR_0_1_3、…と符号化処理を行うべき順番に読み出して符号化処理を行う。他の符号化処理部も同様である。このようにすることで画像符号化処理部はピクチャ内の並列処理を、可変長符号化処理部はピクチャ毎の並列処理が可能になる。 The image encoding processing units 0 to 3 encode the macroblock lines in the area in charge from above and store the intermediate data in the intermediate data buffer 15. At this time, each of the image encoding processing units 0 to 3 writes the intermediate data in the assigned area in the order of processing. As shown in FIG. 10, the operations of the image encoding processing units 0 to 3 are completely parallel, and the intermediate data obtained by the parallel processing is stored in the intermediate data buffer 15. As shown in FIG. 11, the start pointer for each intermediate data of a macroblock line is recorded as PTR_l_m_n, where l is a picture number, m is a macroblock line, and n is an image encoding processing unit number. The variable length coding processing units 0 to 3 perform processing in units of pictures, and perform processing for pictures 0 to 3, respectively. The operations of the variable-length encoding processing units 0 to 3 can start processing only after the data to be processed is stored in the intermediate data buffer 15. Therefore, as shown in FIG. 10, the processes of the variable-length encoding processing units 0 to 3 are parallel processes each shifted by a predetermined timing. The variable length coding processing unit 0 reads out the intermediate data in the order in which the coding processing should be performed, such as PTR_0_0_0, PTR_0_0_1, PTR_0_0_2, PTR_0_0_3, PTR_0_1_0, PTR_0_1_2, PTR_0_1_3,. The same applies to other encoding processing units. In this way, the image encoding processing unit can perform parallel processing within a picture, and the variable length encoding processing unit can perform parallel processing for each picture.

各画像符号化処理部が処理中のマクロブロックの位置情報を制御部に通知し、制御部がエンコード進行状況を知ることにより、各マクロブロックで符号化に必要な隣接マクロブロック情報が生成されたことを把握することができる。各画像符号化処理部が制御部からの指示により、マクロブロック単位で処理の待機・再開を行うようにしておくと、各マクロブロック単位で必要な隣接マクロブロック情報が揃ってからデコードを開始するという制御が可能となる。 Each image encoding processing unit notifies the position information of the macro block being processed to the control unit, and the control unit knows the progress of encoding, thereby generating adjacent macro block information necessary for encoding in each macro block. I can understand that. If each image encoding processing unit waits and resumes processing in units of macroblocks according to instructions from the control unit, decoding starts after the necessary adjacent macroblock information is prepared in units of macroblocks. Control becomes possible.

動画像符号化装置の処理において、参照画読み出しなどのメモリアクセスは通常矩形領域でのメモリアクセスとなるが、1ピクチャの画像データを複数のメモリに領域毎に格納する場合、その境界領域において、1つの矩形アクセスリクエストが2つのメモリにまたがる場合がある。そのためメモリマップに応じて、矩形アクセスリクエストを二つに分ける必要がある。 In the processing of the moving image encoding device, memory access such as reading a reference image is normally memory access in a rectangular area, but when storing image data of one picture for each area in a plurality of memories, in the boundary area, One rectangular access request may span two memories. Therefore, it is necessary to divide the rectangular access request into two according to the memory map.

図１２に、矩形領域単位のアクセス例を示す。Aの領域のアクセスは、Memory0のみのアクセス、Bの領域のアクセスはMemory1のみのアクセスとなるが、Cの領域は左側がMemory0、右側がMemory1に格納されているので、両方のMemoryにアクセスする必要がある。 FIG. 12 shows an example of access in units of rectangular areas. Access to the area A is access only to Memory0, access to the area B is access to only Memory1, but the area C is stored in Memory0 on the left side and Memory1 on the right side. There is a need.

図１３に、振り分けモジュールの構成例を示す。リードアクセスの場合、リクエスト振り分け部でどちらのMemoryにアクセスするか、あるいは、リクエストを分割する必要があるかを判定し、片方のMemoryアクセスの場合は、そのMemoryへのリードアクセス、分割する必要がある場合は、それぞれのMemoryに必要なリードアクセスを行う。各Memoryからのリードデータについては、リードデータ振り分け部２０−３、２０−４でどちらの画像符号化処理部からのリクエストであったに応じて、リクエスト発行元の画像符号化処理部にリードデータを返す。 FIG. 13 shows a configuration example of the distribution module. In the case of read access, the request distribution part determines which memory is accessed or whether the request needs to be divided. In the case of one memory access, read access to that memory needs to be divided. If there is, perform the read access necessary for each Memory. For the read data from each memory, the read data is distributed to the image encoding processing unit that issued the request according to the request from which image encoding processing unit in the read data distribution units 20-3 and 20-4. return it.

ライトアクセスの場合は、リクエスト振り分け部２０−１、２０−２で、どちらのMemoryにアクセスするか、あるいは、リクエストを分割する必要があるかを判定するとともに、画像符号化処理部からライトデータを受け取り、片方のMemoryアクセスの場合は、そのMemoryへのライトアクセス、分割する必要がある場合は、画像符号化処理部から受け取ったライトデータをそれぞれのMemoryに振り分けつつ、ライトアクセスを行う。 In the case of write access, the request distribution units 20-1 and 20-2 determine which memory is to be accessed or whether the request needs to be divided, and write data from the image encoding processing unit. In the case of receiving and memory access on one side, write access to that memory, and if it is necessary to divide, write access is performed while distributing the write data received from the image encoding processing unit to each memory.

上記実施形態のエンコーダにおいて、複数の画像符号化処理部が物理的に複数のモジュールに分かれて配置されていてもよい。この場合、モジュール間で画像データおよび、隣接マクロブロック情報、符号化中間データ、マクロブロック符号化位置情報のうちいずれかのデータの受け渡しを行う機能を持つことにより、複数のモジュールで1つのエンコード装置を構成する。このように複数モジュールで1つのエンコード装置を構成する場合においても、メモリ-画像符号化処理部間の接続パス削減により、モジュール間IF、データ転送量を削減することが可能である。 In the encoder of the above embodiment, a plurality of image encoding processing units may be physically divided into a plurality of modules. In this case, one module is provided with a plurality of modules by having a function of transferring any one of image data and adjacent macroblock information, encoded intermediate data, and macroblock encoded position information between modules. Configure. In this way, even when a single encoding device is configured with a plurality of modules, it is possible to reduce the inter-module IF and the data transfer amount by reducing the connection path between the memory and the image encoding processing unit.

本実施形態をH.264などのデコーダ/エンコーダに適用する場合、上記のように画像符号化復号処理部間の同期制御の制御信号、隣接マクロブロック情報の受け渡し信号などが必要になる場合があるが、これらの機能に必要な回路規模の増加分は、メモリバスの削減効果による回路規模削減に比べて小さい。画像符号化復号処理部とメモリの間のメモリバスは通常、高いデータ転送レートが必要なため、バス幅が大きくなっている(64bitなど)ため、接続パス数の増加に伴い、そのバスセレクト信号などの回路規模も大きくなるが、本実施形態で必要な追加回路分は、制御信号や転送レートの低いデータであるので、回路規模は小さくて済む。本実施形態では、従来技術に比べて、バス幅の広いメモリバスの接続パス数を大幅に削減しているため、本実施形態に必要な制御信号、データパス等を追加しても、全体の回路規模は従来に比べて削減となる。 When this embodiment is applied to a decoder / encoder such as H.264, a control signal for synchronization control between image coding / decoding processing units, a signal for passing adjacent macroblock information, and the like may be required as described above. However, the increase in the circuit scale required for these functions is small compared to the circuit scale reduction due to the memory bus reduction effect. Since the memory bus between the image encoding / decoding processor and the memory usually requires a high data transfer rate, the bus width is increased (64 bits, etc.), so the bus select signal increases as the number of connection paths increases. However, since the additional circuit required in the present embodiment is a control signal and data with a low transfer rate, the circuit scale can be small. In this embodiment, since the number of connection paths of a memory bus having a wide bus width is greatly reduced compared to the prior art, even if control signals, data paths, and the like necessary for this embodiment are added, the whole The circuit scale is reduced compared to the conventional one.

また、画像符号化復号処理部数とメモリ数が増えてもその接続関係が限定される(従来技術では、全画像符号化復号処理部がすべてのメモリにアクセスする必要があるが、本実施形態では、各画像符号化復号処理部は最大で2つまたは3つのメモリにアクセスすればよい)ため、LSIにしたときのI/O端子と画像符号化復号処理部配置の関係を単純化することができ、レイアウト難度を大幅に低減することが可能となる。 Further, even if the number of image coding / decoding processing units and the number of memories increase, the connection relationship is limited (in the conventional technology, all image coding / decoding processing units need to access all memories, but in this embodiment, (Each image encoding / decoding processor only needs to access two or three memories at the maximum), so that the relationship between the I / O terminal and the image encoding / decoding processor arrangement in the LSI can be simplified. This can greatly reduce the layout difficulty.

図１４及び図１５は、本実施形態に従ったエンコーダ装置を説明する図である。
図１４は4Kx2K(3840x2160)のH.264エンコーダに本実施形態を4並列で適用した装置のブロック構成である。 14 and 15 are diagrams for explaining an encoder device according to the present embodiment.
FIG. 14 shows a block configuration of an apparatus in which the present embodiment is applied to a 4Kx2K (3840x2160) H.264 encoder in four parallel manners.

可変長符号化処理部２５−１〜２５−４、画像符号化処理部２６−１〜２６−４をそれぞれ4つずつ有し、可変長符号化処理部２５−１〜２５−４は、ピクチャ単位の並列処理、画像符号化処理部２６−１〜２６−４は、図１５のようにピクチャを4つの領域に分割したそれぞれの領域を担当し、それぞれの領域のデータが対応した4つのメモリに格納されるメモリマップとする。なお本例での水平ベクトル範囲は±512とし、各画像符号化処理部２６−１〜２６−４で処理する領域が参照する領域は、左右のコアで処理する領域を超えることはないものとする。それぞれの可変長符号化部２５−１〜２５−４は、H.264規格にしたがった可変長符号化処理(CAVLC or CABACなど)を実施し、画像符号化処理部２６−１〜２６−４は、H.264規格に画像符号化処理(動きベクトル探索、インター予測、イントラ予測、直交変換、量子化、デブロッキングフィルタ処理など)を行う。また、可変長符号化部２５−１〜２５−４と画像符号化処理部２６−１〜２６−４の間には、中間データを格納する中間データバッファ２７、各画像符号化部の間には隣接マクロブロック情報を受け渡すための隣接マクロブロック情報バッファ２８−１〜２８−３、画像符号化処理部とメモリの間には、メモリアクセスを振り分ける機能を有したメモリ制御部２９、各画像符号化処理部２６−１〜２６−４からの処理マクロブロック位置情報を受け取り、各画像符号化処理部の起動制御を行う画像符号化処理部起動制御部２４を有する。 Each of the variable length coding processing units 25-1 to 25-4 and the image coding processing units 26-1 to 26-4 has four each, and the variable length coding processing units 25-1 to 25-4 The unit parallel processing and image encoding processing units 26-1 to 26-4 are responsible for the respective areas obtained by dividing the picture into four areas as shown in FIG. 15, and the four memories corresponding to the data in the respective areas. The memory map stored in. Note that the horizontal vector range in this example is ± 512, and the region referred to by the region processed by each image encoding processing unit 26-1 to 26-4 does not exceed the region processed by the left and right cores. To do. Each of the variable length encoding units 25-1 to 25-4 performs variable length encoding processing (CAVLC or CABAC or the like) according to the H.264 standard, and the image encoding processing units 26-1 to 26-4. Performs image coding processing (motion vector search, inter prediction, intra prediction, orthogonal transform, quantization, deblocking filter processing, etc.) to the H.264 standard. Between the variable length coding units 25-1 to 25-4 and the image coding processing units 26-1 to 26-4, an intermediate data buffer 27 for storing intermediate data and between each image coding unit. Are adjacent macroblock information buffers 28-1 to 28-3 for passing adjacent macroblock information, a memory control unit 29 having a function of distributing memory access between the image encoding processing unit and the memory, and each image An image encoding processing unit activation control unit 24 that receives processing macroblock position information from the encoding processing units 26-1 to 26-4 and performs activation control of each image encoding processing unit.

図１６に、本実施形態に従った並列動作のタイミングを示す。
前記中間データバッファ、隣接マクロブロック情報バッファ、メモリ制御部、コア起動制御部は、エンコードの並列処理中、上で説明した役割を果たし、H.264デコード処理に
おける本発明の並列動作を可能とする。 FIG. 16 shows the timing of the parallel operation according to the present embodiment.
The intermediate data buffer, the adjacent macroblock information buffer, the memory control unit, and the core activation control unit perform the above-described roles during the parallel encoding process, and enable the parallel operation of the present invention in the H.264 decoding process. .

図１６に示されるように、画像符号化処理部０〜３は、タイミングをずらして、並列に処理を行う。最初、ピクチャ処理が起動されると、画像符号化処理部０が、担当領域のマクロブロックライン（MB-line）０の処理を行う。このマクロブロックラインの処理においては、各マクロブロックの処理が順に起動される。マクロブロックライン０の処理を画像符号化処理部０が終わると、次は、マクロブロックライン１の処理に進むが、同時に、画像符号化処理部１が、担当領域のマクロブロックライン０の処理を開始する。同様にして、画像符号化処理部２、３も処理を開始する。すべての画像符号化処理部がすべてのマクロブロックラインの処理を終了すると、ピクチャの処理が完了する。 As illustrated in FIG. 16, the image encoding processing units 0 to 3 perform processing in parallel at different timings. First, when picture processing is activated, the image encoding processing unit 0 performs processing of the macro block line (MB-line) 0 of the assigned region. In this macro block line processing, the processing of each macro block is activated in turn. When the image encoding processing unit 0 finishes the processing of the macro block line 0, the process proceeds to the processing of the macro block line 1. At the same time, the image encoding processing unit 1 performs the processing of the macro block line 0 in the assigned region. Start. Similarly, the image encoding processing units 2 and 3 also start processing. When all the image encoding processing units have finished processing all the macroblock lines, the picture processing is completed.

本実施形態では、各画像符号化処理部における各マクロブロックの符号化処理開始タイミングが、必要とする隣接マクロブロック情報の生成が完了した後となる制御の一例として、以下のような制御とする。 In the present embodiment, the following control is an example of control that is performed after completion of generation of necessary neighboring macroblock information at the encoding processing start timing of each macroblock in each image encoding processing unit. .

図１７及び図１８は、本発明の実施形態に従った制御フローである。
図１７は、全体制御部が画像処理部を制御して１ピクチャ処理を行うフローを、図１８は各符号化処理部での１ピクチャ処理での処理フローを示す。 17 and 18 are control flows according to the embodiment of the present invention.
FIG. 17 shows a flow in which the overall control unit controls the image processing unit to perform one picture processing, and FIG. 18 shows a processing flow in one picture processing in each encoding processing unit.

図１７において、ステップＳ１０において、画像符号化処理部を起動し、前記画像符号化処理部で１ピクチャの処理が完了するまで待って、処理を終了する。
図１８において、ステップＳ２０において、次に処理するマクロブロックは、担当領域内の左端にあるマクロブロックか否かを判断する。ステップＳ２０の判断がＹｅｓの場合には、ステップＳ２１において、もっとも左の領域を担当する画像符号化処理部が処理を行おうとしているか否かを判断する。ステップＳ２１の判断がＹｅｓの場合には、ステップＳ３０に進む。ステップＳ２１の判断がＮｏの場合には、ステップＳ２２において、左隣接、左上隣接マクロブロック情報は、左の画像符号化処理部との間の隣接情報メモリに存在するか否かを判断し、Ｎｏの場合には、これらの情報が隣接情報メモリに格納されるまで待つ。ステップＳ２２の判断がＹｅｓの場合には、ステップＳ２３において、マクロブロックを処理し、ステップＳ２４において、処理結果を隣接マクロブロック情報として、左の画像符号化処理部との間の隣接情報メモリに格納して、ステップＳ３１に進む。 In FIG. 17, in step S <b> 10, the image encoding processing unit is activated, waits until the processing of one picture is completed in the image encoding processing unit, and the processing ends.
In FIG. 18, in step S20, it is determined whether or not the next macroblock to be processed is the leftmost macroblock in the assigned area. If the determination in step S20 is yes, in step S21, it is determined whether or not the image encoding processing unit in charge of the leftmost region is about to perform processing. If the determination in step S21 is yes, the process proceeds to step S30. If the determination in step S21 is No, it is determined in step S22 whether the left adjacent and upper left adjacent macroblock information exists in the adjacent information memory between the left image encoding processing unit and No. In this case, the process waits until these pieces of information are stored in the adjacent information memory. If the determination in step S22 is Yes, the macroblock is processed in step S23, and the processing result is stored as adjacent macroblock information in the adjacent information memory with the left image encoding processing unit in step S24. Then, the process proceeds to step S31.

ステップＳ２０の判断がＮｏの場合には、ステップＳ２５において、次に処理するマクロブロックは担当領域内の右端のマクロブロックであるか否かを判断する。ステップＳ２５の判断がＮｏの場合には、ステップＳ３０に進む。 If the determination in step S20 is No, it is determined in step S25 whether or not the next macroblock to be processed is the rightmost macroblock in the assigned area. If the determination in step S25 is No, the process proceeds to step S30.

ステップＳ２５の判断がＹｅｓの場合には、ステップＳ２６において、もっとも右の領域の担当の画像符号化処理装置が処理を行おうとしているか否かを判断する。ステップＳ２６の判断がＹｅｓの場合には、ステップＳ３０に進んで、マクロブロックを処理し、ステップＳ３１に進む。ステップＳ２６の判断がＮｏの場合には、ステップＳ２７において、右上隣接マクロブロック情報は、右画像符号化処理部との間の隣接情報メモリに存在するか否かを判断する。ステップＳ２７の判断がＮｏの場合には、当該情報が、隣接情報メモリに格納されるまで待つ。ステップＳ２７の判断がＹｅｓの場合には、ステップＳ２８において、マクロブロックを処理し、ステップＳ２９において、処理結果を隣接マクロブロック情報として、右画像符号化処理部との間の隣接情報メモリに格納し、ステップＳ３１に進む。 If the determination in step S25 is Yes, in step S26, it is determined whether or not the image encoding processing apparatus in charge of the rightmost region is about to perform processing. If the determination in step S26 is yes, the process proceeds to step S30, the macroblock is processed, and the process proceeds to step S31. If the determination in step S26 is No, it is determined in step S27 whether the upper right adjacent macroblock information exists in the adjacent information memory with the right image encoding processing unit. If the determination in step S27 is No, the process waits until the information is stored in the adjacent information memory. If the determination in step S27 is Yes, the macroblock is processed in step S28, and the processing result is stored as adjacent macroblock information in the adjacent information memory with the right image encoding processing unit in step S29. The process proceeds to step S31.

ステップＳ３１では、ピクチャ内の担当マクロブロックの処理が完了したか否かを判断し、Ｎｏの場合には、ステップＳ２０に戻り、Ｙｅｓの場合には、処理を終了する。
図１９は、本実施形態のエンコーダ／デコーダ構成をLSIに適用した場合のチップ構成
例である。 In step S31, it is determined whether or not the processing of the assigned macroblock in the picture is completed. If No, the process returns to step S20, and if Yes, the process ends.
FIG. 19 shows an example of a chip configuration when the encoder / decoder configuration of the present embodiment is applied to an LSI.

LSI実装においては、様々な機能、IFが実装されることが予想されるが、ここでは、エンコード処理に必須である、画像入力IF、ストリーム出力IFのみ図示している。図１９（ａ）は、1チップ構成、図１９（ｂ）は2チップ構成の例である。 In the LSI implementation, various functions and IF are expected to be implemented, but here, only the image input IF and the stream output IF, which are essential for the encoding process, are illustrated. FIG. 19A shows an example of a one-chip configuration, and FIG. 19B shows an example of a two-chip configuration.

図１９（ａ）の1チップ構成では、4つの画像符号化／復号処理部が1つのチップに搭載され、画像格納用の4つのメモリ０〜３が接続されている。また、本図では、ストリーム格納用、中間データ格納用のバッファは別メモリ３０で構成している。このメモリ３０は、画像格納用の4つのメモリ０〜３と別に用意しても良いし、画像格納用に使用している4つのメモリ０〜３の一部に領域を確保して同一メモリとして使用してもよい。 In the one-chip configuration of FIG. 19A, four image encoding / decoding processing units are mounted on one chip, and four memories 0 to 3 for storing images are connected. In this figure, the stream storage buffer and the intermediate data storage buffer are constituted by a separate memory 30. This memory 30 may be prepared separately from the four memories 0 to 3 for storing images, or an area is secured in a part of the four memories 0 to 3 used for storing images as the same memory. May be used.

図１９（ｂ）の2チップ構成では、2つのメモリの間にデータ転送を行うインターフェース３１を設けて、ストリームデータ、中間データ、隣接マクロブロック情報、画像データ等が受け渡し可能なようにしている。このように画像符号化／復号処理部が複数のLSIに分けられ、メモリの接続先が複数のLSIに分かれている場合であっても、データ転送の受け渡しを行い、全体として本実施形態で規定された並列処理を行っている場合、何チップに分けられていたとしても問題はない。 In the two-chip configuration of FIG. 19B, an interface 31 that performs data transfer is provided between two memories so that stream data, intermediate data, adjacent macroblock information, image data, and the like can be transferred. In this way, even when the image encoding / decoding processing unit is divided into a plurality of LSIs and the memory connection destination is divided into a plurality of LSIs, the data transfer is transferred and defined as a whole in this embodiment. When the parallel processing is performed, there is no problem even if it is divided into any number of chips.

図２０は、本実施形態をH.264デコーダに適用した場合の構成図を示す。
図２０は、4Kx2K(3840x2160)のH.264デコーダを4並列で適用した装置である。可変長復号処理部４０−１〜４０−４、画像復号処理部４１−１〜４１−４をそれぞれ4つずつ有し、可変長復号処理部４０−１〜４０−４は、ピクチャ単位の並列処理、画像復号処理部４１−１〜４１−４は、ピクチャを本実施形態にしたがって4つの領域に分割したそれぞれの領域を担当し、それぞれの領域のデータが対応した4つのメモリ０〜３に格納されるメモリマップとする。これらの割り当てはエンコーダの実施形態と同様である。それぞれの可変長復号処理部４０−１〜４０−４は、H.264規格にしたがった可変長復号処理(CAVLC or CABACなど)を実施し、画像復号処理部４１−１〜４１−４は、H.264の画像復号処理に必要な処理(インター予測、イントラ予測、逆直交変換、逆量子化、デブロッキングフィルタ処理など)を行う。また、可変長復号処理部４０−１〜４０−４と画像復号処理部４１−１〜４１−４の間には中間データを格納する中間データバッファ４２、各画像復号処理部４１−１〜４１−４の間には隣接マクロブロック情報を受け渡すための隣接マクロブロック情報バッファ４３−１〜４３−３、画像復号処理部４１−１〜４１−４とメモリ０〜３の間には、メモリアクセスを振り分ける機能を有したメモリ制御部４４、各画像復号処理部４１−１〜１−４からの処理マクロブロック位置情報を受け取り、各コアの起動制御を行う画像復号処理部起動制御部４５を有する。デコーダ処理における画像復号処理部の並列動作の方法は、エンコード処理における画像符号化処理部の並列動作と同様であるので説明を省略する。 FIG. 20 shows a configuration diagram when this embodiment is applied to an H.264 decoder.
FIG. 20 shows an apparatus in which 4Kx2K (3840x2160) H.264 decoders are applied in parallel. Each of the variable length decoding processing units 40-1 to 40-4 and the four image decoding processing units 41-1 to 41-4 includes four variable length decoding processing units 40-1 to 40-4. The processing and image decoding processing units 41-1 to 41-4 are responsible for the respective areas obtained by dividing the picture into four areas according to the present embodiment, and the data in the respective areas correspond to the four memories 0 to 3 corresponding thereto. The stored memory map. These assignments are the same as in the encoder embodiment. Each of the variable length decoding processing units 40-1 to 40-4 performs variable length decoding processing (CAVLC or CABAC or the like) according to the H.264 standard, and the image decoding processing units 41-1 to 41-4 Performs processing necessary for H.264 image decoding (inter prediction, intra prediction, inverse orthogonal transform, inverse quantization, deblocking filter processing, etc.). Further, an intermediate data buffer 42 for storing intermediate data and the image decoding processing units 41-1 to 41-41 are provided between the variable length decoding processing units 40-1 to 40-4 and the image decoding processing units 41-1 to 41-4. -4 between adjacent macroblock information buffers 43-1 to 43-3 for transferring adjacent macroblock information, between the image decoding processing units 41-1 to 41-4 and the memories 0 to 3, the memory A memory control unit 44 having a function of distributing access and an image decoding processing unit activation control unit 45 that receives processing macroblock position information from each of the image decoding processing units 41-1 to 1-4 and performs activation control of each core. Have. The method of the parallel operation of the image decoding processing unit in the decoder process is the same as the parallel operation of the image encoding processing unit in the encoding process, and the description thereof will be omitted.

前記中間データバッファ、隣接マクロブロック情報バッファ、メモリ制御部、コア起動制御部は、デコード処理においても、上で説明したのと同様の役割を果たし、H.264デコード処理における本発明の並列動作を可能とする。 The intermediate data buffer, adjacent macroblock information buffer, memory control unit, and core activation control unit play a role similar to that described above in the decoding process, and perform the parallel operation of the present invention in the H.264 decoding process. Make it possible.

以上の実施形態のほかに、以下の付記を開示する。
（付記１）
画像データをブロック単位で符号化する画像符号化装置において、
前記画像データを分割して格納する複数の格納手段と、
画像符号化処理を行う複数の画像符号化手段と、
該複数の格納手段と該複数の画像符号化手段とを接続する接続手段と、
前記画像データを２次元配列と見た場合に、ブロック単位の画像符号化の進行する方向
とは異なった方向に、前記画像データを前記画像符号化手段の数だけ分割し、それぞれ分割された前記画像データを前記複数の画像符号化手段に割り当てて、各画像符号化手段に並列に画像符号化処理を行わせる制御手段と、
を備えることを特徴とする画像符号化装置。
（付記２）
処理対象の画像データの単位ブロックの隣接単位ブロックの情報を格納する隣接単位ブロック情報格納手段を更に備え、
前記画像符号化手段は、該処理対象の単位ブロックの処理を、処理に必要な隣接単位ブロックの情報が揃ってから開始することを特徴とする付記１に記載の画像符号化装置。
（付記３）
前記画像符号化手段の処理結果である中間データを格納する、該中間データを画像データ内の領域毎の格納位置を示すポインタを有するバッファ手段を更に備えることを特徴とする付記１に記載の画像符号化装置。
（付記４）
前記複数の画像符号化手段が物理的に複数のモジュールに分かれて配置され、
該モジュール間で画像データの処理に必要な情報を通信するためのインターフェース手段を更に備えることを特徴とする付記１に記載の画像符号化装置。
（付記５）
前記画像符号化手段は、符号化処理中の単位ブロックの画像データ内での位置情報を取得し、該位置情報によって該処理対象の単位ブロックの符号化処理の待機、再開を行うことを特徴とする付記１に記載の画像符号化装置。
（付記６）
前記画像符号化処理の進行する方向は、ラスタスキャンの方向であることを特徴とする付記１に記載の画像符号化装置。
（付記７）
前記隣接単位ブロック情報格納手段は、前記２つの画像符号化手段の組に対し１つずつ設けられることを特徴とする付記２に記載の画像符号化装置。
（付記８）
画像データを復号する画像復号装置において、
可変長符号化された画像データを可変長復号する可変長復号手段と、
前記可変長復号された前記画像データにブロック単位で画像復号処理を施す複数の画像復号手段と、
前記画像復号処理された前記画像データを分割して格納する複数の格納手段と、
該複数の格納手段と該複数の画像復号手段とを接続する接続手段と、
画像データを２次元配列と見た場合に、ブロック単位で行う画像復号処理の進行する方向とは異なった方向に、前記画像データを前記画像復号手段の数だけ分割し、それぞれ分割された画像データを前記複数の画像復号手段に割り当てて、各画像復号手段に並列に画像復号処理を行わせる制御手段と、
を備えることを特徴とする画像復号装置。
（付記９）
処理対象の画像データの単位ブロックの隣接単位ブロックの情報を格納する隣接単位ブロック情報格納手段を更に備え、
前記画像復号手段は、該処理対象の単位ブロックの処理を、処理に必要な隣接単位ブロックの情報が揃ってから開始することを特徴とする付記８に記載の画像復号装置。
（付記１０）
前記可変長復号手段の出力する中間データを格納する、該中間データの画像データ内の領域毎の格納位置を示すポインタを記録するバッファ手段を更に備えることを特徴とする付記８に記載の画像復号装置。
（付記１１）
前記複数の画像復号手段が物理的に複数のモジュールに分かれて配置さえれ、
該モジュール間で、画像データの処理に必要な情報を通信するためのインターフェース手段を更に備えることを特徴とする付記８に記載の画像復号装置。
（付記１２）
前記画像復号手段は、復号処理中の単位ブロックの画像データ内での位置情報を取得し、該位置情報によって該処理対象の単位ブロックの復号処理の待機、再開を行うことを特徴とする付記８に記載の画像復号装置。
（付記１３）
前記画像復号処理の進行する方向は、ラスタスキャンの方向であることを特徴とする付記８に記載の画像復号装置。
（付記１４）
前記隣接単位ブロック情報格納手段は、前記２つの画像復号手段の組に対し１つずつ設けられることを特徴とする付記９に記載の画像復号装置。 In addition to the above embodiments, the following supplementary notes are disclosed.
(Appendix 1)
In an image encoding device that encodes image data in units of blocks,
A plurality of storage means for dividing and storing the image data;
A plurality of image encoding means for performing image encoding processing;
Connection means for connecting the plurality of storage means and the plurality of image encoding means;
When the image data is viewed as a two-dimensional array, the image data is divided by the number of the image encoding means in a direction different from the direction in which image encoding progresses in units of blocks, Control means for allocating image data to the plurality of image encoding means and causing each image encoding means to perform image encoding processing in parallel;
An image encoding device comprising:
(Appendix 2)
Further comprising an adjacent unit block information storage means for storing information on adjacent unit blocks of the unit block of the image data to be processed;
2. The image encoding apparatus according to appendix 1, wherein the image encoding unit starts the processing of the unit block to be processed after information on adjacent unit blocks necessary for the processing is gathered.
(Appendix 3)
The image according to claim 1, further comprising buffer means for storing intermediate data as a processing result of the image encoding means, and having a pointer indicating a storage position for each area in the image data. Encoding device.
(Appendix 4)
The plurality of image encoding means are physically divided and arranged in a plurality of modules,
The image encoding apparatus according to claim 1, further comprising an interface unit for communicating information necessary for processing image data between the modules.
(Appendix 5)
The image encoding means acquires position information in the image data of a unit block being encoded, and performs standby and restart of encoding processing of the unit block to be processed based on the position information. The image encoding apparatus according to appendix 1.
(Appendix 6)
The image encoding apparatus according to appendix 1, wherein the direction in which the image encoding process proceeds is a raster scan direction.
(Appendix 7)
The image coding apparatus according to appendix 2, wherein the adjacent unit block information storage unit is provided for each set of the two image coding units.
(Appendix 8)
In an image decoding device for decoding image data,
Variable length decoding means for variable length decoding of variable length encoded image data;
A plurality of image decoding means for performing image decoding processing in block units on the variable length decoded image data;
A plurality of storage means for dividing and storing the image data subjected to the image decoding process;
Connection means for connecting the plurality of storage means and the plurality of image decoding means;
When the image data is viewed as a two-dimensional array, the image data is divided by the number of the image decoding means in a direction different from the direction in which the image decoding process performed in units of blocks is performed. Control means for allocating the image decoding means to the plurality of image decoding means, and causing each image decoding means to perform image decoding processing in parallel,
An image decoding apparatus comprising:
(Appendix 9)
Further comprising an adjacent unit block information storage means for storing information on adjacent unit blocks of the unit block of the image data to be processed;
9. The image decoding apparatus according to appendix 8, wherein the image decoding unit starts the processing of the processing target unit block after information on adjacent unit blocks necessary for the processing is gathered.
(Appendix 10)
9. The image decoding according to appendix 8, further comprising buffer means for storing intermediate data output from the variable length decoding means and for recording a pointer indicating a storage position for each area in the image data of the intermediate data. apparatus.
(Appendix 11)
The plurality of image decoding means can be physically divided into a plurality of modules and arranged,
The image decoding apparatus according to appendix 8, further comprising interface means for communicating information necessary for processing image data between the modules.
(Appendix 12)
The image decoding means acquires position information in the image data of the unit block being decoded, and waits and restarts decoding processing of the unit block to be processed based on the position information. The image decoding apparatus described in 1.
(Appendix 13)
The image decoding apparatus according to appendix 8, wherein the direction in which the image decoding process proceeds is a raster scan direction.
(Appendix 14)
The image decoding apparatus according to appendix 9, wherein the adjacent unit block information storage unit is provided for each set of the two image decoding units.

本発明の実施形態の動作を説明する図（その１）である。It is FIG. (1) explaining operation | movement of embodiment of this invention. 本発明の実施形態の動作を説明する図（その２）である。It is FIG. (2) explaining operation | movement of embodiment of this invention. デブロッキングフィルタ処理について説明する図である。It is a figure explaining a deblocking filter process. 従来手法と本発明手法での画像符号化処理部と各メモリモジュール(Memory)の間の接続パス数の比較を示す図（その１）である。It is FIG. (1) which shows the comparison of the number of connection paths between the image encoding process part and each memory module (Memory) by the conventional method and this invention method. 従来手法と本発明手法での画像符号化処理部と各メモリモジュール(Memory)の間の接続パス数の比較を示す図（その２）である。It is FIG. (2) which shows the comparison of the number of connection paths between the image encoding process part and each memory module (Memory) by the conventional method and this invention method. 各マクロブロックが符号化処理に必要とする隣接マクロブロック情報の位置関係を示す図である。It is a figure which shows the positional relationship of the adjacent macroblock information which each macroblock requires for an encoding process. 隣接マクロブロック情報メモリの例を示す図（その１）である。FIG. 3 is a first diagram illustrating an example of an adjacent macroblock information memory; 隣接マクロブロック情報メモリの例を示す図（その２）である。FIG. 6 is a second diagram illustrating an example of an adjacent macroblock information memory; 中間データバッファの説明図（その１）である。It is explanatory drawing (the 1) of an intermediate data buffer. 中間データバッファの説明図（その２）である。It is explanatory drawing (the 2) of an intermediate data buffer. 中間データバッファの説明図（その３）である。It is explanatory drawing (the 3) of an intermediate data buffer. 矩形領域単位のアクセス例を示す図である。It is a figure which shows the example of an access of a rectangular area unit. 振り分けモジュールの構成例を示す図である。It is a figure which shows the structural example of a distribution module. 本実施形態に従ったエンコーダ装置を説明する図（その１）である。It is FIG. (1) explaining the encoder apparatus according to this embodiment. 本実施形態に従ったエンコーダ装置を説明する図（その２）である。It is FIG. (2) explaining the encoder apparatus according to this embodiment. 本実施形態に従った並列動作のタイミングを示す図である。It is a figure which shows the timing of the parallel operation | movement according to this embodiment. 本発明の実施形態に従った制御フロー（その１）である。It is the control flow (the 1) according to the embodiment of the present invention. 本発明の実施形態に従った制御フロー（その２）である。It is a control flow (the 2) according to embodiment of this invention. 本実施形態のエンコーダ／デコーダ構成をLSIに適用した場合のチップ構成例である。It is an example of a chip configuration when the encoder / decoder configuration of the present embodiment is applied to an LSI. 本実施形態をH.264デコーダに適用した場合の構成図を示す。The block diagram at the time of applying this embodiment to an H.264 decoder is shown. 従来のフレーム間予測の依存関係を守るように制御し、フレーム間での並列化を行う手法の説明図（その１）である。It is explanatory drawing (the 1) of the method which controls so that the dependence relationship of the conventional inter-frame prediction may be protected, and performs parallel between frames. 従来のフレーム間予測の依存関係を守るように制御し、フレーム間での並列化を行う手法の説明図（その２）である。It is explanatory drawing (the 2) of the technique which controls so that the dependence relationship of the conventional inter-frame prediction may be protected, and performs parallel between frames. 従来のフレーム間予測の依存関係を守るように制御し、フレーム間での並列化を行う手法の説明図（その３）である。It is explanatory drawing (the 3) of the method which controls so that the dependence relationship of the conventional inter-frame prediction may be protected, and performs parallel between frames. 従来のスライス単位でフレーム内の並列化を行う手法の説明図（その１）である。It is explanatory drawing (the 1) of the method of performing the parallelization in a frame in the conventional slice unit. 従来のスライス単位でフレーム内の並列化を行う手法の説明図（その２）である。It is explanatory drawing (the 2) of the method of performing the parallelization in a frame in the conventional slice unit. 従来のスライス単位でフレーム内の並列化を行う手法の説明図（その３）である。It is explanatory drawing (the 3) of the method of performing the parallelization in a frame in the conventional slice unit. 従来のマクロブロックの隣接情報参照の依存関係を守るように制御し、マクロブロックライン間での並列化を行う手法の説明図（その１）である。It is explanatory drawing (the 1) of the technique which controls so that the dependence relationship of the adjacent information reference of the conventional macroblock may be protected, and parallelizes between macroblock lines. 従来のマクロブロックの隣接情報参照の依存関係を守るように制御し、マクロブロックライン間での並列化を行う手法の説明図（その２）である。It is explanatory drawing (the 2) of the technique which controls so that the dependence relationship of the adjacent information reference of the conventional macroblock may be protected, and parallelizes between macroblock lines. 従来のマクロブロックの隣接情報参照の依存関係を守るように制御し、マクロブロックライン間での並列化を行う手法の説明図（その３）である。It is explanatory drawing (the 3) of the technique which controls so that the dependence relationship of the adjacent information reference of the conventional macroblock may be protected, and performs parallel between macroblock lines.

Explanation of symbols

１０、１１−１〜１１−３、１２スイッチ
１５中間データバッファ
２０−１、２０−２リクエスト／ライトデータ振り分け部
２０−３、２０−４リードデータ振り分け部
２４画像符号化処理部起動制御部
２５−１〜２５−４可変長符号化処理部
２６−１〜２６−４画像符号化処理部
２７中間データバッファ
２８−１〜２８−４隣接マクロブロック情報バッファ
２９メモリ制御部
３０メモリ
３１通信インターフェース
４０−１〜４０−４可変長復号処理部
４１−１〜４１−４画像復号処理部
４２中間データバッファ
４３−１〜４３−３隣接マクロブロック情報バッファ
４４メモリ制御部
４５画像復号処理部起動制御部 10, 11-1 to 11-3, 12 switch 15 Intermediate data buffer 20-1, 20-2 Request / write data distribution unit 20-3, 20-4 Read data distribution unit 24 Image encoding processing unit activation control unit 25 -1 to 25-4 Variable length coding processing unit 26-1 to 26-4 Image coding processing unit 27 Intermediate data buffer 28-1 to 28-4 Adjacent macroblock information buffer 29 Memory control unit 30 Memory 31 Communication interface 40 -1 to 40-4 Variable length decoding processing unit 41-1 to 41-4 Image decoding processing unit 42 Intermediate data buffer 43-1 to 43-3 Adjacent macroblock information buffer 44 Memory control unit 45 Image decoding processing unit activation control unit

Claims

In an image encoding device that encodes image data in units of blocks,
A plurality of storage means for dividing and storing the image data;
A plurality of image encoding means for performing image encoding processing;
When the image data is viewed as a two-dimensional array, the image data is divided by the number of the image encoding means in a direction different from the direction in which image encoding progresses in units of blocks, Assign each image data to the plurality of image encoding means, stores the divided image data when encoding, the storage unit adjacent to the storing means and the corresponding storage means corresponding to said image coding means Control means for causing each image encoding means to perform image encoding processing in parallel, based on the reference result of the image area ,
Connection means for connecting each of the image encoding means only to storage means for storing an image area to be referred to when encoding the divided image data ;
Further comprising an adjacent unit block information storage means for storing information on adjacent unit blocks of the unit block of the image data to be processed;
The image encoding device is characterized in that the processing of the unit block to be processed is started after information on adjacent unit blocks necessary for the processing is gathered .

The image encoding apparatus according to claim 1, wherein a direction in which the image encoding process proceeds is a raster scan direction.

In an image decoding device for decoding image data,
Variable length decoding means for variable length decoding of variable length encoded image data;
A plurality of image decoding means for performing image decoding processing in block units on the variable length decoded image data;
A plurality of storage means for dividing and storing the image data subjected to the image decoding process ;
When the image data is viewed as a two-dimensional array, the image data is divided by the number of the image decoding means in a direction different from the direction in which the image decoding process performed in units of blocks is performed. respectively assigned to the plurality of image decoding means, wherein when decoding the divided image data, storage means and stored in said storage means adjacent the corresponding storage means image corresponding to the image decoding unit Control means for causing each image decoding means to perform image decoding processing in parallel based on the reference result of the area ;
Connection means for connecting each of the image decoding means only to storage means for storing an image area referred to when the divided image data is decoded ;
Further comprising an adjacent unit block information storage means for storing information on adjacent unit blocks of the unit block of the image data to be processed;
The image decoding device is characterized in that the processing of the unit block to be processed is started after information on adjacent unit blocks necessary for the processing is gathered .

The image decoding apparatus according to claim 3 , wherein a direction in which the image decoding process proceeds is a raster scan direction.