TW201711469A

TW201711469A - Information processing device, information processing method, and information processing program

Info

Publication number: TW201711469A
Application number: TW104135041A
Authority: TW
Inventors: Masaki Oda
Original assignee: Mitsubishi Electric Corp
Priority date: 2015-09-02
Filing date: 2015-10-26
Publication date: 2017-03-16
Also published as: WO2017037900A1; JPWO2017037900A1; JP5944078B1

Abstract

An entropy decoding unit (101) carries out entropy decoding on coded image information which is obtained when image information which is configured of a plurality of macro-blocks is coded, and extracts from the coded image information a plurality of instances of coded information which is provided corresponding to the plurality of macro-blocks, with each of the instances of coded information including at least a motion vector. A region extraction unit (102) extracts regions within the image information wherein motion is present as motion regions, on the basis of the plurality of motion vectors which is included in the plurality of instances of coded information which is extracted by the entropy decoding unit (101).

Description

Information processing device, information processing method and information processing program product

本發明為有關一種擷取在畫像資訊內有運動的運動區域之技術。 The present invention is directed to a technique for extracting a motion region having motion within the portrait information.

以將利用照相機拍攝的畫像資訊之傳送負荷減輕或資料量縮小為目的，將動畫像編碼壓縮產生畫像編碼資訊的技術被廣泛使用。 In order to reduce the transmission load of image information captured by a camera or to reduce the amount of data, techniques for compressing moving image coding to generate image coding information are widely used.

但是在畫像分析畫像編號資訊時必須利用解碼處理還原成編碼前的畫像資訊。 However, when the portrait image number information is analyzed, it is necessary to use the decoding process to restore the image information before encoding.

在專利文獻1中，揭露具有檢測出包含在動畫像之人或車等動物體的分析手法之畫像處理裝置。 Patent Document 1 discloses an image processing apparatus having an analysis method of detecting an animal body such as a person or a car included in a moving image.

[Previous Technical Literature] [Patent Literature]

專利文獻1：日本特開2007-316856號公報 Patent Document 1: Japanese Laid-Open Patent Publication No. 2007-316856

專利文獻1的畫像處理裝置，其為從畫像編號資訊將畫像資訊解碼，分析根據解碼得到的畫像資訊而檢測出動物體。 In the image processing device of Patent Document 1, the image information is decoded from the image number information, and the animal body is detected based on the decoded image information.

如此一來，專利文獻1的畫像處理裝置會有由於解碼畫像資訊而使計算負荷為高的課題。 As described above, the image processing device of Patent Document 1 has a problem that the calculation load is high due to the decoding of the image information.

本發明以解決上述的課題為主要目的，以減低在擷取畫像資訊內有運動的運動區域時之計算負荷為主要目的。 The main object of the present invention is to solve the above-mentioned problems, and to reduce the calculation load when moving a motion region in which image information is captured is the main purpose.

關於本發明之資訊處理裝置，具有：對於將由多個巨集區塊構成的畫像資訊進行編碼而得到之畫像編碼資訊進行解碼，從前述畫像編碼資訊擷取出與前述多個巨集區塊對應設置之各自至少包含運動向量的多個編號資訊之熵解碼部；及依據包含在利用前述熵解碼部擷取出的前述多個編碼資訊之多個運動向量，擷取前述畫像資訊內之有運動的區域作為運動區域之區域擷取部。 The information processing device according to the present invention includes: decoding image code information obtained by encoding image information composed of a plurality of macroblocks, and extracting and setting the plurality of macroblocks from the image code information An entropy decoding unit that includes at least a plurality of pieces of information of a motion vector; and a motion-receiving region in the image information based on a plurality of motion vectors included in the plurality of pieces of encoded information extracted by the entropy decoding unit As the area extraction part of the sports area.

根據本發明，由於不是解碼畫像資訊，而是依據被包含在編碼資訊的運動向量擷取出運動區域，可以減低擷取運動區域時的計算負荷。 According to the present invention, since the image information is not decoded, but the motion region is extracted based on the motion vector included in the encoded information, the calculation load when the motion region is captured can be reduced.

100‧‧‧資訊處理裝置 100‧‧‧Information processing device

101‧‧‧熵解碼部 101‧‧‧ Entropy Decoding Department

102‧‧‧區域擷取部 102‧‧‧Regional Acquisition Department

103‧‧‧像素值轉換部 103‧‧‧Pixel Value Conversion Department

1031‧‧‧區域決定部 1031‧‧‧Regional Decision Department

1032‧‧‧編碼資訊運算部 1032‧‧‧Coded Information Computing Department

1033‧‧‧編碼資訊成像部 1033‧‧‧Coded Information Imaging Department

圖1為顯示有關實施形態1之資訊處理裝置的機能構成例之圖。 Fig. 1 is a view showing an example of a functional configuration of an information processing device according to a first embodiment.

圖2為顯示有關實施形態1之像素值轉換部的內部構成例之圖。 FIG. 2 is a view showing an example of the internal configuration of the pixel value conversion unit according to the first embodiment.

圖3為顯示有關實施形態1之編碼訊的例示之圖。 Fig. 3 is a view showing an example of the coded signal according to the first embodiment.

圖4為顯示有關實施形態1之區域擷取部的動作概要之圖。 Fig. 4 is a view showing an outline of the operation of the area capturing unit according to the first embodiment.

圖5為顯示有關實施形態1之區域擷取部的動作概要之圖。 Fig. 5 is a view showing an outline of the operation of the area capturing unit according to the first embodiment.

圖6為顯示有關實施形態1之區域擷取部的動作例之流程圖。 Fig. 6 is a flow chart showing an operation example of the area capturing unit according to the first embodiment.

圖7為顯示有關實施形態1之像素值轉換部的動作概要之圖。 Fig. 7 is a view showing an outline of the operation of the pixel value conversion unit according to the first embodiment.

圖8為顯示有關實施形態1之像素值轉換部的動作例之流程圖。 Fig. 8 is a flowchart showing an operation example of the pixel value conversion unit according to the first embodiment.

圖9為顯示有關實施形態1之編碼資訊運算部的動作概要之圖。 Fig. 9 is a view showing an outline of the operation of the coded information calculation unit according to the first embodiment.

圖10為顯示有關實施形態1之編碼資訊運算部的動作概要之圖。 Fig. 10 is a view showing an outline of the operation of the coded information calculation unit according to the first embodiment.

圖11為顯示有關實施形態1之編碼資訊成像部的動作概要之圖。 Fig. 11 is a view showing an outline of the operation of the coded information image forming unit according to the first embodiment.

圖12為顯示有關實施形態1之編碼資訊成像部的動作概要之圖。 Fig. 12 is a view showing an outline of the operation of the coded information image forming unit according to the first embodiment.

圖13為顯示有關實施形態1之編碼資訊成像部的動作概要之圖。 Fig. 13 is a view showing an outline of the operation of the coded information image forming unit according to the first embodiment.

圖14為顯示有關實施形態1之資訊處理裝置的硬體構成例之圖。 Fig. 14 is a view showing an example of a hardware configuration of the information processing device according to the first embodiment.

實施形態1. Embodiment 1.

***構成說明*** ***Composition description***

圖1顯示有關實施形態1之資訊處理裝置100的機能構成例。 Fig. 1 shows an example of the functional configuration of the information processing device 100 according to the first embodiment.

如圖1所示，資訊處理裝置100由：熵解碼部101、區域擷取部102及像素值轉換部103構成。 As shown in FIG. 1, the information processing device 100 includes an entropy decoding unit 101, a region capturing unit 102, and a pixel value converting unit 103.

又，像素值轉換部103如圖2所示，其由：區域決定部1031、編號資訊運算部1032及編碼資訊成像部1033構成。 Further, as shown in FIG. 2, the pixel value conversion unit 103 includes an area determination unit 1031, a number information calculation unit 1032, and a coded information image forming unit 1033.

又，後述之資訊處理裝置100的動作相當於資訊處理方法及資訊處理程式的例示。 Further, the operation of the information processing device 100 to be described later corresponds to an example of the information processing method and the information processing program.

在資訊處理裝置100中，如圖14所示，包含：所謂處理器901、記憶裝置902、接收裝置903及傳送裝置904之硬體。 As shown in FIG. 14, the information processing device 100 includes hardware such as a processor 901, a memory device 902, a receiving device 903, and a transmitting device 904.

在記憶裝置902中，記憶有實現熵解碼部101、區域擷取部102及像素值轉換部103的機能之程式。 In the memory device 902, a program for realizing the functions of the entropy decoding unit 101, the area extracting unit 102, and the pixel value converting unit 103 is stored.

接著，處理器901為執行此等程式，進行後述之熵解碼部101、區域擷取部102及像素值轉換部103的動作。 Next, the processor 901 executes the operations of the entropy decoding unit 101, the region extracting unit 102, and the pixel value converting unit 103, which will be described later, in order to execute these programs.

在圖14中，模式顯示處理器901在執行實現熵解碼部101、區域擷取部102及像素值轉換部103的機能之程式時的狀態。 In FIG. 14, the mode display processor 901 performs a state in which the functions of the entropy decoding unit 101, the region extracting unit 102, and the pixel value converting unit 103 are executed.

接收裝置903為接收畫像編碼資訊。 The receiving device 903 receives the image encoding information.

傳送裝置904為將像素值資訊傳送到未圖示之畫像辨識裝置。 The transmitting device 904 transmits the pixel value information to an image recognizing device (not shown).

***動作說明*** *** Action Description ***

其次，說明圖1所示之熵解碼部101、區域擷取部102及像素值轉換部103的動作。 Next, the entropy decoding unit 101 and the area capturing unit 102 shown in FIG. 1 will be described. The operation of the pixel value conversion unit 103.

熵解碼部101透過圖14所示之接收裝置903接收畫像編碼資訊，對於畫像編碼資訊進行熵解碼，從畫像編碼資訊擷取出編碼資訊。 The entropy decoding unit 101 receives the image encoding information through the receiving device 903 shown in FIG. 14, entropy-decodes the image encoding information, and extracts the encoded information from the image encoding information.

畫像編碼資訊為將由多個巨集區塊構成的畫像資訊進行熵編碼而得到的資訊。 The portrait encoding information is information obtained by entropy encoding image information composed of a plurality of macroblocks.

熵解碼部101利用熵解碼，從畫像編碼資訊擷取出與多個巨集區塊對應設置的多個編碼資訊。 The entropy decoding unit 101 extracts a plurality of pieces of coded information set corresponding to the plurality of macroblocks from the picture code information by entropy decoding.

在各編碼資訊中，各自至少包含：運動向量、巨集區塊形態、量子化步驟、及參考畫像資訊。 Each of the encoded information includes at least: a motion vector, a macro block shape, a quantization step, and reference portrait information.

又，熵解碼部101的動作相當於熵解碼處理。 Further, the operation of the entropy decoding unit 101 corresponds to the entropy decoding process.

區域擷取部102將利用熵解碼部101擷取出之包含在多個編碼資訊的多個運動向量依照巨集區塊的順序進行配置，依據多個運動向量的位置，擷取畫像資訊內之有運動的區域作為運動區域。 The area capturing unit 102 arranges a plurality of motion vectors included in the plurality of pieces of encoded information extracted by the entropy decoding unit 101 in the order of the macroblocks, and extracts the image information based on the positions of the plurality of motion vectors. The moving area acts as a moving area.

運動區域為在畫像資訊中描繪動物體的區域。 The motion area is an area in which the animal body is depicted in the portrait information.

更具體而言，區域擷取部102統合多個運動向量之中配置在靠近位置之2以上的運動向量，依據統合後之運動向量的位置擷取出運動區域。 More specifically, the area capturing unit 102 integrates motion vectors arranged at two or more of the plurality of motion vectors among the plurality of motion vectors, and extracts the motion regions based on the positions of the integrated motion vectors.

又，區域擷取部102的動作相當於區域擷取處理。 Further, the operation of the area capturing unit 102 corresponds to the area capturing process.

像素值轉換部103取得與構成利用區域擷取部102擷取出的運動區域之巨集區塊對應之編碼資訊，將已取得之編碼資訊的運動向量、巨集區塊形態、量子化步驟、參考畫像資訊之中的至少任一個轉換為像素值。 The pixel value conversion unit 103 acquires the encoded information corresponding to the macroblock of the motion region extracted by the use region extracting unit 102, and obtains the motion vector of the encoded information, the macroblock shape, the quantization step, and the reference. At least one of the portrait information is converted into a pixel value.

接著，像素值轉換部103將從編碼資訊轉換而來的像素值顯示在每像素之像素值資訊，透過圖14所示之傳送裝置904，傳送到畫像辨識裝置。 Next, the pixel value conversion unit 103 displays the pixel value information converted from the encoded information on the pixel value information per pixel, and transmits it to the image recognition device via the transfer device 904 shown in FIG.

又，將根據像素值轉換部103之編碼資訊的像素值轉換也稱為編碼資訊的成像。 Further, the pixel value conversion based on the encoded information of the pixel value conversion unit 103 is also referred to as imaging of the encoded information.

如前述所示，像素值轉換部103雖然是由：圖2所示之區域決定部1031、編碼資訊運算部1032及編碼資訊成像部1033構成，但是區域決定部1031、編碼資訊運算部1032及編碼資訊成像部1033的詳細說明則於之後加以記述。 As described above, the pixel value conversion unit 103 is composed of the area determination unit 1031, the coded information calculation unit 1032, and the coded information image forming unit 1033 shown in FIG. 2, but the area determination unit 1031, the coded information calculation unit 1032, and the code. The detailed description of the information imaging unit 1033 will be described later.

圖3為顯示利用熵解碼部101對於畫像編碼資訊進行熵解碼而得到的資訊。 FIG. 3 is a view showing information obtained by entropy decoding the image coding information by the entropy decoding unit 101.

根據熵解碼，從畫像編碼資訊得到標頭資訊、編碼資訊及編碼紋理資訊。 According to the entropy decoding, the header information, the encoded information, and the encoded texture information are obtained from the image encoding information.

標頭資訊、編碼資訊及編碼紋理資訊為設置在每個構成畫像資訊之巨集區塊。 The header information, the encoded information, and the encoded texture information are macroblocks that are set in each of the image information.

標頭資訊為顯示例如H.264編碼中之SPS(Sequence Parameter Set；序列參數集)或PPS(Picture Parameter Set；圖片參數集)。 The header information is, for example, an SPS (Sequence Parameter Set) or a PPS (Picture Parameter Set) in H.264 encoding.

在編碼資訊中，包含：所謂巨集區塊形態、量子化步驟、畫面內預測模式、參考畫像資訊、運動向量、畫面內預測成本、畫面間預測成本及巨集區塊符號量之參數。 The coding information includes parameters such as a macro block shape, a quantization step, an intra-picture prediction mode, a reference picture information, a motion vector, an intra-picture prediction cost, an inter-picture prediction cost, and a macro block symbol quantity.

在本實施形態中，可以將巨集區塊形態、量子化步驟、運動向量、參考畫像資訊運用在像素值的轉換。 In the present embodiment, the macro block shape, the quantization step, the motion vector, and the reference portrait information can be used to convert the pixel values.

編碼紋理資訊為編碼中的畫像資訊。 The encoded texture information is the portrait information in the encoding.

根據對於編碼紋理資訊的解碼處理，能夠以巨集區塊單位得到畫像資訊。 According to the decoding process for the encoded texture information, the portrait information can be obtained in a macroblock unit.

在習知技術中，根據對於編碼紋理資訊的解碼處理得到畫像資訊，藉由分析畫像資訊，擷取出畫像資訊內的運動區域。 In the prior art, the portrait information is obtained based on the decoding process for the encoded texture information, and the motion region in the portrait information is extracted by analyzing the portrait information.

在本實施形態中，在不進行對於編碼紋理資訊的解碼處理的狀態下，使區域擷取部102分析包含在編碼資訊的運動向量，擷取出畫像資訊內的運動區域。 In the present embodiment, the region capturing unit 102 analyzes the motion vector included in the encoded information in a state where the decoding process for the encoded texture information is not performed, and extracts the motion region in the image information.

其次，說明區域擷取部102的動作例。 Next, an operation example of the area capturing unit 102 will be described.

區域擷取部102將與多個巨集區塊對應之包含在多個編碼資訊的多個運動向量依照巨集區塊的順序進行配置。 The area capturing unit 102 arranges a plurality of motion vectors included in the plurality of pieces of encoded information corresponding to the plurality of macroblocks in the order of the macroblocks.

接著，區域擷取部102根據依照巨集區塊的順序配置之多個運動向量的位置擷取出運動區域。 Next, the area capturing unit 102 extracts the motion area based on the positions of the plurality of motion vectors arranged in the order of the macroblocks.

運動區域是根據運動向量的有無、運動向量的距離加以決定。 The motion area is determined based on the presence or absence of the motion vector and the distance of the motion vector.

區域擷取部102統合配置在靠近的位置之2以上的運動向量。 The area capturing unit 102 integrates motion vectors of 2 or more at positions close to each other.

換言之，區域擷取部102將包含相互間的距離為臨界值TH_DIST以下之2以上的運動向量之區域指定為候補區域。 In other words, the area capturing unit 102 specifies an area including motion vectors having a distance between two or more of the critical value TH_DIST or less as a candidate area.

接著，區域擷取部102擷取面積為臨界值TH_RANGE以上的候補區域作為運動區域。 Next, the area capturing unit 102 captures a candidate area having an area equal to or greater than the threshold value TH_RANGE as a motion area.

另一方面，區域擷取部102將面積未滿臨界值TH_RANGE的候補區域視為雜訊而予以捨棄。 On the other hand, the area capturing unit 102 discards the candidate area of the area under-complete threshold TH_RANGE as a noise.

圖4及圖5為顯示區域擷取部102的動作概要。 4 and 5 show an outline of the operation of the display area capturing unit 102.

區域擷取部102取得畫像資訊的1訊框份量的編碼資訊。 The area capturing unit 102 acquires the frame number of the image information. Code information.

在圖4(a)中，各列為表示1個巨集區塊之編碼資訊的參數。 In Fig. 4(a), each column is a parameter indicating the coding information of one macroblock.

換言之，運動向量MV1、巨集區塊形態MBT1、量子化步驟ST1、參考畫像資訊INF1為巨集區塊MB1之編碼資訊的參數。 In other words, the motion vector MV1, the macroblock block form MBT1, the quantization step ST1, and the reference picture information INF1 are parameters of the coded information of the macro block MB1.

同樣，運動向量MV2、巨集區塊形態MBT2、量子化步驟ST2、參考畫像資訊INF2為巨集區塊MB2之編碼資訊的參數。 Similarly, the motion vector MV2, the macroblock block form MBT2, the quantization step ST2, and the reference picture information INF2 are parameters of the coded information of the macro block MB2.

針對巨集區塊MB3之後亦相同。 The same is true for the macro block MB3.

其次，如圖4(b)所示，區域擷取部102將運動向量配置成巨集區塊的順序。 Next, as shown in FIG. 4(b), the region extracting unit 102 arranges the motion vectors into the order of the macroblocks.

再者，如圖5(a)所示，區域擷取部102將相互間的距離為臨界值TH_DIST以下的運動向量聚集為1個候補區域。 Further, as shown in FIG. 5(a), the area capturing unit 102 aggregates motion vectors having a distance of a threshold value TH_DIST or less into one candidate area.

接著，如圖5(b)所示，區域擷取部102擷取面積為臨界值TH_RANGE以上的候補區域作為運動區域。 Next, as shown in FIG. 5(b), the area capturing unit 102 captures a candidate area having a threshold value TH_RANGE or more as a motion area.

圖6為顯示區域擷取部102的動作例之流程圖。 FIG. 6 is a flowchart showing an operation example of the display area capturing unit 102.

區域擷取部102首先利用熵解碼部101擷取出之1訊框份量的編碼資訊之運動向量配置成巨集區塊的順序(ST11)。 The area extracting unit 102 first arranges the motion vector of the encoded information of the 1-frame amount extracted by the entropy decoding unit 101 into the order of the macroblocks (ST11).

換言之，區域擷取部102將1訊框份量的運動向量配置成與在將畫像資訊進行解碼情況下的運動向量的配置相同。 In other words, the area capturing unit 102 configures the motion vector of the 1-frame weight to be the same as the configuration of the motion vector in the case of decoding the portrait information.

其次，區域擷取部102判斷配置的所有運動向量是否已調查完成(ST12)。 Next, the area capturing unit 102 judges whether or not all of the configured motion vectors have been investigated (ST12).

在存在有未調查的運動向量之情況下(ST12中為否)，區域擷取部102選擇調查對象之運動向量(ST13)。 When there is a motion vector that has not been investigated (NO in ST12), the region capturing unit 102 selects the motion vector of the investigation target (ST13).

其次，區域擷取部102判斷利用ST13選擇的運動向量、與該運動向量附近的運動向量之距離是否為臨界值TH_DIST以下(ST14)。 Next, the area extracting unit 102 determines whether or not the distance between the motion vector selected by ST13 and the motion vector in the vicinity of the motion vector is equal to or less than the critical value TH_DIST (ST14).

在利用ST13選擇的運動向量、與附近的運動向量之距離為臨界值TH_DIST以下的情況下(ST14為是)，區域擷取部102將包含利用ST13選擇的運動向量與附近的運動向量之區域指定為候補區域，將候補區域儲存在區域儲存緩衝器。 When the motion vector selected by ST13 and the distance to the nearby motion vector are equal to or less than the threshold TH_DIST (YES in ST14), the region extracting unit 102 specifies the region including the motion vector selected by ST13 and the motion vector in the vicinity. For the candidate area, the candidate area is stored in the area storage buffer.

又，區域儲存緩衝器被構成在圖14所示之記憶裝置902。 Further, the area storage buffer is constructed in the memory device 902 shown in FIG.

其次，區域擷取部102將儲存在區域儲存緩衝器之候補區域之中，相互重疊之2以上的候補區域聚集為1個候補區域(ST16)。 Next, the area capturing unit 102 aggregates two or more candidate areas that are stored in the candidate area of the area storage buffer and that are overlapped with each other into one candidate area (ST16).

另一方面，在ST12中，在所有的運動向量的調查都完成的情況下(ST12中為是)，區域擷取部102將區域儲存緩衝器之候補區域之中，面積為未滿臨界值TH_RANGE的候補區域捨棄(ST17)。 On the other hand, in ST12, when all the motion vector investigations are completed (YES in ST12), the area capturing unit 102 sets the area to the under-threshold threshold TH_RANGE among the candidate areas of the area storage buffer. The candidate area is discarded (ST17).

換句話說，區域擷取部102擷取面積為臨界值TH_RANGE的候補區域作為運動區域。 In other words, the area capturing unit 102 captures a candidate area having the area TH_RANGE as the motion area.

其次，區域擷取部102將與被擷取的運動區域對應之編碼資訊儲存在編碼資訊緩衝器(ST18)。 Next, the area capturing unit 102 stores the encoded information corresponding to the captured motion area in the encoded information buffer (ST18).

換言之，區域擷取部102將與構成ST17擷取出的運動區域之巨集區塊對應之編碼資訊儲存在編碼資訊緩衝器。 In other words, the area capturing unit 102 stores the coded information corresponding to the macro block constituting the motion area extracted by ST17 in the coded information buffer.

編碼資訊緩衝器被構成在圖14所示之記憶裝置902。 The coded information buffer is constructed in the memory device 902 shown in FIG.

其次，說明像素值轉換部103。 Next, the pixel value conversion unit 103 will be described.

圖7為顯示像素值轉換部103的動作概要。 FIG. 7 is an outline of the operation of the display pixel value conversion unit 103.

像素值轉換部103從編碼資訊緩衝器取得構成利用區域擷取部102擷取出的運動區域之巨集區塊的編碼資訊。 The pixel value conversion unit 103 acquires the coded information of the macroblocks constituting the motion area extracted by the area extracting unit 102 from the coded information buffer.

接著，像素值轉換部103將各巨集區塊的編碼資訊轉換為像素值。 Next, the pixel value conversion unit 103 converts the encoded information of each macroblock into a pixel value.

像素值轉換部103例如將運動向量之X方向範數與Y方向範數、巨集區塊形態轉換為RGB色彩空間的像素值。 The pixel value conversion unit 103 converts, for example, the X direction norm, the Y direction norm, and the macro block form of the motion vector into pixel values of the RGB color space.

接著，像素值轉換部103將已轉換的像素值依照巨集區塊的配置順序儲存在像素，在每個像素產生顯示像素值的像素值資訊，將已產生的像素值資訊傳送到畫像辨識裝置。 Next, the pixel value conversion unit 103 stores the converted pixel values in pixels according to the arrangement order of the macroblocks, generates pixel value information of the display pixel values in each pixel, and transmits the generated pixel value information to the image recognition device. .

其次，說明像素值轉換部103的構成要素之區域決定部1031、編碼資訊運算部1032及編碼資訊成像部1033。 Next, the region determining unit 1031, the encoding information computing unit 1032, and the encoding information imaging unit 1033 of the components of the pixel value converting unit 103 will be described.

區域決定部1031決定使用於編碼資訊的成像之運動區域的個數。 The area determining unit 1031 determines the number of imaging regions used for imaging information.

編碼資訊運算部1032對於利用區域決定部1031決定的單個或多個運動區域之編碼資訊，決定是否進行運算處理。 The coded information calculation unit 1032 determines whether or not to perform arithmetic processing on the coded information of the single or plurality of motion regions determined by the region determining unit 1031.

編碼資訊運算部1032在對於編碼資訊進行運算處理的情況下，例如進行以下的運算處理。 When the encoding information calculation unit 1032 performs arithmetic processing on the encoded information, for example, the following arithmetic processing is performed.

編碼資訊運算部1032在使用1個運動區域的情況下，算出編碼資訊之巨集區塊每一列的平均值。 When one motion region is used, the coded information calculation unit 1032 calculates the average value of each column of the macroblock of the coded information.

又，編碼資訊運算部1032在使用多個運動區域的情況下，算出編碼資訊之運動區域間的平均值。 Further, when a plurality of motion regions are used, the coded information calculation unit 1032 calculates an average value between motion regions of the coded information.

又，編碼資訊運算部1032在使用多個運動區域的情況下，使用從不同訊框的畫像編碼資訊擷取出的運動區域亦可。 Further, when the plurality of motion regions are used, the coded information computing unit 1032 may use a motion region extracted from image coded information of different frames.

又，編碼資訊運算部1032對於編碼資訊不進行運算處理亦可。 Further, the coded information calculation unit 1032 may not perform arithmetic processing on the coded information.

編碼資訊成像部1033將編碼資訊轉換為像素值。 The encoded information imaging section 1033 converts the encoded information into pixel values.

即，編碼資訊成像部1033決定利用區域決定部1031及編碼資訊運算部1032處理之編碼資訊的配置，將編碼資訊轉換為像素值。 In other words, the coded information image forming unit 1033 determines the arrangement of the coded information processed by the area determining unit 1031 and the coded information calculation unit 1032, and converts the coded information into pixel values.

又，編碼資訊成像部1033在進行像素值轉換時，因應像素值資訊的傳送目的地之畫像辨識裝置的特性，將編碼資訊規格化亦可。 Further, when the coded image forming unit 1033 performs pixel value conversion, the coded information may be normalized in accordance with the characteristics of the image recognition device of the destination of the pixel value information.

編碼資訊運算部1032例如將運動向量與巨集區塊形態規格化亦可。 The coded information calculation unit 1032 may, for example, normalize the motion vector and the macro block form.

又，像素值的形式為彩色、灰階、高動態範圍等亦可，不限定在特定的形式。 Further, the form of the pixel value may be color, gray scale, high dynamic range, or the like, and is not limited to a specific form.

圖8為顯示有關本實施形態之像素值轉換部103的動作例之流程圖。 FIG. 8 is a flowchart showing an operation example of the pixel value conversion unit 103 according to the present embodiment.

圖8的流程是在根據圖6的ST18將與運動區域對應的編碼資訊儲存在編碼資訊緩衝器後才進行的。 The flow of FIG. 8 is performed after the coded information corresponding to the motion area is stored in the coded information buffer according to ST18 of FIG.

首先，區域決定部1031決定使用在編碼資訊的成像之運動區域(ST21)。 First, the region determining unit 1031 determines the imaging region to be used for encoding information (ST21).

其次，編碼資訊運算部1032對於利用ST21決定的運動區域判斷是否進行運算處理(ST22)。 Next, the code information calculation unit 1032 determines whether or not the calculation processing is performed on the motion region determined by ST21 (ST22).

在ST22中判斷為進行運算處理的情況下，編碼資訊運算部1032使用編碼資訊進行運算處理(ST23)。 When it is determined in ST22 that the arithmetic processing is performed, the encoded information computing unit 1032 performs arithmetic processing using the encoded information (ST23).

又，運算處理的例示為參照圖9及圖10於之後記述。 Further, an example of the arithmetic processing will be described later with reference to FIGS. 9 and 10 .

在進行ST23的運算處理後，進行ST24。 After the arithmetic processing of ST23 is performed, ST24 is performed.

另一方面，在ST22判斷為不進行運算處理的情況下，進行ST24。 On the other hand, if it is determined in ST22 that the arithmetic processing is not to be performed, ST24 is performed.

在ST24中，編碼資訊成像部1033判斷是否算出編碼資訊的像素值。 In ST24, the encoded information imaging unit 1033 determines whether or not to calculate the pixel value of the encoded information.

在算出編碼資訊的像素值的情況下，編碼資訊成像部1033算出編碼資訊的像素值(ST25)。 When the pixel value of the encoded information is calculated, the encoded information imaging unit 1033 calculates the pixel value of the encoded information (ST25).

又，像素值算出處理的例示為參照圖11、圖12及圖13於之後記述。 In addition, an example of the pixel value calculation processing will be described later with reference to FIGS. 11 , 12 , and 13 .

其次，編碼資訊成像部1033產生顯示利用ST25算出之編碼資訊的像素值之像素值資訊，將像素值資訊傳送到畫像辨識裝置(ST26)。 Next, the coded information image forming unit 1033 generates pixel value information indicating the pixel value of the coded information calculated by ST25, and transmits the pixel value information to the image recognition device (ST26).

圖9及圖10為顯示在圖8之ST23所進行之根據編碼資訊運算部1032的運算處理例示。 FIG. 9 and FIG. 10 are diagrams showing an arithmetic processing performed by the encoding information computing unit 1032 performed in ST23 of FIG.

圖9為顯示在使用1個運算區域的情況下，編碼資訊運算部1032算出編碼資訊之巨集區塊每一列的平均值，減低資訊量的手法。 FIG. 9 shows a technique for reducing the amount of information by calculating the average value of each column of the macroblock of the encoded information when one arithmetic area is used.

即，在圖9的例示中，編碼資訊運算部1032進行將與(4×4)的巨集區塊對應之16的編碼資訊彙整成4個編碼資訊之運算。 That is, in the example of FIG. 9, the encoding information calculation unit 1032 performs an operation of consolidating the encoded information of 16 corresponding to the (4×4) macroblock into four pieces of encoded information.

接著，在圖9的彙整運算後，編碼資訊成像部1033將彙整後的4個編碼資訊分別轉換為像素值。 Next, after the rounding operation of FIG. 9, the encoded information image forming unit 1033 converts the four pieces of encoded information after the rounding into pixel values.

又，圖10為顯示在使用多個運動區域的情況下，以位於相同位置的巨集區塊算出編碼資訊的平均值，減低資訊量的手法。 Moreover, FIG. 10 shows a method of calculating the average value of the encoded information by using the macroblock located at the same position and reducing the amount of information when a plurality of motion regions are used.

即，在圖10的例示中，編碼資訊運算部1032在2個運動區塊分別為以(4×4)的巨集區塊構成的情況下，進行將{2×(4×4)}的編碼資訊彙整(4×4)個編碼資訊之運算。 In other words, in the example of FIG. 10, when the two motion blocks are each composed of (4 × 4) macroblocks, the coding information calculation unit 1032 performs {2 × (4 × 4)}. The operation of coding information (4 × 4) coded information.

接著，在圖10的彙整運算後，編碼資訊成像部1033將彙整後的16個編碼資訊分別轉換為像素值。 Next, after the rounding operation of FIG. 10, the encoded information image forming unit 1033 converts the 16 pieces of encoded information after the rounding into pixel values.

又，在圖9中，編碼資訊運算部1032雖然是對巨集區塊的每一列算出平均值而彙整編碼資訊，但是利用其他運算方法彙整編碼資訊亦可。 In addition, in FIG. 9, although the coded information calculation unit 1032 calculates the average value for each column of the macroblock and integrates the coded information, the coded information may be aggregated by another calculation method.

例如，編碼資訊運算部1032根據編碼資訊的最大值(或是最小值或是中央值)彙整編碼資訊亦可。 For example, the coded information calculation unit 1032 may integrate the coded information based on the maximum value (or the minimum value or the center value) of the coded information.

即，編碼資訊運算部1032只要是將與構成利用區域擷取部102擷取的運動區域之n個(n為2以上的整數)巨集區塊對應之n個編碼資訊彙整為m個(m為1以上的整數，而且是n的因數)編碼資訊之運算，進行怎樣的運算皆可。 In other words, the coded information calculation unit 1032 aggregates n pieces of coded information corresponding to n (n is an integer of 2 or more) macroblocks of the motion area captured by the use area extracting unit 102 into m pieces (m). It is an operation of encoding information for an integer of 1 or more and a factor of n, and any calculation can be performed.

同樣，在圖10中，編碼資訊運算部1032雖然是算出位於相同位置之巨集區塊的編碼資訊平均值而彙整編碼資訊，但是利用其他的運算方法彙整編碼資訊亦可。 Similarly, in FIG. 10, the coded information computing unit 1032 calculates the average of the coded information of the macroblocks located at the same position and integrates the coded information. However, the coded information may be aggregated by other calculation methods.

例如，編碼資訊運算部1032利用編碼資訊的最大值(或是最小值或是中央值)彙整編碼資訊亦可。 For example, the coded information computing unit 1032 may use the maximum value (or the minimum value or the central value) of the encoded information to summarize the encoded information.

即，編碼資訊運算部1032在利用區域擷取部102擷取出各自以i個(i為1以上的整數)巨集區塊構成之j個(j為2以上的整數)運動區塊的情況下，只要是將與包含在j個運動區塊之(i×j)個巨集區塊對應的(i×j)個編碼資訊彙整為i個編碼資訊之運算，進行怎樣的運算皆可。 In other words, the coding information calculation unit 1032 extracts j (j is an integer of 2 or more) motion blocks each composed of i (i is an integer of 1 or more) macroblocks by the region extraction unit 102. Any calculation may be performed as long as the (i×j) pieces of coded information corresponding to the (i×j) macroblocks included in the j motion blocks are aggregated into i pieces of coded information.

又，在圖10中，顯示了編碼資訊運算部1032對於從1個訊框擷取出的j個(在圖10中j=2)運動區域，將(i×j)個(在圖10中(i×j)=(16×2))編碼資訊彙整為i個(在圖10中i=16)編碼資訊的例示。 Further, in Fig. 10, it is shown that the coding information computing unit 1032 extracts (i × j) motion vectors (j = 2 in Fig. 10) from one frame (in Fig. 10 ( i × j) = (16 × 2)) The coded information is summarized as an example of i (i = 16 in Fig. 10) coded information.

對於此點，編碼資訊運算部1032取得過去利用區域擷取出102所擷取出之(j-1)個運動區域亦可。 At this point, the code information calculation unit 1032 may acquire (j-1) motion regions that have been extracted by the area extraction 102 in the past.

即，編碼資訊運算部1032取得從不同於成為圖4流程對象的訊框之過去的訊框所擷取出的(j-1)個運動區域亦可。 In other words, the coded information calculation unit 1032 may acquire (j-1) motion regions extracted from a frame different from the frame of the frame to be the flow of the flowchart of FIG.

又，(j-1)個運動區域分別為由i個巨集區塊構成者。 Further, (j-1) motion regions are respectively composed of i macroblocks.

接著，編碼資訊運算部1032進行將與包含在結合了已取得之(j-1)個運動區域與利用區域擷取出部102擷取的運動區域(利用圖4的流程擷取出的運動區域)之j個運動區域的(i×j)個巨集區塊對應之(i×j)編碼資訊彙整為i個編碼資訊之運算亦可。 Next, the code information calculation unit 1032 performs a motion region (a motion region extracted by the flow of FIG. 4) included in the (j-1) motion region and the region extraction unit 102 that have been acquired. The (i×j) coded information corresponding to the (i×j) macroblocks of the j motion regions may be aggregated into i coded information operations.

具體而言，編碼資訊運算部1032與圖10相同，算出位於相同位置之巨集區塊的編碼資訊平均值而彙整編碼資訊。 Specifically, the coded information computing unit 1032 calculates the average value of the encoded information of the macroblocks located at the same position and combines the encoded information, as in FIG.

接著，編碼資訊成像部1033在該情況亦同，將彙整後的i個編碼資訊分別轉換為像素值。 Next, in this case, the coded information imaging unit 1033 converts the aggregated i coded information into pixel values.

圖11、圖12及圖13為顯示在圖8的ST25所進行之根據編碼資訊成像部1033的像素值算出處理的例示。 FIG. 11, FIG. 12, and FIG. 13 are diagrams showing the pixel value calculation processing by the coded information image forming unit 1033 performed in ST25 of FIG.

在圖11中，顯示了編碼資訊成像部1033將與(4×4)巨集區塊對應之16個編碼資訊的各個巨集區塊形態、運動向量之X方向範數與Y方向範數轉換為RGB色彩空間的像素值之例示。 In FIG. 11, it is shown that the coded information imaging unit 1033 converts each macroblock shape of the 16 pieces of coded information corresponding to the (4×4) macroblock, and the X-direction norm and the Y-direction norm of the motion vector. An example of the pixel value of the RGB color space.

編碼資訊成像部1033因應像素值資訊的傳送目的地之畫像辨識裝置，對於包含在編碼資訊的值之中，決定轉換成像素值之值。 The coded image forming unit 1033 determines the value converted into the pixel value among the values included in the coded information in accordance with the image recognition device of the transfer destination of the pixel value information.

在例1中，編碼資訊成像部1033將運動向量之X方向範數轉換為R像素值，將運動向量之Y方向範數轉換為G像素值，將巨集區塊形態轉換為B像素值。 In Example 1, the coded information imaging unit 1033 converts the X-direction norm of the motion vector into an R-pixel value, converts the Y-direction norm of the motion vector into a G-pixel value, and converts the macroblock form into a B-pixel value.

在例2中，編碼資訊成像部1033將沒被包含在編碼資訊的固定值轉換為R像素值，將運動向量之Y方向範數轉換為G像素值，將運動向量之X方向範數轉換為B像素值。 In Example 2, the encoded information imaging unit 1033 converts the fixed value not included in the encoded information into an R pixel value, converts the Y-direction norm of the motion vector into a G pixel value, and converts the X-direction norm of the motion vector into B pixel value.

在例3中，編碼資訊成像部1033將巨集區塊形態轉換為RGB所有的像素值。 In Example 3, the encoded information imaging section 1033 converts the macroblock shape into all the pixel values of RGB.

又，編碼資訊成像部1033因應像素值資訊的傳送目的地之畫像辨識裝置，決定像素值的轉換方法。 Further, the coded information image forming unit 1033 determines a method of converting the pixel value in response to the image recognition device of the destination of the pixel value information.

在例4中，編碼資訊成像部1033以0~255之間規格化巨集區塊形態，將規格化後的巨集區塊形態轉換為R像素值，將運動向量之X方向範數轉換為G與B像素值。 In the example 4, the coded information imaging unit 1033 normalizes the macro block form from 0 to 255, converts the normalized macro block form into R pixel values, and converts the X direction norm of the motion vector into G and B pixel values.

在例5中，編碼資訊成像部1033將運動向量之X方向範數與Y方向範數的加總值轉換為R與G像素值，將巨集區塊形態轉換為B像素值。 In Example 5, the coded information imaging unit 1033 converts the total value of the X direction norm and the Y direction norm of the motion vector into R and G pixel values, and converts the macro block form into B pixel values.

又，編碼資訊成像部1033可以使用任意的計算式，從編碼資訊計算出像素值。 Further, the coded information image forming unit 1033 can calculate the pixel value from the coded information using an arbitrary calculation formula.

又，如圖12所示，除了巨集區塊形態、運動向量之外，將量子化步驟值、參考畫像資訊之最前端訊框的編號、最後端訊框的編號轉換為像素值亦可。 Further, as shown in FIG. 12, in addition to the macro block shape and the motion vector, the quantization step value, the number of the leading end frame of the reference portrait information, and the number of the last frame may be converted into pixel values.

又，轉換目的地之色彩空間種類為任意的。 Also, the type of color space of the conversion destination is arbitrary.

即，編碼資訊成像部1033除了將編碼資訊轉換為RGB色彩空間的像素值之外，也可以轉換為YUV色彩空間的像素值或是HSV色彩空間的像素值。 That is, the encoded information imaging unit 1033 can convert to the pixel value of the YUV color space or the pixel value of the HSV color space in addition to converting the encoded information into pixel values of the RGB color space.

又，如圖13所示，編碼資訊成像部1033配合編碼資訊的個數(例如：巨集區塊形態的個數、運動向量的個數)，調整轉換目的地的像素值亦可(圖13的例1)。 Further, as shown in FIG. 13, the coded information image forming unit 1033 adjusts the number of pieces of coded information (for example, the number of macroblock blocks and the number of motion vectors), and adjusts the pixel value of the conversion destination (FIG. 13). Example 1).

又，編碼資訊成像部1033例如在巨集區塊形態的個數與運動向量的個數不同的情況下，複製個數為少的參數，使巨集區塊形態的個數與運動向量的個數一致亦可(圖13的例2)。 Further, for example, when the number of macroblock patterns is different from the number of motion vectors, the coded information imaging unit 1033 copies the number of the number of macroblocks and the number of motion vectors. The number is also consistent (example 2 of Fig. 13).

***實施形態的效果說明*** *** Description of the effect of the implementation form***

如此一來，根據實施形態1，由於不須解碼畫像資訊，就可以依據包含在編碼資訊的運動向量擷取出運動區域，可以減低在擷取運動區域時的計算負荷。 In this way, according to the first embodiment, since it is not necessary to decode the portrait information, the motion region can be extracted based on the motion vector included in the encoded information, and the calculation load at the time of capturing the motion region can be reduced.

又，有關本實施形態之資訊處理裝置100具有從畫像編碼資訊取得編碼資訊的熵解碼部101，由於不須進行熵解碼以外的解碼處理，因此可以減低有關解碼處理的計算負荷。 Further, the information processing device 100 according to the present embodiment has the entropy decoding unit 101 that acquires the encoded information from the image coding information, and since the decoding processing other than the entropy decoding is not required, the calculation load on the decoding processing can be reduced.

又，在區域擷取部102中，因為只利用編碼資訊，因此與畫像資訊相比只要利用較少的資訊量就可以大概確定出動物體存在的區域。 Further, since the area capturing unit 102 uses only the coded information, it is possible to roughly determine the area where the animal body exists by using a smaller amount of information than the image information.

又，利用像素值轉換部103，可以產生適用於畫像辨識裝置之通知運動區域的畫像。 Further, the pixel value conversion unit 103 can generate an image suitable for the motion recognition area of the image recognition device.

再者，藉由利用像素值轉換部103將編碼資訊成像化，與處理畫像辨識裝置解碼的畫像資訊情況相比，可以減低在畫像辨識裝置的計算負荷。 Furthermore, by using the pixel value conversion unit 103 to image the encoded information, The calculation load of the image recognition device can be reduced as compared with the case of the image information decoded by the image recognition device.

與聲音資訊或文字資訊相比，處理畫像資訊情況下的計算負荷為高，而且與解碼的畫像資訊之尺寸增加成正比而使畫像辨識裝置的計算負荷變高，但是根據本實施形態，由於畫像辨識裝置並不是處理畫像資訊而是處理編碼資訊的像素值資訊，可以減低畫像辨識裝置的計算負荷。 Compared with the voice information or the text information, the calculation load in the case of processing the portrait information is high, and the calculation load of the image recognition device is increased in proportion to the size of the decoded image information, but according to the embodiment, the image is generated. The identification device does not process the image information but processes the pixel value information of the encoded information, which can reduce the calculation load of the image recognition device.

***硬體構成說明*** *** Hardware composition description ***

最後，進行資訊處理裝置100之硬體構成的補充說明。 Finally, a supplementary explanation of the hardware configuration of the information processing apparatus 100 is performed.

資訊處理裝置100為電腦。 The information processing device 100 is a computer.

圖14所示之處理器901為進行處理的IC(Integrated Circuit；積體電路)。 The processor 901 shown in FIG. 14 is an IC (Integrated Circuit) for processing.

處理器901為CPU(Central Processing Unit；中央處理單元)、DSP(Digital Signal Processor；數位訊號處理器)等。 The processor 901 is a CPU (Central Processing Unit), a DSP (Digital Signal Processor), or the like.

圖14所示之記憶裝置902為RAM(Random Access Memory；隨機存取記憶體)、ROM(Read Only Memory；唯讀記憶體)、快閃記憶體、HDD(Hard Disk Drive；硬碟驅動器)等。 The memory device 902 shown in FIG. 14 is a RAM (Random Access Memory), a ROM (Read Only Memory), a flash memory, an HDD (Hard Disk Drive), and the like. .

圖14所示之接收裝置903及傳送裝置904分別例如是通訊晶片或NIC(Network Interface Card；網路介面卡)。 The receiving device 903 and the transmitting device 904 shown in FIG. 14 are, for example, a communication chip or a NIC (Network Interface Card).

又，記憶裝置902中也記憶了OS(Operating System；作業系統)。 Further, the OS (Operating System) is also stored in the memory device 902.

接著，OS的至少一部份是藉由處理器901加以執行。 Next, at least a portion of the OS is executed by the processor 901.

處理器901一邊執行OS的至少一部份，一邊執行實現熵解碼部101、區域擷取部102及像素值轉換部103(以下統合此等稱為「部」)的機能之程式。 The processor 901 executes the entropy decoding unit 101, the region extracting unit 102, and the pixel value converting unit 103 while executing at least a part of the OS (the following is integrated) The function program called "part").

在圖14中，雖然是圖示1個處理器，但是資訊處理裝置100為具備多個處理器亦可。 In FIG. 14, although one processor is illustrated, the information processing device 100 may have a plurality of processors.

又，顯示「部」的處理結果之資訊或資料或訊號值或變數值記憶在記憶裝置902、或是處理器901內的電阻器或快取記憶體。 Further, the information or the data or the signal value or the variable value indicating the processing result of the "part" is stored in the memory device 902 or the resistor or the cache memory in the processor 901.

又，實現「部」的機能之程式被記憶在磁碟、軟碟、光碟、壓縮光碟、藍光(登錄商標)光碟、DVD等可攜式記憶媒體亦可。 In addition, the program that realizes the functions of "Min" is stored in a portable memory medium such as a disk, a floppy disk, a compact disc, a compact disc, a Blu-ray (registered trademark) disc, or a DVD.

又，將「部」取代讀為「處理電路」或「電路」或「步驟」或「順序」或「處理」亦可。 Also, it is also possible to replace "section" with "processing circuit" or "circuit" or "step" or "sequence" or "processing".

「處理電路」或「電路」不只是包含處理器901，也包含所謂邏輯IC或GA(Gate Array；閘陣列)或ASIC(Application Specific Integrated Circuit；特殊應用積體電路)或FPGA(Field-Programmable Gate Array；場式可程式閘陣列)之其他種類的處理電路之概念。 The "processing circuit" or "circuit" includes not only the processor 901 but also a so-called logic IC or GA (Gate Array) or an ASIC (Application Specific Integrated Circuit) or an FPGA (Field-Programmable Gate). Array; the concept of other types of processing circuits for field programmable gate arrays.

100‧‧‧資訊處理裝置 100‧‧‧Information processing device

101‧‧‧熵解碼部 101‧‧‧ Entropy Decoding Department

102‧‧‧區域擷取部 102‧‧‧Regional Acquisition Department

103‧‧‧像素值轉換部 103‧‧‧Pixel Value Conversion Department

Claims

An information processing apparatus includes: an entropy decoding unit that performs entropy decoding on image coded information obtained by encoding image information composed of a plurality of macroblocks, and extracts the plurality of macroblocks from the image coded information Each of the block corresponding settings includes at least a plurality of pieces of coded information of the motion vector; and the area extracting unit captures the portrait information according to the plurality of motion vectors included in the plurality of pieces of encoded information extracted by the entropy decoding unit The area with motion inside is used as the movement area.

The information processing device according to claim 1, wherein the information processing device further includes: acquiring encoded information corresponding to a macroblock constituting the motion region extracted by the region capturing unit, and acquiring the acquired information The pixel value conversion unit that converts the encoded information into pixel values.

The information processing device of claim 2, wherein the entropy decoding unit extracts, in addition to the motion vector, a macroblock shape, a quantization step, and a reference image information, respectively, from the image coding information The pixel value conversion unit converts at least one of the motion vector and the macroblock form and the quantization step and the reference image information into pixel values.

The information processing device of claim 1, wherein the area capturing unit configures the plurality of motion vectors included in the plurality of pieces of encoded information in the order of the macroblocks, according to the positions of the plurality of motion vectors. Before taking The area of motion.

The information processing device of claim 4, wherein the region capturing unit integrates motion vectors disposed at or above 2 in the plurality of motion vectors, and extracts the motion according to the position of the integrated motion vector region.

The information processing device according to the second aspect of the invention, wherein the pixel value conversion unit is configured by n (n is an integer of 2 or more) macroblocks in the motion region extracted by the region capturing unit. In this case, the n pieces of coded information corresponding to the n macroblocks are aggregated into m pieces of information (m is an integer of 1 or more and is a factor of n), and the m pieces of coded information to be merged are performed. Converted to pixel values separately.

The information processing device according to the second aspect of the invention, wherein the pixel value conversion unit extracts j (i is an integer of 1 or more) macroblocks by the region extracting unit ( When j is an integer of 2 or more) motion region, (i × j) pieces of coded information corresponding to (i × j) macroblocks included in the j motion regions are aggregated into i codes. The operation of the information converts the aggregated i-coded information into pixel values.

The information processing device according to the second aspect of the invention, wherein the pixel value conversion unit extracts a motion region composed of i (i is an integer of 1 or more) macroblocks by using the region capturing unit (j-1) (j is an integer of 2 or more) motion regions each composed of i macroblocks taken out by the above-mentioned area capturing unit, and obtained and combined with the included (j-1) sports area and utilization The (i×j) pieces of coded information corresponding to the (i×j) macroblocks of the j motion regions of the motion area extracted by the foregoing region extracting unit are aggregated into i coded information operations, and are aggregated The i coded information is converted into pixel values, respectively.

An information processing method for causing a computer to entropy decode image coded information obtained by encoding image information composed of a plurality of macroblocks, and extracting respective settings corresponding to the plurality of macroblocks from the image coded information The computer includes at least a plurality of coding information of the motion vector, and the computer captures the motiond region in the image information as the motion region according to the plurality of motion vectors included in the plurality of coded information.

An information processing program product that performs the following processing on a computer: entropy decoding processing for entropy decoding image coded information obtained by encoding image information composed of a plurality of macroblocks, and encoding information from the image And extracting, from the plurality of macroblocks corresponding to the plurality of macroblocks, a plurality of encoded information including at least a motion vector; and region extracting processing, which is based on the plurality of encoded information that is included in the entropy decoding process. The motion vector captures the moving area in the aforementioned portrait information as the motion area.