Detailed description of the invention
Hereinafter, referring to the drawings embodiment for the present invention is illustrated.It addition, embodiments described below is all
Represent a preferred concrete example of the present invention.The numerical value that represents i.e., in the following embodiments, shape, material, composition are wanted
Element, the configuration of element and connection form, step, the order etc. of step are examples, are not the meaning limiting the present invention.This
Invention is limited only by the appended claims.Thus, about in the element of following embodiment, in expression upper concept
Independent claims in the element that is silent on, be not intended to reach the purpose of the present invention and necessarily need, make
Illustrate for constituting preferred form.
(embodiment 1)
Fig. 1 is the module map of the structure representing the picture coding device about present embodiment.
Picture coding device 100 shown in Fig. 1 possess orthogonal transformation portion 102, quantization unit 103, re-quantization portion 105, inverse just
Transformation component 106, block storage 108, frame memory 109, infra-frame prediction portion 110, inter prediction portion 111, inter prediction is handed over to control
Portion 114, picture/mb-type determination section 113, reference picture list management portion 115, additional judging part 116, variable length code portion
104, subtraction portion 101, adder 107 and switch portion 112.
The orthogonal transformation portion 102 predicted image data to being generated by unit described later and the forecast error of input image sequence
Data, are carried out from image area to the conversion of frequency domain.The quantization unit 103 prediction error data amount of carrying out to transforming in frequency domain
Change processes.Re-quantization portion 105 carries out re-quantization process to by the prediction error data after quantization unit 103 quantification treatment.Inverse orthogonal
Prediction error data after re-quantization is processed by transformation component 106 is carried out from frequency domain to the conversion of image area.
Block storage 108 is used to what the prediction error data after processing according to predicted image data and re-quantization was obtained
The memorizer that decoding image preserves with block unit.Frame memory 109 is used to the memorizer preserved by decoding image with frame unit.
Picture/mb-type determination section 113 determines input picture sequence with which kind of picture/mb-type in I picture, B picture, P picture
Row coding, generates picture type information.Infra-frame prediction portion 110 uses the decoding figure of the block unit being saved in block storage 108
Picture, generates the predicted image data of infra-frame prediction based on coded object block.Inter prediction portion 111 uses and is saved in frame memory
The decoding image of the frame unit in 109, generates the predicted image data of inter prediction based on coded object block.
The reference picture list management portion 115 encoded reference picture distribution reference picture to reference in inter prediction
Index, makes with reference to list (reference picture list) together with DISPLAY ORDER etc..Picture coding device 100 is in order to for B picture
Two are kept with reference to list (L0, L1) with reference to two pictures.
Represent the example with reference to list in fig. 2.The 1st reference picture list (L0) of Fig. 2 is pre-with the bi-directional predicted the 1st
Survey the example of reference picture list corresponding to direction.In the 1st reference picture list of Fig. 2, it is the reference figure of 2 to DISPLAY ORDER
It is the reference picture index of 0 that sheet R1 is assigned with value.Additionally, to DISPLAY ORDER be 1 reference picture R2 to be assigned with value be the reference of 1
Picture indices.Further, to DISPLAY ORDER be 0 reference picture R3 be assigned with value be 2 reference picture index.
That is, in the 1st reference picture list of Fig. 2, reference picture is the nearest with coded object picture on DISPLAY ORDER, then
This reference picture is assigned with the least reference picture index.
On the other hand, the 2nd reference picture list (L1) of Fig. 2 is the reference corresponding with the 2nd bi-directional predicted prediction direction
The example of just list.In the 2nd reference picture list of Fig. 2, to DISPLAY ORDER be 1 reference picture R2 to be assigned with value be 0
Reference picture index.Additionally, to DISPLAY ORDER be 2 reference picture R1 to be assigned with value be the reference picture index of 1.Additionally,
To DISPLAY ORDER be 0 reference picture R3 be assigned with value be 2 reference picture index.
Like this, two sometimes different to the specific reference picture distribution being included in two reference picture lists ginsengs
Photograph and picture index (reference picture R1 of Fig. 2, R2).Additionally, it is sometimes the most specific to be included in two reference picture lists
Reference picture index (reference picture R3 of Fig. 2) that reference picture distribution is identical.
The prediction of the 1st reference picture list (L0) is only used to be referred to as L0 prediction.Only use the 2nd reference picture list (L1)
Prediction is referred to as L1 prediction.The prediction using the 1st reference picture list and the 2nd reference picture list both sides is referred to as bi-directional predicted or double
Prediction.
In L0 predicts, the situation using front as prediction direction is more.In L1 predicts, as prediction direction
The situation using rear is more.That is, the 1st reference picture list is configured to corresponding to the 1st prediction direction, the 2nd reference picture list
It is configured to corresponding to the 2nd prediction direction.
Based on these relations, it was predicted that direction be classified as the 1st prediction direction, the 2nd prediction direction and two-way in a certain kind.
Additionally, in the case of prediction direction is two-way, prediction direction is also embodied by bi-directional predicted or double prediction.
It addition, in the present embodiment, reference picture list management portion 115 utilizes reference picture index and DISPLAY ORDER pipe
Reason reference picture but it also may utilize the management reference pictures such as reference picture index and coded sequence.
Additionally, in following record, the 1st reference picture list is predicted corresponding to L0, and the 2nd reference picture list corresponds to
L1 predicts.Further, the 1st reference picture list corresponds to the 1st prediction direction, and the 2nd reference picture list corresponds to the 2nd prediction side
To.However, it can be that the 1st reference picture list is predicted corresponding to L1, the 2nd reference picture list is predicted corresponding to L0.Equally,
Can also be that the 1st reference picture list corresponds to the 1st prediction direction corresponding to the 2nd prediction direction, the 2nd reference picture list.
Add judging part 116 and use the 1st reference picture list made by reference picture list management portion 115 and the 2nd ginseng
Photograph and picture list, it may be judged whether add candidate's (predicted motion vectors candidates) of predicted motion vector.Specifically, described later
In method, add judging part 116 and judge whether candidate list (the predicted motion vector of the 2nd prediction direction to coded object block
Candidate list) add the motion vector of the 1st prediction direction as the candidate of predicted motion vector.Further, add judging part 116 to set
Fixed adding indicates.
Inter prediction control portion 114 determines the predicted motion vector for coding, to use multiple predicted motion vector to wait
That choose and by the error minimum of the motion vector of motion detection derivation predicted motion vectors candidates carries out motion vector
Coding.Here, error represents the difference value of predicted motion vectors candidates and the motion vector derived by motion detection.
Additionally, inter prediction control portion 114 generates the prediction fortune corresponding with the predicted motion vector determined by each piece
Dynamic vector indexes.And, it was predicted that motion vector index, the control information of predicted motion vectors candidates and reference picture index are sent out
Deliver to variable length code portion 104.
Variable length code portion 104 is to the prediction error data after quantification treatment, inter prediction Directional Sign, reference picture
Index and picture type information carry out variable length code process, thus generate bit stream.
Fig. 3 represents the summary of the handling process of the method for encoding images about present embodiment.Inter prediction control portion 114
Determine coded object block with the prediction direction (S101) in the case of motion vector detection pattern-coding.Then, inter prediction
Control portion 114 judges whether the prediction direction of motion vector detection pattern is bi-directional predicted (S102).
(S102 is) that inter prediction control portion 114 uses method described later in the case of prediction direction is bi-directional predicted
Calculate predicted motion vectors candidates list and the predicted motion vectors candidates list of the 2nd prediction direction of the 1st prediction direction respectively
(S103、S104)。
Then, add judging part 116 to judge whether the motion vector of the 1st prediction direction is appended to the 2nd prediction direction
Predicted motion vectors candidates list (S105).In the case of being judged as additional motion vector (S105 is), inter prediction control
The motion vector of the 1st prediction direction is appended to predicted motion vectors candidates list (S106) of the 2nd prediction direction by portion 114 processed.
Then, inter prediction control portion 114 is from the predicted motion vectors candidates list of the 1st prediction direction and the 2nd prediction side
To predicted motion vectors candidates list in select predicted motion vector the 2nd prediction direction pre-of the 1st prediction direction respectively
Survey motion vector.Further, variable length code portion 104 is by the predicted motion vector corresponding with selected each predicted motion vector
Index coding, is attached in bit stream (S107).
In the case of the prediction direction of motion vector detection pattern is single directional prediction (no in S102), inter prediction controls
Portion 114 calculates predicted motion vectors candidates list (S109) of the prediction direction corresponding with single directional prediction.Inter prediction control portion
114 select predicted motion vector from the predicted motion vectors candidates list of the prediction direction corresponding with single directional prediction.Further, may be used
Elongated degree encoding section 104, by the predicted motion vector index coding corresponding with selected predicted motion vector, is attached to bit
In stream (S110).
Finally, variable length code portion 104 would indicate that the inter prediction direction of prediction direction of motion vector detection pattern
Mark and reference picture index coding, be attached in bit stream (S108).
Then, the handling process of Fig. 4 is used to explain the decision of prediction direction of motion vector detection pattern of Fig. 3
Method (S101).Inter prediction control portion 114 to by the reference picture of the 1st prediction direction index the reference picture that determines and
Indexed, by the reference picture of the 2nd prediction direction, the reference picture determined and carry out motion detection.Further, inter prediction control portion 114
Generate the 1st motion vector the 1 and the 2nd motion vector (S201) relative to two reference pictures.
Here, inter prediction control portion 114 calculates the coded object block in encoded picture and reference figure in motion detection
The difference value of the block in sheet.Further, inter prediction control portion 114 is by multiple pieces in reference picture, difference value minimum
Block is determined as reference block.Further, inter prediction control portion 114 obtains fortune according to the position of coded object block and the position of reference block
Dynamic vector.
Then, inter prediction portion 111 uses the 1st motion vector obtained to generate the prognostic chart picture of the 1st prediction direction.Interframe
PREDICTIVE CONTROL portion 114 is such as calculated by the R-D optimizing model showed by Formula 4 below will coding by this prognostic chart picture
Cost Cost1 (S202) in the case of object block coding.
Cost=D+ λ × R ... (formula 4)
In formula 4, D presentation code distorts.Specifically, the prognostic chart picture generated by certain motion vector will be utilized to compile
Difference absolute value between the original pixel value of code object block encoding and decoding and the pixel value that obtains and coded object block and
Deng be used as D.Additionally, R represents generation code amount.Specifically, by order to the motion vector being used for generating prognostic chart picture is compiled
Code and the code amount that needs etc. are as R.Additionally, λ is Lagrange does not determines multiplier.
Then, inter prediction portion 111 uses the 2nd motion vector obtained to generate the prognostic chart picture of the 2nd prediction direction.And
And, inter prediction control portion 114 calculates Cost2 (S203) according to formula 4.
Then, inter prediction portion 111 uses the 1st motion vector obtained and the 2nd motion vector to generate two-way prognostic chart
Picture.Here, inter prediction portion 111 such as to the prognostic chart picture obtained according to the 1st motion vector and obtains according to the 2nd motion vector
Prognostic chart picture carry out being added average by each pixel, thus generate bidirectional predictive picture.Further, inter prediction control portion 114
CostBi (S204) is calculated according to formula 4.
Then, Cost1, Cost2 and CostBi are compared (S205) by inter prediction control portion 114.In CostBi minimum
In the case of (S205 is), the prediction direction of motion vector detection pattern is determined as bi-directional predicted by inter prediction control portion 114
(S206).In the case of CostBi is not minimum (no in S205), Cost1 with Cost2 is compared by inter prediction control portion 114
(S207)。
(S207 is) that motion vector detection pattern is determined by inter prediction control portion 114 in the case of Cost1 is less
It it is the single directional prediction (S208) of the 1st prediction direction.In the case of Cost1 is no small (no in S207), inter prediction control portion 114
Motion vector detection pattern is determined as the single directional prediction (S209) of the 2nd prediction direction.
It addition, in the present embodiment, inter prediction portion 111 has carried out each pixel when two-way prognostic chart picture generates
Addition average but it also may be weighted being added average etc..
Then, the handling process of Fig. 5 is used to explain the computational methods of predicted motion vectors candidates list of Fig. 3
(S103、S104、S109).Inter prediction control portion 114 determines that the adjacent adjacent block A in the left of coded object block, top are adjacent
Adjacent block B, the adjacent adjacent block C (S301) in upper right side.
Such as, inter prediction control portion 114 will be located in the pixel of left neighbour of pixel of top-left position of coded object block
Affiliated block is determined as adjacent block A.Further, inter prediction control portion 114 will be located in the picture of top-left position of coded object block
The block belonging to pixel of the upper neighbour of element is determined as adjacent block B.Further, inter prediction control portion 114 will be located in coded object block
The block belonging to pixel that the upper right of the pixel of upper-right position is adjacent is determined as adjacent block C.
Then, adjacent block A, B, C are judged whether to meet two condition both sides by inter prediction control portion 114 respectively
(S302).One in these two conditions is, adjacent block N (N be A, B, C some) have with corresponding to coded object block
The motion vector of the prediction direction that the prediction direction of motion vector is identical.Another is that the reference picture of adjacent block N is right with coding
As the reference picture of block is identical.
In the case of meeting two conditions for adjacent block N (in S302 being), inter prediction control portion 114 is by phase
The adjacent motion vectors of adjacent block N is appended in predicted motion vectors candidates list (S303).Additionally, inter prediction control portion 114
Calculate the intermediate value (median) of multiple motion vectors of multiple adjacent block, be appended in predicted motion vectors candidates list
(S304)。
It addition, in the process above, inter prediction control portion 114 will have and the motion vector corresponding to coded object block
The motion vector of adjacent block of the identical prediction direction of prediction direction be appended in predicted motion vectors candidates list.Further,
Inter prediction control portion 114 does not add the motion vector of the adjacent block with different prediction direction.But, inter prediction controls
The motion vector of the adjacent block with different prediction direction can also be appended to predicted motion vector and wait by portion 114 as 0
Select list.
Then, the handling process of Fig. 6 is used to explain the determining method (S105) of additional mark of Fig. 3.
Sometimes the reference picture of reference picture and the 2nd prediction direction represented by reference picture index of the 1st prediction direction
Reference picture represented by index is identical.In the case, the motion vector of the 1st prediction direction and the motion of the 2nd prediction direction
Vector has tendency relatively.
Thus, in the case, inter prediction control portion 114 is in bi-directional predicted, as the fortune of some prediction direction
The predicted motion vectors candidates of dynamic vector and add the motion vector of another prediction direction.Thus, picture coding device 100 energy
Enough the motion vector of some prediction direction is encoded expeditiously.
In the present embodiment, use the motion vector of the 1st prediction direction to the predicted motion vector of the 2nd prediction direction
The example that candidate adds.Alternatively, it is also possible to the motion vector of the 2nd prediction direction to be appended to the predicted motion of the 1st prediction direction
In vectors candidates.
First, add judging part 116 and obtain the reference picture that the motion about Fig. 4 detects the 1st prediction direction of (S201)
Reference picture index (S401, S402) of index and the 2nd prediction direction.Then, add judging part 116 and use the 1st reference picture
List and the 2nd reference picture list judge the reference picture represented by reference picture index and the 2nd prediction side of the 1st prediction direction
To reference picture index represented by reference picture the most identical (S403).
Such as, the reference picture that additional judging part 116 obtains the 1st prediction direction according to the 1st reference picture list indexes institute
The DISPLAY ORDER of the reference picture represented.Additionally, add judging part 116 to obtain the 2nd prediction direction according to the 2nd reference picture list
The DISPLAY ORDER of reference picture represented by reference picture index.Further, judging part 116 is added by the two DISPLAY ORDER ratio
Relatively, if they are identical, then it is judged as that two reference pictures are identical.
In the case of the reference picture of the 1st prediction direction is identical with the reference picture of the 2nd prediction direction (S403 is),
Add judging part 116 to be set as additional mark opening (ON) (S404).Reference picture and the 2nd prediction in the 1st prediction direction
In the case of the reference picture in direction differs (no in S403), add judging part 116 and additional mark is set to off (OFF)
(S405)。
It addition, in the present embodiment, add judging part 116 and use DISPLAY ORDER to judge two reference pictures whether phase
With.But, add judging part 116 coded sequence etc. can also be used to judge two reference pictures are the most identical.
Then, the example of the candidate list generated in the example of Figure 32 is represented in Fig. 7 A, Fig. 7 B.It is assumed here that
Following relation.That is, coded object block has motion vector MvL0 and the motion vector of the 2nd prediction direction of the 1st prediction direction
MvL1.Additionally, adjacent block has motion vector as shown in figure 32.Additionally, in each adjacent block, the reference of the 1st prediction direction
The reference picture of picture and the 2nd prediction direction is identical.Fig. 7 A and Fig. 7 B represents in this relation by the predicted motion vector of Fig. 3
The generation of candidate list processes the example of the predicted motion vectors candidates list that (S103~S106) generates.
In the predicted motion vectors candidates list of the 1st prediction direction of Fig. 7 A, with Median (MvL0_A, MvL0_B,
MvL0_C) corresponding predicted motion vector index is 0.The predicted motion vector index corresponding with motion vector MvL0_A is 1.With
Predicted motion vector index corresponding for motion vector MvL0_B is 2.The predicted motion vector rope corresponding with motion vector MvL0_C
Draw is 3.
Additionally, in the predicted motion vectors candidates list of the 2nd prediction direction of Fig. 7 B, with Median (MvL1_A, 0,
MvL0_C) corresponding predicted motion vector index is 0.The predicted motion vector index corresponding with motion vector MvL0_A is 1.With
Predicted motion vector index corresponding for motion vector MvL0_C is 2.Corresponding for motion vector MvL0 prediction with the 1st prediction direction
Motion vector index is 3.It addition, the method for salary distribution of predicted motion vector index is not limited to this example.
Fig. 8 represents the example of the code table of use when predicted motion vector index carries out variable length code.Prediction fortune
It is the shortest that dynamic vector indexes the least then code length.Inter prediction control portion 114 is to speculating that the candidate allocation that precision of prediction is higher is less
Predicted motion vector index.Thereby, it is possible to make code efficiency improve.
Then, the handling process of Fig. 9 is used to explain the predicted motion vector selection method (S107, S110) of Fig. 3.
Inter prediction control portion 114 sets 0 in count value as initialization, the maximum of setting value in minimum differential motion vector
Value etc. (S501).
Then, inter prediction control portion 114 judges whether to calculate the differential motion of whole predicted motion vectors candidates
Vector (S502).If there remains predicted motion vectors candidates (in S502 being), then inter prediction control portion 114 is by from fortune
Dynamic testing result vector deducts predicted motion vectors candidates, calculates differential motion vector (S503).
Then, inter prediction control portion 114 judges that the differential motion vector obtained is less than minimum differential motion vector
(S504).In the case of differential motion vector is less than minimum differential motion vector (S504 is), inter prediction control portion 114
Minimum differential motion vector and predicted motion vector index are updated (S505).
Then, inter prediction control portion 114 adds 1 (S506) to count value.Further, inter prediction control portion 114 is again
Judge whether ensuing predicted motion vectors candidates (S502).It is being judged as whole predicted motion vectors candidates meters
In the case of having calculated differential motion vector (no in S502), the minimum differential motion that inter prediction control portion 114 will finally determine
Vector and predicted motion vector index send to variable length code portion 104 so that it is coding (S507).
As above, according to present embodiment, as shown in Figure 10 A, represented by the motion vector of the 1st prediction direction
In the case of reference picture is identical with the reference picture represented by the motion vector of the 2nd prediction direction, by the fortune of the 1st prediction direction
Dynamic vector adds to the predicted motion vectors candidates of the 2nd prediction direction.On the other hand, as shown in Figure 10 B, in the 1st prediction direction
The situation different from the reference picture represented by the motion vector of the 2nd prediction direction of the reference picture represented by motion vector
Under, the motion vector of the 1st prediction direction is not added to the predicted motion vectors candidates of the 2nd prediction direction.
So, according to present embodiment, inter prediction control portion 114 is used for obtaining a prediction in bi-directional predicted middle use
The new computational methods of the predicted motion vector in direction.Thus, inter prediction control portion 114 derives and is best suited for coded object figure
The predicted motion vector of the coding of the motion vector of sheet.Thus, code efficiency improves.
Particularly, the ginseng of reference picture and the 2nd prediction direction represented by reference picture index of the 1st prediction direction sometimes
Reference picture represented by photograph and picture index is identical.In the case, inter prediction control portion 114 in bi-directional predicted, as
The predicted motion vectors candidates of the motion vector of some prediction direction and add the motion vector of another prediction direction.By
This, the motion vector of some prediction direction can be encoded by picture coding device 100 expeditiously.
It addition, in the present embodiment, illustrate in the predicted motion vectors candidates list of the 2nd prediction direction, add the
The example of the motion vector of 1 prediction direction but it also may add the in the predicted motion vectors candidates list of the 1st prediction direction
The motion vector of 2 prediction direction.
Additionally, in the case of the reference picture of the 1st prediction direction is different from the reference picture of the 2nd prediction direction, interframe
The motion vector of the 1st prediction direction can also be appended to the candidate list of the 2nd prediction direction by PREDICTIVE CONTROL portion 114.At two
In the case of reference picture difference, the most also improve code efficiency by the increase of candidate's number.
(embodiment 2)
Figure 11 is the module map of the structure representing the picture decoding apparatus about present embodiment.
Picture decoding apparatus 200 shown in Figure 11 possesses length-changeable decoding portion 204, re-quantization portion 205, inverse orthogonal transformation
Portion 206, adder 207, block storage 208, frame memory 209, infra-frame prediction portion 210, inter prediction portion 211, switch portion
212, inter prediction control portion 214, reference picture list management portion 215 and additional judging part 216.
Length-changeable decoding portion 204 carries out length-changeable decoding process to the bit stream of input.Further, length-changeable decoding
Portion 204 generates picture type information, inter-frame forecast mode, inter prediction Directional Sign, skips mark and quantization parameter.Re-quantization
Portion 205 carries out re-quantization process to quantization parameter.Inverse orthogonal transformation portion 206 will carry out the orthogonal transform coefficient after re-quantization process
Convert to image area from frequency domain, generate prediction error image data.
Block storage 208 is used to preserve with block unit prediction error image data be added with predicted image data and give birth to
The memorizer of the image sequence become.Frame memory 209 is used to the memorizer preserved by image sequence with frame unit.
The image sequence of the block unit that infra-frame prediction portion 210 is saved in block storage 208 by use is pre-in performing frame
Survey, generate the predicted image data of decoder object block.
Inter prediction portion 211 uses the image sequence of the frame unit being saved in frame memory 209 to perform inter prediction, by
This generates the predicted image data of decoder object block.Inter prediction control portion 214 is according to inter-frame forecast mode, inter prediction direction
And skip mark, control the motion vector in inter prediction and predicted image data generates method.
The decoded reference picture of reference in inter prediction is distributed reference picture by reference picture list management portion 215
Index, makes with reference to list (as Fig. 2 of embodiment 1) together with DISPLAY ORDER etc..B picture is quilt with reference to two pictures
Decoding.Therefore, reference picture list management portion 215 keeps two with reference to list.
It addition, about reference picture list management portion 215 reference picture index and the DISPLAY ORDER pipe of present embodiment
Reason reference picture.But, reference picture list management portion 215 can also be with reference picture index and coded sequence (decoding order)
Deng management reference picture.
Add judging part 216 and use the 1st reference picture list made by reference picture list management portion 215 and the 2nd ginseng
Photograph and picture list judges whether that the predicted motion vectors candidates list of the 2nd prediction direction to decoder object block adds the 1st prediction
The motion vector in direction.Further, add judging part 216 and set additional mark.It addition, add decision flow process and the embodiment party of mark
Fig. 6 of formula 1 is same, so omitting the description.
Finally, adder 207, by decoded prediction error image data being added with predicted image data, generates and solves
Code image sequence.
Figure 12 represents the summary of the handling process of the picture decoding method about present embodiment.First, inter prediction control
Portion 214 processed judges whether decoded prediction direction is two-way (S601).
Prediction direction after the decoding be two-way in the case of (S601 is), it is pre-that inter prediction control portion 214 calculates the 1st
Survey direction and predicted motion vectors candidates list (S602, S603) of the 2nd prediction direction.It addition, at predicted motion vectors candidates
The computational methods of list use Fig. 5 etc. of embodiment 1.Length-changeable decoding portion 204 goes out the 1st prediction according to bit stream decoding
The reference picture index of direction and the 2nd prediction direction.
Add judging part 216 to select from bit stream decoding from the predicted motion vectors candidates list of the 1st prediction direction
The predicted motion vector represented by predicted motion vector index of the 1st prediction direction.Further, inter prediction control portion 214 is to the 1st
The predicted motion vector of prediction direction is plus the differential motion vector of the 1st prediction direction after bit stream decoding.Thus, interframe
PREDICTIVE CONTROL portion 214 decodes the motion vector (S604) of the 1st prediction direction.
Add judging part 216 and judge whether to be appended to the motion vector of the 1st prediction direction the prediction fortune of the 2nd prediction direction
Dynamic vector candidate list (S605).(S605 is) that inter prediction control portion 214 is by the case of additional mark is on
The motion vector of 1 prediction direction is appended to predicted motion vectors candidates list (S606) of the 2nd prediction direction.It addition, expression is
The additional mark of the no motion vector adding the 1st prediction direction sets in the same manner as waiting with Fig. 6 of embodiment 1.
Inter prediction control portion 214 selects from bit stream solution from the predicted motion vectors candidates list of the 2nd prediction direction
The predicted motion vector represented by predicted motion vector index of the 2nd prediction direction of code.Further, inter prediction control portion 214
Predicted motion vector to the 2nd prediction direction adds the differential motion vector of the 2nd prediction direction gone out from bit stream decoding.By
This, inter prediction control portion 214 decodes the motion vector (S607) of the 2nd prediction direction.
In the case of prediction direction is not two-way (no in S601), i.e. in the case of inter prediction direction is unidirectional,
Inter prediction control portion 214 calculates predicted motion vectors candidates list (S608) of the prediction direction corresponding with single directional prediction.Frame
Between PREDICTIVE CONTROL portion 214 select decoded from the predicted motion vectors candidates list of the prediction direction corresponding with single directional prediction
Predicted motion vector represented by predicted motion vector index.Further, inter prediction control portion 214 calculates corresponding with single directional prediction
The motion vector (S609) of prediction direction.
So, according to present embodiment, inter prediction control portion 214 is used for obtaining a prediction in bi-directional predicted middle use
The new computational methods of the predicted motion vector in direction.Thus, the predicted motion deriving the decoding being best suited for motion vector is vowed
Amount.Additionally, bit stream higher for code efficiency can suitably be decoded by picture decoding apparatus 200.
Particularly, the reference picture represented by reference picture index of the 1st prediction direction and the 2nd prediction direction sometimes
Reference picture represented by reference picture index is identical.In the case, inter prediction control portion 214, in bi-directional predicted, makees
The motion vector of another prediction direction is added for the predicted motion vectors candidates of motion vector of some prediction direction.By
This, picture decoding apparatus 200 can will obtain by encoding the motion vector of some prediction direction expeditiously
Bit stream suitably decode.
It addition, about the inter prediction control portion 214 of the present embodiment predicted motion vectors candidates to the 2nd prediction direction
List adds the motion vector of the 1st prediction direction.But, inter prediction control portion 214 is as coding side, it is also possible to the 1st
The predicted motion vectors candidates list of prediction direction adds the motion vector of the 2nd prediction direction.
Additionally, inter prediction control portion 214 is as coding side, in two reference pictures corresponding with two prediction direction
In the case of difference, it is also possible to the predicted motion vectors candidates list of a prediction direction is added the fortune of another prediction direction
Dynamic vector.
Above, based on multiple embodiments, picture coding device for the present invention and picture decoding apparatus are said
Bright, but the present invention is not limited to these embodiments.These embodiments are implemented the change that those skilled in the art expects
Shape and the form that obtains and the element of these embodiments is at random combined and other form of realizing is also contained in
In the present invention.
For example, it is also possible to the place's reason other process portion performed in specifically process portion performs.In addition it is also possible to change is held
The order that row processes, it is also possible to multiple process are performed in parallel.
Additionally, the present invention can not only realize as picture coding device and picture decoding apparatus, and can as with
The method that processing mechanism is step constituting picture coding device and picture decoding apparatus realizes.Such as, these steps are by calculating
Machine performs.Further, the program that the present invention can be used as making computer perform to comprise step in these methods realizes.Enter
And, the present invention can have the record medium of the embodied on computer readable of the non-transitory of the CD-ROM etc. of this program in fact as record
Existing.
Additionally, picture coding device and picture decoding apparatus combine by will be contained in the element in them, as
Image encoding/decoding device realizes.
Additionally, the multiple elements being included in picture coding device and picture decoding apparatus can also be as integrated electricity
Road i.e. LSI (Large Scale Integration) realizes.These elements both could be separately formed 1 chip, it is possible to
To include that part or all forms 1 chip.Element beyond memorizer such as can also be formed as 1 core
Sheet.Here LSI it is set to, but according to the difference of integrated level, sometimes referred to as IC (Integrated Circuit), system LSI, super
Level LSI or superfine LSI.
Additionally, the method for integrated circuit is not limited to LSI, it is also possible to realized by special circuit or general processor.
The FPGA (Field Programmable Gate Array) that can program can also be used, maybe can reconstruct the electricity within LSI
The connection of road unit and the reconfigurable processor of setting.
And then, if because the integrated circuit of replacement LSI occur in the progress of semiconductor technology or other technology of derivation
Technology, then can certainly use the element that this technology carries out being included in picture coding device and picture decoding apparatus
Integrated circuit.
(embodiment 3)
By realizing the motion image encoding method (method for encoding images) shown in the respective embodiments described above or dynamic by being used for
The program of the structure of state picture decoding method (picture decoding method) recorded in storage medium, it is possible to by the respective embodiments described above
Shown process is implemented in independent computer system simply.Storage medium is disk, CD, photomagneto disk, IC-card, partly leads
Body memories etc., as long as be able to record that the medium of program is the most permissible.
And then, illustrate here the motion image encoding method (method for encoding images) that illustrates in the respective embodiments described above and
The application examples of dynamic image decoding method (picture decoding method) and its system of use.The system is characterized in that, have by
The picture coding that the picture coding device using method for encoding images and the picture decoding apparatus using picture decoding method are constituted
Decoding apparatus.About other structures of system, can according to circumstances and suitably change.
Figure 13 is the integrally-built figure of the content provider system ex100 representing and realizing content distribution service.Communication is taken
Business offer zoning be divided into desired size, be respectively arranged with in each community the base station ex106 as stationary wireless stations,
ex107、ex108、ex109、ex110。
This content provider system ex100 on the Internet ex101 via ISP ex102 and telephone network
Ex104 and base station ex106~ex110 is connected to computer ex111, PDA (Personal Digital Assistant)
Each equipment of ex112, photographing unit ex113, portable phone ex114, game machine ex115 etc..
But, content provider system ex100 is not limited to structure as Figure 13, it is also possible to by some factor combination even
Connect.In addition it is also possible to each equipment is connected directly between telephone network not via base station ex106~ex110 as stationary wireless stations
On ex104.In addition it is also possible to each equipment is directly connected to each other via near radio etc..
Photographing unit ex113 is the equipment that can carry out dynamic image photography of digital camera etc., and photographing unit ex116 is energy
Enough carry out the still image photographing of digital camera etc., the equipment of dynamic image photography.Additionally, portable phone ex114 is GSM
(Global System for Mobile Communications) mode, CDMA (Code Division Multiple
Access) mode, W-CDMA (Wideband-Code Division Multiple Access) mode or LTE (Long
Term Evolution) mode, the portable telephone of HSPA (High Speed Packet Access) or PHS (Personal
Handyphone System) etc., it is which kind of can.
In content provider system ex100, by photographing unit ex113 etc. is connected via base station ex109, telephone network ex104
It is connected on flow medium server ex103, it is possible to carry out on-the-spot distribution etc..At the scene in distribution, user is used photographing unit ex113
The content (image etc. that such as concert is on-the-spot) of photography carries out coded treatment as described above in the embodiments
(that is, the picture coding device as the present invention plays a role), sends to flow medium server ex103.On the other hand, flow medium
Server ex103 carries out stream distribution by sending the content-data come to the client having request.As client, having can be by upper
State the computer ex111 of the decoding of the data after coded treatment, PDAex112, photographing unit ex113, portable phone ex114, game machine
Ex115 etc..In receiving each equipment of data of distribution, the data decoding process received is reproduced (that is, as this
Bright picture decoding apparatus plays a role).
It addition, the coded treatment of the data of photography both can be carried out by photographing unit ex113, it is also possible to by carrying out sending out of data
The flow medium server ex103 sending process is carried out, it is also possible to mutually shares and carrying out.Equally, the decoding process of the data of distribution was both
Can be carried out by client, it is also possible to carried out by flow medium server ex103, it is also possible to mutually share and carrying out.Additionally, do not limit
In photographing unit ex113, it is also possible to by the rest image photographed by photographing unit ex116 and/or dynamic image data via computer
Ex111 sends to flow medium server ex103.Coded treatment in the case of this is situated between by photographing unit ex116, computer ex111, stream
Which of matter server ex103 is carried out can, it is also possible to mutually shares and carrying out.
Additionally, these coding and decodings process general process in the LSIex500 that computer ex111 or each equipment has.
LSIex500 both can be single-chip, it is also possible to be the structure being made up of multiple chips.Alternatively, it is also possible to by moving picture encoding
The software of decoding is encased in some record medium (CD-ROM, floppy disk, hard disk etc.) that can be read by computer ex111 etc.
In, use this software to carry out coding and decoding process.And then, in the case of portable phone ex114 is with photographing unit, it is also possible to
The dynamic image data obtained by this photographing unit is sent.Dynamic image data now is had by portable phone ex114
The data of LSIex500 coded treatment.
In addition it is also possible to be, flow medium server ex103 is multiple server or multiple computer, is data to be disperseed
Process, record and distribute.
As above, in content provider system ex100, client is able to receive that the data of coding reproduce.So,
In content provider system ex100, the information that user sends can be received, decode, reproduce, even not having by client in real time
The user having special right or equipment also is able to realize personal broadcaster.
It addition, be not limited to the example of content provider system ex100, as shown in figure 14, in digital broadcasting system
Ex200 also is able to load at least moving picture encoding device (picture coding device) of above-mentioned embodiment or dynamic image solution
Certain of code device (picture decoding apparatus).Specifically, in the ex201 of broadcasting station, will to image data multiplexing music number
The multiplex data obtained according to waiting transmits to communication or broadcasting satellite ex202 via electric wave.This image data be by above-mentioned respectively
Data after the motion image encoding method coding of explanation in embodiment (that is, are encoded by the picture coding device of the present invention
After data).The broadcasting satellite ex202 receiving these data sends the electric wave of broadcast, it is possible to this electric wave is carried out satellite wide
The antenna ex204 of the family broadcasting reception receives this electric wave, by television set (receiver) ex300 or Set Top Box (STB) ex217 etc.
Device the multiplex data received is decoded and is rendered (that is, the picture decoding apparatus as the present invention plays a role).
Furthermore, it is possible to the multiplex data recorded in the record medium ex215 of DVD, BD etc. is being read and is decoding or is inciting somebody to action
Image data coding writes reader/writer ex218 in record medium ex215 further according to situation and music signal multiplexing
In also be able to the dynamic image decoding device shown in the respective embodiments described above or moving picture encoding device are installed.In this situation
Under, can the signal of video signal that reproduce be shown on monitor ex219, be had the record medium ex215 of multiplex data by record
Signal of video signal can be reproduced in other devices or system.In addition it is also possible to be connected to cable television cable ex203 or
In Set Top Box ex217 on the antenna ex204 of satellite/terrestrial ripple broadcast, dynamic image decoding device is installed, is used television set
Monitor ex219 show.At this time it is also possible to be not to load dynamic image decoding device in television set at Set Top Box.
Figure 15 is to represent to use the dynamic image decoding method and moving picture encoding illustrated in the respective embodiments described above
The figure of television set (receiver) ex300 of method.Television set ex300 possesses via antenna ex204 or the line receiving above-mentioned broadcast
Cable ex203 etc. obtain or have exported image data multiplexing the tuner ex301 of the multiplex data of voice data, will receive
Multiplex data demodulation or be modulated to outside send coded data modulating/demodulating portion ex302 and will demodulation after multiplexing
Data separating be image data, voice data or by signal processing not ex306 coding image data, voice data multiplexing
Multiplexing/separated part ex303.
Additionally, television set ex300 possesses: have and voice data, image data are decoded respectively or respective information is compiled
The sound signal processing portion ex304 and signal of video signal process portion ex305 of code are (that is, as picture coding device or the figure of the present invention
As decoding apparatus plays a role) signal processing part ex306;There is the speaker ex307 exported by decoded acoustical signal
And show output unit ex309 of the display part ex308 of the display etc. of decoded signal of video signal.And then, television set ex300 has
Interface portion ex317 of operation inputting part ex312 etc. of the input of accepted user of getting everything ready operation.And then, television set ex300 has
The power circuit part ex311 merge the control portion ex310 controlling each portion, each portion being supplied electric power.Interface portion ex317 can also be removed
Beyond operation inputting part ex312, also there is the bridge part that the external equipment with reader/writer ex218 etc. is connected
Ex313, be used for installing the record medium ex216 of SD card etc. socket ex314, for Jie such as the external record of hard disk etc.
The modem ex316 etc. that driver ex315 that matter connects is connected with telephone network.It addition, record medium ex216 is can
The structure of the record of information is electrically carried out by the semiconductor memory component of the non-volatile/volatibility of harvesting.Television set
Each portion of ex300 is connected with each other via synchronous bus.
First, the knot multiplex data obtained from outside by antenna ex204 etc. being decoded, reproducing by television set ex300
Structure illustrates.Television set ex300 accepts the user operation from remote controller ex220 etc., based on the control portion with CPU etc.
The control of ex310, separates multiplex data multiplexing/separated part ex303 demodulated by modulating/demodulating portion ex302.And then, electricity
Depending on machine ex300, the voice data sound signal processing portion ex304 separated is decoded, by the image data signal of video signal of separation
Process portion ex305 uses the coding/decoding method decoding illustrated in the respective embodiments described above.By decoded acoustical signal, image letter
Export towards outside from output unit ex309 number respectively.Output time, can temporarily these signals be stored into buffer ex318,
In ex319 etc., so that acoustical signal and signal of video signal reproduced in synchronization.Additionally, television set ex300 may not be from broadcast wait,
And the multiplex data of coding is read from record medium ex215, ex216 of magnetic optical dish, SD card etc..Then, to television set ex300
Acoustical signal or signal of video signal encode, send to outside or are written to record the structure in medium etc. illustrate.Television set
Ex300 accepts the user operation from remote controller ex220 etc., and control based on control portion ex310, by sound signal processing portion
Signal of video signal use, by sound signal encoding, is illustrated in the respective embodiments described above by ex304 by signal of video signal process portion ex305
Coded method coding.By the acoustical signal after coding, signal of video signal multiplexing/separated part ex303 multiplexing, export to outside.
When multiplexing, can temporarily these signals be stored in buffer ex320, ex321 etc., so that acoustical signal and signal of video signal
Reproduced in synchronization.It addition, buffer ex318, ex319, ex320, ex321 both can possess multiple as illustrated, it is also possible to be
Share the structure of more than one buffer.And then, beyond diagram, it is also possible at such as modulating/demodulating portion ex302 or multiple
With also serving as avoiding the overflow of system, the buffer part of underflow to store data in a buffer between/separated part ex303 etc..
Additionally, television set ex300 is in addition to waiting from broadcast or record acquirement voice data, the image datas such as medium, also
Can possess the structure of the AV input accepting mike or photographing unit, the data obtained from them are carried out coded treatment.Separately
Outward, here, television set ex300 is said as the structure that can carry out above-mentioned coded treatment, multiplexing and outside output
Bright but it also may can not to be carried out these and process, and be only capable of carrying out above-mentioned reception, decoding process, the structure of outside output.
Additionally, multiplex data is being read or in the case of write from record medium by reader/writer ex218, on
State decoding process or coded treatment by television set ex300, reader/writer ex218 which carry out can, it is also possible to be
Television set ex300 and reader/writer ex218 are mutually shared and being carried out.
As an example, reading in or information regeneration/record portion ex400 in the case of write of data will be carried out from CD
Representation is in figure 16.Information regeneration/record portion ex400 possess the unit ex401 of following description, ex402, ex403,
ex404、ex405、ex406、ex407.The shaven head ex401 recording surface irradiating laser speckle to recording medium ex215 as CD
And write information, detect the reflection light of the recording surface from record medium ex215 and read in information.Modulation record portion ex402 electricity
Gas ground drives the semiconductor laser being built in shaven head ex401, carries out the modulation of laser according to record data.Reproduce demodulation section
Ex403 obtains electrically being detected by the photodetector being built in shaven head ex401 from the reflection light of recording surface again
Existing signal amplifies, and record signal component in record medium ex215 is separated and demodulated, the information required for reproduction.Buffering
Device ex404 will be used for the information in record medium ex215 that recorded and temporarily keeps from the information recording medium ex215 reproduction.
Table motor ex405 makes record medium ex215 rotate.Servo control portion ex406 is while the rotation controlling table motor ex405 drives one
While make shaven head ex401 move to the information track of regulation, carry out the tracking process of laser spot.Systems control division ex407 carries out letter
The control that breath reproduction/record portion ex400 is overall.The place reason systems control division ex407 of above-mentioned read and write utilizes and keeps
Various information in buffer ex404, carry out the generation of new information further according to needs, add and while make tune
Record portion ex402 processed, reproduce demodulation section ex403, servo control portion ex406 coordination, on one side carry out letter by bare headed ex401
The record of breath reproduces and realizes.Systems control division ex407 is such as made up of microprocessor, is come by the program performing to read write
Perform their process.
Above, it is assumed that shaven head ex401 irradiating laser speckle and be illustrated but it also may be to use to connect near field light and carry out height
The structure of the record of density.
It is denoted as the schematic diagram of the record medium ex215 of CD in fig. 17.Recording surface at record medium ex215
On, spirally it is formed with guide channel (ditch), in information track ex230, has recorded beforehand through the change of shape of ditch
The address information of the absolute position on indicating panel.This address information includes the record block for determining the unit as record data
The information of the position of ex231, reads address believe by being reproduced by information track ex230 in carrying out the device recording and reproducing
Breath, it is possible to determine record block.Additionally, record medium ex215 includes data recording area ex233, inner region ex232, periphery
Region ex234.The region used to record user data is data recording area ex233, is arranged in and compares data recorded area
Specific beyond the record of user data of territory ex233 inner circumferential side or inner region ex232 of periphery and outer region ex234
Purposes.The sound that the data recording area ex233 of such record medium ex215 is encoded by information regeneration/record portion ex400
The read-write of the coded data of these data of sound data, image data or multiplexing.
Above, it is illustrated as a example by the CD of DVD, BD etc. of lifting 1 layer, but is not limited to these, it is also possible to be many
Layer constructs, also is able to beyond surface the CD of record.In addition it is also possible to be the identical local use different wave length at dish
The light recording information of color or from different layer of information of various angle recordings etc., the structure that carries out the record/reproduction of multidimensional
CD.
Additionally, in digital broadcasting with in system ex200, it is also possible to by there is the car ex210 of antenna ex205 from satellite
Dynamic image is reproduced in the display device of the automatic navigator ex211 etc. that ex202 etc. receive data, have at car ex210.Separately
Outward, the structure of automatic navigator ex211 can consider such as to add in the structure shown in Figure 15 the structure in GPS receiver portion,
It is also contemplated that same structure in computer ex111 and portable phone ex114 etc..
Figure 18 A is to represent to use the dynamic image decoding method and moving picture encoding side illustrated in the above-described embodiment
The figure of the portable phone ex114 of method.Portable phone ex114 has by the antenna for receiving and dispatching electric wave between the ex110 of base station
Ex350, can filmed image, the camera section ex365 of rest image, display by the image photographed by camera section ex365, by
The display part ex358 of the liquid crystal display etc. of the decoded data such as the image that antenna ex350 receives.Portable phone ex114
Also have comprise operated key portion ex366 main part, for carry out the speaker etc. of voice output audio output unit ex357,
Be used for carrying out sound input unit ex356 of the mike etc. of sound input, preserve photograph image, rest image, recording
Data after the coding of sound or the image received, rest image, mail etc. or the memorizer portion of decoded data
Ex367 or as with as preserve the socket ex364 of interface portion between the record medium of data.
And then, use Figure 18 B that the structure example of portable phone ex114 is illustrated.Portable phone ex114 is for merging control
Fixture is for display part ex358 and master control part ex360 in each portion of the main part of operated key portion ex366, by power circuit part
Ex361, operation input control portion ex362, signal of video signal process portion ex355, camera interface portion ex363, LCD (Liquid
Crystal Display: liquid crystal display) control portion ex359, modulating/demodulating portion ex352, multiplexing/separated part ex353, sound
Signal processing part ex354, socket ex364, memorizer portion ex367 are connected with each other via bus ex370.
If power circuit part ex361 makes end of conversation and power key become opening by the operation of user, then lead to
Crossing, from set of cells, each portion is supplied electric power, portable phone ex114 starts as can the state of action.
Portable phone ex114 control based on master control part ex360 with CPU, ROM and RAM etc., at voice call mould
During formula, digital sound message will be transformed to by the acoustical signal of sound input unit ex356 collection sound by sound signal processing portion ex354
Number, it is carried out wave spectrum DIFFUSION TREATMENT with modulating/demodulating portion ex352, transmission/reception unit ex351 implements digitaltoanalogconversion
Process and after frequency conversion process through being sent by antenna ex350.Additionally, portable phone ex114 is when voice calling mode, will be by
Frequency conversion process is amplified and implemented to the reception data that antenna ex350 receives and analog-to-digital conversion processes, and uses modulating/demodulating
Portion ex352 carries out wave spectrum counter diffusion process, after being transformed to analoging sound signal by sound signal processing portion ex354, by its warp
Exported by audio output unit ex357.
And then, in the case of sending Email when data communication mode, by by the operated key portion ex366 of main part
Deng operation input Email text data via operation input control portion ex362 send to master control part ex360.Main
Text data modulating/demodulating portion ex352 is carried out wave spectrum DIFFUSION TREATMENT by control portion ex360, real by transmission/reception unit ex351
Execute digitaltoanalogconversion to process and after frequency conversion process, send to base station ex110 via antenna ex350.Receiving electronics postal
In the case of part, the data received are performed the substantially inversely processing of above-mentioned process, and exports display part ex358.
When data communication mode, in the case of sending image, rest image or image and sound, signal of video signal
The signal of video signal supplied from camera section ex365 is compiled by process portion ex355 by the dynamic image shown in the respective embodiments described above
Code method is compressed encoding (that is, the picture coding device as the present invention plays a role), is sent by the image data after coding
Go out to multiplexing/separated part ex353.It addition, sound signal processing portion ex354 is to by camera section ex365 filmed image, static
Encode with the acoustical signal of sound input unit ex356 collection sound during image etc., the voice data after coding is sent
To multiplexing/separated part ex353.
Multiplexing/separated part ex353 is by the way of regulation, to after the coding that signal of video signal process portion ex355 supplies
Image data and the voice data after the coding of sound signal processing portion ex354 supply carry out multiplexing, its result are obtained
Multiplex data modulating/demodulating portion (modulation/demodulation circuit portion) ex352 carries out wave spectrum DIFFUSION TREATMENT, by transmission/reception unit
Ex351 implements digitaltoanalogconversion and processes and after frequency conversion process, sends via antenna ex350.
Receive the data of the dynamic image file being linked to homepage etc. when data communication mode in the case of, or connect
Receive in the case of addition of the Email of image or sound, in order to the multiplex data received via antenna ex350
Being decoded, multiplexing/separated part ex353, by being separated by multiplex data, is divided into the bit stream of image data and voice data
Bit stream, supplies the image data after coding to signal of video signal process portion ex355 via synchronous bus ex370, and will coding
After voice data to sound signal processing portion ex354 supply.Signal of video signal process portion ex355 by with the respective embodiments described above
The dynamic image decoding method that shown motion image encoding method is corresponding is decoded, and is thus decoded signal of video signal
(that is, the picture decoding apparatus as the present invention plays a role), shows such as from display part ex358 via LCD control portion ex359
It is linked to image, the rest image comprised in the dynamic image file of homepage.It addition, sound signal processing portion ex354 is to sound
Signal is decoded, and exports sound from audio output unit ex357.
Additionally, the terminal of above-mentioned portable phone ex114 etc. is as television set ex300, except having encoder, decoder
Beyond both transmitting-receiving type terminals, it is also contemplated that only 3 kinds of the reception terminal of the transmission terminal of encoder, only decoder
Installation form.It addition, in digital broadcasting with in system ex200, it is set to send, receives in image data multiplexing music data
It is illustrated etc. the multiplex data obtained, but the number of multiplexing and the character data etc. of image association in addition to voice data
According to can also, be not multiplex data but image data itself can also.
So, the motion image encoding method represented in the respective embodiments described above or dynamic image decoding method are used in
In which kind of equipment above-mentioned, system can, by such manner, it is possible to the effect that illustrated in the respective embodiments described above.
Additionally, the present invention is not limited to such above-mentioned embodiment, it is possible to carry out without departing from the scope of the present invention
Various deformation or correction.
(embodiment 4)
Can also be by by the motion image encoding method illustrated in the respective embodiments described above or device and foundation
Motion image encoding method or the device of the standard that MPEG-2, MPEG4-AVC, VC-1 etc. are different are the most suitably cut
Change, generate image data.
Here, in the case of generating respectively according to multiple image datas of different standards, when decoding, need to select
Coding/decoding method corresponding to each standard.But, due to can not identify image data to be decoded according to which standard, so produce
The problem that life can not select suitable coding/decoding method.
In order to solve this problem, in image data, the multiplex data of multiplexing voice data etc. uses and comprises expression image
Data are according to the structure of the identification information of which standard.Hereinafter, illustrate to include by moving of illustrating in the respective embodiments described above
The image data that state method for encoding images or device generate is in the concrete structure of interior multiplex data.Multiplex data is MPEG-
The digital stream that 2 transmission are streamed.
Figure 19 is the figure of the structure representing multiplex data.As shown in figure 19, multiplex data by by video flowing, audio stream,
1 multiplexing carried out above in presentation graphic stream (PG), interactive graphic stream and obtain.Video flowing represents main image and the pair of film
Image, audio stream (IG) represents the master voice part of film and the secondary sound mixed with this master voice, and presentation graphic stream represents electricity
The captions of shadow.Here, so-called main image, represent the display common image on picture, so-called subpictures, is in main image
The image shown with less picture.Additionally, interactive graphic stream represents the dialogue made by configuring GUI component on picture
Picture.Video flowing is by the motion image encoding method that illustrates in the respective embodiments described above or device, according to conventional
The motion image encoding method of the standards such as MPEG-2, MPEG4-AVC, VC-1 or device code.Audio stream by Dolby AC-3,
The mode of Dolby Digital Plus, MLP, DTS, DTS-HD or linear PCM etc. encodes.
The each stream being included in multiplex data is identified by PID.Such as, to the video flowing used in the image of film
Distribution 0x1011, distributes 0x1100 to 0x111F to audio stream, distributes 0x1200 to 0x121F to Presentation Graphics, to interactive graphics (IG)
Stream distribution 0x1400 to 0x141F, in the subpictures of film use video flowing distribute 0x1B00 to 0x1B1F, to master
Audio stream distribution 0x1A00 to the 0x1A1F used in the secondary sound of sound mixture.
Figure 20 is to show schematically the figure how multiplex data is re-used.First, the video being made up of multiple frame of video
Stream ex235, the audio stream ex238 being made up of multiple audio frames are transformed to PES packet sequence ex236 and ex239 respectively, and are transformed to
TS bag ex237 and ex240.Equally, the data of presentation graphic stream ex241 and interactive graphics (IG) ex244 are transformed to PES bag sequence respectively
Row ex242 and ex245, then it is transformed to TS bag ex243 and ex246.Multiplex data ex247 is by using 1 by these TS cladding
Stream is constituted.
Figure 21 represents how to preserve video flowing in PES packet sequence in more detail.1st segment table of Figure 21 shows regarding of video flowing
Frequently frame sequence.2nd segment table shows PES packet sequence.Multiple works as shown in arrow yy1, yy2, yy3, yy4 of Figure 21, in video flowing
For Video Presentation Unit I picture, B picture, P picture is divided by each picture and is saved in having of PES bag
In effect load.Each PES bag has PES head, in PES head, preserves the PTS in the display moment as picture
(Presentation Time-Stamp) and the DTS (Decoding Time-Stamp) in the decoding moment as picture.
Figure 22 represents the form of the TS bag being finally written in multiplex data.TS bag is by letters such as the PID with identification stream
The bag of the 188 byte regular lengths that the TS payload of the TS head of 4 bytes of breath and 184 bytes of preservation data is constituted, above-mentioned
PES is coated segmentation and is saved in TS payload.In the case of bd-rom, TS bag is given to the TP_ of 4 bytes
Extra_Header, constitutes the source bag of 192 bytes, is written in multiplex data.ATS is being had described in TP_Extra_Header
(Arrival_Time_Stamp) information such as.ATS represents that this TS bag transfers start time to the pid filter of decoder.?
In multiplex data, source bag arranges as shown in Figure 22 hypomere, and number incremental from the beginning of multiplex data is referred to as SPN, and (source is wrapped
Number).
Additionally, in the TS bag that multiplex data is comprised, in addition to each stream of image, sound, captions etc., also PAT
(Program Association Table)、PMT(Program Map Table)、PCR(Program Clock
Reference) etc..Pat table shows what the PID of the PMT used in multiplex data is, the PID of PAT self is registered as 0.
PMT has PID and the attribute of the stream corresponding with each PID of each stream of image that multiplex data comprised, sound, captions etc.
Information, also has the various descriptors about multiplex data.In descriptor, there is copying of instruction permission/license multiplex data
The copy control information etc. of shellfish.PCR is in order to obtain ATC (Arrival Time Clock) and the conduct of the time shaft as ATS
The synchronization of the STC (System Time Clock) of the time shaft of PTS and DTS, has to be coated with this PCR and transfers to decoder
The information of STC time corresponding for ATS.
Figure 23 is the figure of the data configuration explaining PMT.In the beginning of PMT, it is configured with to describe and is included in this PMT
In the PMT head of length etc. of data.Face behind, is configured with multiple descriptor about multiplex data.Above-mentioned copy controls
Information etc. are recited as descriptor.After descriptor, it is configured with multiple stream letter about each stream being included in multiplex data
Breath.Stream information is by the stream type of the voice compression codecs recorded for identifying stream, the PID of stream, attribute information (the frame speed of stream
Rate, aspect ratio etc.) flow descriptors constitute.There is the quantity of stream present in multiplex data in flow descriptors.
In the case of recorded in record medium etc., above-mentioned multiplex data is remembered together with multiplex data message file
Record.
Multiplex data message file as shown in figure 24, is the management information of multiplex data, the most right with multiplex data
Should, it is made up of multiplex data information, stream attribute information and entry maps.
Multiplex data information as shown in figure 24, is made up of system velocity, reproduction start time, reproduction finish time.System
The maximum to the pid filter of System Target Decoder described later of rate representation multiplex data transfers speed.It is included in multiplexing
The interval of the ATS in data is set as becoming below system velocity.Reproduce the frame of video that start time is the beginning of multiplex data
PTS, reproducing finish time is set as that the PTS of frame of video of the end to multiplex data is plus the value of reproduction interval of 1 frame amount.
Stream attribute information as shown in figure 25, is registered with the attribute about each stream being included in multiplex data by each PID
Information.Attribute information has by video flowing, audio stream, presentation graphic stream, interactive graphic stream and different information.Video flowing attribute
It is many that information has this video flowing by the compression of what kind of voice compression codecs, the resolution of each image data that constitutes video flowing
Less, aspect ratio be how many, frame rate be how many etc. information.Audio stream attribute information has this audio stream by what kind of compression is compiled
The letter that decoder compression, the channel number that is included in this audio stream be how many, is how many etc. corresponding to which kind of language, sample frequency
Breath.In the initialization of these information decoder before reproducing at player etc..
In the present embodiment, stream type in above-mentioned multiplex data, that be included in PMT is used.Additionally, be situated between at record
In the case of record has multiplex data in matter, use the video flowing attribute information being included in multiplex data information.Specifically,
In the motion image encoding method shown in the respective embodiments described above or device, following steps or unit are set, this step or list
Unit is to the stream type being included in PMT or video flowing attribute information, and setting expression is by illustrating in the respective embodiments described above
Motion image encoding method or the intrinsic information of image data that generates of device.By this structure, it is possible to be identified by upper
State the motion image encoding method shown in each embodiment or the image data of device generation and the image according to other standards
Data.
Additionally, represent the step of the dynamic image decoding method of present embodiment in fig. 26.In step exS100, from
Multiplex data obtains the stream type being included in PMT or the video flowing attribute information being included in multiplex data information.Connect
, in step exS101, it is judged that stream type or video flowing attribute information indicate whether it is by the respective embodiments described above
The multiplex data that the motion image encoding method illustrated or device generate.Further, it is being judged as stream type or video flowing attribute letter
In the case of breath is the multiplex data of the motion image encoding method by illustrating in the respective embodiments described above or device generation,
In step exS102, it is decoded by the dynamic image decoding method illustrated in the respective embodiments described above.Additionally, in stream class
Type or video flowing attribute information represent the multiplex data of the standard being based on conventional MPEG-2, MPEG4-AVC, VC-1 etc.
In the case of, in step exS103, by being decoded according to the dynamic image decoding method of conventional standard.
So, by setting new eigenvalue in stream type or video flowing attribute information, can interpolate that when decoding be
The no dynamic image decoding method that can pass through to illustrate in the respective embodiments described above or device decode.Thus, it is being transfused to
In the case of the multiplex data of different standards, it is also possible to select suitable coding/decoding method or device, it is possible to do not send out
Life is decoded mistakenly.Additionally, by the motion image encoding method illustrated in the present embodiment or device or Dynamic Graph
As coding/decoding method or device are used in any of the above described equipment, system.
(embodiment 5)
The motion image encoding method illustrated in the respective embodiments described above and device, dynamic image decoding method and device
Typically can be realized by the LSI as integrated circuit.As an example, represent the knot of the LSIex500 of 1 chip in figure 27
Structure.LSIex500 possess the unit ex501 of following description, ex502, ex503, ex504, ex505, ex506, ex507, ex508,
Ex509, each unit connects via bus ex510.Power circuit part ex505 by the case of being on state at power supply to respectively
Portion's supply electric power, starts as can the state of action.
Such as in the case of carrying out coded treatment, LSIex500 is based on having CPUex502, Memory Controller
The control of the control portion ex501 of ex503, stream controller ex504, driving frequency control portion ex512 etc., by AV I/Oex509
AV signal is inputted from mike ex117 and photographing unit ex113 etc..The AV signal being transfused to is temporarily stored within the outside of SDRAM etc.
Memorizer ex511 in.Control based on control portion ex501, by the data that store according to treating capacity and processing speed suitably
How inferior it is divided into, sends to signal processing part ex507, signal processing part ex507 carries out coding and/or the shadow of acoustical signal
The coding of image signal.Here, the coded treatment of signal of video signal is the coded treatment illustrated in the respective embodiments described above.At signal
In process portion ex507, carry out the process of image data multiplexing etc. of the voice data by coding and coding always according to situation, from
Stream I/Oex506 exports to outside.The bit of this output flows to base station ex107 send or be written to record in medium ex215.
It addition, when multiplexing, can temporarily data be stored in buffer ex508 so that it synchronizes.
It addition, in above-mentioned, if the structure of the outside that memorizer ex511 is LSIex500 is illustrated but it also may
The structure being included in the inside of LSIex500.Buffer ex508 is also not limited to one, it is also possible to possess multiple buffering
Device.Additionally, LSIex500 both can form 1 chip, it is also possible to form multiple chip.
Additionally, in above-mentioned, it is assumed that control portion ex501 has CPUex502, Memory Controller ex503, stream controller
Ex504, driving frequency control portion ex512 etc., but the structure of control portion ex501 is not limited to this structure.For example, it is also possible to be
Signal processing part ex507 is also equipped with the structure of CPU.By being also provided with CPU in the inside of signal processing part ex507, it is possible to enter
One step improves processing speed.Additionally, as other examples, it is also possible to it is that CPUex502 possesses signal processing part ex507 or as letter
The structure in the such as sound signal processing portion of a part of number process portion ex507.In this case, control portion ex501 is
Possesses the structure with signal processing part ex507 or part thereof of CPUex502.
It addition, be set to LSI here, but according to the difference of integrated level, also there are referred to as IC, system LSI, super (super)
LSI, the situation of superfine (ultra) LSI.
Additionally, the method for integrated circuit is not limited to LSI, it is also possible to realized by special circuit or general processor.
The FPGA (Field Programmable Gate Array) that can program after LSI manufactures can also be utilized, maybe can reconstruct
The connection of the circuit unit within LSI and the reconfigurable processor of setting.
And then, if there is replacing the integrated circuit of LSI because of the progress of semiconductor technology or the other technologies of derivation
Technology, then this technology can certainly be used to carry out the integrated of functional module.It is likely to be the application etc. of biotechnology.
(embodiment 6)
At the image data that the motion image encoding method by illustrating in the respective embodiments described above or device are generated
In the case of decoding, it is contemplated that with the feelings by the image data according to standards such as conventional MPEG-2, MPEG4-AVC, VC-1
Condition compares treating capacity can be increased.Therefore, in LSIex500, need to be set as that ratio is by the image data solution according to conventional standard
The higher driving frequency of driving frequency of CPUex502 during code.But, if setting high by driving frequency, then occur to consume electricity
The problem that power uprises.
In order to solve this problem, the dynamic image decoding device of television set ex300, LSIex500 etc. uses and identifies image number
According to according to which standard and according to standard switching driving frequency structure.Figure 28 represents structure ex800 of present embodiment.Drive
Dynamic frequency switching part ex803 is the motion image encoding method by illustrating in the respective embodiments described above or dress at image data
In the case of putting generation, driving frequency is set to height.Further, dynamic image execution illustrated in the respective embodiments described above
Image data is decoded by the decoding process portion ex801 instruction of coding/decoding method.On the other hand, it is based on conventional mark at image data
In the case of accurate image data, it is the motion image encoding method by illustrating in the respective embodiments described above with image data
Or the situation of data that device generates compares, driving frequency is set to low.Further, to the decoding process according to conventional standard
Image data is decoded by portion ex802 instruction.
More particularly, driving frequency switching part ex803 is by the CPUex502 of Figure 27 and driving frequency control portion ex512 structure
Become.Additionally, perform decoding process portion ex801, the Yi Jiyi of the dynamic image decoding method illustrated in the respective embodiments described above
The decoding process portion ex802 of the most past standard is corresponding to the signal processing part ex507 of Figure 27.CPUex502 identifies image data
According to which standard.Further, based on the signal from CPUex502, driving frequency control portion ex512 sets driving frequency.This
Outward, based on the signal from CPUex502, signal processing part ex507 carries out the decoding of image data.Here it is possible to consider at shadow
As the identification of data using such as in the identification information described in embodiment 4.About the information of identification, it is not limited in reality
Execute the information described in mode 4, as long as be capable of identify that image data is the most permissible according to the information of which standard.Such as, at base
In identifying that image data is used in television set and is still used in the external signal of dish etc., it is capable of identify that image data is according to which
In the case of standard, it is also possible to be identified based on such external signal.Additionally, the selection example of the driving frequency of CPUex502
As the standard of image data and driving frequency are set up corresponding look-up table carry out as shown in figure 30 can be considered.By look-up table
Being pre-reserved in the internal storage of buffer ex508 or LSI, CPUex502 is by referring to this look-up table, it is possible to select to drive
Dynamic frequency.
Figure 29 represents the step of the method implementing present embodiment.First, in step exS200, at signal processing part
In ex507, from multiplex data, obtain identification information.Then, in step exS201, in CPUex502, based on identifying letter
Breath identifies whether image data is the coded method by illustrating in the respective embodiments described above or the data of device generation.At shadow
In the case of data are the data generated by the coded method shown in the respective embodiments described above or device, in step
In exS202, CPUex502 sends to driving frequency control portion ex512 and driving frequency is set to high signal.Further, driving
Dynamic frequency control portion ex512 is set as high driving frequency.On the other hand, represent be based on conventional MPEG-2,
In the case of the image data of the standard of MPEG4-AVC, VC-1 etc., in step exS203, CPUex502 is to driving frequency
Control portion ex512 sends and driving frequency is set to low signal.Further, in driving frequency control portion ex512, be set as with
Image data is that the situation of the data of the coded method by illustrating in the respective embodiments described above or device generation is compared lower
Driving frequency.
And then, change LSIex500 by interlocking with the switching of driving frequency or include that the device of LSIex500 applies
Voltage, it is possible to improve further power savings.Such as, driving frequency is set to low in the case of, therewith, permissible
Compared with consideration is set to high situation with by driving frequency, by the voltage to LSIex500 or the device applying including LSIex500
It is set to low.
As long as additionally, driving frequency is set in the case for the treatment of capacity that the establishing method of driving frequency is when decoding is big
Treating capacity high, when decoding little in the case of driving frequency is set to low the most permissible, be not limited to above-mentioned setting
Method.For example, it is possible to consider will be according to the treating capacity of the image data decoding of MPEG4-AVC standard more than will be by upper
In the case of stating the treating capacity of image data decoding of the motion image encoding method shown in each embodiment or device generation,
With the setting that above-mentioned situation is driven frequency on the contrary.
And then, the establishing method of driving frequency is not limited to the structure making driving frequency low.For example, it is also possible to consider
Identification information is the motion image encoding method by illustrating in the respective embodiments described above or the image data of device generation
In the case of, the voltage of LSIex500 or the device applying including LSIex500 will be set to height, be based on conventional in expression
In the case of the image data of the standard of MPEG-2, MPEG4-AVC, VC-1 etc., will be to LSIex500 or include LSIex500
Device apply voltage be set to low.Additionally, as another example, it is also possible to consider to represent it is by above-mentioned in identification information
In the case of the image data that motion image encoding method shown in each embodiment or device generate, do not make CPUex502's
Drive and stop, in the situation of the image data representing the standard being based on conventional MPEG-2, MPEG4-AVC, VC-1 etc.
Under, more than needed owing to having in processes, so that CPUex502 driving suspend.It is also contemplated that represent it is to pass through in identification information
In the case of the image data that the motion image encoding method illustrated in the respective embodiments described above or device generate, as long as also existing
Having in process has more than needed then makes the driving of CPUex502 suspend.In this case, it is possible to consider to be based on conventional with expression
The situation of the image data of the standard of MPEG-2, MPEG4-AVC, VC-1 etc. is compared, and dwell time is set to short.
So, switch driving frequency according to the standard of image data institute foundation, it is possible to realize energy-saving technology.Additionally,
In the case of the device that use battery drives LSIex500 or includes LSIex500, it is possible to extend battery along with economize on electricity
Life-span.
(embodiment 7)
In the above-mentioned equipment such as television set, portable phone, system, sometimes it is transfused to the multiple shadows according to different standards
As data.So, so that also be able to decoding in the case of being transfused to according to multiple image datas of different standards,
The signal processing part ex507 of LSIex500 needs corresponding to multiple standards.But, if be used alone corresponding to each standard
, then there is the problem that circuit scale becomes greatly, cost increases in addition of LSIex500 in signal processing part ex507.
In order to solve this problem, use and will be used for the dynamic image decoding method performing to illustrate in the respective embodiments described above
Decoding process portion and according to conventional MPEG-2, MPEG4-AVC, VC-1 etc. standard a decoding process portion part altogether
Structure.The ex900 of Figure 31 A represents this structure example.Such as, the moving image decoding side illustrated in the respective embodiments described above
The dynamic image decoding method of method and foundation MPEG4-AVC standard is in entropy code, re-quantization, deblocking filter, motion compensation etc.
Process in some process content common.Following structure can be considered: about common process content, share corresponding to
The decoding process portion ex902 of MPEG4-AVC standard, about a mode of the present invention not corresponding to MPEG4-AVC standard
Other process content specific, uses special decoding process portion ex901.Particularly, the present invention is in terms of motion compensation
There is feature, it can be considered to such as use special decoding process portion ex901 for motion compensation, in addition
Some in entropy code, de-blocking filter, re-quantization or whole process, share decoding process portion.About decoding process portion
Share, it is also possible to be following structure: about common process content, share be used for perform to illustrate in the respective embodiments described above
The decoding process portion of dynamic image decoding method, about processing content specific to MPEG4-AVC standard, use special
Decoding process portion.
Additionally, represent will process another example that a part shares with the ex1000 of Figure 31 B.In this embodiment, use use with
One mode of the present invention is specific processes special decoding process portion ex1001 corresponding to content and the conventional mark with other
Special decoding process portion ex1002 corresponding to content and the Dynamic Graph with a mode in the present invention is processed specific to accurate
As the decoding shared that the process content that coding/decoding method is common with in the dynamic image decoding method of other conventional standard is corresponding
The structure of process portion ex1003.Here, special decoding process portion ex1001, ex1002 is not necessarily of the present invention
Process content specific to mode or other conventional standard and become privileged, can be the general place being able to carry out other
The structure of reason.It is further possible to installed the structure of present embodiment by LSIex500.
So, for dynamic image decoding method and the dynamic image solution of conventional standard of a mode in the present invention
Process content common in code method, shares decoding process portion, it is possible to reduce the circuit scale of LSI and reduce cost.
Industrial applicibility
Method for encoding images for the present invention and picture decoding method such as can be at television set, digital video records
Machine, automobile navigation instrument, portable phone, digital camera or digital video camcorder etc. use.
Symbol description
100 picture coding devices
101 subtraction portion
102 orthogonal transformation portions
103 quantization units
104 variable length code portions
105,205 re-quantization portion
106,206 inverse orthogonal transformation portion
107,207 adder
108,208 block storage
109,209 frame memory
110,210 infra-frame prediction portion
111,211 inter prediction portion
112,212 switch portion
113 picture/mb-type determination sections
114,214 inter prediction control portion
115,215 reference picture list management portion
116,216 judging part is added
200 picture decoding apparatus
204 length-changeable decoding portions