CN103004204B - Method for encoding images - Google Patents

Method for encoding images Download PDF

Info

Publication number
CN103004204B
CN103004204B CN201180035076.2A CN201180035076A CN103004204B CN 103004204 B CN103004204 B CN 103004204B CN 201180035076 A CN201180035076 A CN 201180035076A CN 103004204 B CN103004204 B CN 103004204B
Authority
CN
China
Prior art keywords
motion vector
mentioned
reference picture
picture
prediction direction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201180035076.2A
Other languages
Chinese (zh)
Other versions
CN103004204A (en
Inventor
杉尾敏康
西孝启
柴原阳司
笹井寿郎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sun Patent Trust Inc
Original Assignee
Sun Patent Trust Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sun Patent Trust Inc filed Critical Sun Patent Trust Inc
Priority claimed from PCT/JP2011/007319 external-priority patent/WO2012090495A1/en
Publication of CN103004204A publication Critical patent/CN103004204A/en
Application granted granted Critical
Publication of CN103004204B publication Critical patent/CN103004204B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The method for encoding images of the predicted motion vector of the raising of the code efficiency that a kind of derivation is suitable for motion vector is provided.Method for encoding images uses the 1st motion vector and the 2nd motion vector to perform prediction, and coded object block is encoded, position in the 1st reference picture that 1st motion vector instruction the 1st reference picture list is comprised, position in the 2nd reference picture that 2nd motion vector instruction the 2nd reference picture list is comprised, this method for encoding images includes: add step, and the candidate list of the predicted motion vector used in the coding of the 2nd motion vector is added the 1st motion vector candidate as predicted motion vector;Select step, from the candidate list comprising the 1st motion vector, select the predicted motion vector used in the coding of the 2nd motion vector;And coding step, use selected predicted motion vector by the 2nd motion vector encoder.

Description

Method for encoding images
Technical field
The present invention relates to perform prediction and method for encoding images and the execution prediction of picture coding are carried out image solution The picture decoding method of code.
Background technology
The direction in space that picture coding device typically utilizes image (including rest image and dynamic image) to be had is timely Between direction redundancy by quantity of information compress.As the method for the redundancy utilizing direction in space, use the conversion to frequency domain. As the method for the redundancy using time orientation, use inter prediction.Inter prediction is also referred to as inter-picture prediction.
Use the picture coding device of inter prediction when by certain coding of graphics, use on DISPLAY ORDER relative to volume Code object picture is that the encoded picture of front or behind is as reference picture.Further, picture coding device detection coding is right As picture is relative to the motion vector of this reference picture.
Then, picture coding device carries out motion compensation based on motion vector, obtains predicted image data.Then, image Code device obtains the difference of predicted image data and the view data of coded object picture.Then, picture coding device is by institute The difference obtained encodes.Thus, the redundancy of time orientation is removed by picture coding device.
About being referred to as the picture coding device of standardized picture coding mode (with reference to non-patent literature 1) H.264 it is The compression of quantity of information and use these 3 kinds of picture/mb-type of I picture, P picture and B picture.This picture coding device is for I picture not Carry out inter prediction.That is, picture coding device carries out infra-frame prediction to I picture.Infra-frame prediction is also referred to as intra-picture prediction.
Additionally, picture coding device is for P picture, with reference to the front or behind that DISPLAY ORDER is coded object picture 1 picture of coding, carries out inter prediction.Additionally, picture coding device is for B picture, it is coded object with reference to DISPLAY ORDER Encoded two picture of the front or behind of picture carries out inter prediction.
In being referred to as picture coding mode H.264, as the coding mode of the coded object block in B picture, there is motion Vector detection pattern.In motion vector detection pattern, picture coding device uses the fortune of reference picture detection coded object block Dynamic vector.Further, picture coding device uses reference picture and motion vector to generate predicted image data.Further, picture coding Device by predicted image data and the difference value of the view data of coded object block and makes in the generation of predicted image data Motion vector encoder.
In motion vector detection pattern, as described above, with reference to the volume of front or behind of coded object picture Two pictures of code generate the bi-directional predicted of prognostic chart picture.Additionally, in motion vector detection pattern, with reference to coded object Encoded 1 picture of the front or behind of picture generates the single directional prediction of prognostic chart picture.Further, for coded object Block, selects a certain kind in bi-directional predicted and single directional prediction.
In motion vector detection pattern, when by motion vector encoder, picture coding device is according to encoded adjacent The motion vector of block etc. generates predicted motion vector.Further, picture coding device is by the difference of motion vector Yu predicted motion vector Divide and encode.Thus, picture coding device cuts down quantity of information.As specific example, illustrate with reference to Figure 32.
In Figure 32, illustrate coded object block, adjacent block A, adjacent block B and adjacent block C.Adjacent block A is relative to volume The coded block that code object block is adjacent in left.Adjacent block B is the coded block the most adjacent relative to coded object block.Phase Adjacent block C is the coded block adjacent in upper right side relative to coded object block.
Additionally, in Figure 32, adjacent block A is encoded with bi-directional predicted, there is the motion vector MvL0_A of the 1st prediction direction Motion vector MvL1_A with the 2nd prediction direction.
Additionally, adjacent block B is encoded with single directional prediction, there is the motion vector MvL0_B of the 1st prediction direction.Additionally, phase Adjacent block C is encoded with bi-directional predicted, has motion vector MvL0_C and the motion vector of the 2nd prediction direction of the 1st prediction direction MvL1_C.Additionally, coded object block is intended to the block with bi-directional predictive coding, have the 1st prediction direction motion vector MvL0 and The motion vector MvL1 of the 2nd prediction direction.
Picture coding device is when encoding the motion vector MvL0 of the 1st prediction direction of coded object block, according to having The adjacent block of the motion vector of the 1st prediction direction generates the predicted motion vector PMvL0 corresponding with the 1st prediction direction.More specifically Say, picture coding device uses the motion vector MvL0_A of adjacent block A, the motion vector MvL0_B of adjacent block B and adjacent The motion vector MvL0_C of block C generates predicted motion vector PMvL0.
That is, picture coding device is when encoding the motion vector MvL0 of the 1st prediction direction of coded object block, uses phase The motion vector of the 1st prediction direction of adjacent block.Further, picture coding device will be as motion vector MvL0 and predicted motion vector The differential motion vector coding of the difference of PMvL0.
Predicted motion vector PMvL0 uses as the intermediate value being used for calculating motion vector MvL0_A, MvL0_B, MvL0_C The Median (MvL0_A, MvL0_B, MvL0_C) of the formula of (median) etc. calculate.Here, Median Formula 1 below~formula 3 performances.
[numerical expression 1]
Median (x, y, z)=x+y+z-Min (x, Min (y, z))-Max (x, Max (y, z)) ... (formula 1)
[numerical expression 2]
[numerical expression 3]
Picture coding device is when encoding the motion vector MvL1 of the 2nd prediction direction of coded object block, according to having The adjacent block of the motion vector of the 2nd prediction direction, generates the predicted motion vector PMvL1 corresponding with the 2nd prediction direction.More specifically Say, picture coding device use adjacent block A motion vector MvL1_A and adjacent block C motion vector MvL1_C generate prediction Motion vector PMvL1.
That is, picture coding device is when encoding the motion vector MvL1 of the 2nd prediction direction of coded object block, uses phase The motion vector of the 2nd prediction direction of adjacent block.Further, picture coding device will be as motion vector MvL1 and predicted motion vector The differential motion vector of the difference of PMvL1 encodes.Predicted motion vector PMvL1 use Median (MvL1_A, 0, MvL0_ Etc. C) calculate.
Prior art literature
Non-patent literature
Non-patent literature 1:ITU-T is H.26403/2010
Brief summary of the invention
The problem that invention is to be solved
But, in the case of the motion vector of identical prediction direction is less, make in the calculating of predicted motion vector The quantity of motion vector also tail off.In this case, the raising of the code efficiency of motion vector is hampered.
As described above, in the computational methods of conventional predicted motion vector, picture coding device is bi-directional predicted In the case of, calculate the 1st prediction direction and the predicted motion vector of the 2nd prediction direction independently.Therefore, at predicted motion vector The motion vector used in calculating is restricted.Thus, it is impossible to derive optimum motion vector, hinder carrying of code efficiency High.
Summary of the invention
So, it is an object of the invention to provide a kind of derivation be suitable for motion vector code efficiency raising prediction fortune The method for encoding images of dynamic vector and picture decoding method.
Means for solving the above
In order to solve the problems referred to above, method for encoding images for the present invention, use the 1st motion vector and the 2nd motion to vow Amount performs prediction, and is encoded by coded object block, the 1st reference that the 1st motion vector instruction the 1st reference picture list is comprised Position in picture, the position in the 2nd reference picture that the 2nd motion vector instruction the 2nd reference picture list is comprised, above-mentioned Method for encoding images includes: add step, to the candidate list of the predicted motion vector of the coding for above-mentioned 2nd motion vector Add above-mentioned 1st motion vector, as the candidate of above-mentioned predicted motion vector;Select step, from comprising above-mentioned 1st motion vector Above-mentioned candidate list in, select the above-mentioned predicted motion vector of coding for above-mentioned 2nd motion vector;And coding step Suddenly, use selected above-mentioned predicted motion vector by above-mentioned 2nd motion vector encoder.
In addition it is also possible to be, in above-mentioned additional step, identical with above-mentioned 2nd reference picture in above-mentioned 1st reference picture In the case of, add above-mentioned 1st motion vector to above-mentioned candidate list.
In addition it is also possible to be, in above-mentioned selection step, select in the multiple candidates being included in above-mentioned candidate list , minimum relative to the error of above-mentioned 2nd motion vector candidate is as above-mentioned predicted motion vector.
In addition it is also possible to be, in above-mentioned additional step, add to above-mentioned candidate list and detected by motion and detect Above-mentioned 1st motion vector.
In addition it is also possible to be, in above-mentioned additional step, with multiple times of multiple index values Yu above-mentioned predicted motion vector The mode of choosing correspondence one to one adds index value and the candidate of above-mentioned predicted motion vector to above-mentioned candidate list;In above-mentioned choosing Select in step, select index value as above-mentioned predicted motion vector from above-mentioned candidate list;In above-mentioned coding step, also with rope Draw the longest mode of the biggest then code length of value by selected above-mentioned index value coding.
In addition it is also possible to be, in above-mentioned additional step, add the left neighbour of above-mentioned coded object block to above-mentioned candidate list The upper right of the motion vector of block, the motion vector of block of upper neighbour of above-mentioned coded object block and above-mentioned coded object block adjacent The motion vector of block as the candidate of above-mentioned predicted motion vector.
Picture decoding method for the present invention, uses the 1st motion vector and the 2nd motion vector to perform prediction, will decoding Object block decodes, the position in the 1st reference picture that the 1st motion vector instruction the 1st reference picture list is comprised, and the 2nd Position in the 2nd reference picture that motion vector instruction the 2nd reference picture list is comprised, above-mentioned picture decoding method includes: Adding step, the candidate list to the predicted motion vector of the decoding for above-mentioned 2nd motion vector adds above-mentioned 1st motion and vows Measure the candidate as above-mentioned predicted motion vector;Select step, select from the above-mentioned candidate list comprising above-mentioned 1st motion vector Select the above-mentioned predicted motion vector of decoding for above-mentioned 2nd motion vector;And decoding step, use selected above-mentioned pre- Survey motion vector by above-mentioned 2nd motion vector decoder.
In addition it is also possible to be, in above-mentioned additional step, identical with above-mentioned 2nd reference picture in above-mentioned 1st reference picture In the case of, above-mentioned candidate list is added above-mentioned 1st motion vector.
In addition it is also possible to be, in above-mentioned additional step, add to above-mentioned candidate list and detected by motion and detect Above-mentioned 1st motion vector.
In addition it is also possible to be, in above-mentioned additional step, with multiple times of multiple index values Yu above-mentioned predicted motion vector The mode of choosing correspondence one to one adds index value and the candidate of above-mentioned predicted motion vector to above-mentioned candidate list;In above-mentioned solution In code step, also by the biggest by index value, the index value decoding encoded in the way of code length is the longest;Above-mentioned selection step is from above-mentioned Candidate list selects the above-mentioned predicted motion vector corresponding with decoded above-mentioned index value.
In addition it is also possible to be, in above-mentioned additional step, add the left neighbour of above-mentioned decoder object block to above-mentioned candidate list The upper right of the motion vector of block, the motion vector of block of upper neighbour of above-mentioned decoder object block and above-mentioned decoder object block adjacent The motion vector of block as the candidate of above-mentioned predicted motion vector.
Invention effect
By the present invention, derive and be suitable for the predicted motion vector that the code efficiency of motion vector improves.Thus, motion is vowed The code efficiency of amount improves.
Accompanying drawing explanation
Fig. 1 is the structure chart of the picture coding device about embodiment 1.
Fig. 2 is the figure of the example representing two reference picture lists about embodiment 1.
Fig. 3 is the flow chart of the action representing the picture coding device about embodiment 1.
Fig. 4 is to represent the flow chart that the decision of the prediction direction about embodiment 1 processes.
Fig. 5 is to represent the flow chart that the calculating of the candidate list about embodiment 1 processes.
Fig. 6 is to represent the flow chart that the judgement of the additional mark about embodiment 1 processes.
Fig. 7 A is the figure of the example of the candidate list representing the 1st prediction direction about embodiment 1.
Fig. 7 B is the figure of the example of the candidate list representing the 2nd prediction direction about embodiment 1.
Fig. 8 is the figure of the example of the symbol representing the predicted motion vector index about embodiment 1.
Fig. 9 is to represent the figure that the selection of the predicted motion vector about embodiment 1 processes.
Figure 10 A be represent two reference pictures about embodiment 1 identical in the case of the figure of example.
Figure 10 B is the figure of the example in the case of representing about two reference picture differences of embodiment 1.
Figure 11 is the structure chart of the picture decoding apparatus about embodiment 2.
Figure 12 is the flow chart of the action representing the picture decoding apparatus about embodiment 2.
Figure 13 is the overall structure figure of the content provider system realizing content distribution service.
Figure 14 is the overall structure figure of digital broadcasting system.
Figure 15 is the module map of the structure example representing television set.
Figure 16 is to represent to record, to as CD, the structure example in information regeneration/record portion that medium carries out the read-write of information Module map.
Figure 17 is the figure of the structure example of the record medium being denoted as CD.
Figure 18 A is the figure of the example representing portable phone.
Figure 18 B is the module map of the structure example representing portable phone.
Figure 19 is the figure of the structure representing multiplex data.
Figure 20 is to show schematically the figure how each stream is re-used in multiplex data.
Figure 21 is to represent the figure that how video flowing is saved in PES packet sequence in more detail.
Figure 22 is the figure of the structure representing the TS bag of multiplex data and source bag.
Figure 23 is the figure of the data structure representing PMT.
Figure 24 is the figure of the internal structure representing multiplex data information.
Figure 25 is the figure of the internal structure representing stream attribute information.
Figure 26 is the figure representing the step identifying image data.
Figure 27 is the integrated electricity of motion image encoding method and the dynamic image decoding method representing and realizing each embodiment The module map of the structure example on road.
Figure 28 is the figure of the structure representing switching driving frequency.
Figure 29 is the figure representing the step identifying image data, switching driving frequency.
Figure 30 is the figure representing the example that the standard of image data establishes corresponding look-up table with driving frequency.
Figure 31 A is the figure of the example representing the structure module of signal processing part shared.
Figure 31 B is the figure of another example representing the structure module of signal processing part shared.
Figure 32 is the figure of the example of presentation code object block and 3 adjacent blocks.
Detailed description of the invention
Hereinafter, referring to the drawings embodiment for the present invention is illustrated.It addition, embodiments described below is all Represent a preferred concrete example of the present invention.The numerical value that represents i.e., in the following embodiments, shape, material, composition are wanted Element, the configuration of element and connection form, step, the order etc. of step are examples, are not the meaning limiting the present invention.This Invention is limited only by the appended claims.Thus, about in the element of following embodiment, in expression upper concept Independent claims in the element that is silent on, be not intended to reach the purpose of the present invention and necessarily need, make Illustrate for constituting preferred form.
(embodiment 1)
Fig. 1 is the module map of the structure representing the picture coding device about present embodiment.
Picture coding device 100 shown in Fig. 1 possess orthogonal transformation portion 102, quantization unit 103, re-quantization portion 105, inverse just Transformation component 106, block storage 108, frame memory 109, infra-frame prediction portion 110, inter prediction portion 111, inter prediction is handed over to control Portion 114, picture/mb-type determination section 113, reference picture list management portion 115, additional judging part 116, variable length code portion 104, subtraction portion 101, adder 107 and switch portion 112.
The orthogonal transformation portion 102 predicted image data to being generated by unit described later and the forecast error of input image sequence Data, are carried out from image area to the conversion of frequency domain.The quantization unit 103 prediction error data amount of carrying out to transforming in frequency domain Change processes.Re-quantization portion 105 carries out re-quantization process to by the prediction error data after quantization unit 103 quantification treatment.Inverse orthogonal Prediction error data after re-quantization is processed by transformation component 106 is carried out from frequency domain to the conversion of image area.
Block storage 108 is used to what the prediction error data after processing according to predicted image data and re-quantization was obtained The memorizer that decoding image preserves with block unit.Frame memory 109 is used to the memorizer preserved by decoding image with frame unit.
Picture/mb-type determination section 113 determines input picture sequence with which kind of picture/mb-type in I picture, B picture, P picture Row coding, generates picture type information.Infra-frame prediction portion 110 uses the decoding figure of the block unit being saved in block storage 108 Picture, generates the predicted image data of infra-frame prediction based on coded object block.Inter prediction portion 111 uses and is saved in frame memory The decoding image of the frame unit in 109, generates the predicted image data of inter prediction based on coded object block.
The reference picture list management portion 115 encoded reference picture distribution reference picture to reference in inter prediction Index, makes with reference to list (reference picture list) together with DISPLAY ORDER etc..Picture coding device 100 is in order to for B picture Two are kept with reference to list (L0, L1) with reference to two pictures.
Represent the example with reference to list in fig. 2.The 1st reference picture list (L0) of Fig. 2 is pre-with the bi-directional predicted the 1st Survey the example of reference picture list corresponding to direction.In the 1st reference picture list of Fig. 2, it is the reference figure of 2 to DISPLAY ORDER It is the reference picture index of 0 that sheet R1 is assigned with value.Additionally, to DISPLAY ORDER be 1 reference picture R2 to be assigned with value be the reference of 1 Picture indices.Further, to DISPLAY ORDER be 0 reference picture R3 be assigned with value be 2 reference picture index.
That is, in the 1st reference picture list of Fig. 2, reference picture is the nearest with coded object picture on DISPLAY ORDER, then This reference picture is assigned with the least reference picture index.
On the other hand, the 2nd reference picture list (L1) of Fig. 2 is the reference corresponding with the 2nd bi-directional predicted prediction direction The example of just list.In the 2nd reference picture list of Fig. 2, to DISPLAY ORDER be 1 reference picture R2 to be assigned with value be 0 Reference picture index.Additionally, to DISPLAY ORDER be 2 reference picture R1 to be assigned with value be the reference picture index of 1.Additionally, To DISPLAY ORDER be 0 reference picture R3 be assigned with value be 2 reference picture index.
Like this, two sometimes different to the specific reference picture distribution being included in two reference picture lists ginsengs Photograph and picture index (reference picture R1 of Fig. 2, R2).Additionally, it is sometimes the most specific to be included in two reference picture lists Reference picture index (reference picture R3 of Fig. 2) that reference picture distribution is identical.
The prediction of the 1st reference picture list (L0) is only used to be referred to as L0 prediction.Only use the 2nd reference picture list (L1) Prediction is referred to as L1 prediction.The prediction using the 1st reference picture list and the 2nd reference picture list both sides is referred to as bi-directional predicted or double Prediction.
In L0 predicts, the situation using front as prediction direction is more.In L1 predicts, as prediction direction The situation using rear is more.That is, the 1st reference picture list is configured to corresponding to the 1st prediction direction, the 2nd reference picture list It is configured to corresponding to the 2nd prediction direction.
Based on these relations, it was predicted that direction be classified as the 1st prediction direction, the 2nd prediction direction and two-way in a certain kind. Additionally, in the case of prediction direction is two-way, prediction direction is also embodied by bi-directional predicted or double prediction.
It addition, in the present embodiment, reference picture list management portion 115 utilizes reference picture index and DISPLAY ORDER pipe Reason reference picture but it also may utilize the management reference pictures such as reference picture index and coded sequence.
Additionally, in following record, the 1st reference picture list is predicted corresponding to L0, and the 2nd reference picture list corresponds to L1 predicts.Further, the 1st reference picture list corresponds to the 1st prediction direction, and the 2nd reference picture list corresponds to the 2nd prediction side To.However, it can be that the 1st reference picture list is predicted corresponding to L1, the 2nd reference picture list is predicted corresponding to L0.Equally, Can also be that the 1st reference picture list corresponds to the 1st prediction direction corresponding to the 2nd prediction direction, the 2nd reference picture list.
Add judging part 116 and use the 1st reference picture list made by reference picture list management portion 115 and the 2nd ginseng Photograph and picture list, it may be judged whether add candidate's (predicted motion vectors candidates) of predicted motion vector.Specifically, described later In method, add judging part 116 and judge whether candidate list (the predicted motion vector of the 2nd prediction direction to coded object block Candidate list) add the motion vector of the 1st prediction direction as the candidate of predicted motion vector.Further, add judging part 116 to set Fixed adding indicates.
Inter prediction control portion 114 determines the predicted motion vector for coding, to use multiple predicted motion vector to wait That choose and by the error minimum of the motion vector of motion detection derivation predicted motion vectors candidates carries out motion vector Coding.Here, error represents the difference value of predicted motion vectors candidates and the motion vector derived by motion detection.
Additionally, inter prediction control portion 114 generates the prediction fortune corresponding with the predicted motion vector determined by each piece Dynamic vector indexes.And, it was predicted that motion vector index, the control information of predicted motion vectors candidates and reference picture index are sent out Deliver to variable length code portion 104.
Variable length code portion 104 is to the prediction error data after quantification treatment, inter prediction Directional Sign, reference picture Index and picture type information carry out variable length code process, thus generate bit stream.
Fig. 3 represents the summary of the handling process of the method for encoding images about present embodiment.Inter prediction control portion 114 Determine coded object block with the prediction direction (S101) in the case of motion vector detection pattern-coding.Then, inter prediction Control portion 114 judges whether the prediction direction of motion vector detection pattern is bi-directional predicted (S102).
(S102 is) that inter prediction control portion 114 uses method described later in the case of prediction direction is bi-directional predicted Calculate predicted motion vectors candidates list and the predicted motion vectors candidates list of the 2nd prediction direction of the 1st prediction direction respectively (S103、S104)。
Then, add judging part 116 to judge whether the motion vector of the 1st prediction direction is appended to the 2nd prediction direction Predicted motion vectors candidates list (S105).In the case of being judged as additional motion vector (S105 is), inter prediction control The motion vector of the 1st prediction direction is appended to predicted motion vectors candidates list (S106) of the 2nd prediction direction by portion 114 processed.
Then, inter prediction control portion 114 is from the predicted motion vectors candidates list of the 1st prediction direction and the 2nd prediction side To predicted motion vectors candidates list in select predicted motion vector the 2nd prediction direction pre-of the 1st prediction direction respectively Survey motion vector.Further, variable length code portion 104 is by the predicted motion vector corresponding with selected each predicted motion vector Index coding, is attached in bit stream (S107).
In the case of the prediction direction of motion vector detection pattern is single directional prediction (no in S102), inter prediction controls Portion 114 calculates predicted motion vectors candidates list (S109) of the prediction direction corresponding with single directional prediction.Inter prediction control portion 114 select predicted motion vector from the predicted motion vectors candidates list of the prediction direction corresponding with single directional prediction.Further, may be used Elongated degree encoding section 104, by the predicted motion vector index coding corresponding with selected predicted motion vector, is attached to bit In stream (S110).
Finally, variable length code portion 104 would indicate that the inter prediction direction of prediction direction of motion vector detection pattern Mark and reference picture index coding, be attached in bit stream (S108).
Then, the handling process of Fig. 4 is used to explain the decision of prediction direction of motion vector detection pattern of Fig. 3 Method (S101).Inter prediction control portion 114 to by the reference picture of the 1st prediction direction index the reference picture that determines and Indexed, by the reference picture of the 2nd prediction direction, the reference picture determined and carry out motion detection.Further, inter prediction control portion 114 Generate the 1st motion vector the 1 and the 2nd motion vector (S201) relative to two reference pictures.
Here, inter prediction control portion 114 calculates the coded object block in encoded picture and reference figure in motion detection The difference value of the block in sheet.Further, inter prediction control portion 114 is by multiple pieces in reference picture, difference value minimum Block is determined as reference block.Further, inter prediction control portion 114 obtains fortune according to the position of coded object block and the position of reference block Dynamic vector.
Then, inter prediction portion 111 uses the 1st motion vector obtained to generate the prognostic chart picture of the 1st prediction direction.Interframe PREDICTIVE CONTROL portion 114 is such as calculated by the R-D optimizing model showed by Formula 4 below will coding by this prognostic chart picture Cost Cost1 (S202) in the case of object block coding.
Cost=D+ λ × R ... (formula 4)
In formula 4, D presentation code distorts.Specifically, the prognostic chart picture generated by certain motion vector will be utilized to compile Difference absolute value between the original pixel value of code object block encoding and decoding and the pixel value that obtains and coded object block and Deng be used as D.Additionally, R represents generation code amount.Specifically, by order to the motion vector being used for generating prognostic chart picture is compiled Code and the code amount that needs etc. are as R.Additionally, λ is Lagrange does not determines multiplier.
Then, inter prediction portion 111 uses the 2nd motion vector obtained to generate the prognostic chart picture of the 2nd prediction direction.And And, inter prediction control portion 114 calculates Cost2 (S203) according to formula 4.
Then, inter prediction portion 111 uses the 1st motion vector obtained and the 2nd motion vector to generate two-way prognostic chart Picture.Here, inter prediction portion 111 such as to the prognostic chart picture obtained according to the 1st motion vector and obtains according to the 2nd motion vector Prognostic chart picture carry out being added average by each pixel, thus generate bidirectional predictive picture.Further, inter prediction control portion 114 CostBi (S204) is calculated according to formula 4.
Then, Cost1, Cost2 and CostBi are compared (S205) by inter prediction control portion 114.In CostBi minimum In the case of (S205 is), the prediction direction of motion vector detection pattern is determined as bi-directional predicted by inter prediction control portion 114 (S206).In the case of CostBi is not minimum (no in S205), Cost1 with Cost2 is compared by inter prediction control portion 114 (S207)。
(S207 is) that motion vector detection pattern is determined by inter prediction control portion 114 in the case of Cost1 is less It it is the single directional prediction (S208) of the 1st prediction direction.In the case of Cost1 is no small (no in S207), inter prediction control portion 114 Motion vector detection pattern is determined as the single directional prediction (S209) of the 2nd prediction direction.
It addition, in the present embodiment, inter prediction portion 111 has carried out each pixel when two-way prognostic chart picture generates Addition average but it also may be weighted being added average etc..
Then, the handling process of Fig. 5 is used to explain the computational methods of predicted motion vectors candidates list of Fig. 3 (S103、S104、S109).Inter prediction control portion 114 determines that the adjacent adjacent block A in the left of coded object block, top are adjacent Adjacent block B, the adjacent adjacent block C (S301) in upper right side.
Such as, inter prediction control portion 114 will be located in the pixel of left neighbour of pixel of top-left position of coded object block Affiliated block is determined as adjacent block A.Further, inter prediction control portion 114 will be located in the picture of top-left position of coded object block The block belonging to pixel of the upper neighbour of element is determined as adjacent block B.Further, inter prediction control portion 114 will be located in coded object block The block belonging to pixel that the upper right of the pixel of upper-right position is adjacent is determined as adjacent block C.
Then, adjacent block A, B, C are judged whether to meet two condition both sides by inter prediction control portion 114 respectively (S302).One in these two conditions is, adjacent block N (N be A, B, C some) have with corresponding to coded object block The motion vector of the prediction direction that the prediction direction of motion vector is identical.Another is that the reference picture of adjacent block N is right with coding As the reference picture of block is identical.
In the case of meeting two conditions for adjacent block N (in S302 being), inter prediction control portion 114 is by phase The adjacent motion vectors of adjacent block N is appended in predicted motion vectors candidates list (S303).Additionally, inter prediction control portion 114 Calculate the intermediate value (median) of multiple motion vectors of multiple adjacent block, be appended in predicted motion vectors candidates list (S304)。
It addition, in the process above, inter prediction control portion 114 will have and the motion vector corresponding to coded object block The motion vector of adjacent block of the identical prediction direction of prediction direction be appended in predicted motion vectors candidates list.Further, Inter prediction control portion 114 does not add the motion vector of the adjacent block with different prediction direction.But, inter prediction controls The motion vector of the adjacent block with different prediction direction can also be appended to predicted motion vector and wait by portion 114 as 0 Select list.
Then, the handling process of Fig. 6 is used to explain the determining method (S105) of additional mark of Fig. 3.
Sometimes the reference picture of reference picture and the 2nd prediction direction represented by reference picture index of the 1st prediction direction Reference picture represented by index is identical.In the case, the motion vector of the 1st prediction direction and the motion of the 2nd prediction direction Vector has tendency relatively.
Thus, in the case, inter prediction control portion 114 is in bi-directional predicted, as the fortune of some prediction direction The predicted motion vectors candidates of dynamic vector and add the motion vector of another prediction direction.Thus, picture coding device 100 energy Enough the motion vector of some prediction direction is encoded expeditiously.
In the present embodiment, use the motion vector of the 1st prediction direction to the predicted motion vector of the 2nd prediction direction The example that candidate adds.Alternatively, it is also possible to the motion vector of the 2nd prediction direction to be appended to the predicted motion of the 1st prediction direction In vectors candidates.
First, add judging part 116 and obtain the reference picture that the motion about Fig. 4 detects the 1st prediction direction of (S201) Reference picture index (S401, S402) of index and the 2nd prediction direction.Then, add judging part 116 and use the 1st reference picture List and the 2nd reference picture list judge the reference picture represented by reference picture index and the 2nd prediction side of the 1st prediction direction To reference picture index represented by reference picture the most identical (S403).
Such as, the reference picture that additional judging part 116 obtains the 1st prediction direction according to the 1st reference picture list indexes institute The DISPLAY ORDER of the reference picture represented.Additionally, add judging part 116 to obtain the 2nd prediction direction according to the 2nd reference picture list The DISPLAY ORDER of reference picture represented by reference picture index.Further, judging part 116 is added by the two DISPLAY ORDER ratio Relatively, if they are identical, then it is judged as that two reference pictures are identical.
In the case of the reference picture of the 1st prediction direction is identical with the reference picture of the 2nd prediction direction (S403 is), Add judging part 116 to be set as additional mark opening (ON) (S404).Reference picture and the 2nd prediction in the 1st prediction direction In the case of the reference picture in direction differs (no in S403), add judging part 116 and additional mark is set to off (OFF) (S405)。
It addition, in the present embodiment, add judging part 116 and use DISPLAY ORDER to judge two reference pictures whether phase With.But, add judging part 116 coded sequence etc. can also be used to judge two reference pictures are the most identical.
Then, the example of the candidate list generated in the example of Figure 32 is represented in Fig. 7 A, Fig. 7 B.It is assumed here that Following relation.That is, coded object block has motion vector MvL0 and the motion vector of the 2nd prediction direction of the 1st prediction direction MvL1.Additionally, adjacent block has motion vector as shown in figure 32.Additionally, in each adjacent block, the reference of the 1st prediction direction The reference picture of picture and the 2nd prediction direction is identical.Fig. 7 A and Fig. 7 B represents in this relation by the predicted motion vector of Fig. 3 The generation of candidate list processes the example of the predicted motion vectors candidates list that (S103~S106) generates.
In the predicted motion vectors candidates list of the 1st prediction direction of Fig. 7 A, with Median (MvL0_A, MvL0_B, MvL0_C) corresponding predicted motion vector index is 0.The predicted motion vector index corresponding with motion vector MvL0_A is 1.With Predicted motion vector index corresponding for motion vector MvL0_B is 2.The predicted motion vector rope corresponding with motion vector MvL0_C Draw is 3.
Additionally, in the predicted motion vectors candidates list of the 2nd prediction direction of Fig. 7 B, with Median (MvL1_A, 0, MvL0_C) corresponding predicted motion vector index is 0.The predicted motion vector index corresponding with motion vector MvL0_A is 1.With Predicted motion vector index corresponding for motion vector MvL0_C is 2.Corresponding for motion vector MvL0 prediction with the 1st prediction direction Motion vector index is 3.It addition, the method for salary distribution of predicted motion vector index is not limited to this example.
Fig. 8 represents the example of the code table of use when predicted motion vector index carries out variable length code.Prediction fortune It is the shortest that dynamic vector indexes the least then code length.Inter prediction control portion 114 is to speculating that the candidate allocation that precision of prediction is higher is less Predicted motion vector index.Thereby, it is possible to make code efficiency improve.
Then, the handling process of Fig. 9 is used to explain the predicted motion vector selection method (S107, S110) of Fig. 3. Inter prediction control portion 114 sets 0 in count value as initialization, the maximum of setting value in minimum differential motion vector Value etc. (S501).
Then, inter prediction control portion 114 judges whether to calculate the differential motion of whole predicted motion vectors candidates Vector (S502).If there remains predicted motion vectors candidates (in S502 being), then inter prediction control portion 114 is by from fortune Dynamic testing result vector deducts predicted motion vectors candidates, calculates differential motion vector (S503).
Then, inter prediction control portion 114 judges that the differential motion vector obtained is less than minimum differential motion vector (S504).In the case of differential motion vector is less than minimum differential motion vector (S504 is), inter prediction control portion 114 Minimum differential motion vector and predicted motion vector index are updated (S505).
Then, inter prediction control portion 114 adds 1 (S506) to count value.Further, inter prediction control portion 114 is again Judge whether ensuing predicted motion vectors candidates (S502).It is being judged as whole predicted motion vectors candidates meters In the case of having calculated differential motion vector (no in S502), the minimum differential motion that inter prediction control portion 114 will finally determine Vector and predicted motion vector index send to variable length code portion 104 so that it is coding (S507).
As above, according to present embodiment, as shown in Figure 10 A, represented by the motion vector of the 1st prediction direction In the case of reference picture is identical with the reference picture represented by the motion vector of the 2nd prediction direction, by the fortune of the 1st prediction direction Dynamic vector adds to the predicted motion vectors candidates of the 2nd prediction direction.On the other hand, as shown in Figure 10 B, in the 1st prediction direction The situation different from the reference picture represented by the motion vector of the 2nd prediction direction of the reference picture represented by motion vector Under, the motion vector of the 1st prediction direction is not added to the predicted motion vectors candidates of the 2nd prediction direction.
So, according to present embodiment, inter prediction control portion 114 is used for obtaining a prediction in bi-directional predicted middle use The new computational methods of the predicted motion vector in direction.Thus, inter prediction control portion 114 derives and is best suited for coded object figure The predicted motion vector of the coding of the motion vector of sheet.Thus, code efficiency improves.
Particularly, the ginseng of reference picture and the 2nd prediction direction represented by reference picture index of the 1st prediction direction sometimes Reference picture represented by photograph and picture index is identical.In the case, inter prediction control portion 114 in bi-directional predicted, as The predicted motion vectors candidates of the motion vector of some prediction direction and add the motion vector of another prediction direction.By This, the motion vector of some prediction direction can be encoded by picture coding device 100 expeditiously.
It addition, in the present embodiment, illustrate in the predicted motion vectors candidates list of the 2nd prediction direction, add the The example of the motion vector of 1 prediction direction but it also may add the in the predicted motion vectors candidates list of the 1st prediction direction The motion vector of 2 prediction direction.
Additionally, in the case of the reference picture of the 1st prediction direction is different from the reference picture of the 2nd prediction direction, interframe The motion vector of the 1st prediction direction can also be appended to the candidate list of the 2nd prediction direction by PREDICTIVE CONTROL portion 114.At two In the case of reference picture difference, the most also improve code efficiency by the increase of candidate's number.
(embodiment 2)
Figure 11 is the module map of the structure representing the picture decoding apparatus about present embodiment.
Picture decoding apparatus 200 shown in Figure 11 possesses length-changeable decoding portion 204, re-quantization portion 205, inverse orthogonal transformation Portion 206, adder 207, block storage 208, frame memory 209, infra-frame prediction portion 210, inter prediction portion 211, switch portion 212, inter prediction control portion 214, reference picture list management portion 215 and additional judging part 216.
Length-changeable decoding portion 204 carries out length-changeable decoding process to the bit stream of input.Further, length-changeable decoding Portion 204 generates picture type information, inter-frame forecast mode, inter prediction Directional Sign, skips mark and quantization parameter.Re-quantization Portion 205 carries out re-quantization process to quantization parameter.Inverse orthogonal transformation portion 206 will carry out the orthogonal transform coefficient after re-quantization process Convert to image area from frequency domain, generate prediction error image data.
Block storage 208 is used to preserve with block unit prediction error image data be added with predicted image data and give birth to The memorizer of the image sequence become.Frame memory 209 is used to the memorizer preserved by image sequence with frame unit.
The image sequence of the block unit that infra-frame prediction portion 210 is saved in block storage 208 by use is pre-in performing frame Survey, generate the predicted image data of decoder object block.
Inter prediction portion 211 uses the image sequence of the frame unit being saved in frame memory 209 to perform inter prediction, by This generates the predicted image data of decoder object block.Inter prediction control portion 214 is according to inter-frame forecast mode, inter prediction direction And skip mark, control the motion vector in inter prediction and predicted image data generates method.
The decoded reference picture of reference in inter prediction is distributed reference picture by reference picture list management portion 215 Index, makes with reference to list (as Fig. 2 of embodiment 1) together with DISPLAY ORDER etc..B picture is quilt with reference to two pictures Decoding.Therefore, reference picture list management portion 215 keeps two with reference to list.
It addition, about reference picture list management portion 215 reference picture index and the DISPLAY ORDER pipe of present embodiment Reason reference picture.But, reference picture list management portion 215 can also be with reference picture index and coded sequence (decoding order) Deng management reference picture.
Add judging part 216 and use the 1st reference picture list made by reference picture list management portion 215 and the 2nd ginseng Photograph and picture list judges whether that the predicted motion vectors candidates list of the 2nd prediction direction to decoder object block adds the 1st prediction The motion vector in direction.Further, add judging part 216 and set additional mark.It addition, add decision flow process and the embodiment party of mark Fig. 6 of formula 1 is same, so omitting the description.
Finally, adder 207, by decoded prediction error image data being added with predicted image data, generates and solves Code image sequence.
Figure 12 represents the summary of the handling process of the picture decoding method about present embodiment.First, inter prediction control Portion 214 processed judges whether decoded prediction direction is two-way (S601).
Prediction direction after the decoding be two-way in the case of (S601 is), it is pre-that inter prediction control portion 214 calculates the 1st Survey direction and predicted motion vectors candidates list (S602, S603) of the 2nd prediction direction.It addition, at predicted motion vectors candidates The computational methods of list use Fig. 5 etc. of embodiment 1.Length-changeable decoding portion 204 goes out the 1st prediction according to bit stream decoding The reference picture index of direction and the 2nd prediction direction.
Add judging part 216 to select from bit stream decoding from the predicted motion vectors candidates list of the 1st prediction direction The predicted motion vector represented by predicted motion vector index of the 1st prediction direction.Further, inter prediction control portion 214 is to the 1st The predicted motion vector of prediction direction is plus the differential motion vector of the 1st prediction direction after bit stream decoding.Thus, interframe PREDICTIVE CONTROL portion 214 decodes the motion vector (S604) of the 1st prediction direction.
Add judging part 216 and judge whether to be appended to the motion vector of the 1st prediction direction the prediction fortune of the 2nd prediction direction Dynamic vector candidate list (S605).(S605 is) that inter prediction control portion 214 is by the case of additional mark is on The motion vector of 1 prediction direction is appended to predicted motion vectors candidates list (S606) of the 2nd prediction direction.It addition, expression is The additional mark of the no motion vector adding the 1st prediction direction sets in the same manner as waiting with Fig. 6 of embodiment 1.
Inter prediction control portion 214 selects from bit stream solution from the predicted motion vectors candidates list of the 2nd prediction direction The predicted motion vector represented by predicted motion vector index of the 2nd prediction direction of code.Further, inter prediction control portion 214 Predicted motion vector to the 2nd prediction direction adds the differential motion vector of the 2nd prediction direction gone out from bit stream decoding.By This, inter prediction control portion 214 decodes the motion vector (S607) of the 2nd prediction direction.
In the case of prediction direction is not two-way (no in S601), i.e. in the case of inter prediction direction is unidirectional, Inter prediction control portion 214 calculates predicted motion vectors candidates list (S608) of the prediction direction corresponding with single directional prediction.Frame Between PREDICTIVE CONTROL portion 214 select decoded from the predicted motion vectors candidates list of the prediction direction corresponding with single directional prediction Predicted motion vector represented by predicted motion vector index.Further, inter prediction control portion 214 calculates corresponding with single directional prediction The motion vector (S609) of prediction direction.
So, according to present embodiment, inter prediction control portion 214 is used for obtaining a prediction in bi-directional predicted middle use The new computational methods of the predicted motion vector in direction.Thus, the predicted motion deriving the decoding being best suited for motion vector is vowed Amount.Additionally, bit stream higher for code efficiency can suitably be decoded by picture decoding apparatus 200.
Particularly, the reference picture represented by reference picture index of the 1st prediction direction and the 2nd prediction direction sometimes Reference picture represented by reference picture index is identical.In the case, inter prediction control portion 214, in bi-directional predicted, makees The motion vector of another prediction direction is added for the predicted motion vectors candidates of motion vector of some prediction direction.By This, picture decoding apparatus 200 can will obtain by encoding the motion vector of some prediction direction expeditiously Bit stream suitably decode.
It addition, about the inter prediction control portion 214 of the present embodiment predicted motion vectors candidates to the 2nd prediction direction List adds the motion vector of the 1st prediction direction.But, inter prediction control portion 214 is as coding side, it is also possible to the 1st The predicted motion vectors candidates list of prediction direction adds the motion vector of the 2nd prediction direction.
Additionally, inter prediction control portion 214 is as coding side, in two reference pictures corresponding with two prediction direction In the case of difference, it is also possible to the predicted motion vectors candidates list of a prediction direction is added the fortune of another prediction direction Dynamic vector.
Above, based on multiple embodiments, picture coding device for the present invention and picture decoding apparatus are said Bright, but the present invention is not limited to these embodiments.These embodiments are implemented the change that those skilled in the art expects Shape and the form that obtains and the element of these embodiments is at random combined and other form of realizing is also contained in In the present invention.
For example, it is also possible to the place's reason other process portion performed in specifically process portion performs.In addition it is also possible to change is held The order that row processes, it is also possible to multiple process are performed in parallel.
Additionally, the present invention can not only realize as picture coding device and picture decoding apparatus, and can as with The method that processing mechanism is step constituting picture coding device and picture decoding apparatus realizes.Such as, these steps are by calculating Machine performs.Further, the program that the present invention can be used as making computer perform to comprise step in these methods realizes.Enter And, the present invention can have the record medium of the embodied on computer readable of the non-transitory of the CD-ROM etc. of this program in fact as record Existing.
Additionally, picture coding device and picture decoding apparatus combine by will be contained in the element in them, as Image encoding/decoding device realizes.
Additionally, the multiple elements being included in picture coding device and picture decoding apparatus can also be as integrated electricity Road i.e. LSI (Large Scale Integration) realizes.These elements both could be separately formed 1 chip, it is possible to To include that part or all forms 1 chip.Element beyond memorizer such as can also be formed as 1 core Sheet.Here LSI it is set to, but according to the difference of integrated level, sometimes referred to as IC (Integrated Circuit), system LSI, super Level LSI or superfine LSI.
Additionally, the method for integrated circuit is not limited to LSI, it is also possible to realized by special circuit or general processor. The FPGA (Field Programmable Gate Array) that can program can also be used, maybe can reconstruct the electricity within LSI The connection of road unit and the reconfigurable processor of setting.
And then, if because the integrated circuit of replacement LSI occur in the progress of semiconductor technology or other technology of derivation Technology, then can certainly use the element that this technology carries out being included in picture coding device and picture decoding apparatus Integrated circuit.
(embodiment 3)
By realizing the motion image encoding method (method for encoding images) shown in the respective embodiments described above or dynamic by being used for The program of the structure of state picture decoding method (picture decoding method) recorded in storage medium, it is possible to by the respective embodiments described above Shown process is implemented in independent computer system simply.Storage medium is disk, CD, photomagneto disk, IC-card, partly leads Body memories etc., as long as be able to record that the medium of program is the most permissible.
And then, illustrate here the motion image encoding method (method for encoding images) that illustrates in the respective embodiments described above and The application examples of dynamic image decoding method (picture decoding method) and its system of use.The system is characterized in that, have by The picture coding that the picture coding device using method for encoding images and the picture decoding apparatus using picture decoding method are constituted Decoding apparatus.About other structures of system, can according to circumstances and suitably change.
Figure 13 is the integrally-built figure of the content provider system ex100 representing and realizing content distribution service.Communication is taken Business offer zoning be divided into desired size, be respectively arranged with in each community the base station ex106 as stationary wireless stations, ex107、ex108、ex109、ex110。
This content provider system ex100 on the Internet ex101 via ISP ex102 and telephone network Ex104 and base station ex106~ex110 is connected to computer ex111, PDA (Personal Digital Assistant) Each equipment of ex112, photographing unit ex113, portable phone ex114, game machine ex115 etc..
But, content provider system ex100 is not limited to structure as Figure 13, it is also possible to by some factor combination even Connect.In addition it is also possible to each equipment is connected directly between telephone network not via base station ex106~ex110 as stationary wireless stations On ex104.In addition it is also possible to each equipment is directly connected to each other via near radio etc..
Photographing unit ex113 is the equipment that can carry out dynamic image photography of digital camera etc., and photographing unit ex116 is energy Enough carry out the still image photographing of digital camera etc., the equipment of dynamic image photography.Additionally, portable phone ex114 is GSM (Global System for Mobile Communications) mode, CDMA (Code Division Multiple Access) mode, W-CDMA (Wideband-Code Division Multiple Access) mode or LTE (Long Term Evolution) mode, the portable telephone of HSPA (High Speed Packet Access) or PHS (Personal Handyphone System) etc., it is which kind of can.
In content provider system ex100, by photographing unit ex113 etc. is connected via base station ex109, telephone network ex104 It is connected on flow medium server ex103, it is possible to carry out on-the-spot distribution etc..At the scene in distribution, user is used photographing unit ex113 The content (image etc. that such as concert is on-the-spot) of photography carries out coded treatment as described above in the embodiments (that is, the picture coding device as the present invention plays a role), sends to flow medium server ex103.On the other hand, flow medium Server ex103 carries out stream distribution by sending the content-data come to the client having request.As client, having can be by upper State the computer ex111 of the decoding of the data after coded treatment, PDAex112, photographing unit ex113, portable phone ex114, game machine Ex115 etc..In receiving each equipment of data of distribution, the data decoding process received is reproduced (that is, as this Bright picture decoding apparatus plays a role).
It addition, the coded treatment of the data of photography both can be carried out by photographing unit ex113, it is also possible to by carrying out sending out of data The flow medium server ex103 sending process is carried out, it is also possible to mutually shares and carrying out.Equally, the decoding process of the data of distribution was both Can be carried out by client, it is also possible to carried out by flow medium server ex103, it is also possible to mutually share and carrying out.Additionally, do not limit In photographing unit ex113, it is also possible to by the rest image photographed by photographing unit ex116 and/or dynamic image data via computer Ex111 sends to flow medium server ex103.Coded treatment in the case of this is situated between by photographing unit ex116, computer ex111, stream Which of matter server ex103 is carried out can, it is also possible to mutually shares and carrying out.
Additionally, these coding and decodings process general process in the LSIex500 that computer ex111 or each equipment has. LSIex500 both can be single-chip, it is also possible to be the structure being made up of multiple chips.Alternatively, it is also possible to by moving picture encoding The software of decoding is encased in some record medium (CD-ROM, floppy disk, hard disk etc.) that can be read by computer ex111 etc. In, use this software to carry out coding and decoding process.And then, in the case of portable phone ex114 is with photographing unit, it is also possible to The dynamic image data obtained by this photographing unit is sent.Dynamic image data now is had by portable phone ex114 The data of LSIex500 coded treatment.
In addition it is also possible to be, flow medium server ex103 is multiple server or multiple computer, is data to be disperseed Process, record and distribute.
As above, in content provider system ex100, client is able to receive that the data of coding reproduce.So, In content provider system ex100, the information that user sends can be received, decode, reproduce, even not having by client in real time The user having special right or equipment also is able to realize personal broadcaster.
It addition, be not limited to the example of content provider system ex100, as shown in figure 14, in digital broadcasting system Ex200 also is able to load at least moving picture encoding device (picture coding device) of above-mentioned embodiment or dynamic image solution Certain of code device (picture decoding apparatus).Specifically, in the ex201 of broadcasting station, will to image data multiplexing music number The multiplex data obtained according to waiting transmits to communication or broadcasting satellite ex202 via electric wave.This image data be by above-mentioned respectively Data after the motion image encoding method coding of explanation in embodiment (that is, are encoded by the picture coding device of the present invention After data).The broadcasting satellite ex202 receiving these data sends the electric wave of broadcast, it is possible to this electric wave is carried out satellite wide The antenna ex204 of the family broadcasting reception receives this electric wave, by television set (receiver) ex300 or Set Top Box (STB) ex217 etc. Device the multiplex data received is decoded and is rendered (that is, the picture decoding apparatus as the present invention plays a role).
Furthermore, it is possible to the multiplex data recorded in the record medium ex215 of DVD, BD etc. is being read and is decoding or is inciting somebody to action Image data coding writes reader/writer ex218 in record medium ex215 further according to situation and music signal multiplexing In also be able to the dynamic image decoding device shown in the respective embodiments described above or moving picture encoding device are installed.In this situation Under, can the signal of video signal that reproduce be shown on monitor ex219, be had the record medium ex215 of multiplex data by record Signal of video signal can be reproduced in other devices or system.In addition it is also possible to be connected to cable television cable ex203 or In Set Top Box ex217 on the antenna ex204 of satellite/terrestrial ripple broadcast, dynamic image decoding device is installed, is used television set Monitor ex219 show.At this time it is also possible to be not to load dynamic image decoding device in television set at Set Top Box.
Figure 15 is to represent to use the dynamic image decoding method and moving picture encoding illustrated in the respective embodiments described above The figure of television set (receiver) ex300 of method.Television set ex300 possesses via antenna ex204 or the line receiving above-mentioned broadcast Cable ex203 etc. obtain or have exported image data multiplexing the tuner ex301 of the multiplex data of voice data, will receive Multiplex data demodulation or be modulated to outside send coded data modulating/demodulating portion ex302 and will demodulation after multiplexing Data separating be image data, voice data or by signal processing not ex306 coding image data, voice data multiplexing Multiplexing/separated part ex303.
Additionally, television set ex300 possesses: have and voice data, image data are decoded respectively or respective information is compiled The sound signal processing portion ex304 and signal of video signal process portion ex305 of code are (that is, as picture coding device or the figure of the present invention As decoding apparatus plays a role) signal processing part ex306;There is the speaker ex307 exported by decoded acoustical signal And show output unit ex309 of the display part ex308 of the display etc. of decoded signal of video signal.And then, television set ex300 has Interface portion ex317 of operation inputting part ex312 etc. of the input of accepted user of getting everything ready operation.And then, television set ex300 has The power circuit part ex311 merge the control portion ex310 controlling each portion, each portion being supplied electric power.Interface portion ex317 can also be removed Beyond operation inputting part ex312, also there is the bridge part that the external equipment with reader/writer ex218 etc. is connected Ex313, be used for installing the record medium ex216 of SD card etc. socket ex314, for Jie such as the external record of hard disk etc. The modem ex316 etc. that driver ex315 that matter connects is connected with telephone network.It addition, record medium ex216 is can The structure of the record of information is electrically carried out by the semiconductor memory component of the non-volatile/volatibility of harvesting.Television set Each portion of ex300 is connected with each other via synchronous bus.
First, the knot multiplex data obtained from outside by antenna ex204 etc. being decoded, reproducing by television set ex300 Structure illustrates.Television set ex300 accepts the user operation from remote controller ex220 etc., based on the control portion with CPU etc. The control of ex310, separates multiplex data multiplexing/separated part ex303 demodulated by modulating/demodulating portion ex302.And then, electricity Depending on machine ex300, the voice data sound signal processing portion ex304 separated is decoded, by the image data signal of video signal of separation Process portion ex305 uses the coding/decoding method decoding illustrated in the respective embodiments described above.By decoded acoustical signal, image letter Export towards outside from output unit ex309 number respectively.Output time, can temporarily these signals be stored into buffer ex318, In ex319 etc., so that acoustical signal and signal of video signal reproduced in synchronization.Additionally, television set ex300 may not be from broadcast wait, And the multiplex data of coding is read from record medium ex215, ex216 of magnetic optical dish, SD card etc..Then, to television set ex300 Acoustical signal or signal of video signal encode, send to outside or are written to record the structure in medium etc. illustrate.Television set Ex300 accepts the user operation from remote controller ex220 etc., and control based on control portion ex310, by sound signal processing portion Signal of video signal use, by sound signal encoding, is illustrated in the respective embodiments described above by ex304 by signal of video signal process portion ex305 Coded method coding.By the acoustical signal after coding, signal of video signal multiplexing/separated part ex303 multiplexing, export to outside. When multiplexing, can temporarily these signals be stored in buffer ex320, ex321 etc., so that acoustical signal and signal of video signal Reproduced in synchronization.It addition, buffer ex318, ex319, ex320, ex321 both can possess multiple as illustrated, it is also possible to be Share the structure of more than one buffer.And then, beyond diagram, it is also possible at such as modulating/demodulating portion ex302 or multiple With also serving as avoiding the overflow of system, the buffer part of underflow to store data in a buffer between/separated part ex303 etc..
Additionally, television set ex300 is in addition to waiting from broadcast or record acquirement voice data, the image datas such as medium, also Can possess the structure of the AV input accepting mike or photographing unit, the data obtained from them are carried out coded treatment.Separately Outward, here, television set ex300 is said as the structure that can carry out above-mentioned coded treatment, multiplexing and outside output Bright but it also may can not to be carried out these and process, and be only capable of carrying out above-mentioned reception, decoding process, the structure of outside output.
Additionally, multiplex data is being read or in the case of write from record medium by reader/writer ex218, on State decoding process or coded treatment by television set ex300, reader/writer ex218 which carry out can, it is also possible to be Television set ex300 and reader/writer ex218 are mutually shared and being carried out.
As an example, reading in or information regeneration/record portion ex400 in the case of write of data will be carried out from CD Representation is in figure 16.Information regeneration/record portion ex400 possess the unit ex401 of following description, ex402, ex403, ex404、ex405、ex406、ex407.The shaven head ex401 recording surface irradiating laser speckle to recording medium ex215 as CD And write information, detect the reflection light of the recording surface from record medium ex215 and read in information.Modulation record portion ex402 electricity Gas ground drives the semiconductor laser being built in shaven head ex401, carries out the modulation of laser according to record data.Reproduce demodulation section Ex403 obtains electrically being detected by the photodetector being built in shaven head ex401 from the reflection light of recording surface again Existing signal amplifies, and record signal component in record medium ex215 is separated and demodulated, the information required for reproduction.Buffering Device ex404 will be used for the information in record medium ex215 that recorded and temporarily keeps from the information recording medium ex215 reproduction. Table motor ex405 makes record medium ex215 rotate.Servo control portion ex406 is while the rotation controlling table motor ex405 drives one While make shaven head ex401 move to the information track of regulation, carry out the tracking process of laser spot.Systems control division ex407 carries out letter The control that breath reproduction/record portion ex400 is overall.The place reason systems control division ex407 of above-mentioned read and write utilizes and keeps Various information in buffer ex404, carry out the generation of new information further according to needs, add and while make tune Record portion ex402 processed, reproduce demodulation section ex403, servo control portion ex406 coordination, on one side carry out letter by bare headed ex401 The record of breath reproduces and realizes.Systems control division ex407 is such as made up of microprocessor, is come by the program performing to read write Perform their process.
Above, it is assumed that shaven head ex401 irradiating laser speckle and be illustrated but it also may be to use to connect near field light and carry out height The structure of the record of density.
It is denoted as the schematic diagram of the record medium ex215 of CD in fig. 17.Recording surface at record medium ex215 On, spirally it is formed with guide channel (ditch), in information track ex230, has recorded beforehand through the change of shape of ditch The address information of the absolute position on indicating panel.This address information includes the record block for determining the unit as record data The information of the position of ex231, reads address believe by being reproduced by information track ex230 in carrying out the device recording and reproducing Breath, it is possible to determine record block.Additionally, record medium ex215 includes data recording area ex233, inner region ex232, periphery Region ex234.The region used to record user data is data recording area ex233, is arranged in and compares data recorded area Specific beyond the record of user data of territory ex233 inner circumferential side or inner region ex232 of periphery and outer region ex234 Purposes.The sound that the data recording area ex233 of such record medium ex215 is encoded by information regeneration/record portion ex400 The read-write of the coded data of these data of sound data, image data or multiplexing.
Above, it is illustrated as a example by the CD of DVD, BD etc. of lifting 1 layer, but is not limited to these, it is also possible to be many Layer constructs, also is able to beyond surface the CD of record.In addition it is also possible to be the identical local use different wave length at dish The light recording information of color or from different layer of information of various angle recordings etc., the structure that carries out the record/reproduction of multidimensional CD.
Additionally, in digital broadcasting with in system ex200, it is also possible to by there is the car ex210 of antenna ex205 from satellite Dynamic image is reproduced in the display device of the automatic navigator ex211 etc. that ex202 etc. receive data, have at car ex210.Separately Outward, the structure of automatic navigator ex211 can consider such as to add in the structure shown in Figure 15 the structure in GPS receiver portion, It is also contemplated that same structure in computer ex111 and portable phone ex114 etc..
Figure 18 A is to represent to use the dynamic image decoding method and moving picture encoding side illustrated in the above-described embodiment The figure of the portable phone ex114 of method.Portable phone ex114 has by the antenna for receiving and dispatching electric wave between the ex110 of base station Ex350, can filmed image, the camera section ex365 of rest image, display by the image photographed by camera section ex365, by The display part ex358 of the liquid crystal display etc. of the decoded data such as the image that antenna ex350 receives.Portable phone ex114 Also have comprise operated key portion ex366 main part, for carry out the speaker etc. of voice output audio output unit ex357, Be used for carrying out sound input unit ex356 of the mike etc. of sound input, preserve photograph image, rest image, recording Data after the coding of sound or the image received, rest image, mail etc. or the memorizer portion of decoded data Ex367 or as with as preserve the socket ex364 of interface portion between the record medium of data.
And then, use Figure 18 B that the structure example of portable phone ex114 is illustrated.Portable phone ex114 is for merging control Fixture is for display part ex358 and master control part ex360 in each portion of the main part of operated key portion ex366, by power circuit part Ex361, operation input control portion ex362, signal of video signal process portion ex355, camera interface portion ex363, LCD (Liquid Crystal Display: liquid crystal display) control portion ex359, modulating/demodulating portion ex352, multiplexing/separated part ex353, sound Signal processing part ex354, socket ex364, memorizer portion ex367 are connected with each other via bus ex370.
If power circuit part ex361 makes end of conversation and power key become opening by the operation of user, then lead to Crossing, from set of cells, each portion is supplied electric power, portable phone ex114 starts as can the state of action.
Portable phone ex114 control based on master control part ex360 with CPU, ROM and RAM etc., at voice call mould During formula, digital sound message will be transformed to by the acoustical signal of sound input unit ex356 collection sound by sound signal processing portion ex354 Number, it is carried out wave spectrum DIFFUSION TREATMENT with modulating/demodulating portion ex352, transmission/reception unit ex351 implements digitaltoanalogconversion Process and after frequency conversion process through being sent by antenna ex350.Additionally, portable phone ex114 is when voice calling mode, will be by Frequency conversion process is amplified and implemented to the reception data that antenna ex350 receives and analog-to-digital conversion processes, and uses modulating/demodulating Portion ex352 carries out wave spectrum counter diffusion process, after being transformed to analoging sound signal by sound signal processing portion ex354, by its warp Exported by audio output unit ex357.
And then, in the case of sending Email when data communication mode, by by the operated key portion ex366 of main part Deng operation input Email text data via operation input control portion ex362 send to master control part ex360.Main Text data modulating/demodulating portion ex352 is carried out wave spectrum DIFFUSION TREATMENT by control portion ex360, real by transmission/reception unit ex351 Execute digitaltoanalogconversion to process and after frequency conversion process, send to base station ex110 via antenna ex350.Receiving electronics postal In the case of part, the data received are performed the substantially inversely processing of above-mentioned process, and exports display part ex358.
When data communication mode, in the case of sending image, rest image or image and sound, signal of video signal The signal of video signal supplied from camera section ex365 is compiled by process portion ex355 by the dynamic image shown in the respective embodiments described above Code method is compressed encoding (that is, the picture coding device as the present invention plays a role), is sent by the image data after coding Go out to multiplexing/separated part ex353.It addition, sound signal processing portion ex354 is to by camera section ex365 filmed image, static Encode with the acoustical signal of sound input unit ex356 collection sound during image etc., the voice data after coding is sent To multiplexing/separated part ex353.
Multiplexing/separated part ex353 is by the way of regulation, to after the coding that signal of video signal process portion ex355 supplies Image data and the voice data after the coding of sound signal processing portion ex354 supply carry out multiplexing, its result are obtained Multiplex data modulating/demodulating portion (modulation/demodulation circuit portion) ex352 carries out wave spectrum DIFFUSION TREATMENT, by transmission/reception unit Ex351 implements digitaltoanalogconversion and processes and after frequency conversion process, sends via antenna ex350.
Receive the data of the dynamic image file being linked to homepage etc. when data communication mode in the case of, or connect Receive in the case of addition of the Email of image or sound, in order to the multiplex data received via antenna ex350 Being decoded, multiplexing/separated part ex353, by being separated by multiplex data, is divided into the bit stream of image data and voice data Bit stream, supplies the image data after coding to signal of video signal process portion ex355 via synchronous bus ex370, and will coding After voice data to sound signal processing portion ex354 supply.Signal of video signal process portion ex355 by with the respective embodiments described above The dynamic image decoding method that shown motion image encoding method is corresponding is decoded, and is thus decoded signal of video signal (that is, the picture decoding apparatus as the present invention plays a role), shows such as from display part ex358 via LCD control portion ex359 It is linked to image, the rest image comprised in the dynamic image file of homepage.It addition, sound signal processing portion ex354 is to sound Signal is decoded, and exports sound from audio output unit ex357.
Additionally, the terminal of above-mentioned portable phone ex114 etc. is as television set ex300, except having encoder, decoder Beyond both transmitting-receiving type terminals, it is also contemplated that only 3 kinds of the reception terminal of the transmission terminal of encoder, only decoder Installation form.It addition, in digital broadcasting with in system ex200, it is set to send, receives in image data multiplexing music data It is illustrated etc. the multiplex data obtained, but the number of multiplexing and the character data etc. of image association in addition to voice data According to can also, be not multiplex data but image data itself can also.
So, the motion image encoding method represented in the respective embodiments described above or dynamic image decoding method are used in In which kind of equipment above-mentioned, system can, by such manner, it is possible to the effect that illustrated in the respective embodiments described above.
Additionally, the present invention is not limited to such above-mentioned embodiment, it is possible to carry out without departing from the scope of the present invention Various deformation or correction.
(embodiment 4)
Can also be by by the motion image encoding method illustrated in the respective embodiments described above or device and foundation Motion image encoding method or the device of the standard that MPEG-2, MPEG4-AVC, VC-1 etc. are different are the most suitably cut Change, generate image data.
Here, in the case of generating respectively according to multiple image datas of different standards, when decoding, need to select Coding/decoding method corresponding to each standard.But, due to can not identify image data to be decoded according to which standard, so produce The problem that life can not select suitable coding/decoding method.
In order to solve this problem, in image data, the multiplex data of multiplexing voice data etc. uses and comprises expression image Data are according to the structure of the identification information of which standard.Hereinafter, illustrate to include by moving of illustrating in the respective embodiments described above The image data that state method for encoding images or device generate is in the concrete structure of interior multiplex data.Multiplex data is MPEG- The digital stream that 2 transmission are streamed.
Figure 19 is the figure of the structure representing multiplex data.As shown in figure 19, multiplex data by by video flowing, audio stream, 1 multiplexing carried out above in presentation graphic stream (PG), interactive graphic stream and obtain.Video flowing represents main image and the pair of film Image, audio stream (IG) represents the master voice part of film and the secondary sound mixed with this master voice, and presentation graphic stream represents electricity The captions of shadow.Here, so-called main image, represent the display common image on picture, so-called subpictures, is in main image The image shown with less picture.Additionally, interactive graphic stream represents the dialogue made by configuring GUI component on picture Picture.Video flowing is by the motion image encoding method that illustrates in the respective embodiments described above or device, according to conventional The motion image encoding method of the standards such as MPEG-2, MPEG4-AVC, VC-1 or device code.Audio stream by Dolby AC-3, The mode of Dolby Digital Plus, MLP, DTS, DTS-HD or linear PCM etc. encodes.
The each stream being included in multiplex data is identified by PID.Such as, to the video flowing used in the image of film Distribution 0x1011, distributes 0x1100 to 0x111F to audio stream, distributes 0x1200 to 0x121F to Presentation Graphics, to interactive graphics (IG) Stream distribution 0x1400 to 0x141F, in the subpictures of film use video flowing distribute 0x1B00 to 0x1B1F, to master Audio stream distribution 0x1A00 to the 0x1A1F used in the secondary sound of sound mixture.
Figure 20 is to show schematically the figure how multiplex data is re-used.First, the video being made up of multiple frame of video Stream ex235, the audio stream ex238 being made up of multiple audio frames are transformed to PES packet sequence ex236 and ex239 respectively, and are transformed to TS bag ex237 and ex240.Equally, the data of presentation graphic stream ex241 and interactive graphics (IG) ex244 are transformed to PES bag sequence respectively Row ex242 and ex245, then it is transformed to TS bag ex243 and ex246.Multiplex data ex247 is by using 1 by these TS cladding Stream is constituted.
Figure 21 represents how to preserve video flowing in PES packet sequence in more detail.1st segment table of Figure 21 shows regarding of video flowing Frequently frame sequence.2nd segment table shows PES packet sequence.Multiple works as shown in arrow yy1, yy2, yy3, yy4 of Figure 21, in video flowing For Video Presentation Unit I picture, B picture, P picture is divided by each picture and is saved in having of PES bag In effect load.Each PES bag has PES head, in PES head, preserves the PTS in the display moment as picture (Presentation Time-Stamp) and the DTS (Decoding Time-Stamp) in the decoding moment as picture.
Figure 22 represents the form of the TS bag being finally written in multiplex data.TS bag is by letters such as the PID with identification stream The bag of the 188 byte regular lengths that the TS payload of the TS head of 4 bytes of breath and 184 bytes of preservation data is constituted, above-mentioned PES is coated segmentation and is saved in TS payload.In the case of bd-rom, TS bag is given to the TP_ of 4 bytes Extra_Header, constitutes the source bag of 192 bytes, is written in multiplex data.ATS is being had described in TP_Extra_Header (Arrival_Time_Stamp) information such as.ATS represents that this TS bag transfers start time to the pid filter of decoder.? In multiplex data, source bag arranges as shown in Figure 22 hypomere, and number incremental from the beginning of multiplex data is referred to as SPN, and (source is wrapped Number).
Additionally, in the TS bag that multiplex data is comprised, in addition to each stream of image, sound, captions etc., also PAT (Program Association Table)、PMT(Program Map Table)、PCR(Program Clock Reference) etc..Pat table shows what the PID of the PMT used in multiplex data is, the PID of PAT self is registered as 0. PMT has PID and the attribute of the stream corresponding with each PID of each stream of image that multiplex data comprised, sound, captions etc. Information, also has the various descriptors about multiplex data.In descriptor, there is copying of instruction permission/license multiplex data The copy control information etc. of shellfish.PCR is in order to obtain ATC (Arrival Time Clock) and the conduct of the time shaft as ATS The synchronization of the STC (System Time Clock) of the time shaft of PTS and DTS, has to be coated with this PCR and transfers to decoder The information of STC time corresponding for ATS.
Figure 23 is the figure of the data configuration explaining PMT.In the beginning of PMT, it is configured with to describe and is included in this PMT In the PMT head of length etc. of data.Face behind, is configured with multiple descriptor about multiplex data.Above-mentioned copy controls Information etc. are recited as descriptor.After descriptor, it is configured with multiple stream letter about each stream being included in multiplex data Breath.Stream information is by the stream type of the voice compression codecs recorded for identifying stream, the PID of stream, attribute information (the frame speed of stream Rate, aspect ratio etc.) flow descriptors constitute.There is the quantity of stream present in multiplex data in flow descriptors.
In the case of recorded in record medium etc., above-mentioned multiplex data is remembered together with multiplex data message file Record.
Multiplex data message file as shown in figure 24, is the management information of multiplex data, the most right with multiplex data Should, it is made up of multiplex data information, stream attribute information and entry maps.
Multiplex data information as shown in figure 24, is made up of system velocity, reproduction start time, reproduction finish time.System The maximum to the pid filter of System Target Decoder described later of rate representation multiplex data transfers speed.It is included in multiplexing The interval of the ATS in data is set as becoming below system velocity.Reproduce the frame of video that start time is the beginning of multiplex data PTS, reproducing finish time is set as that the PTS of frame of video of the end to multiplex data is plus the value of reproduction interval of 1 frame amount.
Stream attribute information as shown in figure 25, is registered with the attribute about each stream being included in multiplex data by each PID Information.Attribute information has by video flowing, audio stream, presentation graphic stream, interactive graphic stream and different information.Video flowing attribute It is many that information has this video flowing by the compression of what kind of voice compression codecs, the resolution of each image data that constitutes video flowing Less, aspect ratio be how many, frame rate be how many etc. information.Audio stream attribute information has this audio stream by what kind of compression is compiled The letter that decoder compression, the channel number that is included in this audio stream be how many, is how many etc. corresponding to which kind of language, sample frequency Breath.In the initialization of these information decoder before reproducing at player etc..
In the present embodiment, stream type in above-mentioned multiplex data, that be included in PMT is used.Additionally, be situated between at record In the case of record has multiplex data in matter, use the video flowing attribute information being included in multiplex data information.Specifically, In the motion image encoding method shown in the respective embodiments described above or device, following steps or unit are set, this step or list Unit is to the stream type being included in PMT or video flowing attribute information, and setting expression is by illustrating in the respective embodiments described above Motion image encoding method or the intrinsic information of image data that generates of device.By this structure, it is possible to be identified by upper State the motion image encoding method shown in each embodiment or the image data of device generation and the image according to other standards Data.
Additionally, represent the step of the dynamic image decoding method of present embodiment in fig. 26.In step exS100, from Multiplex data obtains the stream type being included in PMT or the video flowing attribute information being included in multiplex data information.Connect , in step exS101, it is judged that stream type or video flowing attribute information indicate whether it is by the respective embodiments described above The multiplex data that the motion image encoding method illustrated or device generate.Further, it is being judged as stream type or video flowing attribute letter In the case of breath is the multiplex data of the motion image encoding method by illustrating in the respective embodiments described above or device generation, In step exS102, it is decoded by the dynamic image decoding method illustrated in the respective embodiments described above.Additionally, in stream class Type or video flowing attribute information represent the multiplex data of the standard being based on conventional MPEG-2, MPEG4-AVC, VC-1 etc. In the case of, in step exS103, by being decoded according to the dynamic image decoding method of conventional standard.
So, by setting new eigenvalue in stream type or video flowing attribute information, can interpolate that when decoding be The no dynamic image decoding method that can pass through to illustrate in the respective embodiments described above or device decode.Thus, it is being transfused to In the case of the multiplex data of different standards, it is also possible to select suitable coding/decoding method or device, it is possible to do not send out Life is decoded mistakenly.Additionally, by the motion image encoding method illustrated in the present embodiment or device or Dynamic Graph As coding/decoding method or device are used in any of the above described equipment, system.
(embodiment 5)
The motion image encoding method illustrated in the respective embodiments described above and device, dynamic image decoding method and device Typically can be realized by the LSI as integrated circuit.As an example, represent the knot of the LSIex500 of 1 chip in figure 27 Structure.LSIex500 possess the unit ex501 of following description, ex502, ex503, ex504, ex505, ex506, ex507, ex508, Ex509, each unit connects via bus ex510.Power circuit part ex505 by the case of being on state at power supply to respectively Portion's supply electric power, starts as can the state of action.
Such as in the case of carrying out coded treatment, LSIex500 is based on having CPUex502, Memory Controller The control of the control portion ex501 of ex503, stream controller ex504, driving frequency control portion ex512 etc., by AV I/Oex509 AV signal is inputted from mike ex117 and photographing unit ex113 etc..The AV signal being transfused to is temporarily stored within the outside of SDRAM etc. Memorizer ex511 in.Control based on control portion ex501, by the data that store according to treating capacity and processing speed suitably How inferior it is divided into, sends to signal processing part ex507, signal processing part ex507 carries out coding and/or the shadow of acoustical signal The coding of image signal.Here, the coded treatment of signal of video signal is the coded treatment illustrated in the respective embodiments described above.At signal In process portion ex507, carry out the process of image data multiplexing etc. of the voice data by coding and coding always according to situation, from Stream I/Oex506 exports to outside.The bit of this output flows to base station ex107 send or be written to record in medium ex215. It addition, when multiplexing, can temporarily data be stored in buffer ex508 so that it synchronizes.
It addition, in above-mentioned, if the structure of the outside that memorizer ex511 is LSIex500 is illustrated but it also may The structure being included in the inside of LSIex500.Buffer ex508 is also not limited to one, it is also possible to possess multiple buffering Device.Additionally, LSIex500 both can form 1 chip, it is also possible to form multiple chip.
Additionally, in above-mentioned, it is assumed that control portion ex501 has CPUex502, Memory Controller ex503, stream controller Ex504, driving frequency control portion ex512 etc., but the structure of control portion ex501 is not limited to this structure.For example, it is also possible to be Signal processing part ex507 is also equipped with the structure of CPU.By being also provided with CPU in the inside of signal processing part ex507, it is possible to enter One step improves processing speed.Additionally, as other examples, it is also possible to it is that CPUex502 possesses signal processing part ex507 or as letter The structure in the such as sound signal processing portion of a part of number process portion ex507.In this case, control portion ex501 is Possesses the structure with signal processing part ex507 or part thereof of CPUex502.
It addition, be set to LSI here, but according to the difference of integrated level, also there are referred to as IC, system LSI, super (super) LSI, the situation of superfine (ultra) LSI.
Additionally, the method for integrated circuit is not limited to LSI, it is also possible to realized by special circuit or general processor. The FPGA (Field Programmable Gate Array) that can program after LSI manufactures can also be utilized, maybe can reconstruct The connection of the circuit unit within LSI and the reconfigurable processor of setting.
And then, if there is replacing the integrated circuit of LSI because of the progress of semiconductor technology or the other technologies of derivation Technology, then this technology can certainly be used to carry out the integrated of functional module.It is likely to be the application etc. of biotechnology.
(embodiment 6)
At the image data that the motion image encoding method by illustrating in the respective embodiments described above or device are generated In the case of decoding, it is contemplated that with the feelings by the image data according to standards such as conventional MPEG-2, MPEG4-AVC, VC-1 Condition compares treating capacity can be increased.Therefore, in LSIex500, need to be set as that ratio is by the image data solution according to conventional standard The higher driving frequency of driving frequency of CPUex502 during code.But, if setting high by driving frequency, then occur to consume electricity The problem that power uprises.
In order to solve this problem, the dynamic image decoding device of television set ex300, LSIex500 etc. uses and identifies image number According to according to which standard and according to standard switching driving frequency structure.Figure 28 represents structure ex800 of present embodiment.Drive Dynamic frequency switching part ex803 is the motion image encoding method by illustrating in the respective embodiments described above or dress at image data In the case of putting generation, driving frequency is set to height.Further, dynamic image execution illustrated in the respective embodiments described above Image data is decoded by the decoding process portion ex801 instruction of coding/decoding method.On the other hand, it is based on conventional mark at image data In the case of accurate image data, it is the motion image encoding method by illustrating in the respective embodiments described above with image data Or the situation of data that device generates compares, driving frequency is set to low.Further, to the decoding process according to conventional standard Image data is decoded by portion ex802 instruction.
More particularly, driving frequency switching part ex803 is by the CPUex502 of Figure 27 and driving frequency control portion ex512 structure Become.Additionally, perform decoding process portion ex801, the Yi Jiyi of the dynamic image decoding method illustrated in the respective embodiments described above The decoding process portion ex802 of the most past standard is corresponding to the signal processing part ex507 of Figure 27.CPUex502 identifies image data According to which standard.Further, based on the signal from CPUex502, driving frequency control portion ex512 sets driving frequency.This Outward, based on the signal from CPUex502, signal processing part ex507 carries out the decoding of image data.Here it is possible to consider at shadow As the identification of data using such as in the identification information described in embodiment 4.About the information of identification, it is not limited in reality Execute the information described in mode 4, as long as be capable of identify that image data is the most permissible according to the information of which standard.Such as, at base In identifying that image data is used in television set and is still used in the external signal of dish etc., it is capable of identify that image data is according to which In the case of standard, it is also possible to be identified based on such external signal.Additionally, the selection example of the driving frequency of CPUex502 As the standard of image data and driving frequency are set up corresponding look-up table carry out as shown in figure 30 can be considered.By look-up table Being pre-reserved in the internal storage of buffer ex508 or LSI, CPUex502 is by referring to this look-up table, it is possible to select to drive Dynamic frequency.
Figure 29 represents the step of the method implementing present embodiment.First, in step exS200, at signal processing part In ex507, from multiplex data, obtain identification information.Then, in step exS201, in CPUex502, based on identifying letter Breath identifies whether image data is the coded method by illustrating in the respective embodiments described above or the data of device generation.At shadow In the case of data are the data generated by the coded method shown in the respective embodiments described above or device, in step In exS202, CPUex502 sends to driving frequency control portion ex512 and driving frequency is set to high signal.Further, driving Dynamic frequency control portion ex512 is set as high driving frequency.On the other hand, represent be based on conventional MPEG-2, In the case of the image data of the standard of MPEG4-AVC, VC-1 etc., in step exS203, CPUex502 is to driving frequency Control portion ex512 sends and driving frequency is set to low signal.Further, in driving frequency control portion ex512, be set as with Image data is that the situation of the data of the coded method by illustrating in the respective embodiments described above or device generation is compared lower Driving frequency.
And then, change LSIex500 by interlocking with the switching of driving frequency or include that the device of LSIex500 applies Voltage, it is possible to improve further power savings.Such as, driving frequency is set to low in the case of, therewith, permissible Compared with consideration is set to high situation with by driving frequency, by the voltage to LSIex500 or the device applying including LSIex500 It is set to low.
As long as additionally, driving frequency is set in the case for the treatment of capacity that the establishing method of driving frequency is when decoding is big Treating capacity high, when decoding little in the case of driving frequency is set to low the most permissible, be not limited to above-mentioned setting Method.For example, it is possible to consider will be according to the treating capacity of the image data decoding of MPEG4-AVC standard more than will be by upper In the case of stating the treating capacity of image data decoding of the motion image encoding method shown in each embodiment or device generation, With the setting that above-mentioned situation is driven frequency on the contrary.
And then, the establishing method of driving frequency is not limited to the structure making driving frequency low.For example, it is also possible to consider Identification information is the motion image encoding method by illustrating in the respective embodiments described above or the image data of device generation In the case of, the voltage of LSIex500 or the device applying including LSIex500 will be set to height, be based on conventional in expression In the case of the image data of the standard of MPEG-2, MPEG4-AVC, VC-1 etc., will be to LSIex500 or include LSIex500 Device apply voltage be set to low.Additionally, as another example, it is also possible to consider to represent it is by above-mentioned in identification information In the case of the image data that motion image encoding method shown in each embodiment or device generate, do not make CPUex502's Drive and stop, in the situation of the image data representing the standard being based on conventional MPEG-2, MPEG4-AVC, VC-1 etc. Under, more than needed owing to having in processes, so that CPUex502 driving suspend.It is also contemplated that represent it is to pass through in identification information In the case of the image data that the motion image encoding method illustrated in the respective embodiments described above or device generate, as long as also existing Having in process has more than needed then makes the driving of CPUex502 suspend.In this case, it is possible to consider to be based on conventional with expression The situation of the image data of the standard of MPEG-2, MPEG4-AVC, VC-1 etc. is compared, and dwell time is set to short.
So, switch driving frequency according to the standard of image data institute foundation, it is possible to realize energy-saving technology.Additionally, In the case of the device that use battery drives LSIex500 or includes LSIex500, it is possible to extend battery along with economize on electricity Life-span.
(embodiment 7)
In the above-mentioned equipment such as television set, portable phone, system, sometimes it is transfused to the multiple shadows according to different standards As data.So, so that also be able to decoding in the case of being transfused to according to multiple image datas of different standards, The signal processing part ex507 of LSIex500 needs corresponding to multiple standards.But, if be used alone corresponding to each standard , then there is the problem that circuit scale becomes greatly, cost increases in addition of LSIex500 in signal processing part ex507.
In order to solve this problem, use and will be used for the dynamic image decoding method performing to illustrate in the respective embodiments described above Decoding process portion and according to conventional MPEG-2, MPEG4-AVC, VC-1 etc. standard a decoding process portion part altogether Structure.The ex900 of Figure 31 A represents this structure example.Such as, the moving image decoding side illustrated in the respective embodiments described above The dynamic image decoding method of method and foundation MPEG4-AVC standard is in entropy code, re-quantization, deblocking filter, motion compensation etc. Process in some process content common.Following structure can be considered: about common process content, share corresponding to The decoding process portion ex902 of MPEG4-AVC standard, about a mode of the present invention not corresponding to MPEG4-AVC standard Other process content specific, uses special decoding process portion ex901.Particularly, the present invention is in terms of motion compensation There is feature, it can be considered to such as use special decoding process portion ex901 for motion compensation, in addition Some in entropy code, de-blocking filter, re-quantization or whole process, share decoding process portion.About decoding process portion Share, it is also possible to be following structure: about common process content, share be used for perform to illustrate in the respective embodiments described above The decoding process portion of dynamic image decoding method, about processing content specific to MPEG4-AVC standard, use special Decoding process portion.
Additionally, represent will process another example that a part shares with the ex1000 of Figure 31 B.In this embodiment, use use with One mode of the present invention is specific processes special decoding process portion ex1001 corresponding to content and the conventional mark with other Special decoding process portion ex1002 corresponding to content and the Dynamic Graph with a mode in the present invention is processed specific to accurate As the decoding shared that the process content that coding/decoding method is common with in the dynamic image decoding method of other conventional standard is corresponding The structure of process portion ex1003.Here, special decoding process portion ex1001, ex1002 is not necessarily of the present invention Process content specific to mode or other conventional standard and become privileged, can be the general place being able to carry out other The structure of reason.It is further possible to installed the structure of present embodiment by LSIex500.
So, for dynamic image decoding method and the dynamic image solution of conventional standard of a mode in the present invention Process content common in code method, shares decoding process portion, it is possible to reduce the circuit scale of LSI and reduce cost.
Industrial applicibility
Method for encoding images for the present invention and picture decoding method such as can be at television set, digital video records Machine, automobile navigation instrument, portable phone, digital camera or digital video camcorder etc. use.
Symbol description
100 picture coding devices
101 subtraction portion
102 orthogonal transformation portions
103 quantization units
104 variable length code portions
105,205 re-quantization portion
106,206 inverse orthogonal transformation portion
107,207 adder
108,208 block storage
109,209 frame memory
110,210 infra-frame prediction portion
111,211 inter prediction portion
112,212 switch portion
113 picture/mb-type determination sections
114,214 inter prediction control portion
115,215 reference picture list management portion
116,216 judging part is added
200 picture decoding apparatus
204 length-changeable decoding portions

Claims (6)

1. a method for encoding images, uses the 1st motion vector and the 2nd motion vector to perform prediction, and is compiled by coded object block Code, the position in the 1st reference picture that the 1st motion vector instruction the 1st reference picture list is comprised, the 2nd motion vector Indicating the position in the 2nd reference picture that the 2nd reference picture list comprised, above-mentioned method for encoding images includes:
Adding step, the candidate list to the predicted motion vector of the coding for above-mentioned 2nd motion vector adds above-mentioned 1st fortune Dynamic vector is as the candidate of above-mentioned predicted motion vector;
Select step, from the above-mentioned candidate list comprising above-mentioned 1st motion vector, select for above-mentioned 2nd motion vector The above-mentioned predicted motion vector of coding;And
Coding step, uses selected above-mentioned predicted motion vector by above-mentioned 2nd motion vector encoder.
2. method for encoding images as claimed in claim 1,
In above-mentioned additional step, in the case of above-mentioned 1st reference picture is identical with above-mentioned 2nd reference picture, to above-mentioned time List is selected to add above-mentioned 1st motion vector.
3. method for encoding images as claimed in claim 1 or 2,
In above-mentioned selection step, select in multiple candidates that above-mentioned candidate list comprised, vow relative to above-mentioned 2nd motion The candidate of the error minimum of amount is as above-mentioned predicted motion vector.
4. method for encoding images as claimed in claim 1 or 2,
In above-mentioned additional step, add above-mentioned 1st motion vector detected and detect by motion to above-mentioned candidate list.
5. method for encoding images as claimed in claim 1 or 2,
In above-mentioned additional step, with the side that multiple index values are the most corresponding with multiple candidates of above-mentioned predicted motion vector Formula adds index value and the candidate of above-mentioned predicted motion vector to above-mentioned candidate list;
In above-mentioned selection step, select index value as above-mentioned predicted motion vector from above-mentioned candidate list;
In above-mentioned coding step, also by index value the biggest in the way of then code length is the longest by selected above-mentioned index value coding.
6. method for encoding images as claimed in claim 1 or 2,
In above-mentioned additional step, add to above-mentioned candidate list the block of the left neighbour of above-mentioned coded object block motion vector, on State the motion vector of the block of the motion vector of the block of the upper neighbour of coded object block and the upper right neighbour of above-mentioned coded object block, make Candidate for above-mentioned predicted motion vector.
CN201180035076.2A 2010-12-27 2011-12-27 Method for encoding images Active CN103004204B (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201061427240P 2010-12-27 2010-12-27
US61/427,240 2010-12-27
PCT/JP2011/007319 WO2012090495A1 (en) 2010-12-27 2011-12-27 Image encoding method and image decoding method

Publications (2)

Publication Number Publication Date
CN103004204A CN103004204A (en) 2013-03-27
CN103004204B true CN103004204B (en) 2016-12-14

Family

ID=

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1507751A (en) * 2002-01-18 2004-06-23 株式会社东芝 Video coding method and apparatus and decoding method and apparatus
CN1943244A (en) * 2004-04-14 2007-04-04 三星电子株式会社 Inter-frame prediction method in video coding, video encoder, video decoding method, and video decoder

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1507751A (en) * 2002-01-18 2004-06-23 株式会社东芝 Video coding method and apparatus and decoding method and apparatus
CN1943244A (en) * 2004-04-14 2007-04-04 三星电子株式会社 Inter-frame prediction method in video coding, video encoder, video decoding method, and video decoder

Similar Documents

Publication Publication Date Title
CN106878749B (en) Picture decoding method, picture decoding apparatus
CN103444181B (en) Dynamic image encoding method, dynamic image encoding device, dynamic image decoding method, moving image decoding apparatus and moving image encoding decoding apparatus
CN104041042B (en) Image encoding method, picture decoding method, picture coding device and picture decoding apparatus
CN103548351B (en) Dynamic image decoding method and moving image decoding apparatus
CN103688545B (en) Image encoding method, image decoding method, image encoding device, and image decoding device
CN103563384B (en) Dynamic image encoding method, dynamic image encoding device, dynamic image decoding method, moving image decoding apparatus and moving image coding and decoding device
CN103858428B (en) Method for encoding images, picture coding device, picture decoding method and picture decoding apparatus
CN103718558B (en) Video encoding method and apparatus, video decoding method and apparatus, and video encoding/decoding apparatus
CN106851269B (en) Picture decoding method and device, image encoding method and device, coding and decoding device
CN103348679B (en) Image decoding method and image decoding device
CN103563385B (en) Image encoding method, image encoding device, image decoding method, image decoding device, and image encoding/decoding device
CN103609110B (en) Picture decoding method, method for encoding images, picture decoding apparatus, picture coding device and image encoding/decoding device
CN103339941B (en) Motion image encoding method, moving picture encoding device, dynamic image decoding method, dynamic image decoding device and moving picture encoding decoding apparatus
CN103563373B (en) Method for encoding images and picture coding device
CN103202020B (en) Video decoding method and video decoding apparatus
CN103650507A (en) Video image coding method, video image coding device, video image decoding method, video image decoding device and video image coding/decoding device
CN106878742A (en) Dynamic image coding and decoding device
CN103650497B (en) Method for encoding images, picture decoding method, picture coding device, picture decoding apparatus and image encoding/decoding device
CN103477637A (en) Moving picture encoding method, moving picture encoding device, moving picture decoding method, moving picture decoding device, and moving picture encoding decoding device
CN103828373A (en) Image encoding method, image encoding device, image decoding method, image decoding device, and image encoding/decoding device
CN103518376A (en) Image encoding method, image encoding device, image decoding method, image decoding device, and image encoding/decoding device
CN105592317B (en) Picture decoding method and picture decoding apparatus
CN103069804B (en) Motion vector calculation method, image decoding method, motion vector calculation device and image decoding device
CN105284110A (en) Image coding method, and image coding device
CN103004204B (en) Method for encoding images

Legal Events

Date Code Title Description
PB01 Publication
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20141010

Address after: Seaman Avenue Torrance in the United States of California No. 2000 room 200

Applicant after: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Address before: Osaka Japan

Applicant before: Matsushita Electric Industrial Co.,Ltd.

CB02 Change of applicant information

Address after: Seaman Avenue Torrance in the United States of California No. 20000 room 200

Applicant after: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Address before: Seaman Avenue Torrance in the United States of California No. 2000 room 200

Applicant before: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

TA01 Transfer of patent application right

Effective date of registration: 20160926

Address after: American New York

Applicant after: Sun patent hosting Co.

Address before: Seaman Avenue Torrance in the United States of California No. 20000 room 200

Applicant before: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

GR01 Patent grant