WO2016191915A1 - System and method for video processing - Google Patents

System and method for video processing Download PDF

Info

Publication number
WO2016191915A1
WO2016191915A1 PCT/CN2015/080230 CN2015080230W WO2016191915A1 WO 2016191915 A1 WO2016191915 A1 WO 2016191915A1 CN 2015080230 W CN2015080230 W CN 2015080230W WO 2016191915 A1 WO2016191915 A1 WO 2016191915A1
Authority
WO
WIPO (PCT)
Prior art keywords
pixel
value
prediction
target
imaging device
Prior art date
Application number
PCT/CN2015/080230
Other languages
French (fr)
Inventor
Xing Chen
Zisheng Cao
Lei Zhu
Original Assignee
SZ DJI Technology Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SZ DJI Technology Co., Ltd. filed Critical SZ DJI Technology Co., Ltd.
Priority to CN201580080404.9A priority Critical patent/CN107925771B/en
Priority to JP2017552077A priority patent/JP6607956B2/en
Priority to PCT/CN2015/080230 priority patent/WO2016191915A1/en
Priority to EP15874401.1A priority patent/EP3152907B1/en
Publication of WO2016191915A1 publication Critical patent/WO2016191915A1/en
Priority to US15/824,581 priority patent/US10893300B2/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/90Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using coding techniques not provided for in groups H04N19/10-H04N19/85, e.g. fractals
    • H04N19/91Entropy coding, e.g. variable length coding [VLC] or arithmetic coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/105Selection of the reference unit for prediction within a chosen coding or prediction mode, e.g. adaptive choice of position and number of pixels used for prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/13Adaptive entropy coding, e.g. adaptive variable length coding [AVLC] or context adaptive binary arithmetic coding [CABAC]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction

Definitions

  • the disclosed embodiments relate generally to video imaging and more particularly, but not exclusively, to systems and methods for processing a video.
  • image compression can be categorized into lossy compression and lossless compression.
  • Representative lossy compression methods are Joint Photographic Experts Group (“JPEG”) and H.26X, with JPEG having a compression rate between ten to forty times.
  • JPEG Joint Photographic Experts Group
  • H.26X has a higher compression rate, which typically can be as high as two hundred times. But the high compression rates are at an expense of losing certain image information, in addition to complex implementations. Therefore, an image quality of an image created via lossy compression is not as good as an image quality of an image created via lossless compression.
  • Major lossless compression methods include JPEG lossless compression, arithmetic coding, Huffman coding, variable-length + Huffman coding, Lempel-Ziv-Weich (“LZW”) coding, etc.
  • JPEG lossless coding is greater than arithmetic coding, which is greater than Huffman coding, which is greater than variable-length + Huffman coding, which is greater than LZW coding.
  • JPEG lossless compression combines prediction values within a frame with Huffman coding that needs a frequency table of the frame image.
  • the image is coded with variable-length coding. If the coding is realized with an Application-Specific Integrated Circuit (“ASIC”), a whole frame of image needs to be buffered (or stored), which requires a large storage space within a camera chip. In most cases, it is almost impossible to buffer a whole frame of image within the chip. A storage chip outside of the ASIC chip is needed, creating extra cost and thereby increasing a difficulty for implementation of compression.
  • ASIC Application-Specific Integrated Circuit
  • obtaining the prediction table comprises obtaining a prediction Huffman table.
  • coding the one or more target frames comprises choosing any frame of the video appearing after the reference frame.
  • obtaining the prediction Huffman table comprises generating a Huffman table of a difference value of each reference pixel of the reference frame.
  • generating the Huffman table comprises determining a prediction value for each of the reference pixels.
  • Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the reference pixel based on respective pixel values of one or more pixels adjacent to the reference pixel.
  • the prediction value of the reference pixel is a constant value when the reference pixel is located in a first row and a first column of the reference frame.
  • the constant value is half of a maximum value of a coding value.
  • the constant value is five hundred and twelve when the maximum value of the coding value is one thousand and twenty-four.
  • Exemplary embodiments of the disclosed methods further comprise selecting the pixels adjacent to the reference pixel from a group of pixels consisting of a first pixel preceding the reference pixel in a same row, a second pixel preceding the reference pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the reference pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  • Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the reference pixel by the first pixel when the reference pixel is a pixel of the first row.
  • Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the reference pixel by the second pixel when the reference pixel is a pixel of the first column.
  • Exemplary embodiments of the disclosed methods further comprise determining the difference value based upon a difference of an actual value of the reference pixel and the prediction value of the reference pixel.
  • generating the prediction Huffman table comprises generating a frequency table of an absolute value of the difference value.
  • generating the frequency table comprises determining statistics of frequencies of the absolute values to form the frequency table.
  • generating the prediction Huffman table comprises generating the prediction Huffman table based on the frequency table.
  • coding the one or more target frames further comprises determining an absolute difference value for each target pixel of the one or more target frames.
  • Exemplary embodiments of the disclosed methods further comprise determining the absolute difference value based upon an absolute value of a difference between an actual value of each target pixel and a prediction value of the target pixel.
  • Exemplary embodiments of the disclosed methods further comprise determining the prediction value of each target pixel for the one or more target frames.
  • Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the target pixel based on respective pixel values of one or more pixels adjacent to the target pixel.
  • the prediction value of the target pixel is a constant value when the target pixel is located in a first row and a first column of one of the one or more target frames.
  • the constant value is half of a maximum value of a coding value.
  • the constant value is five hundred and twelve, when the maximum value of the coding value is one thousand and twenty-four.
  • Exemplary embodiments of the disclosed methods further comprise selecting the pixels adjacent to the target pixel from a group of pixels consisting of a first pixel preceding the target pixel in a same row, a second pixel preceding the target pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the target pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  • Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the target pixel by the first pixel when the target pixel is a pixel of the first row.
  • Exemplary embodiments of the disclosed methods further comprise comprising determining the prediction value of the target pixel by the second pixel when the target pixel is a pixel of the first column.
  • coding the one or more target frames comprises coding the frames based on the prediction Huffman table of the reference frame and the absolute difference values of the one or more target frames to generate variable-length codes.
  • coding the one or more target frames comprises transforming the variable-length codes to fixed-length codes.
  • transforming the variable-length codes comprises converting the variable-length codes to the fixed-length codes using a first-in-first-out service to generate the fixed-length codes.
  • converting the variable-length code comprises inserting a preselected hexadecimal character after every instance of a special character in accordance with a Joint Photographic Experts Group (“JPEG”) protocol.
  • JPEG Joint Photographic Experts Group
  • inserting the preselected hexadecimal character comprises inserting a hexadecimal zero after every instance of hexadecimal two hundred fifty-five.
  • Exemplary embodiments of the disclosed methods further comprise compensating the frequency table to generate a compensated frequency table.
  • compensating the frequency table comprises widening a coding width of the frequency table.
  • compensating the frequency table comprises replacing at least one zero-value each with a non-zero value.
  • replacing comprises replacing two or more zeros each with a one.
  • a video processing system configured to perform the video processing process in accordance with any one of previous embodiments of the disclosed methods.
  • a imaging device comprising:
  • a sensor for acquiring a sequence of image frames of a video
  • a processor for obtaining a prediction table for a received reference frame selected from the sequence of image frames and to code one or more target frames of the video based on the prediction table.
  • the prediction table is a prediction Huffman table.
  • the one or more target frames comprise any frame appearing after the reference frame.
  • the prediction Huffman table is a Huffman table of a difference value for each reference pixel of the reference frame.
  • the Huffman table is generated based upon a prediction value of each of the reference pixels.
  • the prediction value of the reference pixel is determined based on respective pixel values of one or more pixels adjacent to the reference pixel.
  • the prediction value of the reference pixel is a constant value when the reference pixel is located at a first row and a first column of the reference frame.
  • the constant value is half of a maximum value of a coding value.
  • the constant value is five hundred and twelve when the maximum value of the coding value is one thousand and twenty-four.
  • the pixels adjacent to the reference pixel are selected from a group of pixels consisting of a first pixel preceding the reference pixel in a same row, a second pixel preceding the reference pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the reference pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  • the prediction value of the reference pixel is determined by the first pixel when the reference pixel is a pixel of the first row.
  • the prediction value of the reference pixel is determined the second pixel when the reference pixel is a pixel of the first column.
  • the difference value is determined based upon a difference of an actual value of the reference pixel and the prediction value of the reference pixel.
  • the prediction Huffman table is generated with a frequency table of an absolute value of the difference value.
  • the frequency table is determined upon statistics of frequencies of the absolute values to form the frequency table.
  • the prediction Huffman table is generated based on the frequency table.
  • the processor is configured to determine an absolute difference value for each target pixel of the one or more target frames.
  • the processor is configured to determine the absolute difference value based upon an absolute value of a difference between an actual value of each target pixel and a prediction value of the target pixel.
  • the processor is configured to determine the prediction value of each target pixel for the one or more target frames.
  • the prediction value of the target pixel is determined based on respective pixel values of one or more pixels adjacent to the target pixel.
  • the prediction value of the target pixel is a constant value when the target pixel is located in a first row and a first column of one of the one or more target frames.
  • the constant value is half of a maximum value of a coding value.
  • the constant value is five hundred and twelve when the maximum value of the coding value is one thousand and twenty-four.
  • the pixels adjacent to the target pixel is selected from a group of pixels consisting of a first pixel preceding the target pixel in a same row, a second pixel preceding the target pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the target pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  • the processor is configured to determine the prediction value of the target pixel by the first pixel when the target pixel is a pixel of the first row.
  • the processor is configured to determine the prediction value of the target pixel by the second pixel when the target pixel is a pixel of the first column.
  • the one or more target frames are coded based on the prediction Huffman table of the reference frame and the absolute difference values of the one or more target frames to generate variable-length codes.
  • the processor is configured to transform the variable-length codes to fixed-length codes.
  • variable-length codes are transformed from the variable-length codes to the fixed-length codes using a first-in-first-out service.
  • the processor further configured to insert a preselected hexadecimal character after every instance of a special character in accordance with a Joint Photographic Experts Group (“JPEG”) protocol.
  • JPEG Joint Photographic Experts Group
  • the special character is a hexadecimal two hundred fifty-five.
  • the processor is configured to compensate the frequency table to generate a compensated frequency table.
  • the compensated frequency table is compensated by widening a coding width of the frequency table.
  • the coding width is widened by replacing at least one zero-value each with a non-zero value.
  • the non-zero value is a one.
  • Exemplary embodiments of the disclosed imaging devices further comprise a memory for storing the one or more frames of the video after coding.
  • the memory is configured to store the prediction table of the received reference frame.
  • Fig. 1 is an exemplary top level block diagram illustrating a video that comprises a frame sequence of image frames, wherein Huffman tables are associated with certain frames of the video and are used for compressing target frames of the video.
  • Fig. 2 is an exemplary top level flowchart illustrating an embodiment of a method for compressing the video of Fig. 1, wherein the method includes generating a prediction Huffman table for coding the target frames based on a reference frame of the video.
  • Fig. 3 is an exemplary flowchart illustrating an alternative embodiment of the method of Fig. 2, wherein the method includes additional detail for coding the target frames.
  • Fig. 4 is an exemplary flowchart illustrating another alternative embodiment of the method of Fig. 2, wherein the method includes operations being performed on both the reference frame and the target frame.
  • Fig. 5 is an exemplary flowchart illustrating still another alternative embodiment of the method of Fig. 2, wherein the method further comprises a loop for coding multiple frames.
  • Fig. 6 is an exemplary flowchart illustrating an alternative embodiment of the method of Fig. 3, wherein the method obtains the prediction Huffman table based on the reference frame.
  • Fig. 7 is an exemplary detail diagram illustrating an embodiment a partial frame of the reference frame of Fig. 6, wherein a partial frame is transformed for obtaining better prediction values for a target frame.
  • Fig. 8 is an exemplary detail diagram illustrating an alternative embodiment of the reference frame of Fig. 6, wherein a selected pixel is predicted by preceding pixel values.
  • Fig. 9 is an exemplary detail diagram, illustrating another alternative embodiment of the reference frame of Fig. 6, wherein a selected pixel located in a first row or first column is predicted.
  • Fig. 10 is an exemplary flowchart illustrating another alternative embodiment of the method of Fig. 3, wherein the method includes coding the target frame based on the prediction Huffman table, which is based on a reference frame and prediction values of the target frame.
  • Fig. 11 is an exemplary detail diagram illustrating an embodiment of the manner by which a FIFO service can be used for transforming variable-length codes to fixed-length codes.
  • Fig. 12 is an exemplary flowchart illustrating still another embodiment of the method of Fig. 3, wherein the method includes compensating the target frame by broadening a length of the frequency table.
  • Fig. 13 is an exemplary top-level block diagram illustrating an embodiment of an imaging device for implementing the method of Fig. 2.
  • Fig. 14 is an exemplary top-level block diagram, illustrating an alternative embodiment of the imaging device of Fig. 13, wherein the imaging device comprises a memory for storing the video of Fig. 1.
  • a lossless compression system and method that uses a prediction table, such as a Huffman table, based on a reference frame for coding a target frame can prove desirable and provide a basis for a simpler efficient system. This result can be achieved according to one embodiment disclosed in Fig. 1.
  • Fig. 1 shows a video 150 consisting of a frame sequence of N frames 151.
  • the video 150 is compressible.
  • prediction Huffman tables 152 can be associated with certain frames 151 of the video 150 for compressing one or more target frames 151t of the video 150.
  • the target frames 151t can be frames 151 of the video 150 that follow a reference frame 151r in the frame sequence, and the Huffman tables 152p can be created based upon the reference frame 151r.
  • Fig. 1 shows that a target frame 151t, such as an ith frame 151I, can be coded based on a prediction Huffman table 152p of an (i-1)th frame 151H to form a target frame 151t.
  • an (i+1)th frame 151J can be coded based on a Huffman table 152I of the ith frame 151I.
  • a first frame 151A that has no preceding frame 151.
  • Fig. 1 shows that a constant prediction Huffman code 152c can be used in coding the first frame 151A.
  • the constant Huffman code 152c can be any value within a range of Huffman codes that can be deducted from any of Huffman tables 152.
  • Huffman table 152p of the reference frame 151r for coding a single target frame 151t immediately following the reference frame 151r for purposed of illustration only
  • two or more target frames 151t in the frame sequence after the reference frame 151r can be coded based on the prediction Huffman table 152p.
  • Huffman tables 152 for predicting the target frames 152c for purposes of illustration only
  • other suitable forms of prediction tables can also be used in the present disclosure.
  • Fig. 2 illustrates an exemplary embodiment of a method 100 for compressing a video and is described with reference to the video 150 of Fig. 1 for purposes of illustration only.
  • the method 100 can compress the video 150 by using the prediction table 152p of the reference frame 151r, such as the frame 151H of the video 150 for coding a target frame 151t, such as the frame 151I (collectively shown in Fig. 1).
  • the target frame 151t is shown in Fig. 1 as being adjacent to the reference frame 151r.
  • a Huffman table 152p for the reference frame 151r can be obtained, at 110, for coding a target frame 151t.
  • the Huffman tables 152p for each frame 151 can be used for coding one or more target frames 151t that appear in the frame sequence after the reference frame 151r, such as the (i-1)th frame 151H. Additional detail of constructing the Huffman table 152p for the reference frame 151r will be shown and described below with reference to Fig. 6
  • the target frame 151t can be coded based on the prediction Huffman table 152p, i.e., the Huffman table 152p generated for the reference frame 151r, such as the (i-1)th frame 151H, can be used to code for the ith frame 151I of the video 150. Additional detail of coding for target frames 151t will be shown and described in more detail below with reference to Figs. 10. As described above with reference to Fig. 1, any of the frames 151 that appear in the frame sequence after the reference frame 151r can be coded based on the Huffman table 152p. In one embodiment, a compressing speed of the frames 151 can be slower than an imaging speed of the video 150. The frames 151 after the reference frame 151r, according to this embodiment, can be skipped to help ensure the compressing speed, and the Huffman table 152p can be used to code a target frame 151t that does not immediately follow the reference frame 151r.
  • the Huffman table 152p can be used to code
  • an already-received video frame 151 can serve as a reference frame 151r for providing prediction values that can be used to construct the prediction Huffman table 152p.
  • the prediction Huffman table 152p can be used for coding one or more target frames 151t appearing after the reference frame 151r.
  • Such dynamic table generation can release a requirement for buffering video frames 151.
  • the method 100 can lower an implementation difficulty and/or costs, while efficiently ensuring a coding compression ratio.
  • the method 100 does not require storage of a whole frame of raw image data for the compression. Instead, only a prediction Huffman table 152p is stored at any time for compressing the target frames 151t. Therefore, memory space requirement of a camera (not shown) for purposes of video compression can be greatly lowered, making compression feasible to be implemented with the camera.
  • the method 100 can use the prediction Huffman table 152p to implement a lossless compression of the target frames 151t. Therefore, a predetermined quality level for the compression can be ensured.
  • any other frames 151 can be used as the reference frame 151r.
  • Fig. 3 illustrates an alternative embodiment of the method 100.
  • the coding of the target frame 151t, at 130 is shown with additional detail.
  • a prediction Huffman table 152p for a reference frame 151r such as the ith frame 151I (shown in Fig. 1)
  • a target frame 151t such as the (i+1)th frame 151J, can be coded at 130 based on the prediction Huffman table 152p.
  • the prediction Huffman table 152p can be used to a generate Huffman code for each pixel of the target frame 151t.
  • the Huffman code is not used alone for coding the target frame 151t. Instead, the Huffman code can be combined with certain information of the target frame 151t, such as absolute difference values for target pixels of the target frame 151t.
  • a realization process for coding the target frame 151t can be performed, at 132.
  • the codes for the target frame 151t can be constructed based on the prediction Huffman table 152, or the Huffman codes, and the absolute difference values of the target frame 151t. Additional detail regarding the code realization process will be shown and described in greater detail below with reference to Fig. 10.
  • a frequency table 531 (shown in Fig. 6) can be compensated to enhance an accuracy of the prediction values.
  • the frequency table can be used to create the prediction Huffman table 152p. Additional detail regarding compensating the target frame 151t will be shown and described below with reference to Fig. 7.
  • Fig. 4 shows an alternative embodiment of the method 100.
  • obtaining of a prediction Huffman table 152p based on a reference frame 151r can be conducted, at 110, in the manner discussed above with reference to the method 100 of Fig. 2.
  • prediction values can be determined, at 510, for each pixel of the reference frame 151r.
  • a frequency table can be generated based on the prediction values for each pixel of the reference frame 151r.
  • the prediction Huffman table 152p can be constructed based on the frequency table. Additional detail regarding determining the prediction values, at 510, generating the frequency table, at 530, and constructing the prediction Huffman table 152p, at 540, is shown and described in additional detail below with reference to Fig. 6.
  • coding of the frame 151t can be conducted, at 130, in the manner discussed above with reference to the method 100 of Fig. 2.
  • absolute difference values for each pixel, i.e. each target pixel, of the target frame 151t can be determined, at 122, as illustrated in Fig. 4.
  • the absolute difference values can be absolute differences for each and every pixels of the target frame 151t.
  • Each of the absolute difference values can be an absolute value of a difference between an actual value of each pixel and a respective prediction value of the pixel. Additional detail regarding how to determine the absolute difference values of the target frame 151t will be shown and described below with reference to Fig. 10.
  • the absolute difference values for the target frame 151t and the prediction Huffman table 152p can be combined to code the target frame 151t, at 124. Addition detail regarding coding the target frames 151t, at 130, based on the combination of the prediction Huffman table 152p and the absolute difference values of the target frame 151 will be shown and described in greater detail below with reference to Fig. 10.
  • Fig. 5 shows another alterative embodiment of the method 100, wherein the method 100 further comprises a loop 199 for coding multiple frames 151.
  • the target frame 151t and/or the reference frame 151r can be replaced with other frames 151, at 140.
  • the reference frame 151r can remain as a new reference frame 151r for a new target frame 151t.
  • the reference frame 151r can be replaced with another frame 151 following the reference frame 151r.
  • the new reference frame 151r can be replaced by the target frame 151t that has just compressed for efficiency purposes because absolute difference values of the target frame 151t have already been completed.
  • the target frame 151t can be replaced with another frame 151, which can become a new target frame 151t.
  • the new target frame 151t can be any frame 151 appearing after the new reference frame 151r. This process repeats till all target frames 151t of the video 150 are coded.
  • any other frames 151 within the video can be selected as the reference frame 151r and/or the target frames 151t.
  • Fig. 6 shows an embodiment of obtaining a prediction Huffman table 152p based on a reference frame 151r, as part of the method 100.
  • the reference frame 151r can be processed for purposes of better coding and/or better compression.
  • a purpose for processing the reference frame 151r is shown Fig. 7.
  • an exemplary typical pixel composition of the reference frame 151r is shown with a partial frame 710 that can be transformed for obtaining better prediction values for target frame 151t.
  • Fig. 7 shows the partial frame 710 has sixteen pixels 701 in four rows 711A, 711B, 711C, 711D and four columns 721A, 721B, 721C, 721D.
  • a pixel with a blue (“B”) color component can be located in the second row 711B and the second column 721B.
  • the B color pixel can be predicted by a preceding pixel value red(“G”) in the same row 711B, a preceding pixel value green(“G”) in the same column 721B, and a pixel value red (“R”) in a preceding row 721A and preceding column 711A.
  • the red-red-green (“GGR”) components can be of little relevance with the B color pixel.
  • a combination process can be provided, at 750, to transform the partial frame 710 into a new partial frame 730.
  • every two rows of pixels are combined into a single row.
  • the rows 711A, 711B can be combined into a new row 731A
  • the rows 711C, 711D can be combined into a new row 731B.
  • the new partial frame 730 can have eight columns 741A to 741H, each having two pixels.
  • the pixels 701 can be rearranged such that the relevance for the prediction pixels 701 can be enhanced by regrouping pixels 701 with likely relevant color components together.
  • the pixel 701 located in the second row 731B and second column 741B is a G color in the transformed partial frame 730.
  • the available prediction pixels preceding the G color pixel are RRG that are more relevant than those preceding pixels 701, which are GGR, in the partial frame 710.
  • each and every pixel 701 of the reference frame 151r can be determined with a prediction value Px (shown in Equations 1-5).
  • Px shown in Equations 1-5.
  • each pixel 701 of the reference frame 151r can be referred as a selected pixel 701, which can also be referred as a reference pixel for the reference frame 151r.
  • part or all pixels 701 of the reference frame 151r can be selected as selected pixels, or reference pixels, for calculating the prediction values Px.
  • a prediction value Px can be calculated according to at least one of the preceding pixels adjacent to the selected pixel 701.
  • three pixels preceding the selected pixel 701 can be referred as a first pixel, a second pixel and a third pixel.
  • the preceding adjacent pixels can include the first pixel preceding the selected pixel in a same row with the selected pixel, such as pixel A 801 for pixel D, the second pixel preceding the selected pixel in a same column with the selected pixel 701, such as pixel B 802 for pixel D 805.
  • the adjacent pixels to the selected pixel 701 can also include the third pixel in the same row with the first pixel and same column with the second pixel, such as pixel C 803 for pixel D 805.
  • the prediction value of the selected pixel 701 can be determined with any one of the actual pixel values pixel A, pixel B or pixel C.
  • the prediction value of the selected pixel 701 can also be determined via one of the following equations:
  • P x denotes the prediction value of the selected pixel 701
  • a denotes an actual value of the first pixel
  • pixel A preceding the selected pixel
  • b denotes an actual value of the second pixel
  • pixel B preceding the selected pixel
  • c denotes an actual value of the third pixel, pixel C, preceding the selected pixel.
  • pixels 701 located in a first row 731A and/or in a first column 741A can be predicted by a constant value and/or by an actual value of one or more pixels 701 available in a preceding row or column.
  • pixel A 701A can be a first pixel of a frame 740, which pixel A 701A does not have any preceding pixel 701.
  • the constant C can be half of a maximum pixel value, such as five hundred and twelve (512) when the maximum pixel value is one thousand and twenty-four (1024).
  • a prediction value P B1 for pixel B1 can be an actual value of pixel A 701A
  • a prediction value P C1 for pixel C1 can be an actual value of pixel B1
  • a prediction value P D1 for pixel D1 can be an actual value of pixel C1.
  • a prediction value P B2 for pixel B2 can be the actual value of pixel A 701A
  • a prediction value P C2 for pixel C2 can be the actual value of pixel B2
  • a prediction value P D2 for pixel D2 can be the actual value of pixel C2.
  • Pixel A 701A has an actual value of 400 and that pixel B1 has an actual value of 200, and that pixel C1 has an actual value of 300.
  • Values for P B2 , P C2 and P D2 can be determined in a similar manner.
  • a result of the calculation, at 510, is a prediction value 511, for each selected pixel 701, that normally can have a difference with the actual value of the selected pixel 701.
  • a prediction value 511 for each selected pixel 701 that normally can have a difference with the actual value of the selected pixel 701.
  • a difference value between the prediction value 511 and the actual value of the selected pixel 701 can be calculated.
  • an absolute value of the difference value can be taken, at 520. Therefore, a result of the calculation, at 520, can be an absolute difference value of the difference value 521 between the actual value of the selected pixel and the prediction value that is the result of 520.
  • this result can be determined with the following equation:
  • diff denotes the absolute difference value 521 between the actual value of the selected pixel 701 and the prediction value
  • d denotes the actual value of the selected pixel 701
  • Px denotes the prediction value of the selected pixel 701.
  • any other suitable arithmetic difference values can be provided to reflect the difference under this disclosure.
  • a frequency table 531 can be constructed in a form of a histogram based on the calculated absolute difference values 521 for each and every pixel 701 of the reference frame 151r.
  • the frequency table 531 can be constructed by counting frequencies for each of the absolute difference values of the pixels 701.
  • the counting of the frequencies can be conducted by counting appearances of each difference value.
  • each difference value can consist of ten bit of ones or zeros.
  • the counting of frequencies can be conducted by counting the appearance of a highest non-zero value, i.e. a highest one, in each difference value.
  • the result of counting frequency for each difference value, at 530, can be the frequency table 531 in the form of histogram (not shown).
  • the frequency table 531 can have two sets of information: one set represents a value dataset of the available absolute difference values 521, such as ⁇ d 1 , d 2 , ..., d i ,..., d n ⁇ ; and the other set is a frequency dataset represents frequencies corresponding to each of the absolute difference values 521, such as ⁇ f 1 , f 2 , ..., f i ,..., f n ⁇ .
  • the value dataset can be ⁇ d 1 , d 2 , ..., d i ,..., d 10 ⁇ and the frequency dataset can be ⁇ f 1 , f 2 , ..., f i ,..., f 10 ⁇ .
  • the method 100 can use extended coding of twelve bit and/or sixteen bit coding systems.
  • a prediction Huffman table 152p for the reference frame 151r can be generated, in a form of a binary tree 541, based on the frequency table 531.
  • the binary tree 541 can be created in any ordinary ways of constructing a Huffman table 152p.
  • One way of constructing the Huffman table 152p can include working from bottom up of the binary tree 541, sorting the value dataset by the frequency dataset, making two-lowest elements into leaves, and creating a parent mode with a frequency being a sum of the frequencies of the two lower elements.
  • the Huffman table 152p can carry information of coding values and corresponding bit string for each coding value, which string unambiguously represents the coding value.
  • the prediction Huffman table 152p can carry Huffman codes for each of the absolute difference values 521.
  • the Huffman codes are normally variable-length because of differences among the frequencies of each of the absolute difference values 521.
  • the variable-length codes can be further processed with additional detail being shown and described below with reference to Fig. 10.
  • Fig. 10 shows an embodiment of coding a target frame 151t based on the prediction Huffman table 152p for the method 100, which prediction Huffman table 152p is based on a reference frame 151r and prediction values of the target frame 151t.
  • absolute difference values, diff of Equation 6, for each selected pixel 701 of the target frame 151t can be determined.
  • the absolute difference values of the target frame 151t can be determined in a same manner as shown and described above for the absolute difference values for the reference frame 151r, at 510, with reference to Fig. 6 and Equations 1-6.
  • each selected pixel 701 of the target frame 151r can be referred as a target pixel for the target frame 151t.
  • the result of the calculation at 122 can be absolute difference values 811 for each pixel of the target frame 151t.
  • a Huffman code for each pixel of the target frame 151t can be generated based on the prediction Huffman table 152p of the reference frame 151r and the absolute difference values 811.
  • the Huffman code 821 for a selected pixel 701 of the target frame 151t can be generated by combining the Huffman code 541 represented in the prediction Huffman table 152p for the selected pixel 701 and the absolute difference value 811 obtained from the calculation process, at 122.
  • the combined Huffman codes have variable-lengths because both of the Huffman codes generated from the prediction Huffman table 541 of the reference frame 151r and the absolute difference values 811 of the selected pixels 701 can vary in length.
  • variable-length Huffman codes 821 can be difficult to process. Therefore, a transformation from the variable-length Huffman codes to fixed-length codes can be provided under the present disclosure.
  • the transformation from the variable-length code 821 to fixed-length code 831 comprises using a first-in-first-out service (“FIFO”) 850, at 812, to convert the variable-length codes.
  • FIFO first-in-first-out service
  • the variable length codes can be streamlined into the FIFO service 850.
  • FIG. 11 An exemplary embodiment of the FIFO service 850 is shown in Fig. 11, which service can be used for transforming variable-length codes to fixed-length codes.
  • a series of input codes, in_code 1 to in_code 8 and more can be input into the FIFO service 850.
  • the input codes, in_code 1 to in_code 8 can represent the variable-length codes 821.
  • the input codes are transformed into output codes, out_code 1 to out_code 4 and more, which are in fixed-length of sixteen bits.
  • variable-length codes Although shown and described in Fig. 11 as using the FIFO service 850 for transforming the variable-length codes for purposes of illustration only, other forms of services can be applied to transform the variable-length codes, including but not limited to padding zeros before and/or after each code to generate a fixed-length code.
  • certain bytes of the target frame 151t may coincide with special characters defined under the JPEG protocol.
  • those bytes can be marked with identification characters, at 814.
  • identification characters can comprise a hexadecimal zero, 0x00.
  • the identification character can be added immediately after those bytes, or be added immediately before each appearance of the special bytes.
  • a hexadecimal zero, 0x00 can be added after each appearance of hexadecimal two hundred and fifty-five, 0XFF.
  • the output of the transformation 830 can be a fixed-length code 831.
  • the identification character 0x00 to mark each appearance of special characters for purposes of illustration only, other suitable forms of identification approaches can also be applied under the present disclosure.
  • Fig. 12 shows an embodiment of a compensation approach, at 134, for coding a target frame 151t, by broadening a length of a frequency table that is used to generate a prediction Huffman table 152p of a reference frame 151r.
  • scenarios being video recorded can change slowly, and relevance between frames 151 can be relatively greater.
  • prediction values for each target frame 151t can be more accurate.
  • the scenarios being videoed can change rapidly or abruptly, and the relevance between frames 151 can be relatively less.
  • significant differences between a frequency table 531 (shown in Fig. 6) of the reference frame 151r and the frequency table 531 of the target frame 151t can exist.
  • a length of the frequency table 531 of the reference frame 151r can be broadened with extra numbers, at 910.
  • such broadening can be realized by replacing zero values with non-zero values, such as ones in the frequency table 531 of the reference frame 151r, at 912.
  • zeros immediately after the non-zero values can be replaced by non-zeros.
  • two zeros immediately after the non-zero values can be replaced by non-zeros, e.g. by ones.
  • the frequency table 531 is: 15, 15, 15, 15, 15, 15, 15, 15, 15, 0, 0, 0, 0.
  • a new frequency table can be obtained as: 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 15, 1, 1, 0, 0, 0.
  • Fig. 13 shows an exemplary embodiment of an imaging device 200 for implementing the method of Fig. 2.
  • the imaging device 200 can be a video and/or still camera.
  • the imaging device 200 can comprise a lens 951, an image sensor 954, and/or a processor 955.
  • the lens 951 can receive light from a scene 958.
  • the lens 951 can be configured to focus the received light onto the image sensor 954, which generates images of the scene 958.
  • the images generated can be the frames 151 within the video 150 (collectively shown in Fig. 1).
  • the processor 955 can then process the frames for compressing the video in accordance with the method 100 shown and described in more detail above with reference to any one of Figs. 2-5, 6, 10, and 12.
  • An exemplary embodiment of the lens 951 can be a digital single-lens reflex (“DSLR”) lens; however, the lens 951 can comprise any conventional type of lens.
  • Exemplary suitable lenses as the lens 951 can include one or more of a pin-hole lens, a biological lens, a simple convex glass lens, or the like, without limitation.
  • the lens 951 can be configured with certain imaging properties such as one or more of a macro lens, zoom lens, telephoto lens, fisheye lens, wide-angle lens, or the like, without limitation.
  • the image sensor 954 can receive the light from the lens 951 and form an image based on the light received.
  • the image sensor 954 can be a charge coupled sensor (“CCD”), complementary metal-oxide-semiconductor (“CMOS”) sensor, N-type metal-oxide-semiconductor (“NMOS”) sensor, and hybrids/variants thereof, an electro-optical sensor, a thermal/infrared sensor, a color or monochrome sensor, a multi-spectral imaging sensor, a spectrophotometer, a spectrometer, a thermometer, and/or an illuminometer.
  • CCD charge coupled sensor
  • CMOS complementary metal-oxide-semiconductor
  • NMOS N-type metal-oxide-semiconductor
  • the processor 955 can comprise any commercially-available graphic chip that chips can be used in currently available video equipment.
  • the processor 955 can also be a custom-designed graphic chips specially produced for the imaging device 200.
  • the processor 955 can also comprise additional chips for accelerating rendering of 2D graphics and/or 3D scenes, MPEG-2/MPEG-4 decoding, TV output, or an ability to connect multiple displays.
  • the processor 955 can operate under a VGA standard.
  • the processor 955 can include one or more general purpose microprocessors (for example, single or multi-core processors), application-specific integrated circuits, application-specific instruction-set processors, graphics processing units, physics processing units, digital signal processing units, coprocessors, network processing units, audio processing units, encryption processing units, and the like.
  • the processor 955 can be configured to perform any of the methods described herein, including but not limited to, a variety of operations relating to image/frame processing.
  • the processor 955 can include specialized hardware for processing specific operations relating to imaging processing.
  • the processor 955 can usually be operably connected to the image sensor 954. The connection can be via a wired and/or wireless link.
  • the processor 955 can process a non-coded image/frame received by the image sensor 955 and can code the non-coded image/frame image automatically in accordance with the method 200 disclosed herein.
  • the processor 955 can perform any one or more of the processes of method 100 shown and described with reference to any one of Figs. 2-5, 6, 10, and 12.
  • an imaging device 200 can also contain a memory 957.
  • the processor 955 of the imaging device 200 can be operatively connected to the memory 957.
  • the memory 957 optionally can be provided for storing non-coded images (or frames) from the image sensor 954 and/or coded/compressed images (or frames) from the processor 955.
  • the memory 957 can be linked to the processor 955 via wired or wireless connections.
  • the memory 957 can also be linked (not shown) to any other components of the imaging device 200, such as the image sensor 954.
  • Exemplary examples of the memory 957 can be a random access memory (“RAM”), static RAM, dynamic RAM, read-only memory (“ROM”), programmable ROM, erasable programmable ROM, electrically erasable programmable ROM, flash memory, secure digital (“SD”) card, and the like.
  • the imaging device 200 can be a video camera.
  • the memory 957 can be used to store a compressed video 150 (shown in Fig. 1).

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

A system for processing a video and methods for making and using the same. The system can obtain a prediction table for a reference frame of the video and code one or more target frames of the video based on the prediction table. The prediction table can be a Huffman table of a difference value for each reference pixel of the reference frame. The difference value is determined based on an actual value of the reference pixel and a prediction value that can be determined based on respective pixel values of one or more pixels adjacent to the reference pixel. The target frames can be coded based on the Huffman table of the reference frame and prediction values of the target frames. The system advantageously does not need to save a whole frame during a course of encoding and, therefore, can save valuable memory space for video processing.

Description

SYSTEM AND METHOD FOR VIDEO PROCESSING FIELD
The disclosed embodiments relate generally to video imaging and more particularly, but not exclusively, to systems and methods for processing a video.
BACKGROUND
Early video cameras saved uncompressed video directly because no compression mechanism was employed, which requires high bandwidth and big memory space with a hard disk. Particularly, nowadays, four thousand pixel (4K pixel) images are becoming widely popular. It is difficult to directly store uncompressed raw video data acquired from camera sensors because of a requirement of higher imaging rates. For this reason, more and more video cameras employ image compression technologies for compressing images before saving into the hard disk.
Based on levels (or layers) of compression, image compression can be categorized into lossy compression and lossless compression. Representative lossy compression methods are Joint Photographic Experts Group (“JPEG”) and H.26X, with JPEG having a compression rate between ten to forty times. H.26X has a higher compression rate, which typically can be as high as two hundred times. But the high compression rates are at an expense of losing certain image information, in addition to complex implementations. Therefore, an image quality of an image created via lossy compression is not as good as an image quality of an image created via lossless compression.
Major lossless compression methods include JPEG lossless compression, arithmetic coding, Huffman coding, variable-length + Huffman coding, Lempel-Ziv-Weich (“LZW”) coding, etc. As for compression rate, JPEG lossless coding is greater than arithmetic coding, which is greater than Huffman coding, which is greater than variable-length + Huffman coding, which is greater than LZW coding.
JPEG lossless compression combines prediction values within a frame with Huffman coding that needs a frequency table of the frame image. The image is coded with variable-length coding. If the coding is realized with an Application-Specific Integrated Circuit (“ASIC”), a whole frame of image needs to be buffered (or stored), which requires a large storage space within a camera chip. In most cases, it is almost impossible to buffer a whole frame of image within the chip. A storage chip outside of the ASIC chip is needed, creating extra cost and thereby increasing a difficulty for implementation of compression.
In view of the foregoing reasons, there is a need for systems and methods for lossless compressing frames in a video without extra expense of outside storage chip.
SUMMARY
In accordance with a first aspect disclosed herein, there is set forth a method for processing a video, comprising:
obtaining a prediction table for a received reference frame of the video; and
coding one or more target frames of the video based on the prediction table.
In an exemplary embodiment of the disclosed methods, obtaining the prediction table comprises obtaining a prediction Huffman table.
In another exemplary embodiment of the disclosed methods, coding the one or more target frames comprises choosing any frame of the video appearing after the reference frame.
In another exemplary embodiment of the disclosed methods, obtaining the prediction Huffman table comprises generating a Huffman table of a difference value of each reference pixel of the reference frame.
In another exemplary embodiment of the disclosed methods, generating the Huffman table comprises determining a prediction value for each of the reference pixels.
Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the reference pixel based on respective pixel values of one or more pixels adjacent to the reference pixel.
In another exemplary embodiment of the disclosed methods, the prediction value of the reference pixel is a constant value when the reference pixel is located in a first row and a first column of the reference frame.
In another exemplary embodiment of the disclosed methods, wherein the constant value is half of a maximum value of a coding value.
In another exemplary embodiment of the disclosed methods, the constant value is five hundred and twelve when the maximum value of the coding value is one thousand and twenty-four.
Exemplary embodiments of the disclosed methods further comprise selecting the pixels adjacent to the reference pixel from a group of pixels consisting of a first pixel preceding the reference pixel in a same row, a second pixel preceding the reference pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the reference pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the reference pixel by the first pixel when the reference pixel is a pixel of the first row.
Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the reference pixel by the second pixel when the reference pixel is a pixel of the first column.
Exemplary embodiments of the disclosed methods further comprise determining the difference value based upon a difference of an actual value of the reference pixel and the prediction value of the reference pixel.
In another exemplary embodiment of the disclosed methods, generating the prediction Huffman table comprises generating a frequency table of an absolute value of the difference value.
In another exemplary embodiment of the disclosed methods, generating the frequency table comprises determining statistics of frequencies of the absolute values to form the frequency table.
In another exemplary embodiment of the disclosed methods, generating the prediction Huffman table comprises generating the prediction Huffman table based on the frequency table.
In another exemplary embodiment of the disclosed methods, coding the one or more target frames further comprises determining an absolute difference value for each target pixel of the one or more target frames.
Exemplary embodiments of the disclosed methods further comprise determining the absolute difference value based upon an absolute value of a difference between an actual value of each target pixel and a prediction value of the target pixel.
Exemplary embodiments of the disclosed methods further comprise determining the prediction value of each target pixel for the one or more target frames.
Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the target pixel based on respective pixel values of one or more pixels adjacent to the target pixel.
In another exemplary embodiment of the disclosed methods, the prediction value of the target pixel is a constant value when the target pixel is located in a first row and a first column of one of the one or more target frames.
In another exemplary embodiment of the disclosed methods, the constant value is half of a maximum value of a coding value.
In another exemplary embodiment of the disclosed methods, the constant value is five hundred and twelve, when the maximum value of the coding value is one thousand and twenty-four.
Exemplary embodiments of the disclosed methods further comprise selecting the pixels adjacent to the target pixel from a group of pixels consisting of a first pixel preceding the target pixel in a same row, a second pixel preceding the target pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the target pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
Exemplary embodiments of the disclosed methods further comprise determining the prediction value of the target pixel by the first pixel when the target pixel is a pixel of the first row.
Exemplary embodiments of the disclosed methods further comprise comprising determining the prediction value of the target pixel by the second pixel when the target pixel is a pixel of the first column.
In another exemplary embodiment of the disclosed methods, coding the one or more target frames comprises coding the frames based on the prediction Huffman table of the reference frame and the absolute difference values of the one or more target frames to generate variable-length codes.
In another exemplary embodiment of the disclosed methods, coding the one or more target frames comprises transforming the variable-length codes to fixed-length codes.
In another exemplary embodiment of the disclosed methods, transforming the variable-length codes comprises converting the variable-length codes to the fixed-length codes using a first-in-first-out service to generate the fixed-length codes.
In another exemplary embodiment of the disclosed methods, converting the variable-length code comprises inserting a preselected hexadecimal character after every instance of a special character in accordance with a Joint Photographic Experts Group (“JPEG”) protocol.
In another exemplary embodiment of the disclosed methods, inserting the preselected hexadecimal character comprises inserting a hexadecimal zero after every instance of hexadecimal two hundred fifty-five.
Exemplary embodiments of the disclosed methods further comprise compensating the frequency table to generate a compensated frequency table.
In another exemplary embodiment of the disclosed methods, compensating the frequency table comprises widening a coding width of the frequency table.
In another exemplary embodiment of the disclosed methods, compensating the frequency table comprises replacing at least one zero-value each with a non-zero value.
In another exemplary embodiment of the disclosed methods, replacing comprises replacing two or more zeros each with a one.
In accordance with another aspect disclosed herein, there is set forth a video processing system configured to perform the video processing process in accordance with any one of previous embodiments of the disclosed methods.
In accordance with another aspect disclosed herein, there is set forth a computer program product comprising instructions for processing a video in accordance with any one of previous embodiments of the disclosed methods.
In accordance with another aspect disclosed herein, there is set forth a imaging device, comprising:
a sensor for acquiring a sequence of image frames of a video; and
a processor for obtaining a prediction table for a received reference frame selected from the sequence of image frames and to code one or more target frames of the video based on the prediction table.
In an exemplary embodiment of the disclosed imaging devices, the prediction table is a prediction Huffman table.
In another exemplary embodiment of the disclosed imaging devices, the one or more target frames comprise any frame appearing after the reference frame.
In another exemplary embodiment of the disclosed imaging devices, the prediction Huffman table is a Huffman table of a difference value for each reference pixel of the reference frame.
In another exemplary embodiment of the disclosed imaging devices, the Huffman table is generated based upon a prediction value of each of the reference pixels.
In another exemplary embodiment of the disclosed imaging devices, the prediction value of the reference pixel is determined based on respective pixel values of one or more pixels adjacent to the reference pixel.
In another exemplary embodiment of the disclosed imaging devices, the prediction value of the reference pixel is a constant value when the reference pixel is located at a first row and a first column of the reference frame.
In another exemplary embodiment of the disclosed imaging devices, the constant value is half of a maximum value of a coding value.
In another exemplary embodiment of the disclosed imaging devices, the constant value is five hundred and twelve when the maximum value of the coding value is one thousand and twenty-four.
In another exemplary embodiment of the disclosed imaging devices, the pixels adjacent to the reference pixel are selected from a group of pixels consisting of a first pixel preceding the reference pixel in a same row, a second pixel preceding the reference pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the reference pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
In another exemplary embodiment of the disclosed imaging devices, the prediction value of the reference pixel is determined by the first pixel when the reference pixel is a pixel of the first row.
In another exemplary embodiment of the disclosed imaging devices, the prediction value of the reference pixel is determined the second pixel when the reference pixel is a pixel of the first column.
In another exemplary embodiment of the disclosed imaging devices, the difference value is determined based upon a difference of an actual value of the reference pixel and the prediction value of the reference pixel.
In another exemplary embodiment of the disclosed imaging devices, the prediction Huffman table is generated with a frequency table of an absolute value of the difference value.
In another exemplary embodiment of the disclosed imaging devices, the frequency table is determined upon statistics of frequencies of the absolute values to form the frequency table.
In another exemplary embodiment of the disclosed imaging devices, the prediction Huffman table is generated based on the frequency table.
In another exemplary embodiment of the disclosed imaging devices, the processor is configured to determine an absolute difference value for each target pixel of the one or more target frames.
In another exemplary embodiment of the disclosed imaging devices, the processor is configured to determine the absolute difference value based upon an absolute value of a difference between an actual value of each target pixel and a prediction value of the target pixel.
In another exemplary embodiment of the disclosed imaging devices, the processor is configured to determine the prediction value of each target pixel for the one or more target frames.
In another exemplary embodiment of the disclosed imaging devices, the prediction value of the target pixel is determined based on respective pixel values of one or more pixels adjacent to the target pixel.
In another exemplary embodiment of the disclosed imaging devices, the prediction value of the target pixel is a constant value when the target pixel is located in a first row and a first column of one of the one or more target frames.
In another exemplary embodiment of the disclosed imaging devices, the constant value is half of a maximum value of a coding value.
In another exemplary embodiment of the disclosed imaging devices, the constant value is five hundred and twelve when the maximum value of the coding value is one thousand and twenty-four.
In another exemplary embodiment of the disclosed imaging devices, the pixels adjacent to the target pixel is selected from a group of pixels consisting of a first pixel preceding the target pixel in a same row, a second pixel preceding the target pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the target pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
In another exemplary embodiment of the disclosed imaging devices, the processor is configured to determine the prediction value of the target pixel by the first pixel when the target pixel is a pixel of the first row.
In another exemplary embodiment of the disclosed imaging devices, the processor is configured to determine the prediction value of the target pixel by the second pixel when the target pixel is a pixel of the first column.
In another exemplary embodiment of the disclosed imaging devices, the one or more target frames are coded based on the prediction Huffman table of the reference frame and the absolute difference values of the one or more target frames to generate variable-length codes.
In another exemplary embodiment of the disclosed imaging devices, the processor is configured to transform the variable-length codes to fixed-length codes.
In another exemplary embodiment of the disclosed imaging devices, the variable-length codes are transformed from the variable-length codes to the fixed-length codes using a first-in-first-out service.
In another exemplary embodiment of the disclosed imaging devices, the processor further configured to insert a preselected hexadecimal character after every instance of a special character in accordance with a Joint Photographic Experts Group (“JPEG”) protocol.
In another exemplary embodiment of the disclosed imaging devices, the special character is a hexadecimal two hundred fifty-five.
In another exemplary embodiment of the disclosed imaging devices, the processor is configured to compensate the frequency table to generate a compensated frequency table.
In another exemplary embodiment of the disclosed imaging devices, the compensated frequency table is compensated by widening a coding width of the frequency table.
In another exemplary embodiment of the disclosed imaging devices, the coding width is widened by replacing at least one zero-value each with a non-zero value.
In another exemplary embodiment of the disclosed imaging devices, the non-zero value is a one.
Exemplary embodiments of the disclosed imaging devices further comprise a memory for storing the one or more frames of the video after coding.
In another exemplary embodiment of the disclosed imaging devices, the memory is configured to store the prediction table of the received reference frame.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 is an exemplary top level block diagram illustrating a video that comprises a frame sequence of image frames, wherein Huffman tables are associated with certain frames of the video and are used for compressing target frames of the video.
Fig. 2 is an exemplary top level flowchart illustrating an embodiment of a method for compressing the video of Fig. 1, wherein the method includes generating a prediction Huffman table for coding the target frames based on a reference frame of the video.
Fig. 3 is an exemplary flowchart illustrating an alternative embodiment of the method of Fig. 2, wherein the method includes additional detail for coding the target frames.
Fig. 4 is an exemplary flowchart illustrating another alternative embodiment of the method of Fig. 2, wherein the method includes operations being performed on both the reference frame and the target frame.
Fig. 5 is an exemplary flowchart illustrating still another alternative embodiment of the method of Fig. 2, wherein the method further comprises a loop for coding multiple frames.
Fig. 6 is an exemplary flowchart illustrating an alternative embodiment of the method of Fig. 3, wherein the method obtains the prediction Huffman table based on the reference frame.
Fig. 7 is an exemplary detail diagram illustrating an embodiment a partial frame of the reference frame of Fig. 6, wherein a partial frame is transformed for obtaining better prediction values for a target frame.
Fig. 8 is an exemplary detail diagram illustrating an alternative embodiment of the reference frame of Fig. 6, wherein a selected pixel is predicted by preceding pixel values.
Fig. 9 is an exemplary detail diagram, illustrating another alternative embodiment of the reference frame of Fig. 6, wherein a selected pixel located in a first row or first column is predicted.
Fig. 10 is an exemplary flowchart illustrating another alternative embodiment of the method of Fig. 3, wherein the method includes coding the target frame based on the prediction Huffman table, which is based on a reference frame and prediction values of the target frame.
Fig. 11 is an exemplary detail diagram illustrating an embodiment of the manner by which a FIFO service can be used for transforming variable-length codes to fixed-length codes.
Fig. 12 is an exemplary flowchart illustrating still another embodiment of the method of Fig. 3, wherein the method includes compensating the target frame by broadening a length of the frequency table.
Fig. 13 is an exemplary top-level block diagram illustrating an embodiment of an imaging device for implementing the method of Fig. 2.
Fig. 14 is an exemplary top-level block diagram, illustrating an alternative embodiment of the imaging device of Fig. 13, wherein the imaging device comprises a memory for storing the video of Fig. 1.
It should be noted that the figures are not drawn to scale and that elements of similar structures or functions are generally represented by like reference numerals for illustrative purposes throughout the figures. It also should be noted that the figures are only intended to facilitate the description of the preferred embodiments. The figures do not illustrate every aspect of the described embodiments and do not limit the scope of the present disclosure.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
Since currently-available systems for compression of a video are complicated, require large memory spaces, may result in damages and have limited application in video imaging, a lossless compression system and method that uses a prediction table, such as a Huffman table, based on a reference frame for coding a target frame can prove desirable and provide a basis for a simpler efficient system. This result can be achieved according to one embodiment disclosed in Fig. 1.
Fig. 1 shows a video 150 consisting of a frame sequence of N frames 151. The video 150 is compressible. As shown in Fig. 1, prediction Huffman tables 152 can be associated with certain frames 151 of the video 150 for compressing one or more target frames 151t of the video 150. The target frames 151t can be frames 151 of the video 150 that follow a reference frame 151r in the frame sequence, and the Huffman tables 152p can be created based upon the reference frame 151r. Fig. 1 shows that a target frame 151t, such as an ith frame 151I, can be coded based on a prediction Huffman table 152p of an (i-1)th frame 151H to form a target frame 151t. Similarly, an (i+1)th frame 151J can be coded based on a Huffman table 152I of the ith frame 151I. There can be one exception for a first frame 151A, that has no preceding frame 151. Fig. 1 shows that a constant prediction Huffman code 152c can be used in coding the first frame 151A. The constant Huffman code 152c can be any value within a range of Huffman codes that can be deducted from any of Huffman tables 152.
Although shown and described as using a Huffman table 152p of the reference frame 151r for coding a single target frame 151t immediately following the reference frame 151r for purposed of illustration only, two or more target frames 151t in the frame sequence after the reference frame 151r can be coded based on the prediction Huffman table 152p. In addition, although shown and described as using Huffman tables 152 for predicting the target frames 152c for purposes of illustration only, other suitable forms of prediction tables can also be used in the present disclosure.
Fig. 2 illustrates an exemplary embodiment of a method 100 for compressing a video and is described with reference to the video 150 of Fig. 1 for purposes of illustration only. The method 100 can compress the video 150 by using the prediction table 152p of the reference frame 151r, such as the frame 151H of the video 150 for coding a target frame 151t, such as the frame 151I (collectively shown in Fig. 1). The target frame 151t is shown in Fig. 1 as being adjacent to the reference frame 151r. As shown in Fig. 2, a Huffman table 152p for the reference frame 151r can be obtained, at 110, for coding a target frame 151t. The Huffman tables 152p for each frame 151, except a last frame 151N, can be used for coding one or more target frames 151t that appear in the frame sequence after the reference frame 151r, such as the (i-1)th frame 151H. Additional detail of constructing the Huffman table 152p for the reference frame 151r will be shown and described below with reference to Fig. 6
At 130, the target frame 151t can be coded based on the prediction Huffman table 152p, i.e., the Huffman table 152p generated for the reference frame 151r, such as the (i-1)th frame 151H, can be used to code for the ith frame 151I of the video 150. Additional detail of coding for target frames 151t will be shown and described in more detail below with reference to Figs. 10. As described above with reference to Fig. 1, any of the frames 151 that appear in the frame sequence after the reference frame 151r can be coded based on the Huffman table 152p. In one embodiment, a compressing speed of the frames 151 can be slower than an imaging speed of the video 150. The frames 151 after the reference frame 151r, according to this embodiment, can be skipped to help ensure the compressing speed, and the Huffman table 152p can be used to code a target frame 151t that does not immediately follow the reference frame 151r.
Although shown and described as coding one target frame 151t based on the prediction Huffman table 152p for illustrative purposes only, more than one frame 151 can be chosen to be coded based on the Huffman table 152p of the reference frame 151r.
In an exemplary embodiment, an already-received video frame 151 can serve as a reference frame 151r for providing prediction values that can be used to construct the prediction Huffman table 152p. The prediction Huffman table 152p can be used for coding one or more target frames 151t appearing after the reference frame 151r. Such dynamic table generation can release a requirement for buffering video frames 151. In addition, the method 100 can lower an implementation difficulty and/or costs, while efficiently ensuring a coding compression ratio.
Advantageously, the method 100 does not require storage of a whole frame of raw image data for the compression. Instead, only a prediction Huffman table 152p is stored at any time for compressing the target frames 151t. Therefore, memory space requirement of a camera (not shown) for purposes of video compression can be greatly lowered, making compression feasible to be implemented with the camera. In addition, the method 100 can use the prediction Huffman table 152p to implement a lossless compression of the target frames 151t. Therefore, a predetermined quality level for the compression can be ensured.
Although shown and described as using the (i-1)th frame 151H as the reference frame 151r for purposes of illustration only, any other frames 151, except the last frame 151N, can be used as the reference frame 151r.
Fig. 3 illustrates an alternative embodiment of the method 100. In Fig. 3, the coding of the target frame 151t, at 130, is shown with additional detail. After obtaining a prediction Huffman table 152p for a reference frame 151r, such as the ith frame 151I (shown in Fig. 1), at 110, a target frame 151t, such as the (i+1)th frame 151J, can be coded at 130 based on the prediction Huffman table 152p.
The prediction Huffman table 152p can be used to a generate Huffman code for each pixel of the target frame 151t. In an exemplary embodiment, the Huffman code is not used alone for coding the target frame 151t. Instead, the Huffman code can be combined with certain information of the target frame 151t, such as absolute difference values for target pixels of the target frame 151t. In a preferred embodiment, in order to code for the target frame 151t based on the Huffman table 152p, a realization process for coding the target frame 151t can be performed, at 132. The codes for the target frame 151t can be constructed based on the prediction Huffman table 152, or the Huffman codes, and the absolute difference values of the target frame 151t. Additional detail regarding the code realization process will be shown and described in greater detail below with reference to Fig. 10.
In real world, scenarios in a video 150 (shown in Fig. 1) can change either slowly and/or abruptly. For purposes of addressing an issue of abrupt scenario changes in the video 150, a frequency table 531 (shown in Fig. 6) can be compensated to enhance an accuracy of the prediction values. The frequency table can be used to create the prediction Huffman table 152p. Additional detail regarding compensating the target frame 151t will be shown and described below with reference to Fig. 7.
Fig. 4 shows an alternative embodiment of the method 100. Turning to Fig. 4, obtaining of a prediction Huffman table 152p based on a reference frame 151r can be conducted, at 110, in the manner discussed above with reference to the method 100 of Fig. 2. In order to generate the prediction Huffman table 152p, prediction values can be determined, at 510, for each pixel of the reference frame 151r. At 530, a frequency table can be generated based on the prediction values for each pixel of the reference frame 151r. At 540, the prediction Huffman table 152p can be constructed based on the frequency table. Additional detail regarding determining the prediction values, at 510, generating the frequency table, at 530, and constructing the prediction Huffman table 152p, at 540, is shown and described in additional detail below with reference to Fig. 6.
For the target frame 151t, coding of the frame 151t can be conducted, at 130, in the manner discussed above with reference to the method 100 of Fig. 2. To code the target frame 151t, absolute difference values for each pixel, i.e. each target pixel, of the target frame 151t can be determined, at 122, as illustrated in Fig. 4. The absolute difference values can be absolute differences for each and every pixels of the target frame 151t. Each of the absolute difference values can be an absolute value of a difference between an actual value of each pixel and a respective prediction value of the pixel. Additional detail regarding how to determine the absolute difference values of the target frame 151t will be shown and described below with reference to Fig. 10. The absolute difference values for the target frame 151t and the prediction Huffman table 152p can be combined to code the target frame 151t, at 124. Addition detail regarding coding the target frames 151t, at 130, based on the combination of the prediction Huffman table 152p and the absolute difference values of the target frame 151 will be shown and described in greater detail below with reference to Fig. 10.
Fig. 5 shows another alterative embodiment of the method 100, wherein the method 100 further comprises a loop 199 for coding multiple frames 151. When one target frame 151t is coded in the manner shown in Fig. 2, the target frame 151t and/or the reference frame 151r can be replaced with other frames 151, at 140. For example, in certain cases, the reference frame 151r can remain as a new reference frame 151r for a new target frame 151t. In other cases, the reference frame 151r can be replaced with another frame 151 following the reference frame 151r. In a preferred embodiment, the new reference frame 151r can be replaced by the target frame 151t that has just compressed for efficiency purposes because absolute difference values of the target frame 151t have already been completed.
At 140, the target frame 151t can be replaced with another frame 151, which can become a new target frame 151t. The new target frame 151t can be any frame 151 appearing after the new reference frame 151r. This process repeats till all target frames 151t of the video 150 are coded.
Although shown and described with reference to Fig. 5 as being repeated by frames 151 immediately following the reference frame 151r and the target frame 151t as newly a selected reference frame 151r and newly selected target frame 151t for purposes of illustration only, any other frames 151 within the video can be selected as the reference frame 151r and/or the target frames 151t.
Turning to Figs. 6 and 7, Fig. 6 shows an embodiment of obtaining a prediction Huffman table 152p based on a reference frame 151r, as part of the method 100. In Fig. 6, at 502, the reference frame 151r can be processed for purposes of better coding and/or better compression. A purpose for processing the reference frame 151r is shown Fig. 7. In Fig. 7, an exemplary typical pixel composition of the reference frame 151r is shown with a partial frame 710 that can be transformed for obtaining better prediction values for target frame 151t. Fig. 7 shows the partial frame 710 has sixteen pixels 701 in four rows 711A, 711B, 711C, 711D and four columns 721A, 721B, 721C, 721D. In the typical pixel composition of the partial frame 710, a pixel with a blue (“B”) color component can be located in the second row 711B and the second column 721B. According to an exemplary embodiment shown and described below with reference to Fig. 8, the B color pixel can be predicted by a preceding pixel value red(“G”) in the same row 711B, a preceding pixel value green(“G”) in the same column 721B, and a pixel value red (“R”) in a preceding row 721A and preceding column 711A. The red-red-green (“GGR”) components can be of little relevance with the B color pixel.
In order to address the relevance issue described above, in Fig. 7, a combination process can be provided, at 750, to transform the partial frame 710 into a new partial frame 730. With the combination process, every two rows of pixels are combined into a single row. For example, the rows 711A, 711B can be combined into a new row 731A, and/or the rows 711C, 711D can be combined into a new row 731B. The new partial frame 730 can have eight columns 741A to 741H, each having two pixels. As shown in Fig. 7, the pixels 701 can be rearranged such that the relevance for the prediction pixels 701 can be enhanced by regrouping pixels 701 with likely relevant color components together. For example, the pixel 701 located in the second row 731B and second column 741B is a G color in the transformed partial frame 730. The available prediction pixels preceding the G color pixel are RRG that are more relevant than those preceding pixels 701, which are GGR, in the partial frame 710.
Although shown and described as being combining every two rows of the frame 710 for purposes of illustration only, other suitable combinations can also be provided for purposes of enhancing the accuracy of the predictions, including but not limited to combining two or more columns and/or combining every three or more rows to form the new frame 730.
Turning to Figs. 6, 8 and 9, in order to generate a prediction Huffman table 152p for the reference frame 151r, such as the (i-1)th frame (shown in Fig. 1), at 110, each and every pixel 701 of the reference frame 151r can be determined with a prediction value Px (shown in Equations 1-5). For purposes of illustrating this disclosure, each pixel 701 of the reference frame 151r can be referred as a selected pixel 701, which can also be referred as a reference pixel for the reference frame 151r. In some exemplary embodiments, part or all pixels 701 of the reference frame 151r can be selected as selected pixels, or reference pixels, for calculating the prediction values Px. At 510, for one selected pixel 701, such as pixel D 805 (as shown in Fig. 8), a prediction value Px can be calculated according to at least one of the preceding pixels adjacent to the selected pixel 701. For purposes of illustrating the present disclosure, three pixels preceding the selected pixel 701 can be referred as a first pixel, a second pixel and a third pixel. In Fig. 8, the preceding adjacent pixels can include the first pixel preceding the selected pixel in a same row with the selected pixel, such as pixel A 801 for pixel D, the second pixel preceding the selected pixel in a same column with the selected pixel 701, such as pixel B 802 for pixel D 805. In addition, the adjacent pixels to the selected pixel 701 can also include the third pixel in the same row with the first pixel and same column with the second pixel, such as pixel C 803 for pixel D 805.
When the selected value is not in a first row or in a first column of the frame 151r, the prediction value of the selected pixel 701, such as the pixel D 805, can be determined with any one of the actual pixel values pixel A, pixel B or pixel C. In the same case, the prediction value of the selected pixel 701 can also be determined via one of the following equations:
Px = a + b - c Equation (1)
Px = a + ( b – c) * (1/2) Equation (2)
Px = a + ( c – b) * (1/2) Equation (3)
Px = b + ( a – c) * (1/2) Equation (4)
Px = ( a + b) * (1/2) Equation (5)
wherein, Px denotes the prediction value of the selected pixel 701, “a” denotes an actual value of the first pixel, pixel A, preceding the selected pixel, “b” denotes an actual value of the second pixel, pixel B, preceding the selected pixel, and “c” denotes an actual value of the third pixel, pixel C, preceding the selected pixel.
In special cases, such as the exemplary special case shown in Fig. 9, for pixels 701 located in a first row 731A and/or in a first column 741A, the pixels 701 can be predicted by a constant value and/or by an actual value of one or more pixels 701 available in a preceding row or column. In Fig. 9, in an exemplary embodiment, pixel A 701A can be a first pixel of a frame 740, which pixel A 701A does not have any preceding pixel 701. Pixel A 701A can have a constant C as its prediction value, such that: PA = C. The constant C can be half of a maximum pixel value, such as five hundred and twelve (512) when the maximum pixel value is one thousand and twenty-four (1024). A prediction value PB1 for pixel B1 can be an actual value of pixel A 701A, a prediction value PC1 for pixel C1 can be an actual value of pixel B1, and a prediction value PD1 for pixel D1 can be an actual value of pixel C1. In a similar manner, for the column 741A, a prediction value PB2 for pixel B2 can be the actual value of pixel A 701A, a prediction value PC2 for pixel C2 can be the actual value of pixel B2, and a prediction value PD2 for pixel D2 can be the actual value of pixel C2.
As an illustrative example, let us assume that pixel A 701A has an actual value of 400 and that pixel B1 has an actual value of 200, and that pixel C1 has an actual value of 300. Then, Pixel A can have a constant prediction value, e.g. five hundred and twelve (512), i.e. PA = 512. The prediction value for pixel B1, PB1, can be the actual value of Pixel A 701A, which is 400, i.e. PB1= 400. The prediction value for pixel C1, PC1, can be the actual value of Pixel B1, which is 200, i.e. PC1= 200. In addition, the prediction value for pixel D1, PD1, can be the actual value of Pixel C1, which is 300, i.e. PD1= 300. Values for PB2, PC2 and PD2 can be determined in a similar manner.
In Fig. 6, a result of the calculation, at 510, is a prediction value 511, for each selected pixel 701, that normally can have a difference with the actual value of the selected pixel 701. Although shown and described as using an immediate preceding pixel for predictions of any pixels 701 for purpose of illustration only, any suitable preceding pixels 701 in the partial frame 710 can be used for the predictions.
At 520, a difference value between the prediction value 511 and the actual value of the selected pixel 701 can be calculated. In order to achieve a positive difference value, an absolute value of the difference value can be taken, at 520. Therefore, a result of the calculation, at 520, can be an absolute difference value of the difference value 521 between the actual value of the selected pixel and the prediction value that is the result of 520. Generally, this result can be determined with the following equation:
diff = |d-Px| Equation (6)
where diff denotes the absolute difference value 521 between the actual value of the selected pixel 701 and the prediction value; d denotes the actual value of the selected pixel 701; and Px denotes the prediction value of the selected pixel 701.
Although shown and described as being a simple difference between the actual value of the selected pixel 701 and the prediction value for purposes of illustration only, any other suitable arithmetic difference values can be provided to reflect the difference under this disclosure.
In exemplary embodiments of the present disclosure, at 530, a frequency table 531 can be constructed in a form of a histogram based on the calculated absolute difference values 521 for each and every pixel 701 of the reference frame 151r. The frequency table 531 can be constructed by counting frequencies for each of the absolute difference values of the pixels 701. In an exemplary embodiment, the counting of the frequencies can be conducted by counting appearances of each difference value. In a typical ten-bit coding system, each difference value can consist of ten bit of ones or zeros. In a preferred embodiment, the counting of frequencies can be conducted by counting the appearance of a highest non-zero value, i.e. a highest one, in each difference value.
The result of counting frequency for each difference value, at 530, can be the frequency table 531 in the form of histogram (not shown). The frequency table 531 can have two sets of information: one set represents a value dataset of the available absolute difference values 521, such as {d1, d2, …, di,…, dn}; and the other set is a frequency dataset represents frequencies corresponding to each of the absolute difference values 521, such as {f1, f2, …, fi,…, fn}. In the preferred embodiments, when a ten-bit coding system is used and the highest non-zero value is counted, the value dataset can be {d1, d2, …, di,…, d10} and the frequency dataset can be {f1, f2, …, fi,…, f10}.
Although shown and described as using ten-bit coding for purposes of illustration only, the method 100 can use extended coding of twelve bit and/or sixteen bit coding systems.
At 540, a prediction Huffman table 152p for the reference frame 151r can be generated, in a form of a binary tree 541, based on the frequency table 531. The binary tree 541 can be created in any ordinary ways of constructing a Huffman table 152p. One way of constructing the Huffman table 152p can include working from bottom up of the binary tree 541, sorting the value dataset by the frequency dataset, making two-lowest elements into leaves, and creating a parent mode with a frequency being a sum of the frequencies of the two lower elements. The Huffman table 152p can carry information of coding values and corresponding bit string for each coding value, which string unambiguously represents the coding value.
The prediction Huffman table 152p, or the binary tree 541, can carry Huffman codes for each of the absolute difference values 521. The Huffman codes are normally variable-length because of differences among the frequencies of each of the absolute difference values 521. The variable-length codes can be further processed with additional detail being shown and described below with reference to Fig. 10.
Fig. 10 shows an embodiment of coding a target frame 151t based on the prediction Huffman table 152p for the method 100, which prediction Huffman table 152p is based on a reference frame 151r and prediction values of the target frame 151t. At 122, absolute difference values, diff of Equation 6, for each selected pixel 701 of the target frame 151t can be determined. The absolute difference values of the target frame 151t can be determined in a same manner as shown and described above for the absolute difference values for the reference frame 151r, at 510, with reference to Fig. 6 and Equations 1-6. For purposes of this disclosure, each selected pixel 701 of the target frame 151r can be referred as a target pixel for the target frame 151t. As shown and described for the absolute difference values at 521, the result of the calculation at 122 can be absolute difference values 811 for each pixel of the target frame 151t.
At 820, a Huffman code for each pixel of the target frame 151t can be generated based on the prediction Huffman table 152p of the reference frame 151r and the absolute difference values 811. In a preferred embodiment, the Huffman code 821 for a selected pixel 701 of the target frame 151t can be generated by combining the Huffman code 541 represented in the prediction Huffman table 152p for the selected pixel 701 and the absolute difference value 811 obtained from the calculation process, at 122. Generally, the combined Huffman codes have variable-lengths because both of the Huffman codes generated from the prediction Huffman table 541 of the reference frame 151r and the absolute difference values 811 of the selected pixels 701 can vary in length.
Although shown and described with reference to Fig. 10 as using simple combing the Huffman code and the absolute difference value 811 for generating the combined Huffman code for purposes of illustration only, any other suitable combinations of the Huffman code and the absolute difference value 811 can be applied under the present disclosure.
The combined variable-length Huffman codes 821 can be difficult to process. Therefore, a transformation from the variable-length Huffman codes to fixed-length codes can be provided under the present disclosure.
In Fig. 10, at 830, the transformation from the variable-length code 821 to fixed-length code 831 comprises using a first-in-first-out service (“FIFO”) 850, at 812, to convert the variable-length codes. At 812, the variable length codes can be streamlined into the FIFO service 850.
An exemplary embodiment of the FIFO service 850 is shown in Fig. 11, which service can be used for transforming variable-length codes to fixed-length codes. In Fig. 11, a series of input codes, in_code 1 to in_code 8 and more, can be input into the FIFO service 850. The input codes, in_code 1 to in_code 8 can represent the variable-length codes 821. In Fig. 11, after the FIFO service 850, the input codes are transformed into output codes, out_code 1 to out_code 4 and more, which are in fixed-length of sixteen bits.
Although shown and described in Fig. 11 as using the FIFO service 850 for transforming the variable-length codes for purposes of illustration only, other forms of services can be applied to transform the variable-length codes, including but not limited to padding zeros before and/or after each code to generate a fixed-length code.
In a course of the transformation process 830, certain bytes of the target frame 151t may coincide with special characters defined under the JPEG protocol. In such case, those bytes can be marked with identification characters, at 814. Such addition of identification characters can comprise a hexadecimal zero, 0x00. The identification character can be added immediately after those bytes, or be added immediately before each appearance of the special bytes. In an exemplary embodiment, a hexadecimal zero, 0x00, can be added after each appearance of hexadecimal two hundred and fifty-five, 0XFF.
Returning to Fig. 10, the output of the transformation 830 can be a fixed-length code 831. Although shown and described with reference to Fig. 10 as using the identification character 0x00 to mark each appearance of special characters for purposes of illustration only, other suitable forms of identification approaches can also be applied under the present disclosure.
Fig. 12 shows an embodiment of a compensation approach, at 134, for coding a target frame 151t, by broadening a length of a frequency table that is used to generate a prediction Huffman table 152p of a reference frame 151r. In some cases, scenarios being video recorded can change slowly, and relevance between frames 151 can be relatively greater. In such case, prediction values for each target frame 151t can be more accurate. However, in some other cases, the scenarios being videoed can change rapidly or abruptly, and the relevance between frames 151 can be relatively less. In such cases, significant differences between a frequency table 531 (shown in Fig. 6) of the reference frame 151r and the frequency table 531 of the target frame 151t can exist. To address this issue, a length of the frequency table 531 of the reference frame 151r can be broadened with extra numbers, at 910.
In one exemplary embodiment, such broadening can be realized by replacing zero values with non-zero values, such as ones in the frequency table 531 of the reference frame 151r, at 912. In a preferred embodiment, zeros immediately after the non-zero values can be replaced by non-zeros. In an alternative embodiment, two zeros immediately after the non-zero values can be replaced by non-zeros, e.g. by ones. For example, assume the frequency table 531 is: 15, 15, 15, 15, 15, 15, 15, 0, 0, 0, 0, 0. By replacing the two zeros immediately after the last 15 with ones, a new frequency table can be obtained as: 15, 15, 15, 15, 15, 15, 15, 1, 1, 0, 0, 0.
Fig. 13 shows an exemplary embodiment of an imaging device 200 for implementing the method of Fig. 2. The imaging device 200 can be a video and/or still camera. The imaging device 200 can comprise a lens 951, an image sensor 954, and/or a processor 955. The lens 951 can receive light from a scene 958. The lens 951 can be configured to focus the received light onto the image sensor 954, which generates images of the scene 958. In case of the video camera, the images generated can be the frames 151 within the video 150 (collectively shown in Fig. 1). The processor 955 can then process the frames for compressing the video in accordance with the method 100 shown and described in more detail above with reference to any one of Figs. 2-5, 6, 10, and 12.
An exemplary embodiment of the lens 951 can be a digital single-lens reflex (“DSLR”) lens; however, the lens 951 can comprise any conventional type of lens. Exemplary suitable lenses as the lens 951 can include one or more of a pin-hole lens, a biological lens, a simple convex glass lens, or the like, without limitation. Additionally and/or alternatively, the lens 951 can be configured with certain imaging properties such as one or more of a macro lens, zoom lens, telephoto lens, fisheye lens, wide-angle lens, or the like, without limitation.
The image sensor 954 can receive the light from the lens 951 and form an image based on the light received. The image sensor 954 can be a charge coupled sensor (“CCD”), complementary metal-oxide-semiconductor (“CMOS”) sensor, N-type metal-oxide-semiconductor (“NMOS”) sensor, and hybrids/variants thereof, an electro-optical sensor, a thermal/infrared sensor, a color or monochrome sensor, a multi-spectral imaging sensor, a spectrophotometer, a spectrometer, a thermometer, and/or an illuminometer.
The processor 955 can comprise any commercially-available graphic chip that chips can be used in currently available video equipment. The processor 955 can also be a custom-designed graphic chips specially produced for the imaging device 200. The processor 955 can also comprise additional chips for accelerating rendering of 2D graphics and/or 3D scenes, MPEG-2/MPEG-4 decoding, TV output, or an ability to connect multiple displays. In one of the embodiments, the processor 955 can operate under a VGA standard. Additionally and/or alternatively, the processor 955 can include one or more general purpose microprocessors (for example, single or multi-core processors), application-specific integrated circuits, application-specific instruction-set processors, graphics processing units, physics processing units, digital signal processing units, coprocessors, network processing units, audio processing units, encryption processing units, and the like. The processor 955 can be configured to perform any of the methods described herein, including but not limited to, a variety of operations relating to image/frame processing. In some embodiments, the processor 955 can include specialized hardware for processing specific operations relating to imaging processing.
The processor 955 can usually be operably connected to the image sensor 954. The connection can be via a wired and/or wireless link. The processor 955 can process a non-coded image/frame received by the image sensor 955 and can code the non-coded image/frame image automatically in accordance with the method 200 disclosed herein. The processor 955 can perform any one or more of the processes of method 100 shown and described with reference to any one of Figs. 2-5, 6, 10, and 12.
Turning to Fig. 14, an imaging device 200 can also contain a memory 957. The processor 955 of the imaging device 200 can be operatively connected to the memory 957. The memory 957 optionally can be provided for storing non-coded images (or frames) from the image sensor 954 and/or coded/compressed images (or frames) from the processor 955. The memory 957 can be linked to the processor 955 via wired or wireless connections. The memory 957 can also be linked (not shown) to any other components of the imaging device 200, such as the image sensor 954.
Exemplary examples of the memory 957 can be a random access memory (“RAM”), static RAM, dynamic RAM, read-only memory (“ROM”), programmable ROM, erasable programmable ROM, electrically erasable programmable ROM, flash memory, secure digital (“SD”) card, and the like. In a preferred embodiment, as described above with reference to Fig. 13, the imaging device 200 can be a video camera. In such a case, the memory 957 can be used to store a compressed video 150 (shown in Fig. 1).
The described embodiments are susceptible to various modifications and alternative forms, and specific examples thereof have been shown by way of example in the drawings and are herein described in detail. It should be understood, however, that the described embodiments are not to be limited to the particular forms or methods disclosed, but to the contrary, the present disclosure is to cover all modifications, equivalents, and alternatives.

Claims (68)

  1. A method for processing a video, comprising:
    obtaining a prediction table for a received reference frame of the video; and
    coding one or more target frames of the video based on the prediction table.
  2. The method of claim 1, wherein said obtaining the prediction table comprises obtaining a prediction Huffman table.
  3. The method of claim 1 or claim 2, wherein said coding the one or more target frames comprises choosing any frame of the video appearing after the reference frame.
  4. The method of claim 2 or claim 3, wherein said obtaining the prediction Huffman table comprises generating a Huffman table of a difference value of each reference pixel of the reference frame.
  5. The method of claim 4, wherein said generating the Huffman table comprises determining a prediction value for each of the reference pixels.
  6. The method of claim 5, further comprising determining the prediction value of the reference pixel based on respective pixel values of one or more pixels adjacent to the reference pixel.
  7. The method of claim 6, wherein the prediction value of the reference pixel is a constant value when the reference pixel is located in a first row and a first column of the reference frame.
  8. The method of claim 7, wherein the constant value is half of a maximum value of a coding value.
  9. The method of any one of claims 6-8, further comprising selecting the pixels adjacent to the reference pixel from a group of pixels consisting of a first pixel preceding the reference pixel in a same row, a second pixel preceding the reference pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the reference pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  10. The method of claim 9, further comprising determining the prediction value of the reference pixel by the first pixel when the reference pixel is a pixel of the first row.
  11. The method of claim 10, further comprising determining the prediction value of the reference pixel by the second pixel when the reference pixel is a pixel of the first column.
  12. The method of any one of claims 6-11, further comprising determining the difference value based upon a difference of an actual value of the reference pixel and the prediction value of the reference pixel.
  13. The method of claim 12, wherein said generating the prediction Huffman table comprises generating a frequency table of an absolute value of the difference value.
  14. The method of claim 13, wherein said generating the frequency table comprises determining statistics of frequencies of the absolute values to form the frequency table.
  15. The method of claim 14, wherein said generating the prediction Huffman table comprises generating the prediction Huffman table based on the frequency table.
  16. The method of any one of claims 1-3, wherein said coding the one or more target frames further comprises determining an absolute difference value for each target pixel of the one or more target frames.
  17. The method of claim 16, further comprising determining the absolute difference value based upon an absolute value of a difference between an actual value of each target pixel and a prediction value of the target pixel.
  18. The method of claim 17, further comprising determining the prediction value for each target pixel of the one or more target frames.
  19. The method of claim 18, further comprising determining the prediction value of the target pixel based on respective pixel values of one or more pixels adjacent to the target pixel.
  20. The method of claim 19, wherein the prediction value of the target pixel is a constant value when the target pixel is located in a first row and a first column of one of the one or more target frames.
  21. The method of claim 20, wherein the constant value is half of a maximum value of a coding value.
  22. The method of any one of claims 19-21, further comprising selecting the pixels adjacent to the target pixel from a group of pixels consisting of a first pixel preceding the target pixel in a same row, a second pixel preceding the target pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the target pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  23. The method of any one of claims 19-21, further comprising determining the prediction value of the target pixel by the first pixel when the target pixel is a pixel of the first row.
  24. The method of any one of claims 19-21, further comprising determining the prediction value of the target pixel by the second pixel when the target pixel is a pixel of the first column.
  25. The method of claim 17, wherein said coding the one or more target frames comprises coding the frames based on the prediction Huffman table of the reference frame and the absolute difference values of the one or more target frames to generate variable-length codes.
  26. The method of claim 25, wherein said coding the one or more target frames comprises transforming the variable-length codes to fixed-length codes.
  27. The method of claim 26, wherein said transforming the variable-length codes comprises converting the variable-length codes to the fixed-length codes using a first-in-first-out service to generate the fixed-length codes.
  28. The method of claim 27, wherein said inserting the preselected hexadecimal character comprises inserting a hexadecimal zero after every instance of hexadecimal two hundred fifty-five.
  29. The method of any one of claims 13-15, further comprising compensating the frequency table to generate a compensated frequency table.
  30. The method of claim 29, wherein said compensating the frequency table comprises widening a coding width of the frequency table.
  31. The method of claim 30, wherein said compensating the frequency table comprises replacing at least one zero-value each with a non-zero value.
  32. The method of claim 31, wherein said replacing comprises replacing two or more zeros each with a one.
  33. A video processing system configured to perform the video processing process in accordance with any one of claims 1-32.
  34. A computer program product comprising instructions for processing a video in accordance with any one of claims 1-32.
  35. A imaging device, comprising:
    a sensor for acquiring a sequence of image frames of a video; and
    a processor for obtaining a prediction table for a received reference frame selected from the sequence of image frames and to code one or more target frames of the video based on the prediction table.
  36. The imaging device of claim 35, wherein said prediction table is a prediction Huffman table.
  37. The imaging device of claim 35 or claim 36, wherein said one or more target frames comprise any frame of the video appearing after the reference frame.
  38. The imaging device of claim 36 or claim 37, wherein said prediction Huffman table is a Huffman table of a difference value for each reference pixel of the reference frame.
  39. The imaging device of claim 38, wherein said Huffman table is generated based upon a difference value of each of the reference pixels.
  40. The imaging device of claim 39, wherein said prediction value of the reference pixel is determined based on respective pixel values of one or more pixels adjacent to the reference pixel.
  41. The imaging device of claim 40, wherein the prediction value of the reference pixel is a constant value when the reference pixel is located at a first row and a first column of the reference frame.
  42. The imaging device of claim 41, wherein the constant value is half of a maximum value of a coding value.
  43. The imaging device of any one of claims 40-42, wherein the pixels adjacent to the reference pixel are selected from a group of pixels consisting of a first pixel preceding the reference pixel in a same row, a second pixel preceding the reference pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the reference pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  44. The imaging device of claim 43, wherein said prediction value of the reference pixel is determined by the first pixel when the reference pixel is a pixel of the first row.
  45. The imaging device of claim 44, wherein said prediction value of the reference pixel is determined by the second pixel when the reference pixel is a pixel of the first column.
  46. The imaging device of any one of claims 40-45, wherein the difference value is determined based upon a difference of an actual value of the reference pixel and the prediction value of the reference pixel.
  47. The imaging device of claim 46, wherein said prediction Huffman table is generated with a frequency table of an absolute value of the difference value.
  48. The imaging device of claim 47, wherein said frequency table is determined upon statistics of frequencies of the absolute values to form the frequency table.
  49. The imaging device of claim 48, wherein said prediction Huffman table is generated based on the frequency table.
  50. The imaging device of claim 35 or claim 36, wherein the processor is configured to determine an absolute difference value for each target pixel of the one or more target frames.
  51. The imaging device of claim 50, wherein the processor is configured to determine the absolute difference value based upon an absolute value of a difference between an actual value of each target pixel and a prediction value of the target pixel.
  52. The imaging device of claim 51, wherein the processor is configured to determine the prediction value for each target pixel of the one or more target frames.
  53. The imaging device of claim 52, wherein the prediction value of the target pixel is determined based on respective pixel values of one or more pixels adjacent to the target pixel.
  54. The imaging device of claim 53, wherein the prediction value of the target pixel is a constant value when the target pixel is located in a first row and a first column of one of the one or more target frames.
  55. The imaging device of claim 54, wherein the constant value is half of a maximum value of a coding value.
  56. The imaging device any one of claims 52-55, wherein the pixels adjacent to the target pixel is selected from a group of pixels consisting of a first pixel preceding the target pixel in a same row, a second pixel preceding the target pixel in a same column, a third pixel adjacent to the first and second pixels, and, when the target pixel is neither in the first row nor in the first column of the frame, any arithmetic combination of the first, second and third pixels.
  57. The imaging device of any one of claims 52-55, wherein the processor is configured to determine the prediction value of the target pixel by the first pixel when the target pixel is a pixel of the first row.
  58. The imaging device of any one of claims 52-55, wherein the processor is configured to determine the prediction value of the target pixel by the second pixel when the target pixel is a pixel of the first column.
  59. The imaging device of claim 58, wherein said one or more target frames are coded based on the prediction Huffman table of the reference frame and the absolute difference values of the one or more target frames to generate variable-length codes.
  60. The imaging device of claim 59, wherein the processor is configured to transform the variable-length codes to fixed-length codes.
  61. The imaging device of claim 60, wherein said variable-length codes are transformed from the variable-length codes to the fixed-length codes using a first-in-first-out service.
  62. The imaging device of claim 61, wherein the special character is a hexadecimal two hundred fifty-five.
  63. The imaging device of any one of claims 49-51, wherein the processor is configured to compensate the frequency table to generate a compensated frequency table.
  64. The imaging device of claim 63, wherein said compensated frequency table is compensated by widening a coding width of the frequency table.
  65. The imaging device of claim 64, wherein said coding width is widened by replacing at least one zero-value each with a non-zero value.
  66. The imaging device of claim 65, wherein said non-zero value is a one.
  67. The imaging device of any one of claims 35-66, further comprising a memory for storing the one or more frames of the video after coding.
  68. The imaging device of claim 67, wherein the memory is configured to store the prediction table of the received reference frame.
PCT/CN2015/080230 2015-05-29 2015-05-29 System and method for video processing WO2016191915A1 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CN201580080404.9A CN107925771B (en) 2015-05-29 2015-05-29 Method, system, storage medium and imaging device for video processing
JP2017552077A JP6607956B2 (en) 2015-05-29 2015-05-29 Video processing method and video processing system
PCT/CN2015/080230 WO2016191915A1 (en) 2015-05-29 2015-05-29 System and method for video processing
EP15874401.1A EP3152907B1 (en) 2015-05-29 2015-05-29 System and method for video processing
US15/824,581 US10893300B2 (en) 2015-05-29 2017-11-28 System and method for video processing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2015/080230 WO2016191915A1 (en) 2015-05-29 2015-05-29 System and method for video processing

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US15/824,581 Continuation US10893300B2 (en) 2015-05-29 2017-11-28 System and method for video processing

Publications (1)

Publication Number Publication Date
WO2016191915A1 true WO2016191915A1 (en) 2016-12-08

Family

ID=57439881

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/080230 WO2016191915A1 (en) 2015-05-29 2015-05-29 System and method for video processing

Country Status (5)

Country Link
US (1) US10893300B2 (en)
EP (1) EP3152907B1 (en)
JP (1) JP6607956B2 (en)
CN (1) CN107925771B (en)
WO (1) WO2016191915A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102025494B1 (en) * 2018-05-30 2019-09-25 주식회사 아이닉스 Apparatus based on visually lossless compression method for compression of bayer image and method therefor

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5329313A (en) 1992-04-01 1994-07-12 Intel Corporation Method and apparatus for real time compression and decompression of a digital motion video signal using a fixed Huffman table
US20040202251A1 (en) * 2003-04-09 2004-10-14 Savekar Santosh Faster block processing structure for MPEG decoders
JP2007221201A (en) * 2006-02-14 2007-08-30 Victor Co Of Japan Ltd Moving image coding apparatus and program
US20130188885A1 (en) 2012-01-25 2013-07-25 Kabushiki Kaisha Toshiba Apparatus and method for coding image, and non-transitory computer readable medium thereof
US20130322519A1 (en) * 2012-05-29 2013-12-05 Core Logic Inc. Video processing method using adaptive weighted prediction

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH06153180A (en) * 1992-09-16 1994-05-31 Fujitsu Ltd Picture data coding method and device
JPH07321666A (en) * 1994-05-27 1995-12-08 Fuji Photo Film Co Ltd Code length converter
EP0750428B1 (en) * 1995-06-22 2004-03-31 Canon Kabushiki Kaisha Image processing apparatus and method
US6681049B1 (en) * 1999-05-20 2004-01-20 Matsushita Electric Industrial Co., Ltd. Image encoding method and apparatus
US6882750B2 (en) * 2000-05-02 2005-04-19 Zaxel Systems, Inc. Fast loss less image compression system based on neighborhood comparisons
FI114527B (en) * 2002-01-23 2004-10-29 Nokia Corp Grouping of picture frames in video encoding
JP4090862B2 (en) * 2002-04-26 2008-05-28 松下電器産業株式会社 Variable length encoding method and variable length decoding method
CN100420308C (en) * 2002-04-26 2008-09-17 株式会社Ntt都科摩 Image encoding device, image decoding device, image encoding method, image decoding method, image encoding program, and image decoding program
JP2005012495A (en) * 2003-06-19 2005-01-13 Olympus Corp Image processing apparatus, image processing method, and image processing program
CN1784008B (en) 2004-12-02 2010-04-28 北京凯诚高清电子技术有限公司 Encoding method and decoding method for high sharpness video super strong compression
FR2879878B1 (en) * 2004-12-22 2007-05-25 Thales Sa COMPATIBLE SELECTIVE ENCRYPTION METHOD FOR VIDEO STREAM
US7650039B2 (en) * 2005-03-03 2010-01-19 Canon Kabushiki Kaisha Image encoding apparatus, image decoding apparatus, control method therefor, computer program, and computer-readable storage medium
JP4682102B2 (en) * 2005-09-02 2011-05-11 キヤノン株式会社 Image coding apparatus and image coding method
BRPI0619193A2 (en) * 2005-11-30 2011-09-20 Toshiba Kk Toshiba Corp image encoding / image decoding method, image encoding / image decoding apparatus
EP2136564A1 (en) * 2007-01-09 2009-12-23 Kabushiki Kaisha Toshiba Image encoding and decoding method and device
WO2010035733A1 (en) * 2008-09-24 2010-04-01 ソニー株式会社 Image processing device and method
JP2011199432A (en) 2010-03-17 2011-10-06 Seiko Epson Corp Image processing device and program
KR20110123651A (en) * 2010-05-07 2011-11-15 한국전자통신연구원 Apparatus and method for image coding and decoding using skip coding
MX2013000516A (en) * 2010-07-15 2013-09-02 Toshiba Kk Image encoding method and image decoding method.
US20120121018A1 (en) * 2010-11-17 2012-05-17 Lsi Corporation Generating Single-Slice Pictures Using Paralellel Processors
CN102256137B (en) 2011-07-13 2013-07-17 西安电子科技大学 Context-prediction-based polar light image lossless coding method
ITTO20120985A1 (en) 2012-11-14 2014-05-15 St Microelectronics Srl PROCEDURES FOR CODING AND DECODING OF DIGITAL VIDEO FRAME FLOWS, RELATED SYSTEMS AND IT PRODUCTS
CN104253993B (en) 2013-06-28 2018-01-12 炬芯(珠海)科技有限公司 A kind of multimedia data processing method, circuit and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5329313A (en) 1992-04-01 1994-07-12 Intel Corporation Method and apparatus for real time compression and decompression of a digital motion video signal using a fixed Huffman table
US20040202251A1 (en) * 2003-04-09 2004-10-14 Savekar Santosh Faster block processing structure for MPEG decoders
JP2007221201A (en) * 2006-02-14 2007-08-30 Victor Co Of Japan Ltd Moving image coding apparatus and program
US20130188885A1 (en) 2012-01-25 2013-07-25 Kabushiki Kaisha Toshiba Apparatus and method for coding image, and non-transitory computer readable medium thereof
US20130322519A1 (en) * 2012-05-29 2013-12-05 Core Logic Inc. Video processing method using adaptive weighted prediction

Also Published As

Publication number Publication date
JP6607956B2 (en) 2019-11-20
CN107925771B (en) 2022-01-11
CN107925771A (en) 2018-04-17
US10893300B2 (en) 2021-01-12
JP2018513634A (en) 2018-05-24
US20180091828A1 (en) 2018-03-29
EP3152907A1 (en) 2017-04-12
EP3152907A4 (en) 2017-04-26
EP3152907B1 (en) 2021-01-06

Similar Documents

Publication Publication Date Title
US6614483B1 (en) Apparatus and method for compressing image data received from image sensor having bayer pattern
US8098941B2 (en) Method and apparatus for parallelization of image compression encoders
KR100261072B1 (en) Digital signal processing system
JP5530198B2 (en) Image encoding method, decoding method, and apparatus
WO2014163240A1 (en) Method and apparatus for processing video
WO2007010901A1 (en) Dynamic image encoding device, dynamic image decoding device, and code string format
WO2017043769A1 (en) Encoding device, decoding device, and encoding method and decoding method thereof
US20040135903A1 (en) In-stream lossless compression of digital image sensor data
US11445160B2 (en) Image processing device and method for operating image processing device
JP3940672B2 (en) Image processing apparatus and image processing method
WO2016191915A1 (en) System and method for video processing
US20040201714A1 (en) Digital camera with low memory usage
WO2019212230A1 (en) Method and apparatus for decoding image by using transform according to block size in image coding system
KR100207705B1 (en) Apparatus and method of addressing for dct block and raster scan
KR950006768B1 (en) Laster format converter circuit
WO2019151808A1 (en) Electronic device for compressing image by using compression attribute generated in image acquisition procedure using image sensor, and operating method thereof
EP4136610A1 (en) Hdr tone mapping based on creative intent metadata and ambient light
KR960016577A (en) Image signal processing method and image signal processing apparatus
JPH0496484A (en) Digital video signal reproducing device
WO2022114927A1 (en) Device and method for ai encoding and ai decoding of image
WO2021045521A1 (en) Method and device for encoding/decoding image using sub-picture, and bit stream transmission method
WO2021107651A1 (en) Parallel forensic marking device and method
WO2020116213A1 (en) Reception device and transmission device
WO2020130194A1 (en) Video transmitting device
JP2002344947A (en) Image processing system, compression coding method for moving picture, decoding method for moving picture and their program

Legal Events

Date Code Title Description
REEP Request for entry into the european phase

Ref document number: 2015874401

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2015874401

Country of ref document: EP

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15874401

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2017552077

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE