TW201540052A - Method and device for encoding a high-dynamic range image into a bitstream and/or decoding a bitstream representing a high-dynamic range image - Google Patents

Method and device for encoding a high-dynamic range image into a bitstream and/or decoding a bitstream representing a high-dynamic range image Download PDF

Info

Publication number
TW201540052A
TW201540052A TW103145096A TW103145096A TW201540052A TW 201540052 A TW201540052 A TW 201540052A TW 103145096 A TW103145096 A TW 103145096A TW 103145096 A TW103145096 A TW 103145096A TW 201540052 A TW201540052 A TW 201540052A
Authority
TW
Taiwan
Prior art keywords
image
decoded
bit stream
illumination
encoding
Prior art date
Application number
TW103145096A
Other languages
Chinese (zh)
Inventor
Yannick Olivier
Sebastien Lasserre
Fabrice Leleannec
David Touze
Original Assignee
Thomson Licensing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing filed Critical Thomson Licensing
Publication of TW201540052A publication Critical patent/TW201540052A/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/14Digital output to display device ; Cooperation and interconnection of the display device with other functional units
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G2320/00Control of display operating conditions
    • G09G2320/06Adjustment of display parameters
    • G09G2320/0626Adjustment of display parameters for control of overall brightness
    • G09G2320/0646Modulation of illumination source brightness and image signal correlated to each other
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09GARRANGEMENTS OR CIRCUITS FOR CONTROL OF INDICATING DEVICES USING STATIC MEANS TO PRESENT VARIABLE INFORMATION
    • G09G2340/00Aspects of display data processing
    • G09G2340/02Handling of images in compressed format, e.g. JPEG, MPEG

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

The present invention generally relates to a method for encoding an image into a bitstream. The method is characterized in that it comprises: encoding (1100) into the bitstream an illumination map determined (1000) from the image; and encoding (1200) into the bitstream a signalization data indicating that the bitstream comprises the illumination map. The invention relates also a method and device for decoding a bitstream and also the bitstream itself.

Description

將高動態範圍影像編碼成位元流及/或將表示高動態範圍影 像之位元流解碼之方法及裝置 Encode high dynamic range images into bitstreams and/or will represent high dynamic range shadows Method and device for decoding bit stream

本發明一般係關於影像/視訊編碼及解碼,尤其本發明的技術領域相關一影像的編碼,該影像的像素值屬於高動態範圍,及相關將一位元流解碼,該位元流表示一高動態範圍影像。 The present invention generally relates to image/video encoding and decoding, and more particularly to encoding of an image related to the technical field of the present invention, the pixel value of the image belongs to a high dynamic range, and the associated bit stream is decoded, and the bit stream represents a high Dynamic range image.

本段落希望向讀者介紹此技藝的各種不同面向,係相關本發明以下所說明及/或主張的各種不同面向。咸信本討論有助於提供讀者背景資訊,對本發明的不同方面有較佳瞭解。因此,應瞭解到,此等陳述將依此觀點加以閱讀,且非作為先前技術的認可。 This paragraph is intended to introduce the reader to various aspects of the subject matter, which are various aspects of the invention described and/or claimed below. The discussion of this letter helps to provide readers with background information and a better understanding of the different aspects of the invention. Therefore, it should be understood that such statements will be read in this light and are not recognized as prior art.

低動態範圍影像(LDR影像)係以有限位元數(通常為8或10)表示其亮度值的影像,此有限表示不容許正確呈現小的信號變化,尤其是在暗淡及明亮的亮度範圍中。在高動態範圍影像(HDR影像)中,延伸信號表示為使該信號在其整個範圍中維持高準確度。在HDR影像中,像素值通常以浮點格式表示(用於各分量為32位元或16位元,稱為浮點或半浮點),最通用格式係開放EXR(openEXR)半浮點格式(每RGB分量16位元,即每像素48位元),或以具有長表示的整數表示,通常至少16位元。 A low dynamic range image (LDR image) is an image whose brightness value is represented by a finite number of bits (usually 8 or 10). This finite representation does not allow correct small signal variations, especially in dim and bright brightness ranges. . In high dynamic range images (HDR images), the extended signal is represented to maintain the signal with high accuracy throughout its range. In HDR images, pixel values are usually expressed in floating point format (for each component is 32 or 16 bits, called floating point or half floating point), and the most common format is open EXR (openEXR) half floating point format. (16 bits per RGB component, ie 48 bits per pixel), or expressed as an integer with a long representation, usually at least 16 bits.

將一HDR影像編碼的典型措施係縮小該影像的動態範圍,為要藉由初始配置用於LDR影像編碼的傳統編碼方案將該影像編碼。 A typical measure of encoding an HDR image is to reduce the dynamic range of the image, which is to be encoded by a conventional encoding scheme that is initially configured for LDR image encoding.

根據第一措施,將輸入HDR影像應用一色調映射運算子,接著藉由一傳統8-10位元深度編碼方案如JPEG/JPEG200或MPEG-2、H.264/AVC,將色調映射後的影像編碼以用於視訊("一般視聽服務之先進視訊編碼(Advanced video coding for generic audiovisual Services)”,系列H:視聽及多媒體系統,ITU-T H.264建議書,ITU電訊 標準化扇區,2012年1月)。接著,將解碼影像應用一反色調映射運算子,及在輸入影像與解碼且反色調映射後的影像之間計算一殘餘影像。最後,將該殘餘影像藉由一第二傳統8-10位元深度編碼器方案加以編碼。 According to the first measure, the input HDR image is applied with a tone mapping operator, and then the tone mapped image is imaged by a conventional 8-10 bit depth coding scheme such as JPEG/JPEG200 or MPEG-2, H.264/AVC. Coding for video ("Advanced video coding for generic audiovisual services"), Series H: Audiovisual and multimedia systems, ITU-T Recommendation H.264, ITU Telecommunications Standardized Sector, January 2012). Next, an inverse tone mapping operator is applied to the decoded image, and a residual image is calculated between the input image and the decoded and inverse tone mapped image. Finally, the residual image is encoded by a second conventional 8-10 bit depth encoder scheme.

就可藉由一傳統裝置將一低動態範圍影像解碼及顯示的意義來說,此第一措施係反向相容的。 This first measure is backward compatible in the sense that a low dynamic range image can be decoded and displayed by a conventional device.

然而,此第一措施使用二編碼方案,並使輸入影像的動態範圍限制到傳統編碼方案動態範圍的兩倍(16-20位元),此外,此類措施有時導致低動態範圍影像與輸入HDR影像具有較弱相關性,此導致該影像的低編碼效能。 However, this first measure uses a two-coding scheme and limits the dynamic range of the input image to twice the dynamic range of the traditional encoding scheme (16-20 bits). In addition, such measures sometimes result in low dynamic range images and inputs. HDR images have a weak correlation, which results in low coding performance of the image.

根據第二措施,從輸入HDR影像中判定出一照明圖,接著從該影像及該照明圖中得出一殘餘影像,並將該照明圖及該殘餘影像兩者直接編碼。 According to the second measure, an illumination image is determined from the input HDR image, and then a residual image is obtained from the image and the illumination image, and the illumination image and the residual image are directly encoded.

此特定用於一輸入HDR影像編碼的措施與無法將高動態範圍影像解碼及/或顯示的傳統裝置並非反向相容。 This particular measure for an input HDR image encoding is not backward compatible with conventional devices that are unable to decode and/or display high dynamic range images.

此外,此措施不能使用在一般通訊基礎結構中,其中在二遠端裝置之間傳送一HDR影像,原因是此一基礎結構的傳統傳送構件不適合用以攜帶照明圖。 Moreover, this measure cannot be used in a general communication infrastructure in which an HDR image is transmitted between two remote devices because the conventional transfer member of the infrastructure is not suitable for carrying the illumination map.

為補救先前技術的一些缺點,本發明提出一種將一影像編碼成一位元流的方法,包括以下步驟:- 將判定自該影像的一照明圖編碼到該位元流中;及- 將一信號化資料編碼到該位元流中,該信號化資料指示該位元流包括該照明圖。 To remedy some of the shortcomings of the prior art, the present invention provides a method of encoding an image into a bit stream, comprising the steps of: - encoding a lighting pattern from the image into the bit stream; and - placing a signal The data is encoded into the bit stream, the signalized data indicating that the bit stream includes the illumination map.

根據該方法的一實施例,該方法尚包括以下步驟:- 將判定自該影像及該照明圖的一殘餘影像編碼到該位元流中。 According to an embodiment of the method, the method further comprises the step of: - encoding a residual image from the image and the illumination map into the bitstream.

根據一實施例,該照明圖係編碼作為一輔助圖像,其語法符合H264/AVC或HEVC標準。 According to an embodiment, the illumination pattern is encoded as an auxiliary image whose syntax conforms to the H264/AVC or HEVC standard.

除了所謂的"主要編碼圖像”外,亦已將輔助圖像定義在H264/AVC或HEVC標準中,主要編碼圖像實際上對應到內容的主串流(主視訊)。輔助圖像通常能用以傳送額外影像資訊如alpha(α)合成、色度增強 資訊或用於3D應用的深度資訊。 In addition to the so-called "primary coded image", the auxiliary image has also been defined in the H264/AVC or HEVC standard, and the primary encoded image actually corresponds to the main stream of the content (main video). Auxiliary images can often be used to transmit additional image information such as alpha(alpha) synthesis, chroma enhancement Information or in-depth information for 3D applications.

根據一實施例,該殘餘影像係編碼作為一主要圖像,其語法符合H264/AVC或HEVC標準。 According to an embodiment, the residual image is encoded as a primary image whose syntax conforms to the H264/AVC or HEVC standard.

這容許取得表示一HDR影像的一位元流,其完全適用於H264/AVC或HEVC標準:輔助資料(即照明圖)係根據主要圖像編碼順序的相同順序來傳送。在顯示前發生的輔助資料解碼方法符合HEVC規格,因此係按其已規定格式照樣使用。 This allows a one-bit stream representing an HDR image to be obtained, which is fully applicable to the H264/AVC or HEVC standard: auxiliary material (i.e., illumination map) is transmitted in the same order of the main image encoding order. The auxiliary data decoding method that occurs before the display conforms to the HEVC specification and is therefore used in its prescribed format.

根據一實施例,該照明圖係一背光影像,及藉由將該影像除以該背光影像的一解碼版本,得到該殘餘影像。 According to an embodiment, the illumination image is a backlit image, and the residual image is obtained by dividing the image by a decoded version of the backlight image.

根據一實施例,在編碼前,將該殘餘影像進行色調映射。 According to an embodiment, the residual image is tone mapped prior to encoding.

此步驟提供一可見殘餘影像,即在作為結果的殘餘影像藝術地產生與該影像中原場景合理配合且連貫的色調映射場景的一殘餘影像。因為該可見殘餘影像係可由不能處置高動態範圍的傳統裝置來解碼及/或顯示,因此本方法係反向相容,舊有(非HDR)H264/AVC或HEVC解碼器可簡單地丟掉照明圖(其無法由此舊有解碼器識別),及只將該殘餘影像解碼。 This step provides a visible residual image, i.e., a residual image of the resulting tonal mapping scene that is reasonably coordinated with the original scene in the image, as a result of the residual image. Since the visible residual image can be decoded and/or displayed by conventional devices that cannot handle high dynamic range, the method is backward compatible, and the old (non-HDR) H264/AVC or HEVC decoder can simply drop the illumination map. (It cannot be recognized by the old decoder), and only the residual image is decoded.

此外,藉由此類方法將一高動態範圍影像編碼達到一有效率編碼方案,原因係色調映射後的殘餘影像(低動態範圍影像)及該背光影像係分開地編碼,色調映射後的殘餘影像在空間上係高度相關(及與同一序列影像的其他影像係時間上相關)。由於色調映射後的殘餘影像的高壓縮率及用於背光影像編碼的小量資料的高壓縮率,因此達到一編碼增益。 In addition, a high dynamic range image coding is achieved by such a method to achieve an efficient coding scheme, because the residual image after the tone mapping (low dynamic range image) and the backlight image are separately coded, and the residual image after tone mapping is used. Spatially highly correlated (and temporally related to other imagery of the same sequence of images). A coding gain is achieved due to the high compression ratio of the residual image after tone mapping and the high compression ratio of the small amount of data used for backlight image coding.

根據一實施例,該殘餘影像的色調映射包括,根據該殘餘影像的像素值,一伽瑪(gamma)校正或一SLog(S形對數)校正。 According to an embodiment, the tone mapping of the residual image includes a gamma correction or a SLog (S-logarithm) correction based on pixel values of the residual image.

伽瑪校正及SLog校正使暗及亮資訊無損失,結果使重建自殘餘影像及背光影像的一HDR影像具高精確度。此外,伽瑪校正及S-log校正使重建的HDR影像及可見殘餘影像兩者中避免平整剪輯區。 Gamma correction and SLog correction result in no loss of dark and bright information, resulting in high accuracy of reconstructing an HDR image from residual images and backlit images. In addition, gamma correction and S-log correction avoid flattening the clipping region in both the reconstructed HDR image and the visible residual image.

根據一實施例,該方法尚包括該殘餘影像在編碼前的定標。 According to an embodiment, the method further comprises scaling the residual image prior to encoding.

此步驟將得自該殘餘影像的一影像的中間灰階放在一適當值以用於觀看及編碼兩者。 This step places the intermediate grayscale of an image from the residual image at an appropriate value for viewing and encoding both.

根據一實施例,該方法尚包括在編碼前剪輯該殘餘影像。 According to an embodiment, the method further comprises clipping the residual image prior to encoding.

剪輯該殘餘影像確保一有限位元數,及容許使用傳統編碼/解碼方案將該殘餘影像編碼,而且,剪輯該殘餘影像確保與現存基礎建設(編解碼、顯示、分配通道等)係反向相容,因為只有殘餘影像(其具有低動態範圍,通常8-10位元)可透過此類基礎建設傳送用以顯示該影像的一低動態範圍版本。包含有背光資料的小位元流可透過專屬基礎建設攜帶在一旁側容器中,用以分配該影像的原版本(即一HDR影像)。 Clipping the residual image ensures a finite number of bits and allows the residual image to be encoded using a conventional encoding/decoding scheme, and editing the residual image ensures that it is inversely related to the existing infrastructure (codec, display, distribution channel, etc.) Capacity, because only residual images (which have a low dynamic range, typically 8-10 bits) can be transmitted through such infrastructure to display a low dynamic range version of the image. A small bit stream containing backlight data can be carried in a side container through a dedicated infrastructure for distributing the original version of the image (ie, an HDR image).

根據本發明的另一方面,本發明涉及一種將一位元流解碼的方法,該位元流表示一影像,該方法包括以下步驟:- 檢測該位元流中是否一信號化資料指示該位元流包括資料相關判定自該影像的一照明圖;- 藉由將該位元流至少部分地解碼,得到一解碼照明圖;及- 從一解碼殘餘影像及該解碼照明圖中得出一解碼影像。 According to another aspect of the invention, the invention relates to a method of decoding a bit stream, the bit stream representing an image, the method comprising the steps of: - detecting whether a signalized data in the bit stream indicates the bit The metadata stream includes a lighting pattern from which the data is determined; - by decoding the bit stream at least partially, to obtain a decoded illumination image; and - deriving a decoding from a decoded residual image and the decoded illumination image image.

根據一實施例,藉由將該位元流至少部分地解碼,得到該解碼殘餘影像。 According to an embodiment, the decoded residual image is obtained by at least partially decoding the bit stream.

根據一實施例,該信號化資料係檢測自高階語法元素,及其使用係藉由一SEI訊息完成。 According to an embodiment, the signalling data is detected from a higher-order syntax element and its use is accomplished by an SEI message.

根據一實施例,該位元流包括一主要圖像及一輔助圖像,其語法符合標準H264/AVC或HEVC,及其中該主要圖像表示殘餘影像及該輔助圖像表示照明圖。 According to an embodiment, the bit stream comprises a primary image and an auxiliary image, the syntax of which conforms to the standard H264/AVC or HEVC, and wherein the primary image represents a residual image and the auxiliary image represents an illumination map.

根據一實施例,該解碼照明圖係一背光影像,及其中藉由將該解碼殘餘影像乘以該背光影像,得到解碼影像。 According to an embodiment, the decoded illumination image is a backlit image, and wherein the decoded residual image is multiplied by the backlight image to obtain a decoded image.

根據一實施例,在將解碼殘餘影像乘以背光影像前,將解碼殘餘影像進行反色調映射。 According to an embodiment, the decoded residual image is inversely mapped before the decoded residual image is multiplied by the backlit image.

根據該方法的一實施例,該照明圖係待編碼影像的亮度分量的一低空間頻率版本,及藉由計算該影像的亮度分量與編碼低空間頻率版本的一解碼版本之間的差異,得到殘餘影像。 According to an embodiment of the method, the illumination map is a low spatial frequency version of the luminance component of the image to be encoded, and by calculating a difference between the luminance component of the image and a decoded version of the encoded low spatial frequency version, Residual image.

根據本發明的另一方面,本發明涉及一種表示一影像的位元流,其特徵在於包括一信號化資料,指示其表示判定自該影像的一照明圖。 According to another aspect of the invention, the invention relates to a bit stream representing an image, characterized by comprising a signalling material indicating that it represents a lighting pattern determined from the image.

根據本發明的另一方面,本發明涉及一種將一影像編碼的裝置,及一種將一位元流解碼的裝置,該裝置實施上述方法。 According to another aspect of the present invention, the present invention relates to an apparatus for encoding an image, and an apparatus for decoding a bit stream, the apparatus implementing the above method.

以下配合附圖說明本發明的一較佳實施例,將使本發明的特定本質以及本發明的其他目的、有利點、特點及用途更明朗化。 The specific nature of the invention, as well as other objects, advantages, features and uses of the invention will be apparent from the accompanying drawings.

1000‧‧‧照明圖判定步驟(圖1,3,8);裝置(圖10) 1000‧‧‧Lighting diagram determination steps (Fig. 1, 3, 8); device (Fig. 10)

1001‧‧‧資料及位址匯流排 1001‧‧‧ Data and address bus

1002‧‧‧微處理器(或CPU) 1002‧‧‧Microprocessor (or CPU)

1003‧‧‧ROM(或唯讀記憶體) 1003‧‧‧ROM (or read-only memory)

1004‧‧‧RAM(或隨機存取記憶體) 1004‧‧‧RAM (or random access memory)

1005‧‧‧I/O(輸入輸出)介面 1005‧‧‧I/O (input and output) interface

1006‧‧‧電池 1006‧‧‧Battery

1007‧‧‧顯示器 1007‧‧‧ display

1010,1080‧‧‧分量得到步驟 1010, 1080‧‧‧ components get steps

1020‧‧‧背光影像判定步驟 1020‧‧‧Backlight image determination step

1030‧‧‧除法步驟(計算殘餘影像) 1030‧‧‧Division procedure (calculating residual image)

1040‧‧‧殘餘影像色調映射步驟 1040‧‧‧ Residual image tone mapping steps

1050‧‧‧可見殘餘影像定標步驟 1050‧‧‧ visible residual image calibration steps

1060‧‧‧可見殘餘影像剪輯步驟 1060‧‧‧ visible residual image editing steps

1070‧‧‧背光影像得到步驟 1070‧‧‧ Backlighting image getting steps

1090‧‧‧亮度分量低空間頻率版本得到步驟 1090‧‧‧Steps for obtaining a low spatial frequency version of the luminance component

1100‧‧‧照明圖編碼步驟 1100‧‧‧ Lighting pattern coding steps

1110‧‧‧低空間頻率版本編碼步驟 1110‧‧‧ Low spatial frequency version coding step

1120‧‧‧關聯步驟 1120‧‧‧Association steps

1130‧‧‧知覺臨界值判定步驟 1130‧‧‧Perceptual threshold determination step

1140‧‧‧應用臨界值步驟 1140‧‧‧ Applying threshold steps

1200‧‧‧信號化資料步驟 1200‧‧‧ Signaling data steps

1300‧‧‧殘餘影像編碼步驟 1300‧‧‧Residual image coding steps

2000‧‧‧檢測步驟 2000‧‧‧Test steps

2100‧‧‧照明圖得到步驟 2100‧‧‧ Lighting map getting steps

2200‧‧‧解碼殘餘影像得到步驟 2200‧‧‧Steps to decode residual images

2300‧‧‧解碼影像得到步驟 2300‧‧‧Steps to decode images

2310‧‧‧逆定標步驟 2310‧‧‧Reverse calibration steps

2320‧‧‧反色調映射步驟 2320‧‧‧Anti-tone mapping steps

2340‧‧‧乘法步驟 2340‧‧‧Multiplication steps

2350‧‧‧加法步驟 2350‧‧‧Addition steps

2360‧‧‧反知覺調變步驟 2360‧‧‧Anti-perception modulation steps

A,B‧‧‧裝置 A, B‧‧‧ device

a i ,,‧‧‧加權係數 a i , , ‧‧‧weighting factor

Ba,Bal‧‧‧背光影像 Ba, Bal‧‧‧ Backlit image

‧‧‧背光影像解碼版本 ‧‧‧Backlight image decoding version

Ba灰階‧‧‧中灰階在一(mid-gray-at-one)的背光影像 Ba gray gray ‧‧‧ in a backlit images (mid-gray-at-one ) of

Ba調變‧‧‧調變後的背光影像 Ba modulation, ‧ ‧ modulated backlight image

BAG‧‧‧背光影像解碼版本產生模組 BAG‧‧‧Backlight Image Decoding Version Generation Module

BAM‧‧‧背光影像Bal判定模組 BAM‧‧‧Backlight Image Bal Determination Module

BI‧‧‧背光影像Ba判定模組 BI‧‧‧Backlight Image Ba Judging Module

BM‧‧‧調變模組 BM‧‧ ̄ modulation module

C(i)‧‧‧色彩分量 C(i) ‧ ‧ color component

CLI‧‧‧剪輯模組 CLI‧‧‧editing module

C N ‧‧‧正規化分量 C N ‧‧‧ normalized component

cst定標‧‧‧定標因數 Cst calibration ‧ ‧ calibration factor

cst調變‧‧‧調變係數 Cst modulation ‧ ‧ modulation coefficient

DEC1,DEC2‧‧‧解碼器 DEC1, DEC2‧‧‧ decoder

Diff‧‧‧微分影像 Diff ‧‧‧differential image

‧‧‧微分影像的一解碼版本 ‧‧‧Decoded version of the differential image

DIM,IM‧‧‧照明圖 DIM, IM‧‧‧ illumination map

DRI‧‧‧解碼的殘餘影像 Residual images decoded by DRI‧‧‧

ENC1,ENC2‧‧‧編碼器 ENC1, ENC2‧‧‧ encoder

E,△E 0,△E enc TH‧‧‧知覺臨界值 E , △ E 0 , △ E enc , TH‧‧‧ perception threshold

F,BF‧‧‧位元流 F, BF‧‧‧ bit flow

I‧‧‧影像 I‧‧‧ images

‧‧‧解碼影像 ‧‧‧Decode image

IC‧‧‧分量得到模組 IC‧‧‧ component acquisition module

IIC‧‧‧反知覺變換模組 IIC‧‧‧Anti-perception module

ISCA‧‧‧逆定標模組 ISCA‧‧‧Reverse Calibration Module

ITMO‧‧‧反色調映射模組 ITMO‧‧‧Anti-tone mapping module

HL‧‧‧亮度平均值計算模組 HL‧‧‧Brightness average calculation module

L‧‧‧亮度分量 L‧‧‧luminance component

L 平均價 ‧‧‧亮度平均值 L average price ‧ ‧ brightness average

L lf ‧‧‧低空間頻率版本 L lf ‧‧‧Low space frequency version

‧‧‧低空間頻率版本的解碼版本 ‧‧‧Decoded version of low spatial frequency version

L r ‧‧‧微分亮度分量 L r ‧‧‧differential brightness component

N‧‧‧正規化模組(圖4);可用位元數(步驟14) N‧‧‧ normalized module (Figure 4); number of available bits (step 14)

NET‧‧‧通訊網路 NET‧‧‧Communication Network

POP‧‧‧解碼影像得到模組 POP‧‧‧ decoding image acquisition module

PRP‧‧‧照明圖判定模組 PRP‧‧‧Lighting Chart Judgment Module

PT‧‧‧臨界值判定模組 PT‧‧‧critical value determination module

QP,γ,‧‧‧參數 QP, γ , ‧‧‧parameter

Res,RI‧‧‧殘餘影像 Res, RI‧‧‧ residual image

Resc‧‧‧定標的色調映射影像 Res c ‧‧‧calibrated tone map image

Ress‧‧‧作為結果的殘餘影像 Res s ‧‧‧ residual image as a result

Resv‧‧‧可見殘餘影像 Res v ‧‧‧ visible residual image

‧‧‧解碼的殘餘影像 ‧‧‧Remaining images decoded

SCA‧‧‧定標模組 SCA‧‧‧ calibration module

SD‧‧‧信號化資料 SD‧‧‧Signalization data

SMD‧‧‧檢測模組 SMD‧‧‧ test module

SME‧‧‧信號化資料編碼模組 SME‧‧‧ Signaling Data Encoding Module

TDR‧‧‧目標動態範圍 TDR‧‧‧ target dynamic range

TMO‧‧‧色調映射影像得到模組 TMO‧‧‧tone mapping image acquisition module

‧‧‧亮度 ‧‧‧brightness

Y n ‧‧‧最大環境亮度值 Y n ‧‧‧Maximum ambient brightness value

ψ i ,‧‧‧形狀函數 ψ i , ‧‧‧Shape function

以下將參考附圖描述本發明的實施例,圖中:圖1係根據本發明的一實施例以方塊圖顯示將一影像編碼成一位元流的方法的步驟;圖2係根據本發明的一實施例以方塊圖描繪將一位元流F解碼的方法,該位元流表示一影像;圖3係根據本發明的一實施例以方塊圖顯示步驟1000的次步驟;圖4係根據本發明的一實施例以方塊圖顯示步驟1020的次步驟;圖5係根據本發明的一實施例以方塊圖顯示步驟1020的次步驟;圖6係根據本發明的一實施例以方塊圖顯示步驟1020的次步驟;圖7係根據本發明的一實施例以方塊圖顯示步驟2300的次步驟;圖8係根據本發明的一實施例以方塊圖顯示步驟1000的次步驟;圖9係根據本發明的一實施例以方塊圖顯示步驟2300的次步驟;圖10係根據本發明的一實施例顯示一裝置的架構;及圖11係根據本發明的一實施例顯示二遠端裝置透過一通訊網路通訊。 Embodiments of the present invention will be described with reference to the accompanying drawings in which: FIG. 1 is a block diagram showing a step of a method of encoding an image into a bit stream in accordance with an embodiment of the present invention; FIG. 2 is a diagram of a method according to the present invention. The embodiment depicts, in a block diagram, a method of decoding a bit stream F, the bit stream representing an image; FIG. 3 is a block diagram showing a sub-step of step 1000 in accordance with an embodiment of the invention; FIG. 4 is in accordance with the present invention. An embodiment shows a sub-step of step 1020 in a block diagram; FIG. 5 shows a sub-step of step 1020 in a block diagram according to an embodiment of the invention; FIG. 6 shows step 1020 in a block diagram according to an embodiment of the invention. Next step; FIG. 7 shows a second step of step 2300 in a block diagram according to an embodiment of the invention; FIG. 8 shows a sub-step of step 1000 in a block diagram according to an embodiment of the invention; FIG. 9 is in accordance with the present invention. An embodiment shows a sub-step of step 2300 in a block diagram; FIG. 10 shows an architecture of a device according to an embodiment of the invention; and FIG. 11 shows a second remote device through a communication network according to an embodiment of the invention. through .

以下本文中將參考附圖更詳細說明本發明,其中顯示本發明的數個實施例,然而本發明可具體表現在許多替代形式中,不應解釋為侷限於本文中提出的實施例。因此,雖然本發明可作出各種不同的修改及替代形式,但在附圖中藉由範例顯示本發明的數個特定實施例,並將在本文中詳細說明。然而,應瞭解並無意圖將本發明侷限於揭示的特殊形式,相反地,本發明將涵蓋如後附申請專利範圍所界定,在本發明的精神及範疇內包含的所有修改、同等項及替代。整個附圖說明中,相同數字符號表示相同元件。 The invention will be described in more detail below with reference to the accompanying drawings, in which FIG. Accordingly, the present invention may be embodied in various embodiments and various modifications and However, it is to be understood that the invention is not intended to be limited to the specific forms disclosed, and the invention is intended to cover all modifications, equivalents and alternatives . Throughout the drawings, the same numerals indicate the same elements.

本文中使用的術語僅為說明特殊實施例,並不希望限制本 發明,如本文中使用單數形"一"、一個"及"該"意欲也包括複數形,除非是上下文另有清楚指明。尚應瞭解,本說明書中使用"包括"、"包括有"、"包含"及/或"包含有"等用詞時,明確指出所述特點、整數、步驟、操作、元件及/或組件的存在,但不排除一或多個其他特點、整數、步驟、操作、元件、組件及/或群組的存在或添加。此外,當表示一元件係"回應"或"連接"到另一元件時,該元件可直接回應或連接到該另一元件,或可存在插入元件。對照下,當表示一元件係"直接回應"或"直接連接"到另一元件時,就無插入元件存在。本文中使用的"及/或"一詞包含列出的一或多個關聯項的任何及所有組合,並可縮寫為"/"。 The terminology used herein is for the purpose of describing particular embodiments only and The singular forms "a", "the", "the" and "the" and "the" are intended to include the plural, unless the context clearly indicates otherwise. It should be understood that "including", "including", " The inclusion of the words "and/or" includes "the terms, integers, steps, operations, components, and / or components, but does not exclude one or more other features, integers, steps, operations, The presence or addition of elements, components, and/or groups. In addition, when a component is "responsive" or "connected" to another component, the component can directly respond to or be connected to the other component, or can be inserted. In contrast, when an element is referred to as being "directly responsive" or "directly connected" to another element, no intervening element is present. The term "and/or" as used herein includes one or more of the listed. Any and all combinations of associated items, and may be abbreviated as "/".

應瞭解,本文中使用的第一、第二等用詞雖然用以說明不同元件,但此等元件不應受限於此等用詞,此等用詞僅用以區別一元件與另一元件,例如,不背離本發明的教示,第一元件可稱作第二元件,同樣地,第二元件可稱作第一元件。 It should be understood that the terms first, second, etc. used herein are used to describe different elements, and such elements are not limited to such terms, and such terms are used only to distinguish one element from another. For example, a first element could be termed a second element, and a second element could be termed a first element, without departing from the teachings of the invention.

雖然有些附圖在通訊路徑上包含箭頭用以顯示一主要通訊方向,但應瞭解通訊可發生在所示箭頭的相反方向。 While some of the figures include arrows on the communication path to indicate a primary direction of communication, it should be understood that communication can occur in the opposite direction of the arrows shown.

有些實施例係以方塊圖及操作流程圖說明,其中各區塊表示一電路元件、模組或部分碼,其包括一或多個可執行指令用以實施指定的(數個)邏輯函數。而且應注意,在其他實作中,區塊中所示該(等)函數可不依所示順序發生。例如,取決於所涉及功能性,顯示為連續的二區塊事實上係大體上同時執行,或該等區塊有時可在相反順序中執行。 Some embodiments are illustrated in block diagrams and operational flow diagrams, wherein each block represents a circuit component, module or portion code that includes one or more executable instructions for implementing the specified (several) logic functions. Moreover, it should be noted that in other implementations, the (equal) functions shown in the blocks may not occur in the order shown. For example, depending on the functionality involved, the two blocks shown as contiguous are in fact executed substantially simultaneously, or the blocks may sometimes be executed in the reverse order.

本文中參閱"一個實施例"或"一實施例"意指配合該實施例所說明的一特殊特點、結構或特性可包含在本發明的至少一實作中。在說明書的不同地方出現"在一實施例中"或”根據一實例”的用語,未必全參閱相同實施例,分開或替代的實施例也未必與其他實施例互不相關。 References to "an embodiment" or "an embodiment" are intended to mean that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one implementation of the invention. The appearances of the "a" or "an" or "an"

在申請專利範圍中出現的參考數字符號係僅藉由繪示說明,在申請專利範圍的範疇上不應具有任何限制作用。 The reference numerals appearing in the scope of the patent application are not limited by the scope of the patent application.

雖未明確說明,但本發明的實施例及變化可運用在任何組合或子組合中。 Although not explicitly stated, embodiments and variations of the invention may be utilized in any combination or subcombination.

本發明係描述將一影像編碼/解碼,卻延伸到將一序列影像(視訊)編碼/解碼,原因是以下將說明將該序列的各影像循序地編碼/解 碼。 The present invention describes encoding/decoding an image but extending to encoding/decoding a sequence of images (video), as the following illustrates the sequential encoding/decoding of the images of the sequence. code.

圖1係根據本發明的一實施例以方塊圖顯示將一影像編碼成一位元流的方法的步驟。 1 is a block diagram showing the steps of a method of encoding an image into a bit stream in block diagrams in accordance with an embodiment of the present invention.

在步驟1000中,一模組PRP從待編碼的影像I中判定出一照明圖IM。 In step 1000, a module PRP determines an illumination map IM from the image I to be encoded.

一照明圖收集待編碼影像I的像素相關連的照明資料,例如,該照明圖可包括三個一組的照明值用於該影像的各像素,三個一組的各值係一照明值用於一像素的一色彩分量值。以下將照明圖描述成一背光影像或待編碼影像的亮度分量的一低頻率版本,但本發明未侷限於待編碼影像相關連的任何特定照明值表示。 An illumination map collects illumination data associated with pixels of the image to be encoded I. For example, the illumination map may include three sets of illumination values for each pixel of the image, and each of the three sets of values is used for an illumination value. A color component value of a pixel. The illumination pattern is described below as a low frequency version of the luminance component of a backlit image or image to be encoded, although the invention is not limited to any particular illumination value representation associated with the image to be encoded.

在步驟1100中,一編碼器ENC1將照明圖IM編碼到一位元流F中。 In step 1100, an encoder ENC1 encodes the illumination map IM into a one-bit stream F.

在步驟1200中,一模組SM將一信號化資料SD編碼到位元流F中,該信號化資料指示位元流F包括照明圖IM。 In step 1200, a module SM encodes a signalized data SD into a bitstream F, the signalized data indicating that the bitstream F includes an illumination map IM.

根據一實施例,信號化資料SD尚包括參數相關該照明圖結構,及參數相關待應用到解碼照明圖用以重建影像I的處理。 According to an embodiment, the signalized data SD further includes a parameter related to the illumination map structure, and the parameter correlation is to be applied to the decoded illumination map for reconstructing the image I.

根據一實施例,照明圖結構參數包括至少一影像空間解析度及一影像樣本位元深度。 According to an embodiment, the illumination pattern structure parameters include at least one image spatial resolution and an image sample bit depth.

根據本方法的一實施例,在步驟1300中,一編碼器ENC2將判定自影像I及照明圖IM的一殘餘影像RI編碼到位元流F中。 According to an embodiment of the method, in step 1300, an encoder ENC2 encodes a residual image RI from the image I and the illumination map IM into the bit stream F.

接著,根據本實施例,信號化資料SD亦調適成,例如為要得到一解碼影像,使殘餘影像與照明圖同步。 Then, according to the embodiment, the signalized data SD is also adapted, for example, to obtain a decoded image, and the residual image is synchronized with the illumination image.

位元流F包括信號化資料SD、照明圖IM,及殘餘影像RI(根據一實施例),位元流F可儲存在一區域記憶體或遠端記憶體上,及/或透過一通訊介面傳送(如到一匯流排或透過一通訊網路或一廣播網路)。 The bit stream F includes a signalized data SD, an illumination map IM, and a residual image RI (according to an embodiment), the bit stream F can be stored in a region memory or a remote memory, and/or through a communication interface Transfer (such as to a bus or through a communication network or a broadcast network).

圖2係根據本發明的一實施例以方塊圖顯示將一位元流F解碼的方法,該位元流表示一影像,藉由相關圖1所述一影像編碼方法可得到位元流F。 2 is a block diagram showing a method of decoding a bit stream F according to an embodiment of the present invention. The bit stream represents an image, and the bit stream F can be obtained by an image encoding method according to FIG.

在步驟2000中,一模組SMD檢測位元流F中是否一信號化資料SD指示位元流F包括資料相關判定自待解碼影像的一照明圖。 In step 2000, a module SMD detects whether a signalized data SD indicator bit stream F in the bit stream F includes a lighting pattern for determining the image to be decoded from the data correlation.

在該情形中,在步驟2100,一解碼器DEC1藉由將位元流F至少部分地解碼,得到照明圖DIM。 In this case, at step 2100, a decoder DEC1 obtains an illumination map DIM by at least partially decoding the bitstream F.

潛在地,藉由將位元流F解碼亦得到數個參數。 Potentially, several parameters are also obtained by decoding the bit stream F.

根據一實施例,此等參數係得自位元流F中的一SEI訊息。 According to an embodiment, the parameters are derived from an SEI message in the bitstream F.

根據該實施例的一變化,該信號化資料係檢測自高階語法元素,及其使用係由一SEI訊息完成。 According to a variation of this embodiment, the signalling data is detected from higher order syntax elements and its usage is accomplished by an SEI message.

在步驟2200中,一解碼器DEC2從一記憶體中,或根據一實施例藉由將位元流F至少部分地解碼,得到一解碼殘餘影像DRI。 In step 2200, a decoder DEC2 obtains a decoded residual image DRI from a memory, or according to an embodiment, by at least partially decoding the bitstream F.

在步驟2300中,一模組POP從該解碼殘餘影像DRI及解碼照明圖DIM中得到一解碼影像In step 2300, a module POP obtains a decoded image from the decoded residual image DRI and the decoded illumination image DIM. .

根據步驟1100的一實施例,編碼器ENC1係配置用以將照明圖IM編碼作為一輔助圖像,其語法係由標準H264/AVC或HEVC所定義(B.Bross,W.j.Han,G.J.Sullivan,J.R.Ohm,T.Wiegand所擬JCTVC-K1003,"高效率視訊編碼(HEVC)文字規格草案9(High Efficiency Video Coding(HEVC)text specification draft 9)”,2012年10月),及解碼器DEC1係配置用以從一輔助圖像中得到(步驟2100)一照明圖DIM,該輔助圖像的語法係由標準H264/AVC或HEVC所定義。 According to an embodiment of step 1100, the encoder ENC1 is configured to encode the illumination pattern IM as an auxiliary image, the syntax of which is defined by standard H264/AVC or HEVC (B.Bross, WjHan, GJSullivan, JR) Ohm, T. Wiegand's proposed JCTVC-K1003, "High Efficiency Video Coding (HEVC) text specification draft 9", October 2012), and decoder DEC1 configuration To obtain (step 2100) an illumination map DIM from an auxiliary image, the syntax of the auxiliary image is defined by standard H264/AVC or HEVC.

位元流F則包括一輔助圖像,其語法符合標準H264/AVC或HEVC,及該輔助圖像表示照明圖IM。 The bit stream F then includes an auxiliary image whose syntax conforms to the standard H264/AVC or HEVC, and the auxiliary image represents the illumination map IM.

實現一輔助圖像可藉由規定(數個)新視訊編碼層(VCL)NAL單元型式,如H.264/AVC中的作法(參閱ITU-T H.264建議書,系列H:視聽及多媒體系統,視聽服務的基礎建設-視訊動畫的編碼,2009年3月)。在H.264/AVC中,一輔助圖像對應到NAL單元型式19,如下表中所顯示。一輔助圖像的語法及解碼過程完全相同於一主要(非輔助)編碼圖像的語法及解碼過程。換言之,輔助圖像相關資料的解碼與主要編碼圖像解碼時使用相同的解碼語法及引擎。 Implementing an auxiliary image can be done by specifying (several) new video coding layer (VCL) NAL unit types, as in H.264/AVC (see ITU-T Rec. H.264, Series H: Audiovisual and Multimedia) System, infrastructure for audiovisual services - coding of video animation, March 2009). In H.264/AVC, an auxiliary image corresponds to NAL unit type 19, as shown in the following table. The syntax and decoding process of an auxiliary image is identical to the syntax and decoding process of a primary (non-assisted) encoded image. In other words, the decoding of the auxiliary image related material uses the same decoding syntax and engine as the decoding of the primary encoded image.

或者,一輔助編碼圖像係可實現作為可擴縮編碼中的一特定層,此係建議用於HEVC者,如文件JCTVC-O0041中所說明:REXT/MV-HEVC/SHVC HLS:輔助圖像層,及規定在HEVC可擴縮延伸的最近規範中,文件JCTVC-O1008,高效率視訊編碼(HEVC)可擴縮延伸草案4,2013年11月。在此情形中,係以信號表示在位元流的高階語法中(稱為在視訊參數集合-VPS中),使一增強可擴縮層加到包括有主要編碼圖像(在本發明中對應到殘餘圖像)的基礎層中。如下表所示,使用一語法元素可擴縮性_遮罩_旗標(對應到”可擴縮性遮罩指數”),增強可擴縮層的 可擴縮性型式指示其對應到輔助編碼圖像。 Alternatively, an auxiliary coded picture can be implemented as a specific layer in scalable coding, which is recommended for HEVC, as described in the document JCTVC-O0041: REXT/MV-HEVC/SHVC HLS: auxiliary image Layers, and provisions in the recent specification of HEVC scalable extensions, document JCTVC-O1008, High Efficiency Video Coding (HEVC) scalable extension draft 4, November 2013. In this case, signal is represented in the higher-order syntax of the bitstream (referred to in the Video Parameter Set - VPS), and an enhanced scalable layer is added to include the primary encoded image (corresponding in the present invention) Into the base layer of the residual image). As shown in the following table, using a syntax element scalability_mask_flag (corresponding to the "scalable mask index"), the scalable version of the scalable layer indicates that it corresponds to the auxiliary code. image.

此外,一參數"輔助Id"(指示輔助圖像的型式)係源自於可擴縮性Id參數,其本身係演繹自一語法元素大小_id,如第F.7.4.3.1.1節:文件JCTVC-O1008的視訊參數集合延伸語意學中所說明。 In addition, a parameter "auxiliary Id" (indicating the pattern of the auxiliary image) is derived from the scalability Id parameter, which itself is derived from a syntax element size _id, as described in Section F.7.4.3.1.1: The video parameter set of the document JCTVC-O1008 is extended in the semantics.

根據步驟1300的一實施例,編碼器ENC2係配置用以將殘餘影像RI編碼作為一主要圖像,其語法符合H264/AVC或HEVC標準,及解碼器DEC2係配置用以從一主要圖像中得到(步驟2200)一解碼殘餘影像,該主要圖像的語法係由標準H264/AVC或HEVC所定義。主要編碼圖像及輔助編碼圖像通常使用AVC或HEVC規範中所規定的語法及解碼處理。各型式圖像並無特定解碼處理,主要差別相關高階語法(如NAL單元型式或可擴縮性_遮罩_旗標,如以上說明)。 According to an embodiment of step 1300, the encoder ENC2 is configured to encode the residual image RI as a primary image, the syntax of which conforms to the H264/AVC or HEVC standard, and the decoder DEC2 is configured to be used from a primary image. A decoded residual image is obtained (step 2200) whose grammar is defined by standard H264/AVC or HEVC. The primary coded picture and the auxiliary coded picture typically use the syntax and decoding process specified in the AVC or HEVC specifications. There is no specific decoding process for each type of image, and the main difference is related to higher-order syntax (such as NAL unit type or scalability_mask_flag, as explained above).

在此配置中,根據一實施例,可定義參數輔助Id的一特定值用以指示輔助圖像的本質,尤其可定義一特定值用於照明圖,如下表所顯示。作為一範例,輔助Id等於1對應到一alpha通道,輔助Id等於2對應到一深度圖,輔助Id等於3對應到一照明圖。 In this configuration, according to an embodiment, a specific value of the parameter assistance Id can be defined to indicate the nature of the auxiliary image, and in particular a specific value can be defined for the illumination map, as shown in the following table. As an example, the auxiliary Id equals 1 to an alpha channel, the auxiliary Id equals 2 to a depth map, and the auxiliary Id equals 3 to an illumination map.

在另一實施例中,輔助Id用於照明圖的值與用於alpha通過者相同。的確,alpha通道的使用係部分類似於一背光通道的使用,由於其在於將主要輸入圖像乘以alpha圖並加以定標,利用一最終剪輯作業用以保證該信號留在最小及最大信號極限之內。因此可簡單地調適alpha通道概念來使用在背光圖,差異可來自不同定標值及剪輯值(以信號表示在伴隨的SEI訊息中)。 In another embodiment, the value of the auxiliary Id for the illumination map is the same as for the alpha passer. Indeed, the use of the alpha channel is similar to the use of a backlight channel, since it multiplies the main input image by an alpha map and scales it, using a final clipping operation to ensure that the signal remains at the minimum and maximum signal limits. within. Therefore, the alpha channel concept can be easily adapted for use in backlighting, and the differences can come from different scaling values and clip values (signaled in the accompanying SEI message).

位元流F則尚包括一主要圖像,其語法符合H264/AVC或HEVC標準,及該主要圖像表示殘餘影像RI。 The bit stream F also includes a primary image whose syntax conforms to the H264/AVC or HEVC standard, and the primary image represents the residual image RI.

根據步驟1200的一實施例,信號化資料SD係由一SEI訊息所攜帶,指示根據一語法符合JCTVC-O1008,接著,在步驟2000中,根據此實施例,從訊息SEI中檢測出信號化資料SD。 According to an embodiment of step 1200, the signalized material SD is carried by an SEI message indicating that the JCTVC-O 1008 is conformed according to a syntax. Then, in step 2000, the signalized data is detected from the message SEI according to the embodiment. SD.

SEI訊息可用以攜帶參數及/或指示,用以從輔助圖像的一解碼版本及主要圖像的一解碼版本(解碼的殘餘影像)中得到一解碼影像。 The SEI message can be used to carry parameters and/or indications for obtaining a decoded image from a decoded version of the auxiliary image and a decoded version of the primary image (decoded residual image).

作為一範例,語法對信號,一SEI訊息(sei_酬載)係基於JCTVC-O0041/JCTVC-F0031所提供的語法: As an example, the grammar pair signal, a SEI message (sei_payload) is based on the syntax provided by JCTVC-O0041/JCTVC-F0031:

例如,用以得到一解碼影像的參數及/或指示可係下表中的任一者:˙色彩格式˙樣本的位元深度(用於不同色彩分量可不同)˙圖像大小˙圖像樣本拓樸(如正規樣本拓樸或梅花陣式(quinquonce)樣本拓樸)˙定標因數cst定標˙一伽瑪-Slog曲線的參數γ˙一背光影像的參數(a i ,ψ i ),其中ψ i 例如對應到形狀函數,及a i 對應到加權參數,各關聯到該等ψ i 函數˙用以得到一解碼影像的重建模式˙重建(HDR)影像的最小及最大剪輯值。 For example, the parameters and/or indications used to obtain a decoded image can be any of the following: ̇ Color format 位 The bit depth of the sample (which can be used for different color components) ̇ Image size ̇ Image sample Topology (such as regular sample topology or quincence sample topology) ̇ scaling factor cst calibration ̇ gamma-Slog curve parameter γ ̇ a backlight image parameter ( a i , ψ i ), Where ψ i corresponds to a shape function, for example, and a i corresponds to a weighting parameter, each associated with the ψ i function 得到 to obtain a reconstructed mode of the decoded image 最小 reconstructed (HDR) image minimum and maximum clipping values.

使用HEVC標準,例如由下表提供SEI訊息中關連到此等參數的語法,: Using the HEVC standard, for example, the following table provides the syntax for the parameters associated with these parameters in the SEI message:

在表2中,參數hdr_照明_圖像_色彩_格式規定照明圖的色彩格式,4:2:0、4:2:2或4:4:4色彩格式,或該陣列,或該陣列中的單一樣本,其合成在單色度格式中的一圖像In Table 2, the parameter hdr_illumination_image_color_format specifies the color format of the illumination map, 4:2:0, 4:2:2 or 4:4:4 color format, or the array, or the array A single sample in which an image is synthesized in a monochromatic format.

在表2中,參數hdr_照明_圖像_位元_深度_減8規定照明圖的位元深度hdr_照明_圖像_位元_深度=8+hdr_照明_圖像_位元_深度_減8。 In Table 2, the parameter hdr_illumination_image_bit_depth_minus 8 specifies the bit depth hdr_illumination_image_bit_depth=8+hdr_illumination_image_bit of the illumination map _depth_minus 8.

根據一實施例,一位元深度係以信號表示用於明亮度分量,及一者用於色度分量。 According to an embodiment, one bit depth is signaled for the brightness component and one for the chrominance component.

在表2中,參數hdr_照明_圖像_寬度規定照明圖的水平尺寸,參數hdr_照明_圖像_高度規定照明圖的垂直尺寸,及參數hdr_照明_圖像_定標_型式規定照明圖的一定標處理用以取得一全解析度影像。 In Table 2, the parameter hdr_illumination_image_width specifies the horizontal size of the illumination map, the parameter hdr_illumination_image_height specifies the vertical size of the illumination map, and the parameter hdr_illumination_image_calibration_type A certain standard processing of the illumination map is specified to obtain a full resolution image.

在表3中,參數hdr_形狀-函數_大小_x規定一形狀函數(ψ i )用以判定一背光影像的寬度,參數hdr_形狀_函數_大小_y規定一形狀 函數用以判定一背光影像的高度,參數hdr_形狀_函數[cy][cx]提供一定標濾波係數在位置(cy,cx)的值,參數hdr_ldr_伽瑪_slog參數規定用於一反色調映射的參數,及參數hdr_ldr_定標_因數規定該殘餘影像的一定標處理用以取得一全解析度影像。該預設值係等於120。 In Table 3, the parameter hdr_shape-function_size_x defines a shape function ( ψ i ) for determining the width of a backlight image, and the parameter hdr_shape_function_size_y defines a shape function for determining one The height of the backlit image, the parameter hdr_shape_function [cy][cx] provides the value of the specified filter coefficient at the position (cy, cx), and the parameter hdr_ldr_gamma_slog parameter specifies the parameter for an inverse tone mapping. And the parameter hdr_ldr_scaling_factor specifies that the residual image is processed to obtain a full-resolution image. This preset value is equal to 120.

根據一實施例,形狀函數的寬度及高度取決於:˙其在該圖像中的位置˙其應用到的輔助圖像的值˙其應用到的區塊的大小˙其應用到的色彩分量。 According to an embodiment, the width and height of the shape function depend on: its position in the image, the value of the auxiliary image to which it is applied, the size of the block to which it is applied, and the color component to which it is applied.

例如,每區塊大小可定義一形狀函數。 For example, a block function can be defined for each block size.

在另一範例中,可定義一形狀函數用於明亮度分量,及一形狀函數用於色度分量。 In another example, a shape function can be defined for the brightness component and a shape function for the chrominance component.

在另一範例中,每明亮度值範圍可定義一形狀函數,例如,用於在0至127之間的明亮度,應用一第一形狀函數,具有大的寬度及高度。用於128至255之間的明亮度,應用一第二形狀函數,具有較小的寬度及高度,為要限制大的值傳播到鄰區。 In another example, each brightness value range may define a shape function, for example, for brightness between 0 and 127, applying a first shape function with a large width and height. For a brightness between 128 and 255, a second shape function is applied with a smaller width and height to limit the spread of large values to the neighbors.

以下將詳細討論此等參數。 These parameters are discussed in detail below.

可注意到,在SEI訊息中規定的參數一直維持其有效性到接收到新SEI訊息,在該情形中,新參數值將覆寫上去。 It may be noted that the parameters specified in the SEI message maintain their validity until a new SEI message is received, in which case the new parameter value will be overwritten.

圖3係根據本發明的一實施例以方塊圖顯示步驟1000的次步驟。 Figure 3 shows the sub-step of step 1000 in a block diagram in accordance with an embodiment of the present invention.

在步驟1010中,一模組IC得到待編碼影像I的亮度分量L及潛在地至少一色彩分量C(i)。 In step 1010, a module IC obtains a luminance component L of the image to be encoded I and potentially at least one color component C(i).

例如,當影像I屬於色彩空間(X,Y,Z)時,藉由分量Y的一變換f(.),得到亮度分量L,如L=f(Y)。 For example, when the image I belongs to the color space (X, Y, Z), a luminance component L such as L = f(Y) is obtained by a transformation f(.) of the component Y.

當影像I屬於色彩空間(R,G,B)時,藉由以下公式提供的一線性組合得到亮度分量L,例如在709色域中:L=0.2127.R+0.7152.G+0.0722.B When the image I belongs to the color space (R, G, B), the luminance component L is obtained by a linear combination provided by the following formula, for example, in the 709 color gamut: L = 0.2127.R + 0.7152.G + 0.0722.B

在步驟1020中,一模組BAM從影像I的亮度分量L中判定出一背光影像Bal。 In step 1020, a module BAM determines a backlight image Bal from the luminance component L of the image I.

根據步驟1000的此實施例,背光影像Bal係照明圖IM。 According to this embodiment of step 1000, the backlight image Bal is an illumination map IM.

根據步驟1020的一實施例,如圖4所繪示,一模組BI判定一背光影像Ba作為形狀函數ψ i 的一加權線性組合,由以下公式提供:Ba i a i ψ i (1)a i 係加權係數。 According to an embodiment of step 1020, as shown in FIG. 4, a module BI determines a backed image Ba as a weighted linear combination of the shape function ψ i , which is provided by the following formula: Ba = Σ i a i ψ i (1 ) a i is a weighting factor.

因此,從一亮度分量L中判定出一背光影像Ba在於找出最佳加權係數(若事前不知,及潛在地亦在於找出最佳形狀函數),為使背光影像Ba適合亮度分量L。 Therefore, determining a backlight image Ba from a luminance component L is to find an optimal weighting coefficient (if not known beforehand, and potentially also to find an optimal shape function), so that the backlight image Ba is suitable for the luminance component L.

許多熟知方法可用以找出加權係數a i ,例如,可使用最小均方方法,使背光影像Ba與亮度分量L之間的均方誤差減到最小。 Many well-known methods can be used to find the weighting factor a i . For example, the least squares method can be used to minimize the mean square error between the backlight image Ba and the luminance component L.

本發明未侷限於任一特定方法用以得到背光影像Ba。 The invention is not limited to any particular method for obtaining a backlight image Ba.

可注意到,形狀函數可係一顯示背光的真實物理回應(例如由LED製成,各形狀函數則對應到一LED的回應),或可係一純數學構造為要最好適合該亮度分量。 It may be noted that the shape function may be a display of the actual physical response of the backlight (eg, made of LEDs, each shape function corresponding to an LED response), or may be purely mathematically configured to best fit the luminance component.

根據此實施例,背光影像Bal(步驟1020來的輸出)係公式(1)所提供的背光影像Ba。 According to this embodiment, the backlight image Bal (the output from step 1020) is the backlight image Ba provided by the formula (1).

根據步驟1020的一實施例,如圖5所繪示,一模組BM利用影像I藉由一模組HL所得到的一亮度平均值L 平均值 ,將背光影像Ba(由公式(1)所提供)進行調變。 According to one embodiment of step 1020, depicted in FIG. 5, an average value of a luminance average value L BM module by a module using the HL image I obtained, the backlight image Ba (from equation (1) Provide) to make modulation.

根據此實施例,背光影像Bal(步驟1020來的輸出)係調變後的背光影像。 According to this embodiment, the backlight image Bal (the output from step 1020) is a modulated backlight image.

根據一實施例,模組HL係配置用以計算在整個亮度分量L之中的亮度平均值L 平均值 According to an embodiment, the HL coefficient module configured to calculate an average value in the luminance average value L L of the entire luminance component.

根據一實施例,模組HL係配置用以藉由以下公式計算亮度平均值L 平均值 β係一小於1的係數,及E(X)係亮度分量L的數學期望值(平均值)。 According to an embodiment, the module HL is configured to calculate a mean value of the brightness average L by the following formula: The β system is a coefficient smaller than 1, and the mathematical expectation value (average value) of the E (X) system luminance component L.

最後的這個實施例係有利的,原因是它避免具極高值的少數像素影響到亮度平均值L 平均值 ,當影像I屬於一序列影像時,該等具極 高值的像素通常導致極擾人的時間平均亮度不穩定。 This last embodiment is advantageous in that it avoids the fact that a small number of pixels with extremely high values affect the average value of the luminance mean L. When the image I belongs to a sequence of images, these extremely high-value pixels usually cause a harmonic disturbance. The average brightness of people's time is unstable.

本發明未侷限於計算亮度平均值L 平均值 的特定實施例。 The invention is not limited to the particular embodiment of calculating the average value of the luminance mean L.

根據此實施例的一變化,如圖6所繪示,一模組N使背光影像Ba(由公式(1)所提供)由其平均值E(Ba)進行正規化,以便取得一中灰階在一(mid-gray-at-one)的背光影像Ba 灰階 以用於該影像(或用於所有影像,若影像I屬於一序列影像的話): According to a variation of this embodiment, as shown in FIG. 6, a module N normalizes the backlight image Ba (provided by the formula (1)) from its average value E(Ba) to obtain a medium gray scale. In a (mid-gray-at-one) backlight image Ba grayscale for the image (or for all images, if the image I belongs to a sequence of images):

接著,模組BM係配置成,藉由使用以下關係,利用影像亮度分量L的低空間頻率版本L lf ,將中灰階在一的背光影像Ba 灰階 進行調變: cst 調變 係一調變係數,及α係小於1的另一調變係數,通常係1/3。 Next, the module BM is configured to modulate the backlight image Ba gray scale of the middle gray scale by using the low spatial frequency version L lf of the image brightness component L by using the following relationship: The cst modulation system is a modulation coefficient, and the other modulation coefficient of the α system is less than 1, usually 1/3.

根據此變化,背光影像Bal(步驟1020來的輸出)係調變後的背光影像Ba 調變 ,由公式(2)所提供。 According to this variation, Bal backlight image (output of step 1020) based backlight modulation Ba modulated image after transformation, by the formula (2) is provided.

可注意到,調變係數cst 調變 係經調整用以取得一良好觀看亮度用於殘餘影像,並高度取決於得到背光影像的處理,例如,cst調變 1.7用於最小均方所得到的一背光影像。 It can be noted that the modulation coefficient cst modulation is adjusted to obtain a good viewing brightness for the residual image, and is highly dependent on the processing of obtaining the backlight image, for example, cst modulation 1.7 A backlight image obtained for the least mean square.

實際上,藉由線性,用以調變背光影像的所有操作應用到背光係數a i 作為一校正因數,其將係數a i 變換成新係數,以便取得: In fact, by linearity, all operations for modulating the backlight image are applied to the backlight coefficient a i as a correction factor that transforms the coefficient a i into a new coefficient. In order to obtain:

根據步驟1000的此實施例,在步驟1100中,判定背光影像Bal(步驟1020來的輸出)所需的資料係藉由編碼器ENC1編碼,並加到位元流F中。 According to this embodiment of step 1000, in step 1100, the data required to determine the backlight image Bal (the output from step 1020) is encoded by the encoder ENC1 and applied to the bit stream F.

根據一實施例,如上述,此等資料係內嵌在一SEI訊息中。 According to an embodiment, as described above, such data is embedded in an SEI message.

例如,當使用習知的非適應性形狀函數時,待編碼資料係限制到加權係數a i ,但形狀函數ψ i 亦可係先驗未知及接著編碼到位元流F中,例如在一稍算最佳數學構造以用於較佳適用的情形中。因 此,將所有加權係數a i (及潛在地形狀函數ψ i )皆編碼到位元流F中。 For example, when using a conventional non-adaptive shape function, the data to be encoded is limited to the weighting factor a i or However, the shape function ψ i may also be a priori unknown and then encoded into the bit stream F, for example in a slightly better mathematical construction for better application. Therefore, all weighting coefficients a i or (and potentially the shape function ψ i ) are encoded into the bit stream F.

有利的是,為縮小位元流F的大小,在編碼前,將加權係數a i 進行量化。 Advantageously, to reduce the size of the bit stream F, the weighting factor a i or Quantify.

在步驟1030中,藉由將影像除以背光影像的一解碼版本,計算出一殘餘影像Res。 In step 1030, the image is divided by a decoded version of the backlit image. , a residual image Res is calculated.

使用背光影像的解碼版本以確保編碼器與解碼器兩端皆是一相同背光影像係有利的,藉此達到一較佳精確度的最終解碼影像It is advantageous to use a decoded version of the backlit image to ensure that both the encoder and the decoder are identical backlight images, thereby achieving a better accurate final decoded image. .

更確切地,將影像I的亮度分量L及潛在地各色彩分量C(i)(從模組IC所得到)除以背光影像的解碼版本,此除法係每像素逐一地完成。 More precisely, the luminance component L of the image I and the potential color component C(i) (obtained from the module IC) are divided by the decoded version of the backlight image. This division is done one by one per pixel.

例如,當影像I的分量R,G或B係表示在色彩空間(R,G,B)時,得到分量RRes,、GRes及BRes如下: 例如,當影像I的分量X,Y或Z係表示在色彩空間(X,Y,Z)時,得到分量XRes,YRes及ZRes如下: For example, when the component R, G or B of the image I is represented in the color space (R, G, B), the components R Res , G Res and B Res are obtained as follows: For example, when the component X, Y or Z of the image I is represented in the color space (X, Y, Z), the components X Res , Y Res and Z Res are obtained as follows:

根據步驟1030的一變化,在得到殘餘影像Res前,先處理背光影像的解碼版本According to a change of step 1030, the decoded version of the backlight image is processed before the residual image Res is obtained. .

應用到背光影像的解碼版本的處理例如可用以產生一處理過的背光影像,其解析度與其對應的殘餘影像相同,以下,背光影像的解碼版本,一詞將無差別地用以表示背光影像處理過或非處理過的解碼版本Applied to the decoded version of the backlit image The processing may be used, for example, to generate a processed backlight image having the same resolution as its corresponding residual image. Below, the decoded version of the backlit image , the term will be used indiscriminately to represent a decoded version of a backlit image processed or unprocessed .

根據一實施例,使用信號化資料SD中以信號表示的參數,從背光影像的解碼版本中得到背光影像處理過的解碼版本According to an embodiment, the decoded version of the backlit image is used from the signaled parameter in the signaled material SD. Decoded version with backlight image processing .

根據一實施例,在步驟2100中,藉由解碼器DEC1,將位元流F至少部分地解碼,得到背光影像的解碼版本According to an embodiment, in step 2100, the bit stream F is at least partially decoded by the decoder DEC1 to obtain a decoded version of the backlight image. .

如以述,為得到背光影像(步驟1020的輸出)所需的一些資料已編碼(步驟1100),並接著藉由位元流F的至少部分解碼得到該資料。 As described, some of the material needed to obtain a backlit image (the output of step 1020) is encoded (step 1100), and then the data is obtained by at least partial decoding of the bitstream F.

在以上提供的範例之後,接著得到加權係數(及潛在地 形狀函數作為步驟2100的輸出。 After the examples provided above, the weighting coefficients are then obtained. (and potentially shape function As an output of step 2100.

接著,在步驟1070中,一模組BAG藉由: 從加權係數及一些習知的非適應性形狀函數或形狀函數產生出背光影像的一解碼版本Next, in step 1070, a module BAG is: Weighting coefficient And some conventional non-adaptive shape functions or shape functions Produce a decoded version of the backlit image .

在步驟1040中,一模組TMO將殘餘影像Res進行色調映射,為要取得一可見殘餘影像ResvIn step 1040, a module TMO performs tone mapping on the residual image Res to obtain a visible residual image Res v .

因為殘餘影像Res的動態範圍太高及因為此殘餘影像Res顯示過度可見假影,會看似見不到殘餘影像Res。將殘餘影像進行色調映射補救此等缺點中的至少一者。 Since the dynamic range of the residual image Res is too high and since the residual image Res shows excessive visible artifacts, it may appear that the residual image Res is not visible. Performing tone mapping on the residual image remedies at least one of these disadvantages.

本發明未侷限於任何特定色調映射運算子,此單一條件係色調映射運算子應是可逆的。 The invention is not limited to any particular tone mapping operator, and this single condition is that the tone mapping operator should be reversible.

例如,可使用Reinhard所定義的色調映射運算子(E.Reinhard,M.Stark,P.Shirley及J.Ferwerda等人所發表”數位影像的攝影色調再製(Photographic tone reproduction for digital images)”,繪圖學之ACM會報21(2002年7月)),或R.Boitard,K.Bouatouch,R.Cozot,D.Thoreau及A.Gruson等人(2012年),視訊色調映射的時間一致性。在SPIE規程8499中,由A.M.J.van Eijk,C.C.Davis,S.M.Hammel及A.K.Majumdar等人所編輯,數位影像處理之應用(Applications of Digital Image Processing)(第84990D頁至84990D-10頁)。 For example, a tone mapping operator defined by Reinhard (Photographing tone reproduction for digital images , published by E. Reinhard, M. Stark, P. Shirley, and J. Ferwerda et al. ) can be used . ACM will report 21 (July 2002), or R. Boitard, K. Bouatouch, R. Cozot, D. Thoreau and A. Gruson et al. (2012), temporal consistency of video tone mapping. In SPIE Protocol 8499, edited by AMJvan Eijk, CCDavis, SMHammel, and AK Maajumdar, Applications of Digital Im ag e Processing (pp. 84990D to 84990D-10).

根據步驟1000的此實施例,在步驟1300,編碼器ENC2係配置成將可見殘餘影像Resv編碼到位元流F中。 According to this embodiment of step 1000, in step 1300, the encoder ENC2 system configured to image the visible residue Res v F is encoded into the symbol stream.

根據步驟1040的一實施例,將殘餘影像進行色調映射包括,根據該殘餘影像的像素值,一伽瑪(gamma)校正或一SLog校正。 According to an embodiment of step 1040, performing tone mapping on the residual image includes a gamma correction or a SLog correction according to the pixel value of the residual image.

接著例如藉由以下公式提供可見殘餘影像ResvRes v =A.Res γ A係一常數值,γ係一伽瑪曲線的係數,例如等於1/2.4。 The visible residual image Res v is then provided, for example by the following formula: Res v = A. Res γ A is a constant value, the coefficient of the γ -gamma curve, for example equal to 1/2.4.

或者,例如藉由以下公式提供可見殘餘影像ResvRes v =a.ln(Res+b)+c a,b,c係所判定SLog曲線的係數,以便0及1係不變的,及該SLog曲線的導數當由在1以下的一伽瑪曲線延長時係持續在1,因此,a,b,c係參數γ的函數。 Alternatively, for example, the visible residual image Res v is provided by the following formula: Res v = a. ln( Res + b )+ c a,b,c is the coefficient of the determined SLog curve so that 0 and 1 are unchanged, and The derivative of the SLog curve lasts at 1 when extended by a gamma curve below 1 and, therefore, a, b, c is a function of the parameter γ .

根據一實施例,將伽瑪-Slog曲線的參數γ編碼到位元流F中。 According to an embodiment, the parameter γ of the gamma-Slog curve is encoded into the bit stream F.

在殘餘影像Res上應用一伽瑪校正,將暗區域往上拉升但不使夠高光線降低,用以避免亮像素的燃燒效果。 Applying a gamma correction to the residual image Res pulls the dark areas up but does not lower the high light to avoid burning the bright pixels.

在殘餘影像Res上應用一SLog校正,使夠高光線降低但不往上拉升暗區域。 A SLog correction is applied to the residual image Res to reduce the high light but not to pull up the dark area.

接著,根據步驟1040的一實施例,模組TMO根據殘餘影像Res的像素值,應用伽瑪校正或SLog校正。 Next, according to an embodiment of step 1040, the module TMO applies gamma correction or SLog correction according to the pixel value of the residual image Res.

例如,當殘餘影像Res的像素值在一臨界值(等於1)以下時,則應用伽瑪校正,否則應用SLog校正。 For example, when the pixel value of the residual image Res is below a critical value (equal to 1), gamma correction is applied, otherwise SLog correction is applied.

藉由構造,取決於影像I的亮度,可見殘餘影像Resv通常具有一平均值或多或少接近1,使上述伽瑪-Slog結合的使用特別有效率。 By construction, depending on the brightness of the image I, the image residue visible Res v typically has a mean value close to 1 or less, so that the combination of the use of gamma -Slog particularly efficient.

根據該方法的一實施例,在步驟1050中,一模組SCA藉由將可見殘餘影像Resv的各分量乘以一定標因數cst定標,將可見殘餘影像Resv在編碼前(步驟1300)先進行定標。作為結果的殘餘影像Ress則由以下公式提供Ress=cst定標.Resv According to an embodiment of the method, in step 1050, a module will be seen by each component SCA afterimage Res v multiplied by a scaling factor of scaling cst, the residual image will be visible before encoding Res v (step 1300) Calibration is performed first. The resulting residual image Res s is provided by Res s = cst scaling by the following formula. Res v

在一特定實施例中,將定標因數cst定標定義,用以將可見殘餘影像Resv的值映射在0至最大值2N-1之間,其中N係可用位元數作為一輸入以用於編碼器ENC2的編碼。 In a particular embodiment, the scaling factor cst is scaled to define a value of the visible residual image Res v between 0 and a maximum value of 2 N -1, where N is the number of available bits as an input Encoding for encoder ENC2.

藉由將值1(其粗略係可見殘餘影像Resv的平均值)映射到中灰階值2N-1自然得到此結果,因此,用於具有標準位元數N=8的一可見殘餘影像Resv,一定標因數等於120係一極連貫值,原因是極接近在27=128的中間灰階。 This result is naturally obtained by mapping the value 1 (which is the average of the coarse visible residual image Res v ) to the medium gray scale value 2 N-1 , and thus is used for a visible residual image having the standard number of bits N=8. Res v , the certain scaling factor is equal to the 120-series one-pole coherence value, which is very close to the intermediate gray scale at 2 7 =128.

根據該方法的一實施例,在步驟1300中,編碼器ENC2係配置用以將殘餘影像Ress編碼。 According to an embodiment of the method, in step 1300, the encoder ENC2 is configured to encode the residual image Res s .

根據該方法的一實施例,在步驟1060中,一模組CLI將 可見殘餘影像Resv在編碼前進行剪輯,將其動態範圍限制到一目標動態範圍TDR,其例如係根據編碼器ENC2的能力所定義。 According to an embodiment of the method, in step 1060, a visible residue CLI module Res v for editing image before encoding, to limit the dynamic range to a target dynamic range TDR, based for example according to the capabilities of the encoder ENC2 Defined.

根據此最後實施例,作為結果的殘餘影像Resc係根據該方法的實施例,例如藉由以下公式所提供:Resc=max(2N,Resv) Resc=max(2N,Ress) According to this last embodiment, the resulting residual image Res c is according to an embodiment of the method, for example by the following formula: Res c = max(2 N , Res v ) Res c = max(2 N , Res s )

本發明未侷限於此類剪輯(max(.)),卻延伸到其他類型的剪輯。 The invention is not limited to such clips (max(.)), but extends to other types of clips.

根據該方法的此實施例,在步驟1300中,編碼器ENC2係配置用以將殘餘影像Resc編碼。 According to this embodiment of the method, in step 1300, the encoder ENC2 is configured to encode the residual image Res c .

根據該方法的實施例,結合該等定標及剪輯實施例導致一殘餘影像Ressc,由以下公式提供:Ressc=max(2N,cst定標*Resv)或藉由Ressc=max(2N,cst定標*Ress) According to an embodiment of the method, in conjunction with the scaling and clipping embodiments results in a residual image Res sc , provided by the following formula: Res sc = max (2 N , cst scaling * Res v ) or by Res sc = max (2 N , cst calibration *Res s )

根據該方法的此實施例,在步驟1300中,編碼器ENC2係配置用以將殘餘影像Ressc編碼。 According to this embodiment of the method, in step 1300, the encoder ENC2 is configured to encode the residual image Res sc .

可見殘餘影像Resv的色調映射及定標係一參數化過程,該等參數可係固定或不固定的,在後者情形中,該等參數可藉由編碼器ENC1編碼到位元流F中。 The tone mapping and scaling of the residual image Res v can be seen as a parameterization process, which can be fixed or not fixed. In the latter case, the parameters can be encoded into the bit stream F by the encoder ENC1.

根據該方法的一實施例,伽瑪校正的常數值γ、定標因數cst定標可係參數,其係編碼在位元流F中。 Example gamma] constant value, a gamma correction according to the embodiment of the method, calibration scaling factor may be based cst parameters which F-based encoded bitstream in.

可注意到,該等參數αcst 調變 cst 定標 γβ的選擇提供空間用以選擇最佳適合該內容的色調映射,以遵循專家在後製及顏色分級中的品味。 It may be noted that the selection of the parameters α , cst modulation , cst scaling , γ , β provides a space for selecting a tone map that best fits the content to follow the taste of the expert in post-production and color grading.

另一方面,可定義通用參數可接受用於所有各式各樣影像,則無參數係編碼在位元流F中。 On the other hand, the definable general parameters are acceptable for all kinds of images, and no parameters are encoded in the bit stream F.

根據一實施例,如上述,至少一參數αcst 定標 γβ係內嵌在一SEI訊息中。 According to an embodiment, as described above, at least one parameter α, cst scaling, γ, β system embedded in a SEI message.

根據步驟1000的此實施例,殘餘影像RI係可見殘餘影像Resv或Ress或RescAccording to this embodiment of step 1000, the residual image RI is visible residual image Res v or Res s or Res c .

圖7係根據本發明的一實施例以方塊圖顯示步驟2300的次步驟。 Figure 7 shows the sub-step of step 2300 in a block diagram in accordance with an embodiment of the present invention.

如以述,在步驟2100及1070中,藉由解碼器DEC1將位元流F至少部分地解碼,得到一背光影像(一解碼的照明圖)。 As described above, in steps 2100 and 1070, the bit stream F is at least partially decoded by the decoder DEC1 to obtain a backlit image. (A decoded lighting diagram).

位元流F係可已區域地儲存或接收自一通訊網路。 The bit stream F can be stored or received locally from a communication network.

在步驟2200中,藉由一解碼器DEC2將一位元流F至少部分地解碼,得到一解碼的殘餘影像In step 2200, the bit stream F is at least partially decoded by a decoder DEC2 to obtain a decoded residual image. .

如上述,藉由一傳統裝置可見到解碼的殘餘影像As described above, the decoded residual image can be seen by a conventional device. .

在步驟2340中,藉由將解碼的殘餘影像乘以背光影像,得到一解碼影像In step 2340, by decoding the residual image Multiply by backlit image , get a decoded image .

根據此實施例的一變化,在得到解碼影像前,先處理背光影像According to a variation of this embodiment, a decoded image is obtained Before processing the backlit image first .

應用到背光影像的處理例如可用以產生一處理過的背光影像,與其對應的解碼殘餘影像有相同解析度。以下,背光影像,一詞將無差別地表示處理過或未處理過的背光影像Apply to backlit image Processing may be used, for example, to generate a processed backlight image, corresponding to the decoded residual image Have the same resolution. Following, backlit image , the word will represent the processed or unprocessed backlit image indiscriminately .

根據一實施例,使用該信號化資料SD中以信號表示的參數,從背光影像中得到該處理過的背光影像According to an embodiment, the signal represented by the signalized data SD is used to image from the backlight Obtaining the processed backlight image .

根據步驟2100的一實施例,亦從一區域記憶體中或藉由解碼器DEC1將位元流BF至少部分地解碼,得到參數及/或According to an embodiment of step 2100, the bit stream BF is also at least partially decoded from a region memory or by the decoder DEC1 to obtain a parameter. And/or .

根據該方法,在步驟2310中,一模組ISCA藉由將解碼的殘餘影像除以參數,應用一逆定標到解碼的殘餘影像According to the method, in step 2310, a module ISCA uses the residual image to be decoded. Divide by parameter , applying a reverse scaling to the decoded residual image .

在步驟2320,一模組ITMO藉由參數,應用一反色調映射到解碼的殘餘影像At step 2320, a module ITMO is parameterized by , applying an inverse tone mapping to the decoded residual image .

例如,參數定義一伽瑪曲線,及反色調映射係只用以從該伽瑪曲線中找出解碼的殘餘影像的像素值對應的值。 For example, parameters Defining a gamma curve, and the inverse tone mapping system is only used to find the decoded residual image from the gamma curve The value corresponding to the pixel value.

圖8係根據本發明的一實施例以方塊圖顯示步驟1000的次步驟。 Figure 8 shows the sub-step of step 1000 in a block diagram in accordance with an embodiment of the present invention.

根據此實施例,將待編碼的影像I分割成多個影像區塊B,及考量各影像區塊B如下。 According to this embodiment, the image I to be encoded is divided into a plurality of image blocks B, and each image block B is considered as follows.

在步驟1080中,一模組IC得到待編碼影像區塊B的各分量,影像區塊B包括一亮度分量L及潛在地至少一色彩分量C(i),i係一索引,其識別影像區塊B的一色彩分量。影像區塊B的分量屬於一知覺空間,通常係一3D(立體)空間,即影像區塊B包括一亮度分量L及潛在地至少一色彩分量C(i),例如以下稱為C1及C2的二色彩分量。 In step 1080, a module IC obtains components of the image block B to be encoded, and the image block B includes a luminance component L and potentially at least one color component C ( i ), i is an index, and the image area is identified. A color component of block B. The component of image block B belongs to a perceptual space, usually a 3D (stereo) space, that is, image block B includes a luminance component L and potentially at least one color component C ( i ), such as hereinafter referred to as C1 and C2. Two color components.

但本發明未侷限於一灰階影像(無色彩分量),亦未侷限於具有一、二或多個色彩分量的一影像。當以下說明將一灰階影像進行編碼時,不用考慮到提及色彩分量的說明部分。 However, the invention is not limited to a grayscale image (no color component) nor to an image having one, two or more color components. When the following description encodes a grayscale image, the description portion of the color component is not considered.

一知覺空間具有一度量d((L,C1,C2),(L',C1',C2')),其值係該知覺空間中二點的視知覺之間的差異表示(較佳成正比)。 A perceptual space has a metric d (( L, C 1 , C 2) , ( L ' , C 1 ' , C 2 ' )) whose value is a representation of the difference between the visual perceptions of two points in the perceptual space ( It is preferably proportional.

就數學而言,如此定義度量d((L,C1,C2),(L',C1',C2')),以便存在一知覺臨界值△E 0(亦稱為JND,恰可分辨差異),低於該臨界值,人類無法察覺到該知覺空間的二顏色之間的視覺差異,即d((L,C1,C2),(L',C1',C2'))<△E 0, (3)及此知覺臨界值係與該知覺空間的二點(L,C1,C2)及(L',C1',C2')無關。 In mathematics, the metric d (( L, C 1 , C 2) , ( L ', C 1 ', C 2 ' )) is defined such that there is a perceptual threshold Δ E 0 (also known as JND, just The difference can be resolved), below which the human can not perceive the visual difference between the two colors of the perceptual space, ie d (( L, C 1 , C 2) , ( L ', C 1 ', C 2 ' )) < △ E 0 , (3) and this perceptual threshold are independent of the two points ( L, C 1 , C 2) and ( L ', C 1 ', C 2 ' ) of the perceptual space.

因此,將一影像(其分量屬於一知覺空間)編碼以使公式(3)的度量d保持在該知覺臨界值△E 0以下,確保該影像的顯示解碼版本係視覺上無損。 Thus, an image (whose component belongs to a perceptual space) is encoded such that the metric d of equation (3) remains below the perceptual threshold Δ E 0 , ensuring that the display decoded version of the image is visually lossless.

根據一實施例,該度量係可照一像素基礎來計算。 According to an embodiment, the metric is calculated on a pixel basis.

可注意到,實際上個別地控制以下三不等式較容易: It may be noted that it is easier to control the following three inequalities individually:

可注意到,若以大於△E 0的一知覺臨界值滿足公式(3),意即,以下視覺地控制編碼影像,即控制此影像在一顯示解碼版本中的視覺損失。 It can be noted that if a perceptual threshold greater than Δ E 0 satisfies equation (3), that is, the encoded image is visually controlled below, ie, the visual loss of the image in a displayed decoded version is controlled.

例如,當影像I包括數個分量係屬於非知覺空間如(R,G,B)時,將一知覺變換應用到影像I用以得到屬於知覺空間的一亮度分量L及潛在地二色彩分量C1及C2。 For example, when the image I includes several components belonging to an unconscious space such as (R, G, B), a perceptual transformation is applied to the image I to obtain a luminance component L and a potential second color component C1 belonging to the perceptual space. And C2.

此一知覺變換係定義自該顯示器的照明條件,並取決於初始色彩空間。 This perceptual transformation is defined by the lighting conditions of the display and depends on the initial color space.

例如,假定初始空間係(R,G,B)色彩空間,首先將影像I變換到熟知的線性空間(X,Y,Z)(會潛在地需要一反伽瑪(gamma)校正),接著作為結果的影像係變換自編碼影像的一解碼版本顯示的參考照明條件,其在此係該(X,Y,Z)空間中數個值(X n ,Y n ,Z n )的一3D向量。 For example, assuming the initial space system (R, G, B) color space, first transform the image I into a well-known linear space (X, Y, Z) (which potentially requires an inverse gamma correction), Visuals transform result of reference lighting conditions for self-encoded image a decoded version of the display, which in this system the (X, Y, Z) space of several values (X n, Y n, Z n) is a 3D vector.

因此,例如當選擇知覺空間LabCIE1976時,此一知覺變換定義如下:L*=116f(Y/Y n )-16 a*=500(f(X/X n )-f(Y/Y n )) b*=200(f(Y/Y n )-f(Z/Z n ))其中f係一轉換函數,例如由以下公式提供:若r>(6/29)3f(r)=r 1/3 否則 Thus, for example, when the perceptual space LabCIE 1976 is selected, this perceptual transformation is defined as follows: L *=116 f ( Y / Y n )-16 a *=500( f ( X / X n )- f ( Y / Y n ) b *=200( f ( Y / Y n )- f ( Z / Z n )) where f is a transfer function, for example provided by the following formula: if r >(6/29) 3 then f ( r )= r 1/3 otherwise

以下度量可照知覺空間LabCIE1976定義:d((L*,a*,b*),(L*',a*',b*'))2=(△L*)2+(△a*)2+(△b*)2<(△E 0)2L*係二顏色(L*,a*,b*)與(L*',a*',b*')的亮度分量之間的差異,及△a*(△b*分別)係此二顏色的色彩分量之間的差異。 The following metrics can be defined in the perceptual space LabCIE 1976: d (( L *, a *, b *) , ( L * ' , a * ', b * ' ) 2 = (Δ L *) 2 + (△ a *) 2 +(△ b *) 2 <(Δ E 0 ) 2 Δ L * is between the two colors ( L * , a * , b *) and the luminance components of ( L * ', a * ' , b * ' ) The difference, and Δ a *(Δ b * respectively) are the differences between the color components of the two colors.

根據另一範例,當選擇知覺空間Lu*v*時,一知覺變換定義如下:u*=13L(u'-u' )及v*=13L(v'-v' )其中 According to another example, when the selected appearance space Lu * v *, a perceptual transform is defined as follows: u * = 13 L (u '- u' white) and v * = 13 L (v ' - v' white) wherein and

以下歐基里德(Euclidean)度量可照知覺空間Lu*v*定義:d((L*,u*,v*),(L*',u*',v*'))2=(△L)2+(△u*)2+(△v*)2L*係二顏色(L*,u*,v*)及(L*',u*',v*')的亮度分量之間的差異,及△u*(△v*分別)係此二顏色的色彩分量之間的差異。 The following Euclidean metrics can be defined in the perceptual space Lu * v *: d (( L *, u *, v *), ( L * ' , u * ' , v * ' ) 2 = (△ L ) 2 +(Δ u *) 2 +(Δ v *) 2 Δ L * is the luminance component of two colors ( L *, u * , v *) and ( L * ' , u * ' , v * ' ) The difference between, and Δ u *(Δ v * respectively) is the difference between the color components of the two colors.

本發明未侷限於知覺空間LabCIE1976,卻可延伸到任何類型的知覺空間如LabCIE1994、LabCIE2000,其係相同Lab空間但具有不同度量用以測量該知覺距離,或例如其他任一歐基里德知覺空間。其他範例係LMS空間及IPT空間,條件是該度量應照此等知覺空間定義,為使該度量較佳與該知覺差異成正比;結果是,存在一同質最大知覺臨界值△E 0,低於該臨界值,人類無法察覺到該知覺空間的二顏色之間的視覺差異。 The invention is not limited to the perceptual space LabCIE 1976, but can be extended to any type of perceptual space such as LabCIE 1994, LabCIE 2000, which is the same Lab space but with different metrics for measuring the perceptual distance, or for example any other Euclid perceptual space. . Other examples are LMS space and IPT space, provided that the metric is defined in terms of such perceptual spaces, in order for the metric to be better proportional to the perceived difference; as a result, there is a homogenous maximum perceived threshold Δ E 0 , lower than At this critical value, humans cannot perceive the visual difference between the two colors of the perceptual space.

在步驟1090中,一模組LF得到影像I的亮度分量L的一低空間頻率版本L lf In step 1090, a module LF obtains a low spatial frequency version L lf of the luminance component L of image I.

根據步驟1000的此實施例,影像I的亮度分量L的低空間頻率版本L lf 係照明圖IM。 According to this embodiment of step 1000, the low spatial frequency version L lf of the luminance component L of image I is the illumination map IM.

根據一實施例,模組LF係配置成,藉由將該區塊的像素值求平均所算出的一平均值指定到一區塊的各像素,用以計算每區塊的低空間頻率版本L lf According to an embodiment, the module LF is configured to assign an average value calculated by averaging the pixel values of the block to each pixel of a block for calculating a low spatial frequency version L of each block. Lf .

本發明未侷限於算出影像I的低空間頻率版本的一特定實施例,並可使用影像I的亮度分量的任何低通濾波或縮減取樣。 The invention is not limited to a particular embodiment of calculating a low spatial frequency version of image I, and any low pass filtering or downsampling of the luminance component of image I can be used.

在步驟1100中,編碼器ENC1配置用以將低空間頻率版本L lf 編碼到位元流F中。 In step 1100, encoder ENC1 is configured to encode the low spatial frequency version L lf into bit stream F.

在步驟1110中,得到一微分影像Diff,微分影像Diff包括一微分亮度分量L r ,其係藉由計算亮度分量L與編碼低空間頻率版本L lf 的一解碼版本之間的差異所得到。 In step 1110, a differential image Diff obtained, comprising a differential image Diff R & lt differential luminance component L, which is a decoded version of the system by calculating a luminance component L and a low spatial frequency encoded version of L LF The difference between the two is obtained.

潛在地,在步驟1120,一模組ASSO係配置用以使影像I的各色彩分量與微分亮度分量L r 相關聯,為要取得一微分影像Diff。根據該範例,影像I包括二色彩分量C1及C2,接著使色彩分量C1及C2與微分亮度分量L r 相關聯,為要取得包括有三分量(L r ,C1,C2)的一微分影像DiffPotentially, at step 1120, a configuration module for each color component based ASSO for causing the image I L r and differential luminance component is associated, is to obtain a differential image Diff. According to this example, the image I includes two color components C1 and C2, and then the color components C1 and C2 are associated with the differential luminance component L r to obtain a differential image including three components ( L r , C 1 , C 2). Diff .

在步驟1300中,編碼器ENC2係配置用以將微分影像Diff編碼到位元流F中。 In step 1300, the encoder ENC2 is configured to encode the differential image Diff into the bitstream F.

潛在地,在步驟1100及/或1300中,編碼器ENC1及/或ENC2包括一熵編碼。 Potentially, in steps 1100 and/or 1300, encoder ENC1 and/or ENC2 includes an entropy coding.

編碼器ENC2的編碼精確度取決於該知覺空間中的一知覺臨界值△E,其定義知覺空間中度量的一上限及致能該影像的一顯示解碼版本中的視覺損失控制。 The coding accuracy of the encoder ENC2 depends on a perceptual threshold Δ E in the perceptual space, which defines an upper limit of the metric in the perceptual space and enables visual loss control in a display decoded version of the image.

在步驟1130中,根據編碼影像的一解碼版本顯示的參考照明條件及低空間頻率版本L lf 的解碼版本,判定知覺臨界值△EIn step 1130, the reference illumination condition and the decoded version of the low spatial frequency version L lf are displayed according to a decoded version of the encoded image. Determine the threshold value of △ E .

低空間頻率版本L lf 的亮度在該影像之中並非保持不變而是區域地變動,例如,若藉由將一區塊的像素值求平均所算出的一平均值指定到該區塊的各像素來計算每區塊的低空間頻率版本L lf ,則知覺臨界值△E在各區塊之中係保持不變,但該影像的二區塊的平均值可不同。因此,根據該影像的亮度值,知覺臨界值△E區域地變動。 The brightness of the low spatial frequency version L lf does not remain unchanged in the image but varies regionally, for example, if an average value calculated by averaging pixel values of a block is assigned to each of the blocks The pixel calculates the low spatial frequency version L lf of each block, and the perceptual threshold Δ E remains unchanged among the blocks, but the average value of the two blocks of the image may be different. Therefore, the perceptual threshold value Δ E varies regionally based on the luminance value of the video.

知覺臨界值△E的區域變動未侷限於區塊為基礎的變動,卻可延伸到例如基於該影像的亮度值,藉由任何運算子如一分段運算子在該影像之中所界定的任何區域。 The region variation of the perceptual threshold Δ E is not limited to block-based variations, but may extend to any region defined by the image, for example, based on the luminance value of the image, by any operator such as a segment operator. .

在步驟2100中,將步驟1100來的輸出藉由一解碼器DEC1進行解碼(即將位元流F至少部分地解碼),得到低空間頻率版本L lf 的解碼版本。相較於編碼器ENC1的操作(步驟1100),此一解碼器DEC1實施反向操作。 In step 2100, the output from step 1100 is decoded by a decoder DEC1 (ie, the bitstream F is at least partially decoded) to obtain a decoded version of the low spatial frequency version L lf . . This decoder DEC1 performs a reverse operation compared to the operation of the encoder ENC1 (step 1100).

根據步驟1130的一實施例,假定照明在該影像顯示期間潛在地一直增加到一最大環境亮度值Y n ,從低空間頻率版本L lf 的解碼版本的亮度值在最大環境亮度值Y n 中所佔比例,判定出知覺臨界值△EAccording to an embodiment of step 1130, it is assumed that illumination is potentially increased all the way to a maximum ambient luminance value Y n during the image display, from the decoded version of the low spatial frequency version L lf Brightness value The perceived threshold value Δ E is determined as a percentage of the maximum ambient luminance value Y n .

根據步驟1130的一實施例,當不許在最大環境亮度值中編碼劣化時,則由以下公式提供知覺臨界值△E (X n ,Y n ,Z n )係編碼影像的一解碼版本顯示的參考照明條件,及係一值,其表示低空間頻率版本L lf 的解碼版本的亮度,及△E enc 係一知覺編碼參數。通常選擇△E enc 接近△E 0以用於視覺無損編碼,及大於△E 0以用於一編碼在編碼影像的解碼版本中具有視覺損失控制。 According to an embodiment of step 1130, when the maximum allowed coding environment degradation in luminance value, by the following equation perceptual threshold △ E: ( X n , Y n , Z n ) is a reference illumination condition displayed by a decoded version of the encoded image, and Is a value indicating the decoded version of the low spatial frequency version L lf The brightness, and △ E enc is a perceptual coding parameter. Δ E enc is typically selected to be close to Δ E 0 for visual lossless encoding, and greater than Δ E 0 for a code to have visual loss control in the decoded version of the encoded image.

因此,使用此一知覺臨界值△E容許該編碼調適到顯示的環 境照明條件。 Thus, using this threshold value △ E perceptual coding adapted to permit the display of the ambient lighting conditions.

或者,編碼影像的一解碼版本顯示的參考照明條件(X n ,Y n ,Z n ),其具有一局部字元,可取代以編碼影像的一解碼版本顯示的總體參考照明條件,由所定義。 Alternatively, the reference lighting conditions (X n, Y n, Z n) a decoded version of the encoded video display, having a local character, with reference to the overall lighting conditions may be substituted to display a decoded version of the coded image by the Defined.

從一編碼觀點(色彩編碼),此取代係同等於知覺臨界值△E(4)的選擇,原因是在色彩空間LabCIE1976中,編碼具有一精確度等於的一色彩分量a*的△E,其係由以下公式提供: 係同等於編碼具有一精確度等於色彩分量a*'的△E enc ,其係由以下公式提供: From a coding point of view (color coding), this substitution is equivalent to the selection of the perceptual threshold Δ E (4) because, in the color space LabCIE 1976, Δ E having a color component a * equal to an accuracy is encoded. It is provided by the following formula: Is equivalent to the encoding with a precision equal to the color component a * ' , Δ E enc , which is provided by the following formula:

相同備註適用到另一分量b*,因此,不用區域地變動該知覺空間,只要將臨界值從△E enc 調適到△EThe same remark applies to the other component b *, so that the perceptual space is not changed regionally, as long as the threshold is adjusted from Δ E enc to Δ E .

根據步驟1130的一實施例,為避免該影像具有高亮度值部分的子編碼,由以下公式提供知覺臨界值△E 其中一上限係設成△E enc E max ,通常E max 係設成1。最後這公式是指絕不採用低空間頻率版本L lf 的解碼版本的亮度係大於最大環境亮度值Y n According to an embodiment of step 1130, in order to avoid sub-coding of the image having a high luminance value portion, the perceptual threshold value Δ E is provided by the following formula: One of the upper limits is set to Δ E enc E max , and usually E max is set to 1. Finally, this formula means that the decoded version of the low spatial frequency version L lf is never used. The brightness is greater than the maximum ambient brightness value Y n .

另一方面,為避免該影像中具有極低亮度值的部分過度編碼,則由以下公式提供知覺臨界值△E 其中一下限係設成△E enc E min ;通常E min 係設成大約1/5。這是由於低空間頻率版本L lf 的解碼版本的暗區域亮度由最大環境亮度值Y n 所造成的對比遮罩作用。 On the other hand, in order to avoid over-encoding of the portion of the image having a very low luminance value, the perceptual threshold Δ E is provided by the following formula: One of the lower limits is set to Δ E enc E min ; usually E min is set to about 1/5. This is due to the decoded version of the low spatial frequency version L lf The dark area brightness is caused by the contrast mask caused by the maximum ambient brightness value Y n .

由以下公式簡單地得出兩界限的組合: The combination of the two limits is simply derived by the following formula:

根據該方法的一變化,在步驟1140,將一臨界值TH應用到微分影像Diff的該(等)分量,為要使其分量的各動態範圍限制到一目標動態範圍TDR。 According to a variation of the method, at step 1140, a threshold TH is applied to the (equal) component of the differential image Diff to limit each dynamic range of its component to a target dynamic range TDR.

根據步驟1300的一實施例,將一微分影像的各分量藉由知覺臨界值△E進行正規化,接著將正規化的微分影像以不變編碼精確度進行編碼。 According to an embodiment of step 1300, the components of a differential image are normalized by a perceptual threshold Δ E , and then the normalized differential image is encoded with constant encoding accuracy.

因此,編碼精確度因此係知覺臨界值△E的一函數,其區域地變動且是最佳精確度(假定是理想知覺空間)。藉由如此做,正規化的微分影像的編碼精確度到1,確保將微分影像編碼到如所需△E的精確度。 Therefore, the coding accuracy is therefore a function of the perceptual threshold Δ E , which varies regionally and is optimally accurate (assumed to be an ideal perceptual space). By doing so, the normalized differential image coding accuracy to 1, the differential video encoding to ensure the accuracy of △ E as desired.

根據步驟1300的一實施例,一微分影像的一分量藉由知覺臨界值△E的正規化係此分量除以一值,該值係知覺臨界值△E的一函數。 According to an embodiment of step 1300, a component of a differential image is divided by a normalization of the threshold Δ E by a value that is a function of the threshold Δ E .

就數學而言,接著例如將該微分影像的一分量C(包含微分亮度分量及潛在地各色彩分量兩者)正規化如下,用以取到一正規化分量C N α係一值例如等於0.5或1。 In mathematics, for example, a component C of the differential image (including both the differential luminance component and potentially the respective color components) is normalized as follows to obtain a normalized component C N : The alpha system has a value equal to, for example, 0.5 or 1.

根據步驟1300的另一實施例,一微分影像編碼的至少一參數取決於知覺臨界值△EAccording to another embodiment of step 1300, at least one parameter of a differential image encoding is dependent on a perceptual threshold Δ E .

例如,此一編碼的一量化參數QP取決於知覺臨界值△E,實際上,此一參數QP存在於影像/視訊編碼器像h264/AVC及HEVC中,並可區域地定義以用於各編碼區塊。在此範例中,藉由選擇區域QP來執行一編碼,遍及該微分影像具有精確度係區域地(一區塊一區塊地)變化,確保知覺臨界值△E用於各區塊的編碼精確度。 For example, a quantization parameter QP of this code depends on the perceptual threshold value Δ E . In fact, this parameter QP exists in the image/video encoder images h264/AVC and HEVC, and can be defined regionally for each coding. Block. In this example, by selecting a coding region QP performed throughout the differential image with the accuracy of a regional (a block to a block) change, to ensure perceptual threshold △ E precision used to encode each block degree.

圖9係根據本發明的一實施例以方塊圖顯示步驟2300的次步驟。 Figure 9 shows the sub-step of step 2300 in a block diagram in accordance with an embodiment of the present invention.

可注意到,以下考量到一位元流F,其表示一影像I,其包括一亮度分量及潛在地包括至少一色彩分量。影像I的該(等)分量屬於上述知覺色彩空間。 It may be noted that the following considers a one-bit stream F representing an image I comprising a luminance component and potentially including at least one color component. The (equal) component of image I belongs to the perceptual color space described above.

在步驟2100中,如上述,藉由一解碼器DEC1將位元流F至少部分地解碼,得到影像I的亮度分量的一低空間頻率版本解碼版本 In step 2100, as described above, the bit stream F is at least partially decoded by a decoder DEC1 to obtain a low spatial frequency version decoded version of the luminance component of the image I. .

在步驟2200中,藉由解碼器DEC2將位元流F至少部分地解碼,得到一微分影像的解碼版本In step 2200, the bitstream F is at least partially decoded by the decoder DEC2 to obtain a decoded version of the differential image. .

因此,當位元流F所表示的影像I係一灰階影像時,一微分影像的解碼版本包括一微分亮度分量L r ,其表示影像I的一亮度分量L與影像I的亮度分量的低空間頻率版本解碼版本之間的差異。當位元流F所表示的影像I係一彩色影像,即具有一亮度分量L及至少一色彩分量的影像時,一微分影像的解碼版本包括微分亮度分量L r 及影像I的各(該至少一)色彩分量,該微分亮度分量表示影像I的一亮度分量L與影像I的亮度分量的低空間頻率版本解碼版本之間的差異。 Therefore, when the image I represented by the bit stream F is a gray scale image, a decoded version of the differential image A differential brightness component L r is included , which represents a low spatial frequency version decoded version of a luminance component L of image I and a luminance component of image I difference between. When the image I represented by the bit stream F is a color image, that is, an image having a luminance component L and at least one color component, a decoded version of the differential image The differential brightness component L r and each (the at least one) color component of the image I, the differential brightness component representing a low spatial frequency version decoding version of a luminance component L of the image I and a luminance component of the image I difference between.

在步驟2350中,將一微分影像的解碼版本與該影像的亮度分量的低空間頻率版本解碼版本加起來以取得解碼影像In step 2350, a decoded version of the differential image is Low spatial frequency version decoded version with the luminance component of the image Add up to get the decoded image .

解碼器DEC2的解碼精確度取決於一知覺臨界值△E,其定義上述知覺空間中的一度量上限及致能該影像的一顯示解碼版本中的視覺損失控制。因此解碼精確度係該知覺臨界值的一函數,其區域地變動。 The decoding accuracy of the decoder DEC2 depends on a perceptual threshold Δ E which defines a metric upper limit in the perceptual space and enables visual loss control in a display decoded version of the image. Therefore, the decoding accuracy is a function of the perceptual threshold, which varies regionally.

以上相關步驟1130的說明,根據一實施例,判定知覺臨界值△E係根據編碼影像的一解碼版本顯示的參考照明條件(與用於編碼者相同),及影像I的亮度分量的低空間頻率版本解碼版本In the above description of the related step 1130, according to an embodiment, the determination threshold threshold Δ E is a reference illumination condition (same as for the coder) and a low spatial frequency of the luminance component of the image I according to a decoded version of the encoded image. Version decoding version .

根據步驟2200的一實施例,當一微分影像的各分量已藉由知覺臨界值△E正規化時,該微分影像係以不變精確度進行解碼,及微分影像的解碼版本的各分量Diff藉由知覺臨界值△E進行再正規化。 According to an embodiment of step 2200, when the components have a differential image by normalized perceptual threshold △ E, the differential image based decoding version for each component decoder, and the differential image by a constant accuracy Diff Renormalization is performed by the perceptual threshold Δ E .

根據步驟2200的一實施例,再正規化係乘以一值,該值係知覺臨界值△E的一函數。 According to one embodiment of step 2200, and then multiplied by a normalization based value based perceptual threshold a function of △ E.

就數學而言,例如將該微分影像的解碼版本的各分量再正規化如下: α係一值例如等於0.5或1。 In mathematics, for example, the components of the decoded version of the differential image Renormalize as follows: The alpha system has a value equal to, for example, 0.5 or 1.

根據一變化,在步驟2360中,一模組IIC係配置成將一反知覺變換應用到解碼影像(步驟2350來的輸出),例如將解碼影像 的估算變換到熟知空間(X,Y,Z)。 According to a variation, in step 2360, a module IIC is configured to apply an anti-perceptual transformation to the decoded image. (output from step 2350), for example, will decode the image The estimate is transformed into the well-known space (X, Y, Z).

當選擇知覺空間LabCIE1976時,該反知覺變換係由以下公式提供: Y=Y n f -1(1/116(L*+16)) When the perceptual space LabCIE 1976 is selected, the anti-perceptual transformation is provided by the following formula: Y = Y n f -1 (1/116( L *+16))

當選擇知覺空間Luv時,反知覺變換係由以下公式提供: When the perceptual space Luv is selected, the anti-perceptual transformation is provided by the following formula:

潛在地,將空間(X,Y,Z)中的影像反變換用以取得初始空間如(R,G,B)空間中的解碼影像。 Potentially, the inverse of the image in space (X, Y, Z) is used to obtain decoded images in the initial space such as (R, G, B) space.

潛在地,在步驟2100及/或2200期間,亦將位元流F的資料至少部分地熵解碼。 Potentially, during step 2100 and/or 2200, the data of the bitstream F is also at least partially entropy decoded.

解碼器DEC1(DEC2分別)係配置成將已由編碼器ENC1(ENC2分別)編碼的資料解碼。 The decoder DEC1 (DEC2, respectively) is configured to decode the data that has been encoded by the encoder ENC1 (ENC2 respectively).

編碼器ENC1及/或ENC2(及解碼器DEC1及/或DEC2)未侷限於一特定編碼器(解碼器),但當需要一熵編碼器(解碼器)時,一熵編碼器如霍夫曼(Huffmann)編碼器、算術編碼器,或上下文適應性編碼器像h264/AVC或HEVC中使用的Cabac係有利的。 The encoders ENC1 and/or ENC2 (and decoders DEC1 and/or DEC2) are not limited to a particular encoder (decoder), but when an entropy coder (decoder) is required, an entropy coder such as Hoffman A (Huffmann) encoder, an arithmetic coder, or a context adaptive encoder is advantageous like the Cabac system used in h264/AVC or HEVC.

編碼器ENC1及ENC2(及解碼器DEC1及DEC2)未侷限於一特定編碼器,其例如可係利用損失像JPEG、JPEG2000、MPEG2、h264/AVC或HEVC的一影像/視訊編碼器。 Encoders ENC1 and ENC2 (and decoders DEC1 and DEC2) are not limited to a particular encoder, which may, for example, utilize an image/video encoder that loses JPEG, JPEG2000, MPEG2, h264/AVC or HEVC.

在圖1至9中,該等模組係功能單元,其與可區別的實體單元可相關或不相關,例如,此等模組或其中有些模組可共同在單一零件或電路中,或對一軟體功能性作出貢獻。反過來說,有些模組可潛在地由數個分開實體所組成,適合本發明的裝置係使用純硬體來實現,例如使用專屬硬體如ASIC或FPGA或VLSI(分別是«應用特定積體電路»、«現場 可程式閘陣列»、«極大型積體電路»),或由嵌入一裝置中的數個積體電子零件來實現,或由硬體與軟體零件混合來實現。 In Figures 1 to 9, the modules are functional units that may or may not be related to distinguishable physical units, for example, such modules or some of the modules may be co-located in a single part or circuit, or A software functionality contributes. Conversely, some modules may potentially consist of several separate entities, and devices suitable for the present invention are implemented using pure hardware, such as using proprietary hardware such as ASIC or FPGA or VLSI (application-specific integration, respectively) Circuit », «Site The programmable gate array», «maximum integrated circuit»), can be realized by several integrated electronic parts embedded in a device, or by mixing hardware and software parts.

圖10繪示一裝置1000的示範架構,其可配置用以執行相關圖1至9所述方法。 10 illustrates an exemplary architecture of an apparatus 1000 that is configurable to perform the methods described in relation to FIGS. 1-9.

裝置1000包括以下元件,其藉由一資料及位址匯流排1001鏈接起來:- 一微處理器1002(或CPU),其例如係一DSP(或數位信號處理器);- 一ROM(或唯讀記憶體)1003;- 一RAM(或隨機存取記憶體)1004;- 一I/O(輸入/輸出)介面1005,用以從一應用程式接收資料以用於傳送;及- 一電池1006。 The device 1000 includes the following elements linked by a data and address bus 1001: a microprocessor 1002 (or CPU), for example, a DSP (or digital signal processor); - a ROM (or only Read memory) 1003; - a RAM (or random access memory) 1004; - an I / O (input / output) interface 1005 for receiving data from an application for transmission; and - a battery 1006 .

根據一變化,電池1006係在該裝置的外部,圖10的此等元件各為熟諳此藝者所熟知,將不再贅述。在所述各記憶體中,本發明說明書中所使用的«暫存器»一詞可對應到小容量區(一些位元)或對應到極大區(如整個程式,或大量接收或解碼的資料)。ROM 1003包括至少一程式及數個參數,根據本發明方法的演算式係儲存在ROM 1003中。當開關打開時,CPU 1002將RAM中的程式上傳並執行對應指令。 According to a variation, the battery 1006 is external to the device, and such components of Figure 10 are well known to those skilled in the art and will not be described again. In each of the memories, the term "scratchpad" used in the description of the present invention may correspond to a small capacity area (some bits) or to a maximum area (such as the entire program, or a large amount of received or decoded data). ). The ROM 1003 includes at least one program and a plurality of parameters, and the calculation formula of the method according to the present invention is stored in the ROM 1003. When the switch is turned on, the CPU 1002 uploads the program in the RAM and executes the corresponding instruction.

RAM 1004包括,在一暫存器中係由CPU 1002所執行及裝置1000的開關打開後所上傳的程式,在一暫存器中係輸入資料,在一暫存器中係該方法在不同狀態中的中間資料,及在一暫存器中係執行該方法所使用的其他變數。 The RAM 1004 includes a program uploaded by the CPU 1002 and opened by the switch of the device 1000 in a register, and the data is input in a register, and the method is in a different state in a register. Intermediate data in the middle, and other variables used in the execution of the method in a register.

本文中說明的實作例如可實現在一方法或一處理、一裝置,一軟體程式,一資料流,或一信號中。即若只在單一形式實作的情境中討論(例如只討論作為一方法或一裝置),所討論特點的實作亦可實現在其他形式中(例如一程式)。一裝置例如可實現在適當硬體、軟體及韌體中,該等方法例如可實現在一裝置如一處理器中,其通常指處理裝置,例如包括電腦、微處理器、積體電路或可程式邏輯裝置。處理器亦包括通訊裝置如電腦、手機、可攜式或個人數位助理器("PDA"),及其他有助於終端用戶間資訊通訊的裝置。 The implementations described herein may be implemented, for example, in a method or a process, a device, a software program, a data stream, or a signal. That is, if it is discussed only in the context of a single-form implementation (for example, only as a method or a device), the implementation of the features discussed may be implemented in other forms (eg, a program). A device can be implemented, for example, in a suitable hardware, software, and firmware. The methods can be implemented, for example, in a device such as a processor, which is generally referred to as a processing device, for example, including a computer, a microprocessor, an integrated circuit, or a programmable Logic device. The processor also includes communication devices such as computers, cell phones, portable or personal digital assistants ("PDAs"), and other devices that facilitate communication of information between end users.

根據一實施例,該裝置尚包括構件用以得到編碼影像的一 解碼版本顯示的參考照明條件如最大環境亮度值Y n According to an embodiment, the apparatus further includes means for obtaining a reference illumination condition, such as a maximum ambient brightness value Y n , displayed by a decoded version of the encoded image.

根據一實施例,該裝置包括一顯示器1007,及得到構件用以得到該編碼影像的一解碼版本顯示的參考照明條件,該構件配置成從顯示器1007的一些特性或從顯示器1007周圍由該裝置捕捉到的照明條件中,判定出該編碼影像的一解碼版本顯示的此類參考照明條件。 According to an embodiment, the apparatus includes a display 1007, and a reference illumination condition obtained by the component for obtaining a decoded version of the encoded image, the component being configured to be captured from the display 1007 or from the display 1007 by the device In the lighting conditions that are obtained, such reference lighting conditions displayed by a decoded version of the encoded image are determined.

例如,用以得到最大環境亮度值Y n 的構件係附在顯示器上測量環境條件的一感測器,為此目的可使用光二極體或類似物。 For example, a sensor member for obtaining the maximum system environment brightness Y n is measured with the monitor environmental conditions, may be used for this purpose, or the like photo-diode.

根據編碼或編碼器的一特定實施例,影像I係得自一來源,該來源例如屬於以下各項組成的集合:- 一區域記憶體(1003或1004),如視訊記憶體或RAM(或隨機存取記憶體)、快閃記憶體、ROM(或唯讀記憶體)、硬碟;- 一儲存介面(1005),如具有大量儲存的介面、RAM、快閃記憶體、ROM、光碟或磁墊;- 一通訊介面(1005),如有線介面(例如匯流排介面、廣域網路介面、區域網路介面),或無線介面(如IEEE 802.11介面或藍牙®介面);及- 一影像捕捉電路(如一感測器如CCD(或電荷耦合元件)或CMOS(或互補金氧半導體))。 According to a particular embodiment of the encoding or encoder, the image I is derived from a source, for example, belonging to a collection of: - a regional memory (1003 or 1004), such as video memory or RAM (or random) Access memory), flash memory, ROM (or read-only memory), hard disk; - a storage interface (1005), such as a large storage interface, RAM, flash memory, ROM, CD or magnetic Pad; - a communication interface (1005), such as a wired interface (such as bus interface, WAN interface, regional network interface), or a wireless interface (such as IEEE 802.11 interface or Bluetooth® interface); and - an image capture circuit ( Such as a sensor such as CCD (or charge coupled device) or CMOS (or complementary metal oxide semiconductor).

根據該解碼或解碼器的不同實施例,解碼影像係傳送到一目的地;明確的說,該目的地屬於以下各項組成的集合:- 一區域記憶體(1003或1004),如視訊記憶體或RAM、快閃記憶體、硬碟;- 一儲存介面(1005),如具有大量儲存的介面、RAM、快閃記憶體、ROM、光碟或磁墊;- 一通訊介面(1005),如有線介面(例如匯流排介面(如USB(或通用串列匯流排))、廣域網路介面、區域網路介面、HDMI(高畫質多媒體介面)介面),或無線介面(如IEEE 802.11介面、WiFi®或藍牙®介面);及- 一顯示器。 Decoding the image according to different embodiments of the decoding or decoder Is transmitted to a destination; specifically, the destination belongs to a collection of: - a regional memory (1003 or 1004), such as video memory or RAM, flash memory, hard disk; Storage interface (1005), such as a large storage interface, RAM, flash memory, ROM, CD or magnetic pad; - a communication interface (1005), such as a wired interface (such as bus interface (such as USB (or universal string) Column bus)), WAN interface, regional network interface, HDMI (high-definition multimedia interface) interface, or wireless interface (such as IEEE 802.11 interface, WiFi® or Bluetooth® interface); and - a display.

根據編碼或編碼器的不同實施例,位元流F係傳送到一目的地,作為一範例,位元流F係儲存在一區域或遠端記憶體如視訊記憶體(1004)或RAM(1004)、硬碟(1003)中。在一變化中,該位元流係傳送到一 儲存介面(1005),如具有大量儲存的介面、快閃記憶體、ROM、光碟或磁墊,或透過一通訊介面(1005)傳送,如到點對點鏈接、通訊匯流排、單點對多點鏈接或廣播網路的介面。 According to different embodiments of the encoding or encoder, the bitstream F is transmitted to a destination. As an example, the bitstream F is stored in a region or remote memory such as video memory (1004) or RAM (1004). ), hard disk (1003). In a change, the bit stream is transmitted to a Storage interface (1005), such as with a large number of stored interfaces, flash memory, ROM, CD or magnetic pad, or transmitted through a communication interface (1005), such as to point-to-point links, communication bus, single-point to multi-point links Or the interface of the broadcast network.

根據解碼或解碼器的不同實施例,位元流BF及/或F係得自一來源,作為一範例,該位元流係讀取自一區域記憶體,如視訊記憶體(1004)、RAM(1004)、ROM(1003)、快閃記憶體(1003)或硬碟(1003)中。在一變化中,該位元流係接收自一儲存介面(1005),如具有大量儲存的介面、RAM、ROM、快閃記憶體、光碟或磁墊,及/或接收自一通訊介面(1005),如到點對點鏈接、匯流排、單點對多點鏈接或廣播網路的介面。 According to different embodiments of the decoder or decoder, the bit stream BF and/or F is derived from a source, as an example, the bit stream is read from a region memory, such as video memory (1004), RAM. (1004), ROM (1003), flash memory (1003) or hard disk (1003). In one variation, the bit stream is received from a storage interface (1005), such as a mass storage interface, RAM, ROM, flash memory, optical disk or magnetic pad, and/or received from a communication interface (1005) ), such as to a point-to-point link, bus, single-to-multipoint link or broadcast network interface.

根據不同實施例,裝置1000係配置用以實施相關圖1,3-6及8所述的編碼方法,該裝置屬於以下各項組成的集合:- 行動裝置;- 通訊裝置;- 遊戲裝置;- 機上盒;- 平板(或平板電腦);- 膝上型電腦;- 靜態影像攝影機- 視訊攝影機;- 編碼晶片;- 靜態影像伺服器;及- 視訊伺服器(如廣播伺服器,隨選視訊伺服器或一網站伺服器)。 According to various embodiments, the apparatus 1000 is configured to implement the encoding method described in relation to Figures 1,3-6 and 8, which is a collection of the following components: - a mobile device; - a communication device; - a gaming device; Set-top box; - tablet (or tablet); - laptop; - still image camera - video camera; - coded chip; - still image server; and - video server (eg broadcast server, video on demand) Server or a web server).

根據不同實施例,裝置1000係配置用以實施相關圖2,7及9所述的解碼方法,該裝置屬於以下各項組成的集合:- 行動裝置;- 通訊裝置;- 遊戲裝置;- 機上盒;- 電視機;- 平板(或平板電腦); - 膝上型電腦;- 顯示器;及- 解碼晶片。 According to various embodiments, the apparatus 1000 is configured to implement the decoding method described in relation to Figures 2, 7 and 9, which is a collection of the following components: - a mobile device; - a communication device; - a gaming device; - an onboard Box;- TV;- tablet (or tablet); - laptop; - display; and - decoder chip.

根據圖11中繪示的一實施例,透過一通訊網路NET在二遠端裝置A與B之間的一傳輸情境中,裝置A包括配置用以實施如相關圖1所述一影像編碼方法的構件,及裝置B包括配置用以實施如相關圖2所述解碼方法的構件。 According to an embodiment illustrated in FIG. 11, in a transmission scenario between two remote devices A and B through a communication network NET, the device A includes a configuration for implementing an image encoding method as described in relation to FIG. The components, and apparatus B, include components configured to implement the decoding method as described in relation to FIG.

根據本發明的一變化,該網路係一廣播網路,調適用以將靜態影像或視訊影像從裝置A播放到包括裝置B的數個解碼裝置。 According to a variant of the invention, the network is a broadcast network adapted to play still or video images from device A to a plurality of decoding devices comprising device B.

本文中所述各種不同處理及特點的實作可具體實現在各種不同的設備或應用程式中,尤其例如在設備或應用程式中。此類設備範例包括編碼器、解碼器、後處理器(用以處理解碼器來的輸出)、前處理器(用以提供輸入到編碼器)、視訊編碼器、視訊解碼器、視訊編解碼器、網站伺服器、機上盒、膝上型電腦、個人電腦、手機、PDA,及其他通訊裝置。顯然地,該設備可以是移動式,甚至安裝在汽車中。 The various processes and features described herein can be implemented in a variety of different devices or applications, such as in a device or application, for example. Examples of such devices include an encoder, a decoder, a post-processor (to process the output of the decoder), a pre-processor (to provide input to the encoder), a video encoder, a video decoder, and a video codec. , web servers, set-top boxes, laptops, personal computers, cell phones, PDAs, and other communication devices. Obviously, the device can be mobile or even installed in a car.

此外,該等方法可藉由一處理器所執行的指令來實施,及此類指令(及/或一實作所產生的資料值)可儲存在一處理器可讀媒體如積體電路、軟體載體,或其他儲存裝置如硬碟、CD光碟("CD")、光碟(如DVD,常指多樣化數位光碟或數位視訊光碟)、隨機存取記憶體("RAM")或唯讀記憶體("ROM")上。該等指令可形成一應用程式,其有形具體化在一處理器可讀媒體上,指令例如係可在硬體、韌體、軟體或一組合中,指令例如係可在作業系統、分開的應用程式或二者的組合中找到。因此一處理器的特徵例如係可作為一裝置配置用以執行一處理,同時亦是一裝置包含有一處理器可讀媒體(如一儲存裝置),具有指令用以實施一處理。此外,一處理器可讀媒體可儲存(在指令外添加或代替指令)一實作所產生的資料值。 Moreover, the methods can be implemented by instructions executed by a processor, and such instructions (and/or data values generated by an implementation) can be stored in a processor readable medium such as an integrated circuit, software. Carrier, or other storage device such as a hard disk, a compact disc ("CD"), a compact disc (such as a DVD, often referred to as a variety of digital or digital video discs), random access memory ("RAM"), or read-only memory ("ROM"). The instructions may form an application tangibly embodied on a processor readable medium, such as in hardware, firmware, software, or a combination, the instructions being, for example, in an operating system, separate applications Found in the program or a combination of both. Thus, a processor may be configured, for example, as a device for performing a process, and a device comprising a processor readable medium (such as a storage device) having instructions for performing a process. In addition, a processor readable medium can store (adding or replacing instructions in addition to instructions) a data value generated by the implementation.

如熟諳此藝者所明白,數個實作可產生格式化用以攜帶資訊的各種信號,其例如可加以儲存或傳送,該資訊例如可包括用以執行一方法的指令,或所述實作中的一者所產生的資料。例如,可將一信號格式化,用以攜帶作為資料,係用以寫入或讀取所述一實施例的語法規則,或 用以攜帶作為資料,係所述一實施例所寫成的實際語法值。例如可將此一信號格式化作為一電磁波(例如使用頻譜的一射頻部分)或作為一頻帶信號。格式化例如可包括將一資料流編碼,及利用編碼的資料流來調變一載波。信號所攜帶的資訊例如可以是類比或數位資訊,如所知,信號可透過各種不同有線或無線鏈接傳送,信號可儲存在一處理器可讀媒體上。 As will be appreciated by those skilled in the art, several implementations can generate various signals formatted for carrying information, which can be stored or transmitted, for example, which can include, for example, instructions for performing a method, or the implementation. Information produced by one of them. For example, a signal can be formatted for carrying as a material for writing or reading the grammar rules of the embodiment, or The information used to carry as the data is the actual grammatical value written in the embodiment. For example, the signal can be formatted as an electromagnetic wave (e.g., using a radio frequency portion of the spectrum) or as a frequency band signal. Formatting, for example, can include encoding a data stream and utilizing the encoded data stream to modulate a carrier. The information carried by the signal can be, for example, analog or digital information. As is known, the signals can be transmitted over a variety of different wired or wireless links, and the signals can be stored on a processor readable medium.

已說明數個實作,然而,應瞭解可作出不同修改,例如可將不同實作的元件結合、增補、修改或移除以產生其他實作。此外,一般技術人員應瞭解其他結構或處理可取代已揭示者,及作為結果的實作將如所揭示實作,依(數個)至少大體上相同的方式,執行(數個)至少大體上相同的功能,以達成(數個)至少大體上相同的結果,因此,由本發明可想出此等及其他實作。 Several implementations have been described, however, it should be understood that various modifications may be made, such as combining, adding, modifying, or removing elements of different implementations to produce other implementations. In addition, one of ordinary skill in the art will appreciate that other structures or processes may be substituted for those disclosed, and the resulting implementations will be implemented as a plurality of at least substantially the same. The same function is used to achieve (several) at least substantially the same result, and thus, such and other implementations are conceivable from the present invention.

1000‧‧‧照明圖判定步驟 1000‧‧‧Lighting chart determination steps

1100‧‧‧照明圖編碼步驟 1100‧‧‧ Lighting pattern coding steps

1200‧‧‧信號化資料步驟 1200‧‧‧ Signaling data steps

1300‧‧‧殘餘影像編碼步驟 1300‧‧‧Residual image coding steps

ENC1,ENC2‧‧‧編碼器 ENC1, ENC2‧‧‧ encoder

F‧‧‧位元流 F‧‧‧ bit flow

I‧‧‧影像 I‧‧‧ images

IM‧‧‧照明圖 IM‧‧‧Lighting diagram

PRP‧‧‧照明圖判定模組 PRP‧‧‧Lighting Chart Judgment Module

RI‧‧‧殘餘影像 RI‧‧‧ residual image

SD‧‧‧信號化資料 SD‧‧‧Signalization data

SME‧‧‧信號化資料編碼模組 SME‧‧‧ Signaling Data Encoding Module

Claims (19)

一種將一影像編碼成一位元流之方法,其特徵為包括以下步驟:- 將一判定(1000)自該影像之照明圖編碼(1100)至該位元流中;及- 將一信號化資料編碼(1200)至該位元流中,該信號化資料指示該位元流包括該照明圖。 A method of encoding an image into a bit stream, comprising the steps of: - encoding (1000) a decision from the image (1100) into the bit stream; and - signaling a data Encoding (1200) into the bitstream, the signalling material indicates that the bitstream includes the illumination map. 如申請專利範圍第1項之方法,其中該方法尚包括以下步驟:- 將一判定自該影像及該照明圖之殘餘影像編碼(1300)至該位元流中。 The method of claim 1, wherein the method further comprises the step of: encoding (1300) a residual image from the image and the illumination image into the bit stream. 如申請專利範圍第1或2項之方法,其中將該照明圖編碼作為一輔助圖像,其語法符合H264/AVC或HEVC標準。 The method of claim 1 or 2, wherein the illumination pattern is encoded as an auxiliary image, the syntax of which conforms to the H264/AVC or HEVC standard. 如申請專利範圍第2或3項之方法,其中將該殘餘影像編碼作為一主要圖像,其語法符合H264/AVC或HEVC標準。 The method of claim 2, wherein the residual image is encoded as a primary image, the syntax of which conforms to the H264/AVC or HEVC standard. 如申請專利範圍第1至4項中任一項之方法,其中該照明圖係一背光影像,及藉由將該影像除以該背光影像之一解碼版本,得到該殘餘影像。 The method of any one of claims 1 to 4, wherein the illumination image is a backlit image, and the residual image is obtained by dividing the image by a decoded version of the backlight image. 如申請專利範圍第5項之方法,其中在編碼前將該殘餘影像進行色調映射。 The method of claim 5, wherein the residual image is subjected to tone mapping before encoding. 如申請專利範圍第4項之方法,其中將該殘餘影像進行色調映射包括,根據該殘餘影像之像素值,一伽瑪校正或一SLog(S形對數)校正。 The method of claim 4, wherein the mapping of the residual image to the tone comprises, based on the pixel value of the residual image, a gamma correction or a SLog (S-logarithm) correction. 如申請專利範圍第1至7項中任一項之方法,其中該方法又包括該殘餘影像在編碼前之定標。 The method of any one of claims 1 to 7, wherein the method further comprises the calibration of the residual image prior to encoding. 如申請專利範圍第1至8項中任一項之方法,其中該方法又包括在編碼前將該殘餘影像進行剪輯。 The method of any one of claims 1 to 8, wherein the method further comprises editing the residual image prior to encoding. 一種將一位元流解碼之方法,該位元流表示一影像,該方法之特徵為包括以下步驟:- 檢測(2000)該位元流中是否一信號化資料指示該位元流包括資料相關一判定自待解碼影像之照明圖;- 藉由將該位元流至少部分地解碼,得到(2100)一解碼照明圖;及- 從一解碼殘餘影像及該解碼照明圖中得出(2300)一解碼影像。 A method of decoding a bit stream, the bit stream representing an image, the method comprising the steps of: - detecting (2000) whether a signalized data in the bit stream indicates that the bit stream includes data correlation Determining an illumination pattern of the image to be decoded; - by at least partially decoding the bit stream, obtaining (2100) a decoded illumination image; and - deriving from a decoded residual image and the decoded illumination image (2300) A decoded image. 如申請專利範圍第10項之方法,其中藉由將該位元流至少部分地 解碼,得到該解碼殘餘影像。 The method of claim 10, wherein the bit stream is at least partially Decoding to obtain the decoded residual image. 如申請專利範圍第10或11項之方法,其中該信號化資料係檢測自高階語法元素,及其使用係由一SEI訊息完成。 The method of claim 10, wherein the signalling data is detected from a higher-order syntax element and its use is performed by an SEI message. 如申請專利範圍第10至12項中任一項之方法,其中該位元流包括一主要圖像及一輔助圖像,其語法符合H264/AVC或HEVC標準,及其中主要圖像表示殘餘影像,及輔助圖像表示照明圖。 The method of any one of claims 10 to 12, wherein the bit stream comprises a primary image and an auxiliary image, the syntax of which conforms to the H264/AVC or HEVC standard, and wherein the primary image represents residual image And the auxiliary image represents the illumination map. 如申請專利範圍第10至13項中任一項之方法,其中解碼照明圖係一背光影像,及其中藉由將解碼殘餘影像乘以背光影像,得到解碼影像。 The method of any one of claims 10 to 13, wherein the decoded illumination image is a backlit image, and wherein the decoded residual image is multiplied by the backlight image to obtain a decoded image. 如申請專利範圍第10至14項中任一項之方法,其中在將解碼殘餘影像乘以背光影像前,先將解碼殘餘影像進行反色調映射。 The method of any one of claims 10 to 14, wherein the decoded residual image is inversely mapped prior to multiplying the decoded residual image by the backlit image. 如申請專利範圍第1至4項中任一項之方法,其中照明圖係待編碼影像之亮度分量之一低空間頻率版本,及藉由計算該影像之亮度分量與該編碼低空間頻率版本之一解碼版本間之差異,得到殘餘影像。 The method of any one of claims 1 to 4, wherein the illumination map is a low spatial frequency version of one of the luminance components of the image to be encoded, and by calculating a luminance component of the image and the encoded low spatial frequency version A residual image is obtained by decoding the difference between the versions. 一種將一影像編碼之裝置,包括構件用以:- 將一判定(1000)自該影像之照明圖編碼(ENC1)至該位元流中;- 將一信號化資料編碼(ENC2)至該位元流中,該信號化資料指示該位元流包括該照明圖。 A device for encoding an image, comprising: means for: - encoding a decision (1000) from the illumination pattern of the image (ENC1) into the bit stream; - encoding a signalized data (ENC2) to the bit In the metaflow, the signalling material indicates that the bitstream includes the illumination map. 一種將一位元流解碼之裝置,該位元流表示一影像,該方法之特徵為包括以下步驟:- 檢測(SMD)該位元流中是否一信號化資料指示該位元流表示一判定自該影像之照明圖;- 藉由將該位元流至少部分地解碼,得到(DEC1)一解碼照明圖;及- 從一解碼殘餘影像及該解碼照明圖中得出(DEC2)一解碼影像。 A device for decoding a bit stream, the bit stream representing an image, the method comprising the steps of: - detecting (SMD) whether a signalized data in the bit stream indicates that the bit stream represents a decision Illumination map from the image; - by decoding the bit stream at least partially, to obtain (DEC1) a decoded illumination map; and - deriving (DEC2) a decoded image from a decoded residual image and the decoded illumination image . 一種位元流,表示一影像,該位元流之特徵為包括一信號化資料,指示該位元流表示一判定自該影像之照明圖。 A bit stream representing an image, the bit stream being characterized by a signalized data indicating that the bit stream represents a lighting pattern determined from the image.
TW103145096A 2013-12-27 2014-12-24 Method and device for encoding a high-dynamic range image into a bitstream and/or decoding a bitstream representing a high-dynamic range image TW201540052A (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP13306885 2013-12-27

Publications (1)

Publication Number Publication Date
TW201540052A true TW201540052A (en) 2015-10-16

Family

ID=49955860

Family Applications (1)

Application Number Title Priority Date Filing Date
TW103145096A TW201540052A (en) 2013-12-27 2014-12-24 Method and device for encoding a high-dynamic range image into a bitstream and/or decoding a bitstream representing a high-dynamic range image

Country Status (2)

Country Link
TW (1) TW201540052A (en)
WO (1) WO2015097118A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BR112017015790A2 (en) 2015-01-30 2018-03-27 Thomson Licensing method and apparatus for coding and decoding a color image
EP3107300A1 (en) 2015-06-15 2016-12-21 Thomson Licensing Method and device for encoding both a high-dynamic range frame and an imposed low-dynamic range frame
EP3113496A1 (en) 2015-06-30 2017-01-04 Thomson Licensing Method and device for encoding both a hdr picture and a sdr picture obtained from said hdr picture using color mapping functions
EP3185561A1 (en) 2015-12-23 2017-06-28 THOMSON Licensing Methods and devices for encoding and decoding frames with a high dynamic range, and corresponding signal and computer program
US11941789B2 (en) 2018-10-19 2024-03-26 Gopro, Inc. Tone mapping and tone control integrations for image processing

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8218625B2 (en) * 2004-04-23 2012-07-10 Dolby Laboratories Licensing Corporation Encoding, decoding and representing high dynamic range images
US9571856B2 (en) * 2008-08-25 2017-02-14 Microsoft Technology Licensing, Llc Conversion operations in scalable video encoding and decoding
EP2583272B1 (en) * 2010-06-21 2016-01-20 Dolby Laboratories Licensing Corporation Displaying images on local-dimming displays

Also Published As

Publication number Publication date
WO2015097118A1 (en) 2015-07-02

Similar Documents

Publication Publication Date Title
JP7356478B2 (en) Method and apparatus for encoding and decoding color pictures
KR102367205B1 (en) Method and device for encoding both a hdr picture and a sdr picture obtained from said hdr picture using color mapping functions
US9924178B2 (en) Method and device for encoding a high-dynamic range image and/or decoding a bitstream
JP6568078B2 (en) Method and apparatus for encoding image data and method and apparatus for decoding image data
JP2019513323A (en) Method and apparatus for encoding high dynamic range pictures, corresponding encoding method and encoding apparatus
US20170171565A1 (en) Method and apparatus for predicting image samples for encoding or decoding
TW201540052A (en) Method and device for encoding a high-dynamic range image into a bitstream and/or decoding a bitstream representing a high-dynamic range image
KR20160019060A (en) Encoding and decoding methods for adapting the average luminance of high dynamic range pictures and corresponding encoder and decoder
JP7043164B2 (en) Methods and Devices for Encoding Both High Dynamic Range Frames and Impose Low Dynamic Range Frames
TW201536028A (en) Method and device for encoding a high-dynamic range image
WO2015097135A1 (en) Method and device for encoding a high-dynamic range image
WO2015097126A1 (en) Method and device for encoding a high-dynamic range image and/or decoding a bitstream
EP3113494A1 (en) Method and device for encoding a high-dynamic range image
WO2015177132A1 (en) Method and device for encoding a high-dynamic range frame and/or decoding a bitstream
TW201705764A (en) A method for encoding/decoding a picture block
TW201531089A (en) Method and apparatus for encoding image data and method and apparatus for decoding image data
WO2015097124A1 (en) Method and device for encoding a high-dynamic range image and/or decoding a bitstream
WO2015097131A1 (en) Method and device for encoding a high-dynamic range image
WO2015097129A1 (en) Method and device for encoding a high-dynamic range image
WO2015097134A1 (en) Method and device for encoding a high-dynamic range image and/or decoding a bitstream
WO2015177136A1 (en) Method and device for encoding a high-dynamic range frame and/or decoding a bitstream
WO2015177139A1 (en) Method and device for encoding a high-dynamic range frame and/or decoding a bitstream
WO2015177133A1 (en) Method and device for encoding a high-dynamic range frame and/or decoding a bitstream