WO2010072983A2 - Procede d'encodage par segmentation d'une image - Google Patents
Procede d'encodage par segmentation d'une image Download PDFInfo
- Publication number
- WO2010072983A2 WO2010072983A2 PCT/FR2009/052681 FR2009052681W WO2010072983A2 WO 2010072983 A2 WO2010072983 A2 WO 2010072983A2 FR 2009052681 W FR2009052681 W FR 2009052681W WO 2010072983 A2 WO2010072983 A2 WO 2010072983A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- elementary
- type
- compression
- image
- elementary block
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/41—Bandwidth or redundancy reduction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/12—Selection from among a plurality of transforms or standards, e.g. selection between discrete cosine transform [DCT] and sub-band transform or selection between H.263 and H.264
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/14—Coding unit complexity, e.g. amount of activity or edge presence estimation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/186—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/48—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/85—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
Definitions
- the present invention relates to a method of encoding by segmentation of a video image.
- the purpose of the invention is in particular to improve the rendering quality of an image having previously been subjected to a compression operation in order to limit the information flow rate necessary for storing and / or transmitting information relating to to the image considered once encoded.
- the encoding operation according to the invention is carried out in particular by ensuring the possibility of a return of a maximum of details for areas of the image corresponding to areas called HMI (Human-Machine Interface) relative to to other areas of the image considered corresponding to areas of photos, video ....
- HMI Human-Machine Interface
- the field of the invention is, in general, that of the compression of video images.
- the field of the invention will be more particularly detailed in a context essentially involving the so-called H264 compression standard, without this aspect be limiting as to the scope of the invention.
- other compression standards for example of the VC1 or DivX type, even if they are less advantageous in certain contexts, can be used in the implementation of the method according to the invention.
- the different video coding standards are all based on the same basic principles. On the one hand they rely on the redundancy of temporal information, or spatial, to reduce the amount of information, without necessarily introducing losses. On the other hand, certain information, or details, are deleted, which introduces losses in the restored image, said losses being generally targeted with respect to the properties of psycho-vision. Indeed, certain details of the images are not perceived by the eye or can be omitted. We then obtain a compressed video stream.
- the main steps of video compression are as follows:
- uqi are rectangular regions that have a size between 4M and 16 * 16 pixels; Motion estimation;
- a DCT Discrete Cosine Transform
- the data from the DCT is quantized by being encoded on a limited number of bits. This is where the loss of information comes in.
- the context in which the present invention will be described will be that of the implementation of media center applications, which consists in the remote use of a computer in a home at different points of the home, in order to have various services that can be used through client stations, for example a digital television decoder, distributed in the habitat.
- client stations for example a digital television decoder
- the H264 standard provides for two types of compression, illustrated in FIG. 1.
- a first type of compression 100 called lossless compression mode, or lossless compression, makes it possible to obtain, from an original image 103, a restored image. 104 after a compression phase 105 which induces no loss in the restored image 104.
- a second type of compression 101 called compression mode with loss, or lossy compression, makes it possible to obtain, from an original image 106, a restored image 107 after a compression phase 108 which induces a loss of information in the restored image 107 relative to the original image 106, loss of information which results in a decrease in the quality of the image. the image, especially in terms of sharpness.
- the H264 standard is preferred for the transmission of video through the network created. But this standard is, as explained above, likely to produce information losses during compression operations involved in the encoding of video data in particular. Basically, these losses are supposed not to be really discernible by the human eye; this was particularly the case when the video data to be encoded and transmitted were only of the type of photos or television program images, for example. However, in some cases, the defects introduced by these information losses can become very visible. This is particularly the case when we have images such as buttons, menus, or any other element containing text and a lot of details.
- the method according to the invention proposes a solution to the problems and disadvantages which have just been exposed.
- a solution is proposed for improving the rendering quality of the image to be rendered.
- it is sought in particular to distinguish areas of the image corresponding to HMIs, whose rendering quality must be optimized, and areas of the image corresponding to content type photo, video , images ..., for which lossy compression can be accepted.
- the invention therefore essentially relates to a method of encoding an image, said encoding being of mixed encoding type with the possibility of using a first type of lossless compression, and a method of encoding an image.
- second type of lossy compression said method comprising the step of dividing the image into a plurality of elementary blocks; characterized in that said method comprises the further additional steps of:
- the method according to the invention may comprise, in addition to the main steps which have just been mentioned in the preceding paragraph, one or more additional characteristics among the following: the step of determining the elementary blocks having a high level of detail comprises the various operations consisting of, for each elementary block considered:
- the various operations of the step of determining the elementary blocks having a high level of detail are carried out for each of the color planes of the image considered, the elementary block considered being identified as an elementary block having a high level of detail if for one of the color planes considered, the rate of high frequency components is greater than a predetermined specific threshold, each color plane being associated with a specific threshold;
- the method comprises the additional steps of:
- the method comprises the additional step of applying the first type of compression to any elementary block of homogeneous elementary block type
- the encoding is of the H264 encoding type.
- FIG. 2 schematically illustrates the encoding method according to the invention
- FIG. 3 illustrates an exemplary implementation of the method according to the invention in which certain elementary blocks of the image to be compressed are compressed in a lossless compression mode after the implementation of particular criteria for determining the mode of operation. compression;
- FIG. 4 illustrates an example of an image encoded by an exemplary implementation of the method according to the invention.
- the invention it is sought to segment an image to be encoded by using particular criteria to determine whether each elementary block considered must be encoded in a lossless compression mode or in a lossy compression mode.
- the criteria defined are intended to allow the elements of I 1 HMI (buttons, menus, etc.) to be distinguished from the rest of the image.
- An essential criterion lies in the massive or non-existent presence of texts in each elementary block considered.
- the invention it is proposed, in a first step, to analyze the spectral content of each elementary block; such an approach is justified by the fact that the text elements in an image are characterized by the large number of abrupt transitions in brightness and / or chrominance.
- FIG. 2 illustrates such a principle.
- an elementary block 201 containing a button-type graphic element 202 is shown.
- the elementary block is initially subjected to a filter 203, equivalent to an edge detection.
- This is a high pass filtering to obtain a high frequency elementary block 204.
- Such a filtering amplifies the sudden variations of the image contained in the elementary block considered, and attenuates the smooth parts, without details, of the picture of departure. We thus find, in the high frequency elementary block, a large amount of very bright pixels where there is text, or a lot of details.
- the high frequency image 204 may be obtained by a differentiating filter such as Laplacian.
- a thresholding operation 205 is performed to determine whether the compression of the elementary block must be of the lossless or lossy type.
- the following steps are provided: A thresholding step applied to the high frequency image, where the value "1" is assigned to a pixel whose frequency value is greater than a previously determined threshold, advantageously empirically;
- a decision step if the number obtained in the previous step is greater than a given value, determined for example empirically, then the elementary block considered is marked as lossless. In the opposite case, the considered elementary block is marked as lossy.
- an image 300 composed, for example, of 16 elementary blocks is thus obtained, a plurality of elementary blocks marked lossless, represented in hatched form, the other elementary blocks being marked lossy, represented as hatched.
- an elementary block 301 which, at the end of the thresholding operation 205 is marked lossy, but is framed by a first elementary block 302 marked lossless and by a second elementary block 303 also labeled lossless, to finally mark the elementary block 301 considered lossless. It will therefore be subject to lossless compression.
- Such an operation improves the rendering of the image that will be subsequently restored, avoiding too many transitions between the lossless compressed elementary blocks and the lossy compressed elementary blocks.
- homogeneous elementary blocks which, after the thresholding operation 205, have been marked lossy, are transformed into elementary blocks marked lossless.
- Homogeneous elementary block also called flat zone, denotes zones of zero gradient: such zones are characterized, for the three color components considered, by a zero vector derivative along two perpendicular axes of the image considered.
- FIG. 4 shows an image 402, consisting of a first window
- Lossless compression element blocks 403, shown hatched, and lossy compression element blocks 404, shown unhatched, are illustrated in this figure.
- the image 402 has undergone the encoding method according to the invention, with the application of the implementation of the first advantageous mode which has just been described. Thus, no lossy elementary block is found directly between two lossless elementary blocks.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Discrete Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Compression Of Band Width Or Redundancy In Fax (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP09806104.7A EP2380352B1 (fr) | 2008-12-23 | 2009-12-23 | Procédé d'encodage par segmentation d'une image |
BRPI0923548A BRPI0923548A2 (pt) | 2008-12-23 | 2009-12-23 | processo de codificação por segmentação de uma imagem |
CN200980156286.XA CN102308582B (zh) | 2008-12-23 | 2009-12-23 | 用于对图像进行分割编码的方法 |
US13/141,499 US8630500B2 (en) | 2008-12-23 | 2009-12-23 | Method for the encoding by segmentation of a picture |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0859018A FR2940577B1 (fr) | 2008-12-23 | 2008-12-23 | Procede d'encodage par segmentation d'une image. |
FR0859018 | 2008-12-23 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2010072983A2 true WO2010072983A2 (fr) | 2010-07-01 |
WO2010072983A3 WO2010072983A3 (fr) | 2010-10-07 |
Family
ID=41111033
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR2009/052681 WO2010072983A2 (fr) | 2008-12-23 | 2009-12-23 | Procede d'encodage par segmentation d'une image |
Country Status (6)
Country | Link |
---|---|
US (1) | US8630500B2 (fr) |
EP (1) | EP2380352B1 (fr) |
CN (1) | CN102308582B (fr) |
BR (1) | BRPI0923548A2 (fr) |
FR (1) | FR2940577B1 (fr) |
WO (1) | WO2010072983A2 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108810537A (zh) * | 2017-04-26 | 2018-11-13 | 腾讯科技(深圳)有限公司 | 一种图片转码方法、装置及图像处理设备 |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5777398B2 (ja) * | 2011-05-13 | 2015-09-09 | キヤノン株式会社 | 画像処理装置、画像処理装置の画像処理方法及びプログラム |
EP2790408A1 (fr) * | 2013-03-18 | 2014-10-15 | VEGA Grieshaber KG | Procédé de stockage compressé de données graphiques |
JP6315911B2 (ja) | 2013-07-09 | 2018-04-25 | キヤノン株式会社 | 画像符号化装置、画像符号化方法及びプログラム、画像復号装置、画像復号方法及びプログラム |
CN107079155A (zh) * | 2014-03-25 | 2017-08-18 | 云巅控股有限公司 | 视频内容分类 |
CN105979265A (zh) * | 2016-05-05 | 2016-09-28 | 贵州白山云科技有限公司 | 图片压缩方法和装置 |
US10491963B1 (en) * | 2016-06-28 | 2019-11-26 | Amazon Technologies, Inc. | Use video codecs to deliver images |
CN113794899A (zh) * | 2021-08-31 | 2021-12-14 | 新华三大数据技术有限公司 | 云桌面图像数据传输方法、装置、设备及存储介质 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6549666B1 (en) * | 1994-09-21 | 2003-04-15 | Ricoh Company, Ltd | Reversible embedded wavelet system implementation |
US6252994B1 (en) * | 1998-01-26 | 2001-06-26 | Xerox Corporation | Adaptive quantization compatible with the JPEG baseline sequential mode |
US6198850B1 (en) * | 1998-06-12 | 2001-03-06 | Xerox Corporation | System and method for segmentation dependent lossy and lossless compression for higher quality |
US6625323B2 (en) * | 1998-09-25 | 2003-09-23 | Eastman Kodak Company | Method for compressing and decompressing digital having text |
US7218784B1 (en) * | 2000-05-01 | 2007-05-15 | Xerox Corporation | Method and apparatus for controlling image quality and compression ratios |
US6701020B2 (en) * | 2001-03-15 | 2004-03-02 | Hewlett-Packard Development Company, L.P. | JADE-JPEG based adaptive document compression engine |
US7039247B2 (en) * | 2003-01-31 | 2006-05-02 | Sony Corporation | Graphic codec for network transmission |
US20080175475A1 (en) * | 2007-01-23 | 2008-07-24 | Chih-Ta Star Sung | Method of image frame compression |
JP4707751B2 (ja) * | 2009-05-21 | 2011-06-22 | シャープ株式会社 | 画像圧縮方法、画像圧縮装置、画像形成装置、コンピュータプログラム及び記録媒体 |
-
2008
- 2008-12-23 FR FR0859018A patent/FR2940577B1/fr not_active Expired - Fee Related
-
2009
- 2009-12-23 WO PCT/FR2009/052681 patent/WO2010072983A2/fr active Application Filing
- 2009-12-23 US US13/141,499 patent/US8630500B2/en not_active Expired - Fee Related
- 2009-12-23 CN CN200980156286.XA patent/CN102308582B/zh not_active Expired - Fee Related
- 2009-12-23 BR BRPI0923548A patent/BRPI0923548A2/pt not_active Application Discontinuation
- 2009-12-23 EP EP09806104.7A patent/EP2380352B1/fr active Active
Non-Patent Citations (1)
Title |
---|
None |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108810537A (zh) * | 2017-04-26 | 2018-11-13 | 腾讯科技(深圳)有限公司 | 一种图片转码方法、装置及图像处理设备 |
Also Published As
Publication number | Publication date |
---|---|
WO2010072983A3 (fr) | 2010-10-07 |
BRPI0923548A2 (pt) | 2016-01-26 |
US20110262051A1 (en) | 2011-10-27 |
CN102308582B (zh) | 2015-03-11 |
CN102308582A (zh) | 2012-01-04 |
EP2380352B1 (fr) | 2020-12-23 |
US8630500B2 (en) | 2014-01-14 |
EP2380352A2 (fr) | 2011-10-26 |
FR2940577A1 (fr) | 2010-06-25 |
FR2940577B1 (fr) | 2011-04-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2380352B1 (fr) | Procédé d'encodage par segmentation d'une image | |
EP3209021B1 (fr) | Support d'enregistrement mémorisant un flux de données d'images codées | |
FR2903556A1 (fr) | Procedes et des dispositifs de codage et de decodage d'images, un systeme de telecommunications comportant de tels dispositifs et des programmes d'ordinateur mettant en oeuvre de tels procedes | |
EP2052545B1 (fr) | Dispositif et procede de codage et de decodage echelonnables de flux de donnees d'images, signal et programme d'ordinateur correspondants | |
EP3707900B1 (fr) | Procédé de formation d'une séquence d'images de sortie à partir d'une séquence d'images d'entrée, procédé de reconstruction d'une séquence d'images d'entrée à partir d'une séquence d'images de sortie, dispositifs, equipement serveur, equipement client et programmes d'ordinateurs associés | |
EP2279621B1 (fr) | Procédé de codage, de décodage, codeur et décodeur | |
FR2888424A1 (fr) | Dispositif et procede de codage et de decodage de donnees video et train de donnees | |
US20240007658A1 (en) | Scalable encoding and decoding method and apparatus | |
FR2932036A1 (fr) | Procede et systeme permettant de proteger un flux video compresse contre les erreurs survenant lors d'une transmission | |
EP2380350A1 (fr) | Systeme et procede de codage video | |
WO2010146314A1 (fr) | Codage de vecteurs mouvement par competition de predicteurs | |
EP3637765A1 (fr) | Optimisation d'un sous-échantillonnage intervenant avant le codage d'images en compression | |
US8483288B2 (en) | Methods, apparatus and system for film grain cache splitting for film grain simulation | |
EP2410749A1 (fr) | Procédé d'encodage adaptatif d'un flux vidéo numérique, notamment pour diffusion sur ligne xDSL | |
EP1574068B1 (fr) | Procede de codage d'une image par ondelettes et procede de decodage correspondant | |
FR2927758A1 (fr) | Procede et dispositif de codage-decodage d'images video successives selon un flux video principal en pleine resolution et selon un flux video secondaire en qualite reduite | |
EP2914005B1 (fr) | Générateur de flux vidéo | |
EP3991436A1 (fr) | Procédé d'encodage et de décodage vidéo par signalisation d'un sous-ensemble de candidat | |
WO2013160597A1 (fr) | Compression/décompression progressive d'un flux numérique vidéo comprenant au moins une image entrelacée | |
FR2990097A1 (fr) | Procedes de codage et de decodage spatialement echelonnables et dispositifs correspondants | |
CN116569550A (zh) | 编码方法、解码方法、编码器、解码器及存储介质 | |
WO2018167419A1 (fr) | Procédé de codage et décodage d'images, dispositif de codage et décodage et programmes d'ordinateur correspondants | |
FR2891966A1 (fr) | Dispositifs et procedes de codage et de decodage echelonnables de flux de donnees d'images, signal, programme d'ordinateur et module d'adaptation de qualite d'images correspondants | |
FR2911233A1 (fr) | Procedes et dispositifs de codage et de decodage d'un flux d de donnees scalable tenant compte d'une classe de scalabilite, produits programme d'ordinateur, signal et support de donnees correspondants. | |
FR2858096A1 (fr) | Procede de decodage d'image et dispositif de mise en oeuvre de ce procede |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 200980156286.X Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 09806104 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13141499 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009806104 Country of ref document: EP |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: PI0923548 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: PI0923548 Country of ref document: BR Kind code of ref document: A2 Effective date: 20110622 |