CN111953989A - Image compression method and device based on combination of user interaction and semantic segmentation technology - Google Patents
Image compression method and device based on combination of user interaction and semantic segmentation technology Download PDFInfo
- Publication number
- CN111953989A CN111953989A CN202010702091.9A CN202010702091A CN111953989A CN 111953989 A CN111953989 A CN 111953989A CN 202010702091 A CN202010702091 A CN 202010702091A CN 111953989 A CN111953989 A CN 111953989A
- Authority
- CN
- China
- Prior art keywords
- image
- compression
- compressed
- semantic segmentation
- semantic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/26—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
- G06V10/267—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/176—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- Multimedia (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Bioinformatics & Computational Biology (AREA)
- Signal Processing (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
The invention belongs to the technical field of image processing and computer vision, and particularly relates to an image compression method and device based on combination of user interaction and semantic segmentation technology, wherein the method comprises the steps of preprocessing an image to be compressed by utilizing a semantic segmentation network to obtain semantic segmentation of the image; dividing the image into a plurality of categories of image blocks according to semantic segmentation; the user sets the compression quality of each category of image blocks; based on the set compression quality, compressing the image blocks of each category by using a BPG (Business Process group) coding tool to obtain an intermediate file; decoding the intermediate file by using a BPG decoding tool to obtain a decompressed image block; combining the decompressed image blocks; the method can meet the requirements of compressing the local areas of the image to different degrees in a specific scene, can automatically sense the interest area and automatically adjust the image compression quality of the different local areas of the image, and has huge potential commercial value and good popularization value.
Description
Technical Field
The invention belongs to the technical field of image processing and computer vision, and particularly relates to an image compression method and device based on combination of user interaction and semantic segmentation technology.
Background
On one hand, image compression is one of important support technologies in the field of information technology, and is also one of research hotspots in the field of computer vision. Conventional image compression standards, such as JPEG, JPEG2000, etc., usually have image distortion phenomena, such as blocking effect and compression artifact, when compressing at a low bit rate. BPG is a new image format based on High Efficiency Video Coding, HEVC (High Efficiency Video Coding). Compared with JPEG and JPEG2000, the BPG has higher compression quality under the same compression rate. Meanwhile, BPG supports lossless compression. However, the BPG can only support compression with the same compression ratio in the image global, and cannot compress local regions of interest with different compression ratios, which makes some specific image compression requirements unsatisfied.
Image segmentation techniques, on the other hand, are a pre-processing step of many image processing tasks. The conventional image segmentation method is divided into: threshold-based segmentation methods, region-based segmentation methods, watershed algorithms, and the like. However, the result of the conventional image segmentation method has no semantic information, that is, the algorithm itself does not know what information is about the segmented result. With the development of deep learning, the image semantic segmentation technology based on deep learning can realize the decomposition of a scene of an image into several individual entities and classify each entity with a pixel level of fineness and mark an accurate boundary. This type of algorithm can separate the foreground and background of an image and identify the class to which the pixels in the foreground belong. Further, labeling and training the interested category can adaptively identify and segment the interested region, and the result can be used for guiding other high-level image processing tasks.
Therefore, in order to solve the problem of local compression of the image, the invention guides the image compression task by using the result of semantic segmentation to realize local compression with different compression qualities on different interest areas of the same image. The method can meet the specific requirements of local image compression, and has strong application value and wide market value.
Disclosure of Invention
In order to obtain a better compressed image, the invention provides an image compression method and device based on the combination of user interaction and semantic segmentation technology, wherein the method comprises the following steps:
inputting an image to be compressed, and preprocessing the image by utilizing a semantic segmentation network to obtain semantic segmentation of the image;
dividing the image into a plurality of categories of image blocks according to semantic segmentation;
the user sets the compression quality of each category of image blocks;
based on the set compression quality, compressing the image blocks of each category by using a BPG (Business Process group) coding tool to obtain an intermediate file;
decoding the intermediate file by using a BPG decoding tool to obtain a decompressed image block;
and combining the decompressed image blocks to obtain a compression result of the original image.
Furthermore, the semantic segmentation network comprises a convolution coding end, a convolution decoding end and a softmax layer, wherein the convolution coding end is provided with three first convolution layers, and each first convolution layer comprises convolution operation, normalization operation, pooling operation and activation operation; the convolution coding end and the convolution coding end are symmetrically provided with three second convolution layers, and each second convolution layer comprises convolution operation, normalization operation, up-sampling operation and activation operation.
Further, the dividing the image block according to semantic division specifically includes: according to the semantic labels, a plurality of divided areas are aggregated into a specific number of areas by a k-means method, so that the boundary contour of the image area is continuous and smooth, and the boundary contour information is mapped to an original image, so that the image to be compressed is divided into different image blocks.
Further, aggregating the plurality of divided regions into a specific number of regions by a k-means method includes: and aggregating the semantic information of the separation result into N clusters according to the Euclidean distance between the pixels of the image block, dividing the image into N regions based on the N clusters obtained after clustering, mapping the N regions to the image to be compressed, dividing the image with compression into N regions, and storing the upper left corner of each region in the image to be compressed to obtain coordinate information.
Furthermore, according to the coordinate information of the upper left corner in the image to be compressed of each area, the decompressed image blocks are placed in the corresponding positions of the original image areas to form a compressed result image with the size consistent with that of the original image.
The invention also provides an image compression device based on the combination of user interaction and semantic segmentation technology, which comprises a semantic segmenter, an image block clustering device, a custom compression quality module, a BPG encoder, a BPG decoder and a compressed image output device, wherein:
the semantic divider is used for performing semantic division on an input image to be compressed, acquiring a boundary outline of the image and dividing the image into a plurality of image blocks according to the boundary outline;
the image block clustering device is used for further dividing the image blocks divided by the semantic divider and clustering the image blocks into N areas;
the user-defined compression quality module is used for the user to specify the compression quality of each area;
the BPG coder is used for compressing each area according to the compression quality specified by the user to produce an intermediate file;
the BPG decoder is used for decoding the intermediate file to obtain a decompressed image block;
and the compressed image output device is used for combining the decompressed image blocks to obtain a compression result of the bar original image and outputting the compression result to a user.
The invention has the beneficial effects that:
1) the invention provides an image compression method based on combination of user interaction and semantic segmentation technology, which can compress local regions of interest in an image with different compression qualities, so that the compressed image can locally adjust the compression quality, and meanwhile, the compressed image is more in line with human eye perception characteristics;
2) the method adopts a semantic segmentation technology based on deep learning, and the technology can adaptively extract boundary information of different categories, is used for segmenting the image interesting region and further guides a local region compression task.
Drawings
FIG. 1 is a schematic overall flow chart of a codec according to the present invention;
FIG. 2 is a schematic diagram of a semantic segmentation network;
fig. 3 is a schematic diagram of a process of generating image blocks of different areas.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
The invention provides an image compression method based on combination of user interaction and semantic segmentation technology, which comprises the following steps:
inputting an image to be compressed, and preprocessing the image by utilizing a semantic segmentation network to obtain semantic segmentation of the image;
dividing the image into a plurality of categories of image blocks according to semantic segmentation;
the user sets the compression quality of each category of image blocks;
based on the set compression quality, compressing the image blocks of each category by using a BPG (Business Process group) coding tool to obtain an intermediate file;
decoding the intermediate file by using a BPG decoding tool to obtain a decompressed image block;
and combining the decompressed image blocks to obtain a compression result of the original image.
Example 1
As shown in fig. 1, the processing of the picture in this embodiment includes two steps, namely, performing compression coding on the picture, and decoding the coded and compressed picture, and specifically includes:
(1) encoding process
Carrying out memorial semantic segmentation on an image to be compressed, and further carrying out region division by using k-means to obtain region contour information of the image;
dividing the image into a plurality of interest areas according to the area outline;
the user specifies a compression quality for each region;
compressing the image blocks of each category by using a BPG coding tool to obtain an intermediate file;
(2) decoding process
Decoding the intermediate file by using a BPG decoding tool to obtain a decompressed image block;
and combining the decompressed image blocks to obtain a compression result of the original image.
Example 2
This example is described in addition to example 1.
The present embodiment uses the semantic segmentation network as the preprocessing, so the semantic segmentation network needs to be pre-trained. This time, the source data set is used as the training data set of the network, and the data set is labeled with multiple categories (including background categories) such as people, cars, trees, and the like, and covers common targets of human eyes, and these targets are regarded as objects interested by human eyes in this embodiment.
In order to obtain the boundary contour information of the target of interest, the image to be compressed is input to a pre-trained semantic segmentation network, and the schematic diagram of the network structure is shown in fig. 2. The convolutional encoder comprises three first convolutional layers, each first convolutional layer comprises a convolutional operation, a normalization operation, an activation function and a pooling layer, deep features are extracted by continuously utilizing the convolutional operation at a convolutional encoding end, and important features are selected through the pooling layers to reduce the size of the features; and at the convolution decoding end, a network structure symmetrical to the convolution encoding end is adopted, three second convolution layers are arranged, each second convolution layer comprises a convolution operation, a normalization operation, an activation function and an upsampling layer, and the image is enlarged through the upsampling layer. In the embodiment, the convolution layer rich feature information is utilized, and the pooling layer is replaced by upsampling to increase the image resolution. Further, in order to obtain richer information, feature information of each layer of the convolution coding end and features of each layer of the symmetric convolution decoding end are fused to obtain a feature layer with the size of the original image, the feature layer is sent to the Softmax layer to carry out pixel classification to obtain the probability of each category of different pixels, and the category where the maximum probability value is located is selected as the category to which the pixel belongs.
A schematic diagram of a method for generating image blocks of different regions of interest is shown in fig. 3. Since the semantic segmentation result is not necessarily accurate, a lot of segmentation noise often appears, which causes the segmentation area to be too discrete, and it is difficult to propose an effective region of interest. Therefore, the embodiment proposes to use a k-means clustering algorithm to aggregate the semantic information of the segmentation result into N clusters, wherein the aggregation is based on the euclidean distance between the pixels of the segmentation result. The number N of the cluster centers is a super parameter which can be set by a user and is smaller than the number of labels of the semantic segmentation result. Then extracting boundary contour information of the N regions, mapping the boundary contour information to an original image, and dividing the original image into the N regions; preferably, the output image in fig. 2 is divided into 5 clusters, i.e. 5 areas of different colors. Meanwhile, the coordinate information of the upper left corner of each area in the original image is saved for subsequent combination.
For each region, the compression quality is specified by the user, and the compression quality can be selected from any integer of 1 to 50, and the value serves as a parameter of BPG compression encoding. The compression quality reflects the degree of compression of the image. The larger the numerical value is, the larger the compression ratio is, and the poorer the quality of the compressed image is; conversely, the smaller the value, the smaller the compression ratio, and the better the image quality after compression.
In the invention, an open-source libpg tool is used to transmit specified compression quality parameters, and an image compression intermediate file decoding end is used to decode an intermediate file by using a BPG decoding tool to obtain a decompressed image block; and combining the images according to the semantic segmentation result to obtain a compressed output picture.
The decompressed image blocks are kept consistent with the input image blocks in size, and the decompressed image blocks are placed in the corresponding positions of the original image areas according to the stored coordinate information of the upper left corner of each area, so that a compressed result image with the size consistent with that of the original image can be combined.
Example 3
The present embodiment provides an image compression apparatus based on a combination of user interaction and a semantic segmentation technology, the apparatus includes a semantic segmenter, an image block clusterer, a custom compression quality module, a BPG encoder, a BPG decoder, and a compressed image outputter, wherein:
the semantic divider is used for performing semantic division on an input image to be compressed, acquiring a boundary outline of the image and dividing the image into a plurality of image blocks according to the boundary outline;
the image block clustering device is used for further dividing the image blocks divided by the semantic divider and clustering the image blocks into N areas;
the user-defined compression quality module is used for the user to specify the compression quality of each area;
the BPG coder is used for compressing each area according to the compression quality specified by the user to produce an intermediate file;
the BPG decoder is used for decoding the intermediate file to obtain a decompressed image block;
and the compressed image output device is used for combining the decompressed image blocks to obtain a compression result of the bar original image and outputting the compression result to a user.
In this embodiment, the image block clustering device has a similar structure to the semantic segmentation network in embodiment 1, and includes a convolution encoding end, a convolution decoding end, and a softmax layer, where the convolution encoding end is provided with three first convolution layers, and each first convolution layer includes a convolution operation, a normalization operation, a pooling operation, and an activation operation; the convolution coding end and the convolution coding end are symmetrically provided with three second convolution layers, and each second convolution layer comprises convolution operation, normalization operation, up-sampling operation and activation operation.
In this embodiment, the image block clustering unit specifically operates to aggregate a plurality of partitioned areas into a specific number of areas by a k-means method according to semantic labels, so that the boundary contour of the image area is continuous and smooth, and the boundary contour information is mapped to the original image, thereby dividing the image to be compressed into different image blocks.
In this embodiment, aggregating a plurality of divided regions into a specific number of regions by a k-means method includes: and aggregating the semantic information of the separation result into N clusters according to the Euclidean distance between the pixels of the image block, dividing the image into N regions based on the N clusters obtained after clustering, mapping the N regions to the image to be compressed, dividing the image with compression into N regions, and storing the upper left corner of each region in the image to be compressed to obtain coordinate information.
In this embodiment, the compressed image output device puts the decompressed image blocks into the corresponding positions of the original image areas according to the coordinate information of the upper left corner in the image to be compressed in each area, and combines the decompressed image blocks into a compressed result image with the same size as the original image.
The invention does not carry out overall compression on the original image, but carries out compression with specified compression quality on the local part of the image, so that the compression quality of the compressed image can be locally adjusted, and a user can customize the local compression quality of the image to accord with the visual perception and the aesthetic feeling of the user, thereby having greater flexibility.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.
Claims (10)
1. The image compression method based on the combination of the user interaction and the semantic segmentation technology is characterized by comprising the following steps of:
inputting an image to be compressed, and preprocessing the image by utilizing a semantic segmentation network to obtain semantic segmentation of the image;
dividing the image into a plurality of categories of image blocks according to semantic segmentation;
the user sets the compression quality of each category of image blocks;
based on the set compression quality, compressing the image blocks of each category by using a BPG (Business Process group) coding tool to obtain an intermediate file;
decoding the intermediate file by using a BPG decoding tool to obtain a decompressed image block;
and combining the decompressed image blocks to obtain a compression result of the original image.
2. The image compression method based on the combination of the user interaction and the semantic segmentation technology as claimed in claim 1, wherein the semantic segmentation network comprises a convolutional encoding end, a convolutional decoding end and a softmax layer, the convolutional encoding end is provided with three first convolutional layers, and each first convolutional layer comprises a convolutional operation, a normalization operation, a pooling operation and an activation operation; the convolution coding end and the convolution coding end are symmetrically provided with three second convolution layers, and each second convolution layer comprises convolution operation, normalization operation, up-sampling operation and activation operation.
3. The image compression method based on the combination of user interaction and semantic segmentation technology as claimed in claim 1, wherein dividing image blocks according to semantic segmentation specifically comprises: according to the semantic labels, a plurality of divided areas are aggregated into a specific number of areas by a k-means method, so that the boundary contour of the image area is continuous and smooth, and the boundary contour information is mapped to an original image, so that the image to be compressed is divided into different image blocks.
4. The image compression method based on the combination of the user interaction and the semantic segmentation technology as claimed in claim 3, wherein the aggregating of the plurality of segmented regions into a specific number of regions by a k-means method comprises: the semantic information obtained by the separation network is aggregated into N clusters according to the Euclidean distance between image pixels, the image is divided into N semantic regions based on the N clusters obtained after clustering, the N semantic regions are mapped to the image to be compressed, the image to be compressed is divided into N image regions, and the coordinate information of the upper left corner of each region in the image to be compressed is stored.
5. The image compression method based on the combination of the user interaction and the semantic segmentation technology as claimed in claim 4, wherein the decompressed image blocks are placed in the corresponding positions of the original image areas according to the coordinate information of the upper left corner in the image to be compressed of each area to form a compressed result image with the same size as the original image.
6. The image compression apparatus based on user interaction and semantic segmentation technology combination as claimed in claim 1, comprising a semantic segmenter, an image block clustering device, a custom compression quality module, a BPG encoder, a BPG decoder and a compressed image outputter, wherein:
the semantic divider is used for performing semantic division on an input image to be compressed, acquiring a boundary outline of the image and dividing the image into a plurality of image blocks according to the boundary outline;
the image block clustering device is used for further dividing the image blocks divided by the semantic divider and clustering the image blocks into N areas;
the user-defined compression quality module is used for the user to specify the compression quality of each area;
the BPG coder is used for compressing each area according to the compression quality specified by the user to produce an intermediate file;
the BPG decoder is used for decoding the intermediate file to obtain a decompressed image block;
and the compressed image output device is used for combining the decompressed image blocks to obtain a compression result of the original image and outputting the compression result to a user.
7. The image compression device based on the combination of the user interaction and the semantic segmentation technology as claimed in claim 6, wherein the image block clustering device comprises a convolution coding end, a convolution decoding end and a softmax layer, the convolution coding end is provided with three first convolution layers, and each first convolution layer comprises a convolution operation, a normalization operation, a pooling operation and an activation operation; the convolution coding end and the convolution coding end are symmetrically provided with three second convolution layers, and each second convolution layer comprises convolution operation, normalization operation, up-sampling operation and activation operation.
8. The image compression device based on the combination of the user interaction and the semantic segmentation technology as claimed in claim 1, wherein the image block clusterer specifically operates to aggregate a plurality of segmented regions into a specific number of regions by a k-means method according to semantic tags, so that the image region boundary contour is continuous and smooth, and the boundary contour information is mapped to an original image, so as to divide the image to be compressed into different image blocks.
9. The image compression apparatus based on user interaction combined with semantic segmentation technology as claimed in claim 8, wherein aggregating a plurality of segmented regions into a certain number of regions by k-means method comprises: aggregating the semantic information of the separation result into N clusters according to the Euclidean distance between the pixels of the image block, dividing the image into N regions based on the N clusters obtained after clustering, mapping the boundary outlines of the N regions to the image to be compressed, dividing the image with compression into N regions, and storing the upper left corner of each region in the image to be compressed to obtain coordinate information.
10. The image compression apparatus based on the combination of user interaction and semantic segmentation technology as claimed in claim 9, wherein the compressed image output device puts the decompressed image blocks into the corresponding positions of the original image areas according to the coordinate information of the upper left corner in the image to be compressed of each area, and combines them into a compressed result image with the same size as the original image.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010702091.9A CN111953989A (en) | 2020-07-21 | 2020-07-21 | Image compression method and device based on combination of user interaction and semantic segmentation technology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010702091.9A CN111953989A (en) | 2020-07-21 | 2020-07-21 | Image compression method and device based on combination of user interaction and semantic segmentation technology |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111953989A true CN111953989A (en) | 2020-11-17 |
Family
ID=73341188
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010702091.9A Pending CN111953989A (en) | 2020-07-21 | 2020-07-21 | Image compression method and device based on combination of user interaction and semantic segmentation technology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111953989A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113660486A (en) * | 2021-07-28 | 2021-11-16 | 阿里巴巴(中国)有限公司 | Image coding, decoding, reconstructing and analyzing method, system and electronic equipment |
CN115118989A (en) * | 2022-08-29 | 2022-09-27 | 平生医疗科技(昆山)有限公司 | Image compression method and device |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110517329A (en) * | 2019-08-12 | 2019-11-29 | 北京邮电大学 | A kind of deep learning method for compressing image based on semantic analysis |
CN111127470A (en) * | 2019-12-24 | 2020-05-08 | 江西理工大学 | Image semantic segmentation method based on context and shallow space coding and decoding network |
CN111145277A (en) * | 2019-12-31 | 2020-05-12 | 山东浪潮人工智能研究院有限公司 | Image compression method of depth semantic perception and BPG compression tool |
-
2020
- 2020-07-21 CN CN202010702091.9A patent/CN111953989A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110517329A (en) * | 2019-08-12 | 2019-11-29 | 北京邮电大学 | A kind of deep learning method for compressing image based on semantic analysis |
CN111127470A (en) * | 2019-12-24 | 2020-05-08 | 江西理工大学 | Image semantic segmentation method based on context and shallow space coding and decoding network |
CN111145277A (en) * | 2019-12-31 | 2020-05-12 | 山东浪潮人工智能研究院有限公司 | Image compression method of depth semantic perception and BPG compression tool |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113660486A (en) * | 2021-07-28 | 2021-11-16 | 阿里巴巴(中国)有限公司 | Image coding, decoding, reconstructing and analyzing method, system and electronic equipment |
CN115118989A (en) * | 2022-08-29 | 2022-09-27 | 平生医疗科技(昆山)有限公司 | Image compression method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Hu et al. | Towards coding for human and machine vision: A scalable image coding approach | |
US7203356B2 (en) | Subject segmentation and tracking using 3D sensing technology for video compression in multimedia applications | |
Deng et al. | Content-based image compression for arbitrary-resolution display devices | |
Bradley et al. | Visual attention for region of interest coding in JPEG 2000 | |
CN110517329A (en) | A kind of deep learning method for compressing image based on semantic analysis | |
CN108632625A (en) | A kind of method for video coding, video encoding/decoding method and relevant device | |
EP2526699A1 (en) | Data pruning for video compression using example-based super-resolution | |
CN102637302B (en) | Image coding method | |
CN111586412B (en) | High-definition video processing method, master device, slave device and chip system | |
US20230082809A1 (en) | Method and data processing system for lossy image or video encoding, transmissionand decoding | |
CN111953989A (en) | Image compression method and device based on combination of user interaction and semantic segmentation technology | |
WO2023005740A1 (en) | Image encoding, decoding, reconstruction, and analysis methods, system, and electronic device | |
Simard et al. | A foreground-background separation algorithm for image compression | |
US20230316464A1 (en) | Virtual view drawing method and apparatus, rendering method and apparatus, and decoding method and apparatus, and devices and storage medium | |
CN111626178B (en) | Compressed domain video motion recognition method and system based on new spatio-temporal feature stream | |
US20230326086A1 (en) | Systems and methods for image and video compression | |
CN117560511A (en) | Spacer image compression method and system based on graph segmentation technology and electric power inspection | |
Alface et al. | V3c-based coding of dynamic meshes | |
JP2004523178A (en) | How to process video into encoded bitstream | |
WO2022141222A1 (en) | Virtual viewport generation method and apparatus, rendering and decoding methods and apparatuses, device and storage medium | |
Li et al. | Fast portrait segmentation with highly light-weight network | |
Hu et al. | Content-aware facial image compression with deep learning method | |
WO2021191406A1 (en) | Autoencoder-based segmentation mask generation in an alpha channel | |
Kumar | Wavelet Sub-band block coding based lossless High-Speed Compression of Compound Image | |
KR20050021086A (en) | Method for segmenting motion object of compressed motion pictures |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20201117 |
|
RJ01 | Rejection of invention patent application after publication |