KR102945698B1 - 듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법 - Google Patents
듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법Info
- Publication number
- KR102945698B1 KR102945698B1 KR1020230043979A KR20230043979A KR102945698B1 KR 102945698 B1 KR102945698 B1 KR 102945698B1 KR 1020230043979 A KR1020230043979 A KR 1020230043979A KR 20230043979 A KR20230043979 A KR 20230043979A KR 102945698 B1 KR102945698 B1 KR 102945698B1
- Authority
- KR
- South Korea
- Prior art keywords
- image
- model
- encoding
- video sequence
- image data
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/20—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0475—Generative networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/70—Determining position or orientation of objects or cameras
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T9/00—Image coding
- G06T9/002—Image coding using neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/25—Determination of region of interest [ROI] or a volume of interest [VOI]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/162—User input
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/167—Position within a video image, e.g. region of interest [ROI]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
- H04N19/467—Embedding additional information in the video signal during the compression process characterised by the embedded information being invisible, e.g. watermarking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/70—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
- H04N21/234345—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements the reformatting operation being performed only on part of the stream, e.g. a region of the image or a time segment
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/20—Special algorithmic details
- G06T2207/20084—Artificial neural networks [ANN]
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/07—Target detection
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Life Sciences & Earth Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| EP22167664.6 | 2022-04-11 | ||
| EP22167664.6A EP4262209A1 (en) | 2022-04-11 | 2022-04-11 | System and method for image coding using dual image models |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20230145920A KR20230145920A (ko) | 2023-10-18 |
| KR102945698B1 true KR102945698B1 (ko) | 2026-03-30 |
Family
ID=81306805
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020230043979A Active KR102945698B1 (ko) | 2022-04-11 | 2023-04-04 | 듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US12401801B2 (https=) |
| EP (1) | EP4262209A1 (https=) |
| JP (1) | JP7837926B2 (https=) |
| KR (1) | KR102945698B1 (https=) |
| CN (1) | CN116896637A (https=) |
| TW (1) | TW202344056A (https=) |
Families Citing this family (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US12294640B1 (en) * | 2023-12-12 | 2025-05-06 | Atombeam Technologies Inc | System and method for distributed edge-cloud homomorphic compression using adaptive neural networks |
| US20240291993A1 (en) * | 2023-02-28 | 2024-08-29 | Ford Global Technologies, Llc | Rule-based digitized image compression |
| US20260050893A1 (en) * | 2024-08-16 | 2026-02-19 | Saudi Arabian Oil Company | Proactive equipment maintenance and control through remote edge monitoring |
| CN119583816B (zh) * | 2024-12-06 | 2025-09-09 | 中国科学技术大学先进技术研究院 | 基于生成式人工智能的多路径嵌入视频编码方法及系统 |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20100296583A1 (en) | 2009-05-22 | 2010-11-25 | Aten International Co., Ltd. | Image processing and transmission in a kvm switch system with special handling for regions of interest |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| HUP0301368A3 (en) | 2003-05-20 | 2005-09-28 | Amt Advanced Multimedia Techno | Method and equipment for compressing motion picture data |
| US9215467B2 (en) | 2008-11-17 | 2015-12-15 | Checkvideo Llc | Analytics-modulated coding of surveillance video |
| US10326978B2 (en) * | 2010-06-30 | 2019-06-18 | Warner Bros. Entertainment Inc. | Method and apparatus for generating virtual or augmented reality presentations with 3D audio positioning |
| GB201312382D0 (en) | 2013-07-10 | 2013-08-21 | Microsoft Corp | Region-of-interest aware video coding |
| US10567771B2 (en) | 2014-12-15 | 2020-02-18 | Miovision Technologies Incorporated | System and method for compressing video data |
| KR20230010060A (ko) * | 2016-10-04 | 2023-01-17 | 주식회사 비원영상기술연구소 | 영상 데이터 부호화/복호화 방법 및 장치 |
| US10349060B2 (en) | 2017-06-30 | 2019-07-09 | Intel Corporation | Encoding video frames using generated region of interest maps |
| US10452923B2 (en) * | 2017-11-28 | 2019-10-22 | Visual Semantics, Inc. | Method and apparatus for integration of detected object identifiers and semantic scene graph networks for captured visual scene behavior estimation |
| CN108833925B (zh) | 2018-07-19 | 2020-09-11 | 哈尔滨工业大学 | 一种基于深度神经网络的帧间预测方法 |
| US11580395B2 (en) | 2018-11-14 | 2023-02-14 | Nvidia Corporation | Generative adversarial neural network assisted video reconstruction |
| US11689726B2 (en) | 2018-12-05 | 2023-06-27 | Google Llc | Hybrid motion-compensated neural network with side-information based video coding |
| CN110493596B (zh) | 2019-09-02 | 2021-09-17 | 西北工业大学 | 一种基于神经网络的视频编码系统及方法 |
| CN111447449B (zh) | 2020-04-01 | 2022-05-06 | 北京奥维视讯科技有限责任公司 | 基于roi的视频编码方法和系统以及视频传输和编码系统 |
| US12170779B2 (en) * | 2020-04-09 | 2024-12-17 | Nokia Technologies Oy | Training a data coding system comprising a feature extractor neural network |
| CN111541900B (zh) | 2020-04-28 | 2022-05-17 | 山东浪潮科学研究院有限公司 | 基于gan的安防视频压缩方法、装置、设备及存储介质 |
| US11900662B2 (en) * | 2020-12-16 | 2024-02-13 | Here Global B.V. | Method, apparatus, and computer program product for training a signature encoding module and a query processing module to identify objects of interest within an image utilizing digital signatures |
| US12341977B2 (en) * | 2020-12-23 | 2025-06-24 | Intel Corporation | Technologies for region-of-interest video encoding |
| CN113099161A (zh) | 2021-04-13 | 2021-07-09 | 北京中科深智科技有限公司 | 一种基于深度神经网络的会议视频重建方法和系统 |
-
2022
- 2022-04-11 EP EP22167664.6A patent/EP4262209A1/en active Pending
-
2023
- 2023-03-02 TW TW112107577A patent/TW202344056A/zh unknown
- 2023-03-10 CN CN202310229235.7A patent/CN116896637A/zh active Pending
- 2023-04-04 US US18/295,556 patent/US12401801B2/en active Active
- 2023-04-04 KR KR1020230043979A patent/KR102945698B1/ko active Active
- 2023-04-05 JP JP2023061210A patent/JP7837926B2/ja active Active
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20100296583A1 (en) | 2009-05-22 | 2010-11-25 | Aten International Co., Ltd. | Image processing and transmission in a kvm switch system with special handling for regions of interest |
Non-Patent Citations (1)
| Title |
|---|
| Qi Xia et al., OBJECT-BASED IMAGE CODING: A LEARNING-DRIVEN REVISIT, ICME2020 (2020.3.18.) 1부. |
Also Published As
| Publication number | Publication date |
|---|---|
| US20230328260A1 (en) | 2023-10-12 |
| CN116896637A (zh) | 2023-10-17 |
| KR20230145920A (ko) | 2023-10-18 |
| EP4262209A1 (en) | 2023-10-18 |
| JP7837926B2 (ja) | 2026-03-31 |
| TW202344056A (zh) | 2023-11-01 |
| US12401801B2 (en) | 2025-08-26 |
| JP2023155898A (ja) | 2023-10-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102945698B1 (ko) | 듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법 | |
| US11074791B2 (en) | Automatic threat detection based on video frame delta information in compressed video streams | |
| Sitara et al. | Digital video tampering detection: An overview of passive techniques | |
| CN119031147B (zh) | 基于可学习任务感知机制的视频编解码加速方法及系统 | |
| Ding et al. | Identification of motion-compensated frame rate up-conversion based on residual signals | |
| WO2021205065A1 (en) | Training a data coding system comprising a feature extractor neural network | |
| JP2020508010A (ja) | 画像処理およびビデオ圧縮方法 | |
| US20130279598A1 (en) | Method and Apparatus For Video Compression of Stationary Scenes | |
| Xia et al. | Detecting video frame rate up-conversion based on frame-level analysis of average texture variation | |
| CN113271469B (zh) | 一种安全可逆的视频隐私安全保护系统及保护方法 | |
| TWI870727B (zh) | 編碼/重建圖像的至少一部份的方法及其處理裝置 | |
| Gorur et al. | Skip decision and reference frame selection for low-complexity H. 264/AVC surveillance video coding | |
| Youssef et al. | Adaptive video watermarking integrating a fuzzy wavelet-based human visual system perceptual model | |
| Fernández et al. | Digital video manipulation detection technique based on compression algorithms | |
| Wang et al. | Digital video steganalysis by subtractive prediction error adjacency matrix | |
| KR20230040286A (ko) | 영상정보의 비트스트림정보에 기반하여 물체 유기이벤트를 감지하는 방법 및 시스템 | |
| Hiller et al. | Recognition and pseudonymization of data privacy relevant areas in videos for compliance with GDPR | |
| Habeb et al. | Video Anomaly Detection using Residual Autoencoder: A Lightweight Framework | |
| Cossalter et al. | Privacy-enabled object tracking in video sequences using compressive sensing | |
| CN120823563B (zh) | 摄像头图像异常检测方法及系统 | |
| CN118972646B (zh) | 一种物联网隐私数据安全防护方法及系统 | |
| EP4311237A1 (en) | Image encoding method, image decoding method, image processing method, image encoding device, and image decoding device | |
| Wu et al. | Generative Memorize-Then-Recall framework for low bit-rate Surveillance Video Compression | |
| Lash | Uses of motion imagery in activity-based intelligence | |
| WO2025114384A1 (en) | Media coding concept based on a variational autoencoder |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0109 | Patent application |
St.27 status event code: A-0-1-A10-A12-nap-PA0109 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| D11 | Substantive examination requested |
Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D11-EXM-PA0201 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| D16 | Fast track examination requested |
Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D16-EXM-PA0302 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| P11 | Amendment of application requested |
Free format text: ST27 STATUS EVENT CODE: A-2-2-P10-P11-NAP-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13 | Application amended |
Free format text: ST27 STATUS EVENT CODE: A-2-2-P10-P13-NAP-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| PA0302 | Request for accelerated examination |
St.27 status event code: A-1-2-D10-D16-exm-PA0302 |
|
| D22 | Grant of ip right intended |
Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D22-EXM-PE0701 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| PE0701 | Decision of registration |
St.27 status event code: A-1-2-D10-D22-exm-PE0701 |
|
| F11 | Ip right granted following substantive examination |
Free format text: ST27 STATUS EVENT CODE: A-2-4-F10-F11-EXM-PR0701 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
| PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U11-oth-PR1002 Fee payment year number: 1 |
|
| U11 | Full renewal or maintenance fee paid |
Free format text: ST27 STATUS EVENT CODE: A-2-2-U10-U11-OTH-PR1002 (AS PROVIDED BY THE NATIONAL OFFICE) Year of fee payment: 1 |
|
| PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
| Q13 | Ip right document published |
Free format text: ST27 STATUS EVENT CODE: A-4-4-Q10-Q13-NAP-PG1601 (AS PROVIDED BY THE NATIONAL OFFICE) |