KR102945698B1 - 듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법 - Google Patents

듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법

Info

Publication number
KR102945698B1
KR102945698B1 KR1020230043979A KR20230043979A KR102945698B1 KR 102945698 B1 KR102945698 B1 KR 102945698B1 KR 1020230043979 A KR1020230043979 A KR 1020230043979A KR 20230043979 A KR20230043979 A KR 20230043979A KR 102945698 B1 KR102945698 B1 KR 102945698B1
Authority
KR
South Korea
Prior art keywords
image
model
encoding
video sequence
image data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
KR1020230043979A
Other languages
English (en)
Korean (ko)
Other versions
KR20230145920A (ko
Inventor
케시캉가스 악셀
에드팜 빅토르
Original Assignee
엑시스 에이비
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 엑시스 에이비 filed Critical 엑시스 에이비
Publication of KR20230145920A publication Critical patent/KR20230145920A/ko
Application granted granted Critical
Publication of KR102945698B1 publication Critical patent/KR102945698B1/ko
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/0475Generative networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T9/00Image coding
    • G06T9/002Image coding using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/162User input
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/167Position within a video image, e.g. region of interest [ROI]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • H04N19/467Embedding additional information in the video signal during the compression process characterised by the embedded information being invisible, e.g. watermarking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/70Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by syntax aspects related to video coding, e.g. related to compression standards
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234345Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements the reformatting operation being performed only on part of the stream, e.g. a region of the image or a time segment
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
KR1020230043979A 2022-04-11 2023-04-04 듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법 Active KR102945698B1 (ko)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP22167664.6 2022-04-11
EP22167664.6A EP4262209A1 (en) 2022-04-11 2022-04-11 System and method for image coding using dual image models

Publications (2)

Publication Number Publication Date
KR20230145920A KR20230145920A (ko) 2023-10-18
KR102945698B1 true KR102945698B1 (ko) 2026-03-30

Family

ID=81306805

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020230043979A Active KR102945698B1 (ko) 2022-04-11 2023-04-04 듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법

Country Status (6)

Country Link
US (1) US12401801B2 (https=)
EP (1) EP4262209A1 (https=)
JP (1) JP7837926B2 (https=)
KR (1) KR102945698B1 (https=)
CN (1) CN116896637A (https=)
TW (1) TW202344056A (https=)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US12294640B1 (en) * 2023-12-12 2025-05-06 Atombeam Technologies Inc System and method for distributed edge-cloud homomorphic compression using adaptive neural networks
US20240291993A1 (en) * 2023-02-28 2024-08-29 Ford Global Technologies, Llc Rule-based digitized image compression
US20260050893A1 (en) * 2024-08-16 2026-02-19 Saudi Arabian Oil Company Proactive equipment maintenance and control through remote edge monitoring
CN119583816B (zh) * 2024-12-06 2025-09-09 中国科学技术大学先进技术研究院 基于生成式人工智能的多路径嵌入视频编码方法及系统

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100296583A1 (en) 2009-05-22 2010-11-25 Aten International Co., Ltd. Image processing and transmission in a kvm switch system with special handling for regions of interest

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
HUP0301368A3 (en) 2003-05-20 2005-09-28 Amt Advanced Multimedia Techno Method and equipment for compressing motion picture data
US9215467B2 (en) 2008-11-17 2015-12-15 Checkvideo Llc Analytics-modulated coding of surveillance video
US10326978B2 (en) * 2010-06-30 2019-06-18 Warner Bros. Entertainment Inc. Method and apparatus for generating virtual or augmented reality presentations with 3D audio positioning
GB201312382D0 (en) 2013-07-10 2013-08-21 Microsoft Corp Region-of-interest aware video coding
US10567771B2 (en) 2014-12-15 2020-02-18 Miovision Technologies Incorporated System and method for compressing video data
KR20230010060A (ko) * 2016-10-04 2023-01-17 주식회사 비원영상기술연구소 영상 데이터 부호화/복호화 방법 및 장치
US10349060B2 (en) 2017-06-30 2019-07-09 Intel Corporation Encoding video frames using generated region of interest maps
US10452923B2 (en) * 2017-11-28 2019-10-22 Visual Semantics, Inc. Method and apparatus for integration of detected object identifiers and semantic scene graph networks for captured visual scene behavior estimation
CN108833925B (zh) 2018-07-19 2020-09-11 哈尔滨工业大学 一种基于深度神经网络的帧间预测方法
US11580395B2 (en) 2018-11-14 2023-02-14 Nvidia Corporation Generative adversarial neural network assisted video reconstruction
US11689726B2 (en) 2018-12-05 2023-06-27 Google Llc Hybrid motion-compensated neural network with side-information based video coding
CN110493596B (zh) 2019-09-02 2021-09-17 西北工业大学 一种基于神经网络的视频编码系统及方法
CN111447449B (zh) 2020-04-01 2022-05-06 北京奥维视讯科技有限责任公司 基于roi的视频编码方法和系统以及视频传输和编码系统
US12170779B2 (en) * 2020-04-09 2024-12-17 Nokia Technologies Oy Training a data coding system comprising a feature extractor neural network
CN111541900B (zh) 2020-04-28 2022-05-17 山东浪潮科学研究院有限公司 基于gan的安防视频压缩方法、装置、设备及存储介质
US11900662B2 (en) * 2020-12-16 2024-02-13 Here Global B.V. Method, apparatus, and computer program product for training a signature encoding module and a query processing module to identify objects of interest within an image utilizing digital signatures
US12341977B2 (en) * 2020-12-23 2025-06-24 Intel Corporation Technologies for region-of-interest video encoding
CN113099161A (zh) 2021-04-13 2021-07-09 北京中科深智科技有限公司 一种基于深度神经网络的会议视频重建方法和系统

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100296583A1 (en) 2009-05-22 2010-11-25 Aten International Co., Ltd. Image processing and transmission in a kvm switch system with special handling for regions of interest

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Qi Xia et al., OBJECT-BASED IMAGE CODING: A LEARNING-DRIVEN REVISIT, ICME2020 (2020.3.18.) 1부.

Also Published As

Publication number Publication date
US20230328260A1 (en) 2023-10-12
CN116896637A (zh) 2023-10-17
KR20230145920A (ko) 2023-10-18
EP4262209A1 (en) 2023-10-18
JP7837926B2 (ja) 2026-03-31
TW202344056A (zh) 2023-11-01
US12401801B2 (en) 2025-08-26
JP2023155898A (ja) 2023-10-23

Similar Documents

Publication Publication Date Title
KR102945698B1 (ko) 듀얼 이미지 모델을 사용하는 이미지 코딩을 위한 시스템 및 방법
US11074791B2 (en) Automatic threat detection based on video frame delta information in compressed video streams
Sitara et al. Digital video tampering detection: An overview of passive techniques
CN119031147B (zh) 基于可学习任务感知机制的视频编解码加速方法及系统
Ding et al. Identification of motion-compensated frame rate up-conversion based on residual signals
WO2021205065A1 (en) Training a data coding system comprising a feature extractor neural network
JP2020508010A (ja) 画像処理およびビデオ圧縮方法
US20130279598A1 (en) Method and Apparatus For Video Compression of Stationary Scenes
Xia et al. Detecting video frame rate up-conversion based on frame-level analysis of average texture variation
CN113271469B (zh) 一种安全可逆的视频隐私安全保护系统及保护方法
TWI870727B (zh) 編碼/重建圖像的至少一部份的方法及其處理裝置
Gorur et al. Skip decision and reference frame selection for low-complexity H. 264/AVC surveillance video coding
Youssef et al. Adaptive video watermarking integrating a fuzzy wavelet-based human visual system perceptual model
Fernández et al. Digital video manipulation detection technique based on compression algorithms
Wang et al. Digital video steganalysis by subtractive prediction error adjacency matrix
KR20230040286A (ko) 영상정보의 비트스트림정보에 기반하여 물체 유기이벤트를 감지하는 방법 및 시스템
Hiller et al. Recognition and pseudonymization of data privacy relevant areas in videos for compliance with GDPR
Habeb et al. Video Anomaly Detection using Residual Autoencoder: A Lightweight Framework
Cossalter et al. Privacy-enabled object tracking in video sequences using compressive sensing
CN120823563B (zh) 摄像头图像异常检测方法及系统
CN118972646B (zh) 一种物联网隐私数据安全防护方法及系统
EP4311237A1 (en) Image encoding method, image decoding method, image processing method, image encoding device, and image decoding device
Wu et al. Generative Memorize-Then-Recall framework for low bit-rate Surveillance Video Compression
Lash Uses of motion imagery in activity-based intelligence
WO2025114384A1 (en) Media coding concept based on a variational autoencoder

Legal Events

Date Code Title Description
PA0109 Patent application

St.27 status event code: A-0-1-A10-A12-nap-PA0109

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

D11 Substantive examination requested

Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D11-EXM-PA0201 (AS PROVIDED BY THE NATIONAL OFFICE)

D16 Fast track examination requested

Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D16-EXM-PA0302 (AS PROVIDED BY THE NATIONAL OFFICE)

P11 Amendment of application requested

Free format text: ST27 STATUS EVENT CODE: A-2-2-P10-P11-NAP-X000 (AS PROVIDED BY THE NATIONAL OFFICE)

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13 Application amended

Free format text: ST27 STATUS EVENT CODE: A-2-2-P10-P13-NAP-X000 (AS PROVIDED BY THE NATIONAL OFFICE)

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

PA0302 Request for accelerated examination

St.27 status event code: A-1-2-D10-D16-exm-PA0302

D22 Grant of ip right intended

Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D22-EXM-PE0701 (AS PROVIDED BY THE NATIONAL OFFICE)

PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701

F11 Ip right granted following substantive examination

Free format text: ST27 STATUS EVENT CODE: A-2-4-F10-F11-EXM-PR0701 (AS PROVIDED BY THE NATIONAL OFFICE)

PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

St.27 status event code: A-2-2-U10-U11-oth-PR1002

Fee payment year number: 1

U11 Full renewal or maintenance fee paid

Free format text: ST27 STATUS EVENT CODE: A-2-2-U10-U11-OTH-PR1002 (AS PROVIDED BY THE NATIONAL OFFICE)

Year of fee payment: 1

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601

Q13 Ip right document published

Free format text: ST27 STATUS EVENT CODE: A-4-4-Q10-Q13-NAP-PG1601 (AS PROVIDED BY THE NATIONAL OFFICE)