CN117413294A - 多视图视频中的深度分割 - Google Patents

多视图视频中的深度分割 Download PDF

Info

Publication number
CN117413294A
CN117413294A CN202280039534.8A CN202280039534A CN117413294A CN 117413294 A CN117413294 A CN 117413294A CN 202280039534 A CN202280039534 A CN 202280039534A CN 117413294 A CN117413294 A CN 117413294A
Authority
CN
China
Prior art keywords
tile
depth
source view
tiles
map
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202280039534.8A
Other languages
English (en)
Chinese (zh)
Inventor
C·韦雷坎普
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Koninklijke Philips NV
Original Assignee
Koninklijke Philips NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips NV filed Critical Koninklijke Philips NV
Publication of CN117413294A publication Critical patent/CN117413294A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/00Three-dimensional [3D] image rendering
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/13Edge detection
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/18Image warping, e.g. rearranging pixels individually
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/50Image enhancement or restoration using two or more images, e.g. averaging or subtraction
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/136Segmentation; Edge detection involving thresholding
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/751Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/139Format conversion, e.g. of frame-rate or size
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/161Encoding, multiplexing or demultiplexing different image signal components
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20221Image fusion; Image merging
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20224Image subtraction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/275Image signal generators from three-dimensional [3D] object models, e.g. computer-generated stereoscopic image signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2213/00Details of stereoscopic systems
    • H04N2213/003Aspects relating to the "2D+depth" image format

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Computer Graphics (AREA)
  • Image Processing (AREA)
  • Processing Or Creating Images (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Image Analysis (AREA)
CN202280039534.8A 2021-06-03 2022-05-25 多视图视频中的深度分割 Pending CN117413294A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP21177608.3A EP4099270A1 (en) 2021-06-03 2021-06-03 Depth segmentation in multi-view videos
EP21177608.3 2021-06-03
PCT/EP2022/064243 WO2022253677A1 (en) 2021-06-03 2022-05-25 Depth segmentation in multi-view videos

Publications (1)

Publication Number Publication Date
CN117413294A true CN117413294A (zh) 2024-01-16

Family

ID=76269657

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202280039534.8A Pending CN117413294A (zh) 2021-06-03 2022-05-25 多视图视频中的深度分割

Country Status (9)

Country Link
US (1) US12430836B2 (https=)
EP (2) EP4099270A1 (https=)
JP (1) JP2024522463A (https=)
KR (1) KR20240016401A (https=)
CN (1) CN117413294A (https=)
BR (1) BR112023025252A2 (https=)
CA (1) CA3221973A1 (https=)
TW (1) TW202305746A (https=)
WO (1) WO2022253677A1 (https=)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240257369A1 (en) * 2023-01-31 2024-08-01 Sony Interactive Entertainment LLC Systems and methods for video data depth determination and video modification
US12567167B2 (en) * 2023-02-06 2026-03-03 Walmart Apollo, Llc Systems and methods for analyzing depth in images obtained in product storage facilities to detect outlier items
CN116168071B (zh) * 2023-02-08 2026-03-24 杭州海康机器人股份有限公司 深度数据获取方法、装置、电子设备及机器可读存储介质
US20240391695A1 (en) * 2023-05-24 2024-11-28 United States Postal Service Vehicle mounted sensors and methods of using the same
US20250193443A1 (en) * 2023-12-07 2025-06-12 Electronics And Telecommunications Research Institute Method and apparatus for encoding multi plane image based volumetric video
TWI900021B (zh) * 2024-05-22 2025-10-01 鴻海精密工業股份有限公司 深度估計方法、裝置、電子設備及儲存介質
CN120111200A (zh) * 2025-01-15 2025-06-06 北京天马辉电子技术有限责任公司 3d显示图像生成方法及相关设备

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2225725A2 (en) * 2007-12-20 2010-09-08 Koninklijke Philips Electronics N.V. Segmentation of image data
WO2011036761A1 (ja) 2009-09-25 2011-03-31 株式会社東芝 多視点画像生成方法および装置
US20120236114A1 (en) 2011-03-18 2012-09-20 Te-Hao Chang Depth information generator for generating depth information output by only processing part of received images having different views, and related depth information generating method and depth adjusting apparatus thereof
CN104604232A (zh) 2012-04-30 2015-05-06 数码士控股有限公司 用于编码多视点图像的方法及装置,以及用于解码多视点图像的方法及装置
US10089740B2 (en) * 2014-03-07 2018-10-02 Fotonation Limited System and methods for depth regularization and semiautomatic interactive matting using RGB-D images
CN110291788B (zh) * 2017-02-20 2022-03-08 索尼公司 图像处理设备和图像处理方法
CN109147025B (zh) * 2018-07-11 2023-07-18 北京航空航天大学 一种面向rgbd三维重建的纹理生成方法
US10965932B2 (en) 2019-03-19 2021-03-30 Intel Corporation Multi-pass add-on tool for coherent and complete view synthesis
CN110148217A (zh) * 2019-05-24 2019-08-20 北京华捷艾米科技有限公司 一种实时三维重建方法、装置及设备
US11195283B2 (en) * 2019-07-15 2021-12-07 Google Llc Video background substraction using depth
US12230002B2 (en) * 2019-10-01 2025-02-18 Intel Corporation Object-based volumetric video coding
KR102696467B1 (ko) * 2022-09-16 2024-08-19 한국전자통신연구원 이동시점 동영상 생성 장치 및 방법

Also Published As

Publication number Publication date
JP2024522463A (ja) 2024-06-21
EP4348577A1 (en) 2024-04-10
CA3221973A1 (en) 2022-12-08
US20240378789A1 (en) 2024-11-14
US12430836B2 (en) 2025-09-30
BR112023025252A2 (pt) 2024-02-20
WO2022253677A1 (en) 2022-12-08
EP4099270A1 (en) 2022-12-07
KR20240016401A (ko) 2024-02-06
TW202305746A (zh) 2023-02-01

Similar Documents

Publication Publication Date Title
US12430836B2 (en) Depth segmentation in multi-view videos
US11080932B2 (en) Method and apparatus for representing a virtual object in a real environment
US10665025B2 (en) Method and apparatus for representing a virtual object in a real environment
US11348267B2 (en) Method and apparatus for generating a three-dimensional model
Zitnick et al. Stereo for image-based rendering using image over-segmentation
US9406131B2 (en) Method and system for generating a 3D representation of a dynamically changing 3D scene
KR100888537B1 (ko) 이미지의 2 계층이며, 삼차원인 표현을 생성하기 위한 컴퓨터 구현 방법, 시스템 및 컴퓨터―판독가능 매체
US6124864A (en) Adaptive modeling and segmentation of visual image streams
Mori et al. Efficient use of textured 3D model for pre-observation-based diminished reality
US20240062402A1 (en) Depth orders for multi-view frame storing and rendering
Sankoh et al. Robust billboard-based, free-viewpoint video synthesis algorithm to overcome occlusions under challenging outdoor sport scenes
Chari et al. Augmented reality using over-segmentation
US12190444B2 (en) Image-based environment reconstruction with view-dependent colour
Kilner Free-Viewpoint Video for Outdoor Sporting Events
Papadakis et al. Virtual camera synthesis for soccer game replays
Taneja et al. 3D reconstruction and video-based rendering of casually captured videos
Würmlin Stereo-based Methods (25 min)

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination