TW202305746A - 多視圖視訊中之深度分割 - Google Patents

多視圖視訊中之深度分割 Download PDF

Info

Publication number
TW202305746A
TW202305746A TW111120607A TW111120607A TW202305746A TW 202305746 A TW202305746 A TW 202305746A TW 111120607 A TW111120607 A TW 111120607A TW 111120607 A TW111120607 A TW 111120607A TW 202305746 A TW202305746 A TW 202305746A
Authority
TW
Taiwan
Prior art keywords
patch
depth
map
source view
patches
Prior art date
Application number
TW111120607A
Other languages
English (en)
Chinese (zh)
Inventor
克莉斯汀 維爾甘
Original Assignee
荷蘭商皇家飛利浦有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 荷蘭商皇家飛利浦有限公司 filed Critical 荷蘭商皇家飛利浦有限公司
Publication of TW202305746A publication Critical patent/TW202305746A/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/13Edge detection
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T15/00Three-dimensional [3D] image rendering
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/18Image warping, e.g. rearranging pixels individually
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/50Image enhancement or restoration using two or more images, e.g. averaging or subtraction
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/136Segmentation; Edge detection involving thresholding
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/194Segmentation; Edge detection involving foreground-background segmentation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/50Depth or shape recovery
    • G06T7/55Depth or shape recovery from multiple images
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • G06V10/75Organisation of the matching processes, e.g. simultaneous or sequential comparisons of image or video features; Coarse-fine approaches, e.g. multi-scale approaches; using context analysis; Selection of dictionaries
    • G06V10/751Comparing pixel values or logical combinations thereof, or feature values having positional relevance, e.g. template matching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/139Format conversion, e.g. of frame-rate or size
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/10Processing, recording or transmission of stereoscopic or multi-view image signals
    • H04N13/106Processing image signals
    • H04N13/161Encoding, multiplexing or demultiplexing different image signal components
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10028Range image; Depth image; 3D point clouds
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20221Image fusion; Image merging
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20224Image subtraction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N13/00Stereoscopic video systems; Multi-view video systems; Details thereof
    • H04N13/20Image signal generators
    • H04N13/275Image signal generators from three-dimensional [3D] object models, e.g. computer-generated stereoscopic image signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2213/00Details of stereoscopic systems
    • H04N2213/003Aspects relating to the "2D+depth" image format

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Computer Graphics (AREA)
  • Image Processing (AREA)
  • Processing Or Creating Images (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
  • Image Analysis (AREA)
TW111120607A 2021-06-03 2022-06-02 多視圖視訊中之深度分割 TW202305746A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP21177608.3A EP4099270A1 (en) 2021-06-03 2021-06-03 Depth segmentation in multi-view videos
EP21177608.3 2021-06-03

Publications (1)

Publication Number Publication Date
TW202305746A true TW202305746A (zh) 2023-02-01

Family

ID=76269657

Family Applications (1)

Application Number Title Priority Date Filing Date
TW111120607A TW202305746A (zh) 2021-06-03 2022-06-02 多視圖視訊中之深度分割

Country Status (9)

Country Link
US (1) US12430836B2 (https=)
EP (2) EP4099270A1 (https=)
JP (1) JP2024522463A (https=)
KR (1) KR20240016401A (https=)
CN (1) CN117413294A (https=)
BR (1) BR112023025252A2 (https=)
CA (1) CA3221973A1 (https=)
TW (1) TW202305746A (https=)
WO (1) WO2022253677A1 (https=)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI900021B (zh) * 2024-05-22 2025-10-01 鴻海精密工業股份有限公司 深度估計方法、裝置、電子設備及儲存介質

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20240257369A1 (en) * 2023-01-31 2024-08-01 Sony Interactive Entertainment LLC Systems and methods for video data depth determination and video modification
US12567167B2 (en) * 2023-02-06 2026-03-03 Walmart Apollo, Llc Systems and methods for analyzing depth in images obtained in product storage facilities to detect outlier items
CN116168071B (zh) * 2023-02-08 2026-03-24 杭州海康机器人股份有限公司 深度数据获取方法、装置、电子设备及机器可读存储介质
US20240391695A1 (en) * 2023-05-24 2024-11-28 United States Postal Service Vehicle mounted sensors and methods of using the same
US20250193443A1 (en) * 2023-12-07 2025-06-12 Electronics And Telecommunications Research Institute Method and apparatus for encoding multi plane image based volumetric video
CN120111200A (zh) * 2025-01-15 2025-06-06 北京天马辉电子技术有限责任公司 3d显示图像生成方法及相关设备

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2225725A2 (en) * 2007-12-20 2010-09-08 Koninklijke Philips Electronics N.V. Segmentation of image data
WO2011036761A1 (ja) 2009-09-25 2011-03-31 株式会社東芝 多視点画像生成方法および装置
US20120236114A1 (en) 2011-03-18 2012-09-20 Te-Hao Chang Depth information generator for generating depth information output by only processing part of received images having different views, and related depth information generating method and depth adjusting apparatus thereof
CN104604232A (zh) 2012-04-30 2015-05-06 数码士控股有限公司 用于编码多视点图像的方法及装置,以及用于解码多视点图像的方法及装置
US10089740B2 (en) * 2014-03-07 2018-10-02 Fotonation Limited System and methods for depth regularization and semiautomatic interactive matting using RGB-D images
CN110291788B (zh) * 2017-02-20 2022-03-08 索尼公司 图像处理设备和图像处理方法
CN109147025B (zh) * 2018-07-11 2023-07-18 北京航空航天大学 一种面向rgbd三维重建的纹理生成方法
US10965932B2 (en) 2019-03-19 2021-03-30 Intel Corporation Multi-pass add-on tool for coherent and complete view synthesis
CN110148217A (zh) * 2019-05-24 2019-08-20 北京华捷艾米科技有限公司 一种实时三维重建方法、装置及设备
US11195283B2 (en) * 2019-07-15 2021-12-07 Google Llc Video background substraction using depth
US12230002B2 (en) * 2019-10-01 2025-02-18 Intel Corporation Object-based volumetric video coding
KR102696467B1 (ko) * 2022-09-16 2024-08-19 한국전자통신연구원 이동시점 동영상 생성 장치 및 방법

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI900021B (zh) * 2024-05-22 2025-10-01 鴻海精密工業股份有限公司 深度估計方法、裝置、電子設備及儲存介質

Also Published As

Publication number Publication date
JP2024522463A (ja) 2024-06-21
EP4348577A1 (en) 2024-04-10
CN117413294A (zh) 2024-01-16
CA3221973A1 (en) 2022-12-08
US20240378789A1 (en) 2024-11-14
US12430836B2 (en) 2025-09-30
BR112023025252A2 (pt) 2024-02-20
WO2022253677A1 (en) 2022-12-08
EP4099270A1 (en) 2022-12-07
KR20240016401A (ko) 2024-02-06

Similar Documents

Publication Publication Date Title
US12430836B2 (en) Depth segmentation in multi-view videos
US11348267B2 (en) Method and apparatus for generating a three-dimensional model
US9406131B2 (en) Method and system for generating a 3D representation of a dynamically changing 3D scene
JP2009539155A5 (https=)
EP4361957B1 (en) Image-tiles-based environment reconstruction
US20240062402A1 (en) Depth orders for multi-view frame storing and rendering
Sankoh et al. Robust billboard-based, free-viewpoint video synthesis algorithm to overcome occlusions under challenging outdoor sport scenes
US20240406363A1 (en) Handling blur in multi-view imaging
Chari et al. Augmented reality using over-segmentation
US12190444B2 (en) Image-based environment reconstruction with view-dependent colour
Kim et al. Compensated visual hull for defective segmentation and occlusion
Lee et al. Interactive retexturing from unordered images
Do et al. On multi-view texture mapping of indoor environments using Kinect depth sensors
Kilner Free-Viewpoint Video for Outdoor Sporting Events
CN113486803A (zh) 视频中嵌入图像的装置