JP2025514816A - ビデオの符号化と復号化のための参照ピクチャのリサンプリング - Google Patents
ビデオの符号化と復号化のための参照ピクチャのリサンプリング Download PDFInfo
- Publication number
- JP2025514816A JP2025514816A JP2024562190A JP2024562190A JP2025514816A JP 2025514816 A JP2025514816 A JP 2025514816A JP 2024562190 A JP2024562190 A JP 2024562190A JP 2024562190 A JP2024562190 A JP 2024562190A JP 2025514816 A JP2025514816 A JP 2025514816A
- Authority
- JP
- Japan
- Prior art keywords
- interpolation filters
- upsampling
- filter
- video
- frames
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/09—Supervised learning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/04—Segmentation; Word boundary detection
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/117—Filters, e.g. for pre-processing or post-processing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/157—Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
- H04N19/159—Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/17—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
- H04N19/172—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/169—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
- H04N19/186—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a colour or a chrominance component
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/503—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
- H04N19/51—Motion estimation or motion compensation
- H04N19/523—Motion estimation or motion compensation with sub-pixel accuracy
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/50—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
- H04N19/59—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving spatial sub-sampling or interpolation, e.g. alteration of picture size or resolution
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/80—Details of filtering operations specially adapted for video compression, e.g. for pixel interpolation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
- G10L2025/932—Decision in previous or following frames
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- General Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Probability & Statistics with Applications (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Color Television Systems (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US202263363386P | 2022-04-21 | 2022-04-21 | |
| US63/363,386 | 2022-04-21 | ||
| PCT/US2023/019386 WO2023205409A1 (en) | 2022-04-21 | 2023-04-21 | Reference picture resampling for video coding |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| JP2025514816A true JP2025514816A (ja) | 2025-05-09 |
| JP2025514816A5 JP2025514816A5 (https=) | 2026-04-28 |
Family
ID=88415706
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| JP2024562190A Pending JP2025514816A (ja) | 2022-04-21 | 2023-04-21 | ビデオの符号化と復号化のための参照ピクチャのリサンプリング |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US20250240413A1 (https=) |
| EP (1) | EP4512075A4 (https=) |
| JP (1) | JP2025514816A (https=) |
| CN (1) | CN119054276A (https=) |
| MX (1) | MX2024012933A (https=) |
| WO (1) | WO2023205409A1 (https=) |
Family Cites Families (29)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6898245B2 (en) * | 2001-03-26 | 2005-05-24 | Telefonaktiebolaget Lm Ericsson (Publ) | Low complexity video decoding |
| US7991236B2 (en) * | 2006-10-16 | 2011-08-02 | Nokia Corporation | Discardable lower layer adaptations in scalable video coding |
| US8199812B2 (en) * | 2007-01-09 | 2012-06-12 | Qualcomm Incorporated | Adaptive upsampling for scalable video coding |
| US8107571B2 (en) * | 2007-03-20 | 2012-01-31 | Microsoft Corporation | Parameterized filters and signaling techniques |
| US8676308B2 (en) * | 2009-11-03 | 2014-03-18 | Boston Scientific Neuromodulation Corporation | System and method for mapping arbitrary electric fields to pre-existing lead electrodes |
| US20120075436A1 (en) * | 2010-09-24 | 2012-03-29 | Qualcomm Incorporated | Coding stereo video data |
| US9591303B2 (en) * | 2012-06-28 | 2017-03-07 | Qualcomm Incorporated | Random access and signaling of long-term reference pictures in video coding |
| US9584808B2 (en) * | 2013-02-22 | 2017-02-28 | Qualcomm Incorporated | Device and method for scalable coding of video information |
| US10284842B2 (en) * | 2013-03-05 | 2019-05-07 | Qualcomm Incorporated | Inter-layer reference picture construction for spatial scalability with different aspect ratios |
| US10291827B2 (en) * | 2013-11-22 | 2019-05-14 | Futurewei Technologies, Inc. | Advanced screen content coding solution |
| US10368097B2 (en) * | 2014-01-07 | 2019-07-30 | Nokia Technologies Oy | Apparatus, a method and a computer program product for coding and decoding chroma components of texture pictures for sample prediction of depth pictures |
| EP3092806A4 (en) * | 2014-01-07 | 2017-08-23 | Nokia Technologies Oy | Method and apparatus for video coding and decoding |
| US10091512B2 (en) * | 2014-05-23 | 2018-10-02 | Futurewei Technologies, Inc. | Advanced screen content coding with improved palette table and index map coding methods |
| FI20165547A1 (fi) * | 2016-06-30 | 2018-12-31 | Nokia Technologies Oy | Laitteisto, menetelmä ja tietokoneohjelma videokoodausta ja videokoodauksen purkua varten |
| US10382781B2 (en) * | 2016-09-28 | 2019-08-13 | Qualcomm Incorporated | Interpolation filters for intra prediction in video coding |
| US10341659B2 (en) * | 2016-10-05 | 2019-07-02 | Qualcomm Incorporated | Systems and methods of switching interpolation filters |
| JP2019036821A (ja) * | 2017-08-14 | 2019-03-07 | キヤノン株式会社 | 画像処理装置、画像処理方法、及びプログラム |
| CN108833918B (zh) * | 2018-06-20 | 2021-09-17 | 腾讯科技(深圳)有限公司 | 视频编码方法、解码方法、装置、计算机设备及存储介质 |
| US11277644B2 (en) * | 2018-07-02 | 2022-03-15 | Qualcomm Incorporated | Combining mode dependent intra smoothing (MDIS) with intra interpolation filter switching |
| US11190764B2 (en) * | 2018-07-06 | 2021-11-30 | Qualcomm Incorporated | Merged mode dependent intra smoothing (MDIS) and intra interpolation filter switching with position dependent intra prediction combination (PDPC) |
| CN112868238B (zh) * | 2018-10-23 | 2023-04-21 | 北京字节跳动网络技术有限公司 | 局部照明补偿和帧间预测编解码之间的并置 |
| CN112913247B (zh) * | 2018-10-23 | 2023-04-28 | 北京字节跳动网络技术有限公司 | 使用局部照明补偿的视频处理 |
| EP3700210A1 (en) * | 2019-02-21 | 2020-08-26 | Ateme | Method and apparatus for image encoding |
| SG11202112279WA (en) * | 2019-05-11 | 2021-12-30 | Beijing Bytedance Network Technology Co Ltd | Selective use of coding tools in video processing |
| US12143631B2 (en) * | 2019-06-23 | 2024-11-12 | Sharp Kabushiki Kaisha | Systems and methods for performing an adaptive resolution change in video coding |
| FR3098072B1 (fr) * | 2019-06-26 | 2021-08-06 | Ateme | Procédé de traitement d’un ensemble d’images d’une séquence vidéo |
| US11356707B2 (en) * | 2019-09-23 | 2022-06-07 | Qualcomm Incorporated | Signaling filters for video processing |
| CN112616057B (zh) * | 2019-10-04 | 2024-08-23 | Oppo广东移动通信有限公司 | 图像预测方法、编码器、解码器以及存储介质 |
| EP3945721B1 (en) * | 2020-07-30 | 2024-08-07 | Ateme | Method for image processing and apparatus for implementing the same |
-
2023
- 2023-04-21 US US18/857,157 patent/US20250240413A1/en active Pending
- 2023-04-21 JP JP2024562190A patent/JP2025514816A/ja active Pending
- 2023-04-21 EP EP23792575.5A patent/EP4512075A4/en active Pending
- 2023-04-21 CN CN202380034176.6A patent/CN119054276A/zh active Pending
- 2023-04-21 WO PCT/US2023/019386 patent/WO2023205409A1/en not_active Ceased
-
2024
- 2024-10-18 MX MX2024012933A patent/MX2024012933A/es unknown
Also Published As
| Publication number | Publication date |
|---|---|
| EP4512075A4 (en) | 2026-04-29 |
| CN119054276A (zh) | 2024-11-29 |
| WO2023205409A1 (en) | 2023-10-26 |
| MX2024012933A (es) | 2024-12-06 |
| EP4512075A1 (en) | 2025-02-26 |
| US20250240413A1 (en) | 2025-07-24 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN113950836B (zh) | 基于矩阵的帧内预测的上下文确定 | |
| KR102707051B1 (ko) | 광학 흐름 정제를 위한 조기 종료 | |
| JP5965054B2 (ja) | データ符号化及び復号化 | |
| CN113678459A (zh) | 基于矩阵的帧内预测的适应性 | |
| KR20220013939A (ko) | 행렬 기반 인트라 예측에서의 제한된 업샘플링 프로세스 | |
| JP7712988B2 (ja) | Mpmリストを構成する方法、クロマブロックのイントラ予測モードを取得する方法、および装置 | |
| CN113748676A (zh) | 帧内编解码模式下的矩阵推导 | |
| KR102359415B1 (ko) | 비디오 코딩을 위한 인터 예측 장치 및 방법을 위한 보간 필터 | |
| JP2022529432A5 (https=) | ||
| CN113875233A (zh) | 使用上采样的基于矩阵的帧内预测 | |
| CN113728629A (zh) | 视频译码中的运动向量推导 | |
| CN119835417B (zh) | 视频图像分量预测方法及装置、计算机存储介质 | |
| JP2025111512A (ja) | イントラ予測のためのデバイスおよび方法 | |
| US20250133232A1 (en) | Method for decoding, system, and method for intra predicting | |
| JP2025508549A (ja) | ビデオ符号化における局所照明補償のためのモデル調整 | |
| JP2024157004A (ja) | 映像信号をエンコーディングするための方法、計算デバイス、非一時的なコンピュータ可読記憶媒体、及びコンピュータプログラム。 | |
| JP2024537716A (ja) | 選択可能なニューラルネットワークベースのコーディングツールによるビデオコーディング | |
| US20260106979A1 (en) | Integerization for interpolation filter design in video coding | |
| KR20220097997A (ko) | 루프 필터링을 제어하는 영상 코딩 장치 및 방법 | |
| US20250240413A1 (en) | Reference picture resampling for video coding | |
| CN118829981A (zh) | 视频编码中插值滤波器设计的整数化 | |
| KR20210154103A (ko) | 루마 매핑 크로마 스케일링을 이용하는 영상 부호화 및 복호화 | |
| JP2026507821A (ja) | 映像エンコーディング/デコーディング方法及び装置、並びにビットストリームを保存した記録媒体 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20260417 |
|
| A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20260417 |