KR20220034102A - 오디오 스트림에 있어서의 메타데이터를 코딩하고 가요성 객체간 및 객체내 비트레이트 적응화를 위한 방법 및 시스템 - Google Patents

오디오 스트림에 있어서의 메타데이터를 코딩하고 가요성 객체간 및 객체내 비트레이트 적응화를 위한 방법 및 시스템 Download PDF

Info

Publication number
KR20220034102A
KR20220034102A KR1020227000308A KR20227000308A KR20220034102A KR 20220034102 A KR20220034102 A KR 20220034102A KR 1020227000308 A KR1020227000308 A KR 1020227000308A KR 20227000308 A KR20227000308 A KR 20227000308A KR 20220034102 A KR20220034102 A KR 20220034102A
Authority
KR
South Korea
Prior art keywords
metadata
coding
audio
bit
parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
KR1020227000308A
Other languages
English (en)
Korean (ko)
Inventor
바츨라브 엑슬러
Original Assignee
보이세지 코포레이션
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 보이세지 코포레이션 filed Critical 보이세지 코포레이션
Publication of KR20220034102A publication Critical patent/KR20220034102A/ko
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
KR1020227000308A 2019-07-08 2020-07-07 오디오 스트림에 있어서의 메타데이터를 코딩하고 가요성 객체간 및 객체내 비트레이트 적응화를 위한 방법 및 시스템 Pending KR20220034102A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201962871253P 2019-07-08 2019-07-08
US62/871,253 2019-07-08
PCT/CA2020/050943 WO2021003569A1 (en) 2019-07-08 2020-07-07 Method and system for coding metadata in audio streams and for flexible intra-object and inter-object bitrate adaptation

Publications (1)

Publication Number Publication Date
KR20220034102A true KR20220034102A (ko) 2022-03-17

Family

ID=74113835

Family Applications (2)

Application Number Title Priority Date Filing Date
KR1020227000308A Pending KR20220034102A (ko) 2019-07-08 2020-07-07 오디오 스트림에 있어서의 메타데이터를 코딩하고 가요성 객체간 및 객체내 비트레이트 적응화를 위한 방법 및 시스템
KR1020227000309A Pending KR20220034103A (ko) 2019-07-08 2020-07-07 오디오 스트림에 있어서의 메타데이터를 코딩하고, 오디오 스트림 코딩에 효율적인 비트레이트 할당을 위한 방법 및 시스템

Family Applications After (1)

Application Number Title Priority Date Filing Date
KR1020227000309A Pending KR20220034103A (ko) 2019-07-08 2020-07-07 오디오 스트림에 있어서의 메타데이터를 코딩하고, 오디오 스트림 코딩에 효율적인 비트레이트 할당을 위한 방법 및 시스템

Country Status (10)

Country Link
US (2) US12154582B2 (https=)
EP (2) EP3997697B1 (https=)
JP (3) JP7739255B2 (https=)
KR (2) KR20220034102A (https=)
CN (2) CN114097028B (https=)
AU (2) AU2020310952A1 (https=)
BR (2) BR112021026678A2 (https=)
ES (1) ES3035091T3 (https=)
MX (2) MX2021015476A (https=)
WO (2) WO2021003570A1 (https=)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2020310952A1 (en) * 2019-07-08 2022-01-20 Voiceage Corporation Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
GB2614482A (en) * 2020-09-25 2023-07-05 Apple Inc Seamless scalable decoding of channels, objects, and hoa audio content
JP7663418B2 (ja) * 2021-06-09 2025-04-16 日本放送協会 音響メタデータ処理装置及びプログラム
US20250225988A1 (en) * 2021-10-12 2025-07-10 Nokia Technologies Oy Delayed orientation signalling for immersive communications
EP4421804A4 (en) * 2021-10-21 2024-10-30 Beijing Xiaomi Mobile Software Co., Ltd. METHOD AND DEVICE FOR SIGNAL CODING AND DECODING AS WELL AS CODING DEVICE, DECODING DEVICE AND STORAGE MEDIUM
EP4428857A4 (en) * 2021-11-02 2024-10-30 Beijing Xiaomi Mobile Software Co., Ltd. METHOD AND DEVICE FOR SIGNAL CODING AND DECODING AS WELL AS USER DEVICE, NETWORK-SIDE DEVICE AND STORAGE MEDIUM
GB2628410B (en) * 2023-03-24 2025-09-17 Nokia Technologies Oy Low coding rate parametric spatial audio encoding
US12518772B2 (en) 2023-08-01 2026-01-06 Samsung Electronics Co., Ltd. Codec bitrate selection in audio object coding
CN120435737A (zh) * 2024-01-04 2025-08-05 北京小米移动软件有限公司 编码和解码方法、设备及存储介质

Family Cites Families (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5630011A (en) * 1990-12-05 1997-05-13 Digital Voice Systems, Inc. Quantization of harmonic amplitudes representing speech
US5311520A (en) * 1991-08-29 1994-05-10 At&T Bell Laboratories Method and apparatus for programmable memory control with error regulation and test functions
US7657427B2 (en) * 2002-10-11 2010-02-02 Nokia Corporation Methods and devices for source controlled variable bit-rate wideband speech coding
US9626973B2 (en) 2005-02-23 2017-04-18 Telefonaktiebolaget L M Ericsson (Publ) Adaptive bit allocation for multi-channel audio encoding
EP1866913B1 (en) 2005-03-30 2008-08-27 Koninklijke Philips Electronics N.V. Audio encoding and decoding
US8798776B2 (en) * 2008-09-30 2014-08-05 Dolby International Ab Transcoding of audio metadata
EP2375409A1 (en) 2010-04-09 2011-10-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction
KR102185941B1 (ko) * 2011-07-01 2020-12-03 돌비 레버러토리즈 라이쎈싱 코오포레이션 적응형 오디오 신호 생성, 코딩 및 렌더링을 위한 시스템 및 방법
EP2873074A4 (en) * 2012-07-12 2016-04-13 Nokia Technologies Oy VECTORIAL QUANTIFICATION
MY178710A (en) * 2012-12-21 2020-10-20 Fraunhofer Ges Forschung Comfort noise addition for modeling background noise at low bit-rates
US9715880B2 (en) 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
US9852735B2 (en) 2013-05-24 2017-12-26 Dolby International Ab Efficient coding of audio scenes comprising audio objects
TWI615834B (zh) 2013-05-31 2018-02-21 Sony Corp 編碼裝置及方法、解碼裝置及方法、以及程式
EP2830047A1 (en) * 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for low delay object metadata coding
WO2015056383A1 (ja) * 2013-10-17 2015-04-23 パナソニック株式会社 オーディオエンコード装置及びオーディオデコード装置
US9564136B2 (en) * 2014-03-06 2017-02-07 Dts, Inc. Post-encoding bitrate reduction of multiple object audio
WO2015150480A1 (en) 2014-04-02 2015-10-08 Dolby International Ab Exploiting metadata redundancy in immersive audio metadata
FR3020732A1 (fr) 2014-04-30 2015-11-06 Orange Correction de perte de trame perfectionnee avec information de voisement
EP2963949A1 (en) 2014-07-02 2016-01-06 Thomson Licensing Method and apparatus for decoding a compressed HOA representation, and method and apparatus for encoding a compressed HOA representation
WO2016013164A1 (ja) 2014-07-25 2016-01-28 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 音響信号符号化装置、音響信号復号装置、音響信号符号化方法および音響信号復号方法
WO2016138502A1 (en) * 2015-02-27 2016-09-01 Arris Enterprises, Inc. Adaptive joint bitrate allocation
WO2016162283A1 (en) * 2015-04-07 2016-10-13 Dolby International Ab Audio coding with range extension
US9866596B2 (en) * 2015-05-04 2018-01-09 Qualcomm Incorporated Methods and systems for virtual conference system using personal communication devices
US10395664B2 (en) * 2016-01-26 2019-08-27 Dolby Laboratories Licensing Corporation Adaptive Quantization
US10573324B2 (en) * 2016-02-24 2020-02-25 Dolby International Ab Method and system for bit reservoir control in case of varying metadata
FR3048808A1 (fr) * 2016-03-10 2017-09-15 Orange Codage et decodage optimise d'informations de spatialisation pour le codage et le decodage parametrique d'un signal audio multicanal
EP3605531B1 (en) 2017-03-28 2024-08-21 Sony Group Corporation Information processing device, information processing method, and program
US10354660B2 (en) 2017-04-28 2019-07-16 Cisco Technology, Inc. Audio frame labeling to achieve unequal error protection for audio frames of unequal importance
JP7045266B2 (ja) 2017-06-09 2022-03-31 日本放送協会 音響信号補助情報変換伝送装置及びプログラム
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
WO2019023488A1 (en) 2017-07-28 2019-01-31 Dolby Laboratories Licensing Corporation METHOD AND SYSTEM FOR PROVIDING MULTIMEDIA CONTENT TO A CUSTOMER
KR20250016479A (ko) 2017-09-20 2025-02-03 보이세지 코포레이션 씨이엘피 코덱에 있어서 비트-예산을 효율적으로 분배하는 방법 및 디바이스
US10854209B2 (en) 2017-10-03 2020-12-01 Qualcomm Incorporated Multi-stream audio coding
CN111164679B (zh) 2017-10-05 2024-04-09 索尼公司 编码装置和方法、解码装置和方法以及程序
US10999693B2 (en) * 2018-06-25 2021-05-04 Qualcomm Incorporated Rendering different portions of audio data using different renderers
GB2575305A (en) * 2018-07-05 2020-01-08 Nokia Technologies Oy Determination of spatial audio parameter encoding and associated decoding
US10359827B1 (en) * 2018-08-15 2019-07-23 Qualcomm Incorporated Systems and methods for power conservation in an audio bus
US11683487B2 (en) 2019-03-26 2023-06-20 Qualcomm Incorporated Block-based adaptive loop filter (ALF) with adaptive parameter set (APS) in video coding
KR102717379B1 (ko) * 2019-03-29 2024-10-15 텔레폰악티에볼라겟엘엠에릭슨(펍) 멀티 채널 오디오 프레임에서 예측적인 코딩에서 에러 복구를 위한 방법 및 장치
AU2020310952A1 (en) 2019-07-08 2022-01-20 Voiceage Corporation Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding

Also Published As

Publication number Publication date
BR112021026678A2 (pt) 2022-02-15
MX2021015660A (es) 2022-02-03
JP2022539884A (ja) 2022-09-13
CA3145047A1 (en) 2021-01-14
BR112021025420A2 (pt) 2022-02-01
MX2021015476A (es) 2022-01-24
US20220319524A1 (en) 2022-10-06
JP2022539608A (ja) 2022-09-12
EP3997697A1 (en) 2022-05-18
WO2021003570A1 (en) 2021-01-14
WO2021003569A1 (en) 2021-01-14
JP7739255B2 (ja) 2025-09-16
AU2020310084A1 (en) 2022-01-20
US12154582B2 (en) 2024-11-26
CN114097028A (zh) 2022-02-25
CA3145045A1 (en) 2021-01-14
EP3997697B1 (en) 2025-05-28
CN114072874B (zh) 2025-10-17
EP3997698A1 (en) 2022-05-18
US12387734B2 (en) 2025-08-12
JP7699095B2 (ja) 2025-06-26
AU2020310084B2 (en) 2025-12-04
JP2025133926A (ja) 2025-09-11
EP3997697A4 (en) 2023-09-06
CN114097028B (zh) 2025-10-17
AU2020310952A1 (en) 2022-01-20
CN114072874A (zh) 2022-02-18
US20220238127A1 (en) 2022-07-28
ES3035091T3 (en) 2025-08-28
EP3997698A4 (en) 2023-07-19
KR20220034103A (ko) 2022-03-17

Similar Documents

Publication Publication Date Title
JP7124170B2 (ja) セカンダリチャンネルを符号化するためにプライマリチャンネルのコーディングパラメータを使用するステレオ音声信号を符号化するための方法およびシステム
US12387734B2 (en) Method and system for coding metadata in audio streams and for flexible intra-object and inter-object bitrate adaptation
JP7285830B2 (ja) Celpコーデックにおいてサブフレーム間にビット配分を割り振るための方法およびデバイス
CA3145045C (en) Method and system for coding metadata in audio streams and for flexible intra-object and inter-object bitrate adaptation
CA3145047C (en) Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
HK40069813A (en) Method and system for coding metadata in audio streams and for flexible intra-object and inter-object bitrate adaptation
HK40069013A (en) Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
KR20250110811A (ko) 객체 기반 오디오 코텍에 있어서 불연속적 전송을 위한 방법 및 장치
HK40069813B (zh) 用於编解码音频流中的元数据及用於灵活对象内和对象间比特率适配的方法和系统
HK40069013B (zh) 用於编解码音频流中的元数据和用於对音频流编解码的有效比特率分配的方法和系统
KR20250137598A (ko) 오디오 코덱에 있어서 가요성 결합 포맷 비트-레이트 적응화를 위한 방법 및 디바이스
Eksler et al. Object-Based Audio Coding in Immersive Mobile Communications
HK40126637A (zh) 基於对象的音频编解码器中不连续传输的方法和设备
KR20250065890A (ko) 메타데이터가 있는 매개 변수적으로 코딩된 독립 스트림의 불연속 전송을 위한 디코더 및 디코딩 방법
KR20250067870A (ko) 메타데이터가 있는 매개 변수적으로 코딩된 독립 스트림의 불연속 전송을 위한 인코더 및 인코딩 방법
KR20230088409A (ko) 오디오 코덱에 있어서 오디오 대역폭 검출 및 오디오 대역폭 스위칭을 위한 방법 및 디바이스

Legal Events

Date Code Title Description
PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

D21 Rejection of application intended

Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D21-EXM-PE0902 (AS PROVIDED BY THE NATIONAL OFFICE)

PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902