ES3035091T3 - Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding - Google Patents

Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding

Info

Publication number
ES3035091T3
ES3035091T3 ES20836269T ES20836269T ES3035091T3 ES 3035091 T3 ES3035091 T3 ES 3035091T3 ES 20836269 T ES20836269 T ES 20836269T ES 20836269 T ES20836269 T ES 20836269T ES 3035091 T3 ES3035091 T3 ES 3035091T3
Authority
ES
Spain
Prior art keywords
audio
bit
encoding
metadata
audio streams
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
ES20836269T
Other languages
English (en)
Spanish (es)
Inventor
Vaclav Eksler
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
VoiceAge Corp
Original Assignee
VoiceAge Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by VoiceAge Corp filed Critical VoiceAge Corp
Application granted granted Critical
Publication of ES3035091T3 publication Critical patent/ES3035091T3/es
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
ES20836269T 2019-07-08 2020-07-07 Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding Active ES3035091T3 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201962871253P 2019-07-08 2019-07-08
PCT/CA2020/050944 WO2021003570A1 (en) 2019-07-08 2020-07-07 Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding

Publications (1)

Publication Number Publication Date
ES3035091T3 true ES3035091T3 (en) 2025-08-28

Family

ID=74113835

Family Applications (1)

Application Number Title Priority Date Filing Date
ES20836269T Active ES3035091T3 (en) 2019-07-08 2020-07-07 Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding

Country Status (10)

Country Link
US (2) US12154582B2 (https=)
EP (2) EP3997697B1 (https=)
JP (3) JP7739255B2 (https=)
KR (2) KR20220034102A (https=)
CN (2) CN114097028B (https=)
AU (2) AU2020310952A1 (https=)
BR (2) BR112021026678A2 (https=)
ES (1) ES3035091T3 (https=)
MX (2) MX2021015476A (https=)
WO (2) WO2021003570A1 (https=)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2020310952A1 (en) * 2019-07-08 2022-01-20 Voiceage Corporation Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
GB2614482A (en) * 2020-09-25 2023-07-05 Apple Inc Seamless scalable decoding of channels, objects, and hoa audio content
JP7663418B2 (ja) * 2021-06-09 2025-04-16 日本放送協会 音響メタデータ処理装置及びプログラム
US20250225988A1 (en) * 2021-10-12 2025-07-10 Nokia Technologies Oy Delayed orientation signalling for immersive communications
EP4421804A4 (en) * 2021-10-21 2024-10-30 Beijing Xiaomi Mobile Software Co., Ltd. METHOD AND DEVICE FOR SIGNAL CODING AND DECODING AS WELL AS CODING DEVICE, DECODING DEVICE AND STORAGE MEDIUM
EP4428857A4 (en) * 2021-11-02 2024-10-30 Beijing Xiaomi Mobile Software Co., Ltd. METHOD AND DEVICE FOR SIGNAL CODING AND DECODING AS WELL AS USER DEVICE, NETWORK-SIDE DEVICE AND STORAGE MEDIUM
GB2628410B (en) * 2023-03-24 2025-09-17 Nokia Technologies Oy Low coding rate parametric spatial audio encoding
US12518772B2 (en) 2023-08-01 2026-01-06 Samsung Electronics Co., Ltd. Codec bitrate selection in audio object coding
CN120435737A (zh) * 2024-01-04 2025-08-05 北京小米移动软件有限公司 编码和解码方法、设备及存储介质

Family Cites Families (40)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5630011A (en) * 1990-12-05 1997-05-13 Digital Voice Systems, Inc. Quantization of harmonic amplitudes representing speech
US5311520A (en) * 1991-08-29 1994-05-10 At&T Bell Laboratories Method and apparatus for programmable memory control with error regulation and test functions
US7657427B2 (en) * 2002-10-11 2010-02-02 Nokia Corporation Methods and devices for source controlled variable bit-rate wideband speech coding
US9626973B2 (en) 2005-02-23 2017-04-18 Telefonaktiebolaget L M Ericsson (Publ) Adaptive bit allocation for multi-channel audio encoding
EP1866913B1 (en) 2005-03-30 2008-08-27 Koninklijke Philips Electronics N.V. Audio encoding and decoding
US8798776B2 (en) * 2008-09-30 2014-08-05 Dolby International Ab Transcoding of audio metadata
EP2375409A1 (en) 2010-04-09 2011-10-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction
KR102185941B1 (ko) * 2011-07-01 2020-12-03 돌비 레버러토리즈 라이쎈싱 코오포레이션 적응형 오디오 신호 생성, 코딩 및 렌더링을 위한 시스템 및 방법
EP2873074A4 (en) * 2012-07-12 2016-04-13 Nokia Technologies Oy VECTORIAL QUANTIFICATION
MY178710A (en) * 2012-12-21 2020-10-20 Fraunhofer Ges Forschung Comfort noise addition for modeling background noise at low bit-rates
US9715880B2 (en) 2013-02-21 2017-07-25 Dolby International Ab Methods for parametric multi-channel encoding
US9852735B2 (en) 2013-05-24 2017-12-26 Dolby International Ab Efficient coding of audio scenes comprising audio objects
TWI615834B (zh) 2013-05-31 2018-02-21 Sony Corp 編碼裝置及方法、解碼裝置及方法、以及程式
EP2830047A1 (en) * 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for low delay object metadata coding
WO2015056383A1 (ja) * 2013-10-17 2015-04-23 パナソニック株式会社 オーディオエンコード装置及びオーディオデコード装置
US9564136B2 (en) * 2014-03-06 2017-02-07 Dts, Inc. Post-encoding bitrate reduction of multiple object audio
WO2015150480A1 (en) 2014-04-02 2015-10-08 Dolby International Ab Exploiting metadata redundancy in immersive audio metadata
FR3020732A1 (fr) 2014-04-30 2015-11-06 Orange Correction de perte de trame perfectionnee avec information de voisement
EP2963949A1 (en) 2014-07-02 2016-01-06 Thomson Licensing Method and apparatus for decoding a compressed HOA representation, and method and apparatus for encoding a compressed HOA representation
WO2016013164A1 (ja) 2014-07-25 2016-01-28 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ 音響信号符号化装置、音響信号復号装置、音響信号符号化方法および音響信号復号方法
WO2016138502A1 (en) * 2015-02-27 2016-09-01 Arris Enterprises, Inc. Adaptive joint bitrate allocation
WO2016162283A1 (en) * 2015-04-07 2016-10-13 Dolby International Ab Audio coding with range extension
US9866596B2 (en) * 2015-05-04 2018-01-09 Qualcomm Incorporated Methods and systems for virtual conference system using personal communication devices
US10395664B2 (en) * 2016-01-26 2019-08-27 Dolby Laboratories Licensing Corporation Adaptive Quantization
US10573324B2 (en) * 2016-02-24 2020-02-25 Dolby International Ab Method and system for bit reservoir control in case of varying metadata
FR3048808A1 (fr) * 2016-03-10 2017-09-15 Orange Codage et decodage optimise d'informations de spatialisation pour le codage et le decodage parametrique d'un signal audio multicanal
EP3605531B1 (en) 2017-03-28 2024-08-21 Sony Group Corporation Information processing device, information processing method, and program
US10354660B2 (en) 2017-04-28 2019-07-16 Cisco Technology, Inc. Audio frame labeling to achieve unequal error protection for audio frames of unequal importance
JP7045266B2 (ja) 2017-06-09 2022-03-31 日本放送協会 音響信号補助情報変換伝送装置及びプログラム
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
WO2019023488A1 (en) 2017-07-28 2019-01-31 Dolby Laboratories Licensing Corporation METHOD AND SYSTEM FOR PROVIDING MULTIMEDIA CONTENT TO A CUSTOMER
KR20250016479A (ko) 2017-09-20 2025-02-03 보이세지 코포레이션 씨이엘피 코덱에 있어서 비트-예산을 효율적으로 분배하는 방법 및 디바이스
US10854209B2 (en) 2017-10-03 2020-12-01 Qualcomm Incorporated Multi-stream audio coding
CN111164679B (zh) 2017-10-05 2024-04-09 索尼公司 编码装置和方法、解码装置和方法以及程序
US10999693B2 (en) * 2018-06-25 2021-05-04 Qualcomm Incorporated Rendering different portions of audio data using different renderers
GB2575305A (en) * 2018-07-05 2020-01-08 Nokia Technologies Oy Determination of spatial audio parameter encoding and associated decoding
US10359827B1 (en) * 2018-08-15 2019-07-23 Qualcomm Incorporated Systems and methods for power conservation in an audio bus
US11683487B2 (en) 2019-03-26 2023-06-20 Qualcomm Incorporated Block-based adaptive loop filter (ALF) with adaptive parameter set (APS) in video coding
KR102717379B1 (ko) * 2019-03-29 2024-10-15 텔레폰악티에볼라겟엘엠에릭슨(펍) 멀티 채널 오디오 프레임에서 예측적인 코딩에서 에러 복구를 위한 방법 및 장치
AU2020310952A1 (en) 2019-07-08 2022-01-20 Voiceage Corporation Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding

Also Published As

Publication number Publication date
BR112021026678A2 (pt) 2022-02-15
MX2021015660A (es) 2022-02-03
JP2022539884A (ja) 2022-09-13
CA3145047A1 (en) 2021-01-14
BR112021025420A2 (pt) 2022-02-01
KR20220034102A (ko) 2022-03-17
MX2021015476A (es) 2022-01-24
US20220319524A1 (en) 2022-10-06
JP2022539608A (ja) 2022-09-12
EP3997697A1 (en) 2022-05-18
WO2021003570A1 (en) 2021-01-14
WO2021003569A1 (en) 2021-01-14
JP7739255B2 (ja) 2025-09-16
AU2020310084A1 (en) 2022-01-20
US12154582B2 (en) 2024-11-26
CN114097028A (zh) 2022-02-25
CA3145045A1 (en) 2021-01-14
EP3997697B1 (en) 2025-05-28
CN114072874B (zh) 2025-10-17
EP3997698A1 (en) 2022-05-18
US12387734B2 (en) 2025-08-12
JP7699095B2 (ja) 2025-06-26
AU2020310084B2 (en) 2025-12-04
JP2025133926A (ja) 2025-09-11
EP3997697A4 (en) 2023-09-06
CN114097028B (zh) 2025-10-17
AU2020310952A1 (en) 2022-01-20
CN114072874A (zh) 2022-02-18
US20220238127A1 (en) 2022-07-28
EP3997698A4 (en) 2023-07-19
KR20220034103A (ko) 2022-03-17

Similar Documents

Publication Publication Date Title
ES3035091T3 (en) Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
KR20200091880A (ko) 양자화 및 엔트로피 코딩을 이용한 방향성 오디오 코딩 파라미터들을 인코딩 또는 디코딩하기 위한 장치 및 방법
JP2017507365A (ja) 複数のオブジェクトオーディオのポスト符号化ビットレート低減
KR20250002792A (ko) 인코딩 장치 및 인코딩 방법, 디코딩 장치 및 디코딩 방법, 및 프로그램
CA3145045C (en) Method and system for coding metadata in audio streams and for flexible intra-object and inter-object bitrate adaptation
CA3145047C (en) Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
US20070198256A1 (en) Method for middle/side stereo encoding and audio encoder using the same
HK40069813B (zh) 用於编解码音频流中的元数据及用於灵活对象内和对象间比特率适配的方法和系统
HK40069013A (en) Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
HK40069013B (zh) 用於编解码音频流中的元数据和用於对音频流编解码的有效比特率分配的方法和系统
HK40069813A (en) Method and system for coding metadata in audio streams and for flexible intra-object and inter-object bitrate adaptation
JP2025536102A (ja) オブジェクトベースオーディオコーデックにおける不連続送信のための方法およびデバイス
KR20250137598A (ko) 오디오 코덱에 있어서 가요성 결합 포맷 비트-레이트 적응화를 위한 방법 및 디바이스
KR20250065890A (ko) 메타데이터가 있는 매개 변수적으로 코딩된 독립 스트림의 불연속 전송을 위한 디코더 및 디코딩 방법
WO2024052450A1 (en) Encoder and encoding method for discontinuous transmission of parametrically coded independent streams with metadata