MX2022005146A - Distribucion de tasa de bits en servicios inmersivos de voz y audio. - Google Patents

Distribucion de tasa de bits en servicios inmersivos de voz y audio.

Info

Publication number
MX2022005146A
MX2022005146A MX2022005146A MX2022005146A MX2022005146A MX 2022005146 A MX2022005146 A MX 2022005146A MX 2022005146 A MX2022005146 A MX 2022005146A MX 2022005146 A MX2022005146 A MX 2022005146A MX 2022005146 A MX2022005146 A MX 2022005146A
Authority
MX
Mexico
Prior art keywords
metadata
bitstream
downmix
bitrate distribution
bitrates
Prior art date
Application number
MX2022005146A
Other languages
English (en)
Inventor
Juan Felix Torres
Stefanie Brown
Rishabh Tyagi
Original Assignee
Dolby Laboratories Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Laboratories Licensing Corp filed Critical Dolby Laboratories Licensing Corp
Publication of MX2022005146A publication Critical patent/MX2022005146A/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/002Dynamic bit allocation

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Stereophonic System (AREA)

Abstract

Se divulgan realizaciones para la distribución de tasa de bits en servicios inmersivos de voz y audio. En una realización, un método para codificar un flujo de bits IVAS comprende: recibir una señal de audio de entrada; mezclar en forma descendente la señal de audio de entrada en uno o más canales de mezcla descendente y metadatos espaciales; leer un conjunto de una o más tasas de bits para los canales de mezcla descendente y un conjunto de niveles de cuantización para los metadatos espaciales de una tabla de control de distribución de tasa de bits; determinar una combinación de una o más tasas de bits para los canales de mezcla descendente; determinar un nivel de cuantización de metadatos del conjunto de niveles de cuantización de metadatos usando un proceso de distribución de tasa de bits; cuantizar y codificar los metadatos espaciales usando el nivel de cuantización de metadatos; generar, usando la combinación de una o más tasas de bits, un flujo de bits de mezcla descendente para el único o más canales de mezcla descendente; combinar el flujo de bits de mezcla descendente, los metadatos espaciales cuantizados y codificados y el conjunto de niveles de cuantización en el flujo de bits IVAS.
MX2022005146A 2019-10-30 2020-10-28 Distribucion de tasa de bits en servicios inmersivos de voz y audio. MX2022005146A (es)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201962927772P 2019-10-30 2019-10-30
US202063092830P 2020-10-16 2020-10-16
PCT/US2020/057737 WO2021086965A1 (en) 2019-10-30 2020-10-28 Bitrate distribution in immersive voice and audio services

Publications (1)

Publication Number Publication Date
MX2022005146A true MX2022005146A (es) 2022-05-30

Family

ID=73476272

Family Applications (1)

Application Number Title Priority Date Filing Date
MX2022005146A MX2022005146A (es) 2019-10-30 2020-10-28 Distribucion de tasa de bits en servicios inmersivos de voz y audio.

Country Status (12)

Country Link
US (1) US20220406318A1 (es)
EP (1) EP4052256A1 (es)
JP (1) JP2023500632A (es)
KR (1) KR20220088864A (es)
CN (1) CN114616621A (es)
AU (1) AU2020372899A1 (es)
BR (1) BR112022007735A2 (es)
CA (1) CA3156634A1 (es)
IL (1) IL291655A (es)
MX (1) MX2022005146A (es)
TW (3) TW202410024A (es)
WO (1) WO2021086965A1 (es)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2023533665A (ja) * 2020-06-11 2023-08-04 ドルビー ラボラトリーズ ライセンシング コーポレイション 低遅延オーディオ・コーデックのためのパラメータの量子化およびエントロピー符号化
WO2023141034A1 (en) * 2022-01-20 2023-07-27 Dolby Laboratories Licensing Corporation Spatial coding of higher order ambisonics for a low latency immersive audio codec
WO2024012666A1 (en) * 2022-07-12 2024-01-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for encoding or decoding ar/vr metadata with generic codebooks
GB2623516A (en) * 2022-10-17 2024-04-24 Nokia Technologies Oy Parametric spatial audio encoding
WO2024097485A1 (en) 2022-10-31 2024-05-10 Dolby Laboratories Licensing Corporation Low bitrate scene-based audio coding

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI396188B (zh) * 2005-08-02 2013-05-11 Dolby Lab Licensing Corp 依聆聽事件之函數控制空間音訊編碼參數的技術
AR077680A1 (es) * 2009-08-07 2011-09-14 Dolby Int Ab Autenticacion de flujos de datos
EP2862166B1 (en) * 2012-06-14 2018-03-07 Dolby International AB Error concealment strategy in a decoding system
EP2838086A1 (en) * 2013-07-22 2015-02-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. In an reduction of comb filter artifacts in multi-channel downmix with adaptive phase alignment
US10885921B2 (en) * 2017-07-07 2021-01-05 Qualcomm Incorporated Multi-stream audio coding
CN110945494B (zh) * 2017-07-28 2024-06-21 杜比实验室特许公司 向客户端提供媒体内容的方法和系统
US10854209B2 (en) * 2017-10-03 2020-12-01 Qualcomm Incorporated Multi-stream audio coding
CA3219540A1 (en) * 2017-10-04 2019-04-11 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding
WO2019106221A1 (en) * 2017-11-28 2019-06-06 Nokia Technologies Oy Processing of spatial audio parameters
WO2020008112A1 (en) * 2018-07-03 2020-01-09 Nokia Technologies Oy Energy-ratio signalling and synthesis
GB2586214A (en) * 2019-07-31 2021-02-17 Nokia Technologies Oy Quantization of spatial audio direction parameters
GB2595891A (en) * 2020-06-10 2021-12-15 Nokia Technologies Oy Adapting multi-source inputs for constant rate encoding

Also Published As

Publication number Publication date
TW202410024A (zh) 2024-03-01
IL291655A (en) 2022-05-01
AU2020372899A1 (en) 2022-04-21
KR20220088864A (ko) 2022-06-28
WO2021086965A1 (en) 2021-05-06
CA3156634A1 (en) 2021-05-06
CN114616621A (zh) 2022-06-10
TW202230332A (zh) 2022-08-01
TWI762008B (zh) 2022-04-21
BR112022007735A2 (pt) 2022-07-12
JP2023500632A (ja) 2023-01-10
TW202135046A (zh) 2021-09-16
TWI821966B (zh) 2023-11-11
EP4052256A1 (en) 2022-09-07
US20220406318A1 (en) 2022-12-22

Similar Documents

Publication Publication Date Title
MX2022005146A (es) Distribucion de tasa de bits en servicios inmersivos de voz y audio.
KR101418661B1 (ko) 다운믹스 시그널 표현에 기초한 업믹스 시그널 표현을 제공하기 위한 장치, 멀티채널 오디오 시그널을 표현하는 비트스트림을 제공하기 위한 장치, 왜곡 제어 시그널링을 이용하는 방법들, 컴퓨터 프로그램 및 비트 스트림
US11594235B2 (en) Noise filling in multichannel audio coding
KR101449434B1 (ko) 복수의 가변장 부호 테이블을 이용한 멀티 채널 오디오를부호화/복호화하는 방법 및 장치
TWI521502B (zh) 多聲道音訊的較高頻率和降混低頻率內容的混合編碼
US9378748B2 (en) Reduced complexity converter SNR calculation
KR20170113667A (ko) 적어도 하나의 필 요소 내의 향상된 스펙트럼 대역 복제 메타데이터를 사용한 오디오 비트스트림의 디코딩
MX2020009581A (es) Métodos y dispositivos para codificar y/o decodificar señales de audio inmersivo.
US10176812B2 (en) Decoder and method for multi-instance spatial-audio-object-coding employing a parametric concept for multichannel downmix/upmix cases
KR101837686B1 (ko) 공간적 오디오 객체 코딩에 오디오 정보를 적응시키기 위한 장치 및 방법
MX2022001152A (es) Codificacion y decodificacion de flujos de bits ivas.
CN109074812A (zh) 用于具有全局ild和改进的中/侧决策的mdct m/s立体声的装置和方法
CL2023001573A1 (es) Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas.
EP3997697A1 (en) Method and system for coding metadata in audio streams and for efficient bitrate allocation to audio streams coding
TWI501220B (zh) 嵌入與擷取輔助資料
JP2016530789A (ja) 修正された出力信号を得るために符号化されたオーディオ信号を復号化するための装置および方法
US20240153512A1 (en) Audio codec with adaptive gain control of downmixed signals
AR120361A1 (es) Distribución de tasa de bits en servicios inmersivos de voz y audio
KR20080035448A (ko) 다채널 오디오 신호의 부호화/복호화 방법 및 장치
Kim et al. Mastering signal processing in mpeg saoc
KR20070041336A (ko) 오디오 신호의 인코딩 및 디코딩 방법, 및 이를 구현하기위한 장치