CL2023001573A1 - Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas. - Google Patents

Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas.

Info

Publication number
CL2023001573A1
CL2023001573A1 CL2023001573A CL2023001573A CL2023001573A1 CL 2023001573 A1 CL2023001573 A1 CL 2023001573A1 CL 2023001573 A CL2023001573 A CL 2023001573A CL 2023001573 A CL2023001573 A CL 2023001573A CL 2023001573 A1 CL2023001573 A1 CL 2023001573A1
Authority
CL
Chile
Prior art keywords
downmix
gains
channel
channels
primary
Prior art date
Application number
CL2023001573A
Other languages
English (en)
Inventor
David S Mcgrath
Rishabh Tyagi
Harald Mundt
Original Assignee
Dolby Laboratories Licensing Corp
Dolby Int Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Laboratories Licensing Corp, Dolby Int Ab filed Critical Dolby Laboratories Licensing Corp
Publication of CL2023001573A1 publication Critical patent/CL2023001573A1/es

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/083Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/03Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereophonic System (AREA)

Abstract

Se divulga un método de codificación/decodificación de señal de audio que usa una estrategia de mezcla descendente de codificación aplicada en un codificador que es diferente de una estrategia de remezcla/mezcla ascendente de decodificación aplicada en un decodificador. Con base en el tipo de esquema de codificación de mezcla descendente, el método comprende: calcular las ganancias de mezcla descendente de entrada que se van a aplicar a la señal de audio de entrada para construir un canal primario de mezcla descendente; determinar las ganancias de modificación de escala de mezcla descendente para modificar la escala del canal primario de mezcla descendente; generar ganancias de predicción con base en la señal de audio de entrada, las ganancias de mezcla descendente de entrada y las ganancias de modificación de escala de mezcla descendente; determinar los canales residuales de los canales laterales mediante el uso del canal primario de mezcla descendente y las ganancias de predicción para generar predicciones de canal lateral y restar las predicciones de canal lateral de los canales laterales; determinar las ganancias de descorrelación con base en la energía en los canales residuales; codificar el canal primario de mezcla descendente, los canales residuales, las ganancias de predicción y las ganancias de descorrelación; y enviar el flujo de bits a un decodificador.
CL2023001573A 2020-12-02 2023-06-01 Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas. CL2023001573A1 (es)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202063120365P 2020-12-02 2020-12-02
US202163171404P 2021-04-06 2021-04-06
US202163228732P 2021-08-03 2021-08-03

Publications (1)

Publication Number Publication Date
CL2023001573A1 true CL2023001573A1 (es) 2023-11-03

Family

ID=79259444

Family Applications (1)

Application Number Title Priority Date Filing Date
CL2023001573A CL2023001573A1 (es) 2020-12-02 2023-06-01 Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas.

Country Status (10)

Country Link
US (1) US20240135937A1 (es)
EP (1) EP4256555A1 (es)
JP (1) JP2023551732A (es)
KR (1) KR20230116895A (es)
AU (1) AU2021393468A1 (es)
CA (1) CA3203960A1 (es)
CL (1) CL2023001573A1 (es)
IL (1) IL303377A (es)
MX (1) MX2023006501A (es)
WO (1) WO2022120093A1 (es)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW202334938A (zh) 2021-12-20 2023-09-01 瑞典商都比國際公司 正交鏡像濾波器域中之沉浸式音訊及視訊服務空間重建濾波器庫
WO2023141034A1 (en) * 2022-01-20 2023-07-27 Dolby Laboratories Licensing Corporation Spatial coding of higher order ambisonics for a low latency immersive audio codec
WO2024097485A1 (en) 2022-10-31 2024-05-10 Dolby Laboratories Licensing Corporation Low bitrate scene-based audio coding

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102160254B1 (ko) * 2014-01-10 2020-09-25 삼성전자주식회사 액티브다운 믹스 방식을 이용한 입체 음향 재생 방법 및 장치
US10986456B2 (en) * 2017-10-05 2021-04-20 Qualcomm Incorporated Spatial relation coding using virtual higher order ambisonic coefficients

Also Published As

Publication number Publication date
WO2022120093A1 (en) 2022-06-09
MX2023006501A (es) 2023-06-21
AU2021393468A1 (en) 2023-07-20
CA3203960A1 (en) 2022-06-09
KR20230116895A (ko) 2023-08-04
EP4256555A1 (en) 2023-10-11
JP2023551732A (ja) 2023-12-12
US20240135937A1 (en) 2024-04-25
IL303377A (en) 2023-08-01

Similar Documents

Publication Publication Date Title
CL2023001573A1 (es) Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas.
JP5922684B2 (ja) マルチチャネルの復号化装置
KR102241915B1 (ko) 다채널 코딩에서 스테레오 채움을 위한 장치 및 방법
JP5418930B2 (ja) 音声復号化方法および音声復号化器
RU2017108988A (ru) Усовершенствованное стереофоническое кодирование на основе комбинации адаптивно выбираемого левого/правого или среднего/побочного стереофонического кодирования и параметрического стереофонического кодирования
KR101253699B1 (ko) 주파수 영역 위너 필터링을 사용한 공간 오디오 코딩을위한 시간적 엔벨로프 정형화
US11594235B2 (en) Noise filling in multichannel audio coding
MX2022005146A (es) Distribucion de tasa de bits en servicios inmersivos de voz y audio.
SE0402652D0 (sv) Methods for improved performance of prediction based multi- channel reconstruction
KR20210122897A (ko) Mdct-기반의 복소수 예측 스테레오 코딩
TWI521502B (zh) 多聲道音訊的較高頻率和降混低頻率內容的混合編碼
KR102230668B1 (ko) 미드/사이드 결정이 개선된 전역 ild를 갖는 mdct m/s 스테레오의 장치 및 방법
US9454972B2 (en) Audio and speech coding device, audio and speech decoding device, method for coding audio and speech, and method for decoding audio and speech
MY181486A (en) Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal
ATE537537T1 (de) Signalkomprimierungsverfahren und -vorrichtung
CA2880412C (en) Apparatus and methods for adapting audio information in spatial audio object coding
AU2013301831A1 (en) Encoder, decoder, system and method employing a residual concept for parametric audio object coding
KR20070110111A (ko) 보장된 최대 비트 레이트를 가지는 정보의 무손실 인코딩
MX2019011955A (es) Codificacion y decodificacion de posiciones de picos espectrales.
KR20120038311A (ko) 공간 파라미터 부호화 장치 및 방법,그리고 공간 파라미터 복호화 장치 및 방법
KR20070044352A (ko) 오디오 신호의 인코딩 및 디코딩 방법, 및 이를 구현하기위한 장치
KR101735619B1 (ko) 멀티 채널 신호의 부호화/복호화 장치 및 방법
KR101635099B1 (ko) 멀티 채널 신호의 부호화/복호화 장치 및 방법
AR120361A1 (es) Distribución de tasa de bits en servicios inmersivos de voz y audio
KR20170054363A (ko) 멀티 채널 신호의 부호화/복호화 장치 및 방법