CL2023001573A1 - Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas. - Google Patents
Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas.Info
- Publication number
- CL2023001573A1 CL2023001573A1 CL2023001573A CL2023001573A CL2023001573A1 CL 2023001573 A1 CL2023001573 A1 CL 2023001573A1 CL 2023001573 A CL2023001573 A CL 2023001573A CL 2023001573 A CL2023001573 A CL 2023001573A CL 2023001573 A1 CL2023001573 A1 CL 2023001573A1
- Authority
- CL
- Chile
- Prior art keywords
- downmix
- gains
- channel
- channels
- primary
- Prior art date
Links
- 230000003044 adaptive effect Effects 0.000 title 1
- 230000005236 sound signal Effects 0.000 abstract 3
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/083—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/03—Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereophonic System (AREA)
Abstract
Se divulga un método de codificación/decodificación de señal de audio que usa una estrategia de mezcla descendente de codificación aplicada en un codificador que es diferente de una estrategia de remezcla/mezcla ascendente de decodificación aplicada en un decodificador. Con base en el tipo de esquema de codificación de mezcla descendente, el método comprende: calcular las ganancias de mezcla descendente de entrada que se van a aplicar a la señal de audio de entrada para construir un canal primario de mezcla descendente; determinar las ganancias de modificación de escala de mezcla descendente para modificar la escala del canal primario de mezcla descendente; generar ganancias de predicción con base en la señal de audio de entrada, las ganancias de mezcla descendente de entrada y las ganancias de modificación de escala de mezcla descendente; determinar los canales residuales de los canales laterales mediante el uso del canal primario de mezcla descendente y las ganancias de predicción para generar predicciones de canal lateral y restar las predicciones de canal lateral de los canales laterales; determinar las ganancias de descorrelación con base en la energía en los canales residuales; codificar el canal primario de mezcla descendente, los canales residuales, las ganancias de predicción y las ganancias de descorrelación; y enviar el flujo de bits a un decodificador.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US202063120365P | 2020-12-02 | 2020-12-02 | |
US202163171404P | 2021-04-06 | 2021-04-06 | |
US202163228732P | 2021-08-03 | 2021-08-03 |
Publications (1)
Publication Number | Publication Date |
---|---|
CL2023001573A1 true CL2023001573A1 (es) | 2023-11-03 |
Family
ID=79259444
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CL2023001573A CL2023001573A1 (es) | 2020-12-02 | 2023-06-01 | Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas. |
Country Status (10)
Country | Link |
---|---|
US (1) | US20240135937A1 (es) |
EP (1) | EP4256555A1 (es) |
JP (1) | JP2023551732A (es) |
KR (1) | KR20230116895A (es) |
AU (1) | AU2021393468A1 (es) |
CA (1) | CA3203960A1 (es) |
CL (1) | CL2023001573A1 (es) |
IL (1) | IL303377A (es) |
MX (1) | MX2023006501A (es) |
WO (1) | WO2022120093A1 (es) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
TW202334938A (zh) | 2021-12-20 | 2023-09-01 | 瑞典商都比國際公司 | 正交鏡像濾波器域中之沉浸式音訊及視訊服務空間重建濾波器庫 |
WO2023141034A1 (en) * | 2022-01-20 | 2023-07-27 | Dolby Laboratories Licensing Corporation | Spatial coding of higher order ambisonics for a low latency immersive audio codec |
WO2024097485A1 (en) | 2022-10-31 | 2024-05-10 | Dolby Laboratories Licensing Corporation | Low bitrate scene-based audio coding |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102160254B1 (ko) * | 2014-01-10 | 2020-09-25 | 삼성전자주식회사 | 액티브다운 믹스 방식을 이용한 입체 음향 재생 방법 및 장치 |
US10986456B2 (en) * | 2017-10-05 | 2021-04-20 | Qualcomm Incorporated | Spatial relation coding using virtual higher order ambisonic coefficients |
-
2021
- 2021-12-02 JP JP2023533783A patent/JP2023551732A/ja active Pending
- 2021-12-02 AU AU2021393468A patent/AU2021393468A1/en active Pending
- 2021-12-02 KR KR1020237022333A patent/KR20230116895A/ko unknown
- 2021-12-02 MX MX2023006501A patent/MX2023006501A/es unknown
- 2021-12-02 IL IL303377A patent/IL303377A/en unknown
- 2021-12-02 WO PCT/US2021/061671 patent/WO2022120093A1/en active Application Filing
- 2021-12-02 EP EP21836685.4A patent/EP4256555A1/en active Pending
- 2021-12-02 CA CA3203960A patent/CA3203960A1/en active Pending
- 2021-12-02 US US18/327,623 patent/US20240135937A1/en active Pending
-
2023
- 2023-06-01 CL CL2023001573A patent/CL2023001573A1/es unknown
Also Published As
Publication number | Publication date |
---|---|
WO2022120093A1 (en) | 2022-06-09 |
MX2023006501A (es) | 2023-06-21 |
AU2021393468A1 (en) | 2023-07-20 |
CA3203960A1 (en) | 2022-06-09 |
KR20230116895A (ko) | 2023-08-04 |
EP4256555A1 (en) | 2023-10-11 |
JP2023551732A (ja) | 2023-12-12 |
US20240135937A1 (en) | 2024-04-25 |
IL303377A (en) | 2023-08-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CL2023001573A1 (es) | Servicios inmersivos de voz y audio (ivas) con estrategias de mezcla descendente adaptativas. | |
JP5922684B2 (ja) | マルチチャネルの復号化装置 | |
KR102241915B1 (ko) | 다채널 코딩에서 스테레오 채움을 위한 장치 및 방법 | |
JP5418930B2 (ja) | 音声復号化方法および音声復号化器 | |
RU2017108988A (ru) | Усовершенствованное стереофоническое кодирование на основе комбинации адаптивно выбираемого левого/правого или среднего/побочного стереофонического кодирования и параметрического стереофонического кодирования | |
KR101253699B1 (ko) | 주파수 영역 위너 필터링을 사용한 공간 오디오 코딩을위한 시간적 엔벨로프 정형화 | |
US11594235B2 (en) | Noise filling in multichannel audio coding | |
MX2022005146A (es) | Distribucion de tasa de bits en servicios inmersivos de voz y audio. | |
SE0402652D0 (sv) | Methods for improved performance of prediction based multi- channel reconstruction | |
KR20210122897A (ko) | Mdct-기반의 복소수 예측 스테레오 코딩 | |
TWI521502B (zh) | 多聲道音訊的較高頻率和降混低頻率內容的混合編碼 | |
KR102230668B1 (ko) | 미드/사이드 결정이 개선된 전역 ild를 갖는 mdct m/s 스테레오의 장치 및 방법 | |
US9454972B2 (en) | Audio and speech coding device, audio and speech decoding device, method for coding audio and speech, and method for decoding audio and speech | |
MY181486A (en) | Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal | |
ATE537537T1 (de) | Signalkomprimierungsverfahren und -vorrichtung | |
CA2880412C (en) | Apparatus and methods for adapting audio information in spatial audio object coding | |
AU2013301831A1 (en) | Encoder, decoder, system and method employing a residual concept for parametric audio object coding | |
KR20070110111A (ko) | 보장된 최대 비트 레이트를 가지는 정보의 무손실 인코딩 | |
MX2019011955A (es) | Codificacion y decodificacion de posiciones de picos espectrales. | |
KR20120038311A (ko) | 공간 파라미터 부호화 장치 및 방법,그리고 공간 파라미터 복호화 장치 및 방법 | |
KR20070044352A (ko) | 오디오 신호의 인코딩 및 디코딩 방법, 및 이를 구현하기위한 장치 | |
KR101735619B1 (ko) | 멀티 채널 신호의 부호화/복호화 장치 및 방법 | |
KR101635099B1 (ko) | 멀티 채널 신호의 부호화/복호화 장치 및 방법 | |
AR120361A1 (es) | Distribución de tasa de bits en servicios inmersivos de voz y audio | |
KR20170054363A (ko) | 멀티 채널 신호의 부호화/복호화 장치 및 방법 |