BR112023001616A2 - Aparelho, método e programa de computador para codificar um sinal de áudio ou para decodificar uma cena de áudio codificada - Google Patents

Aparelho, método e programa de computador para codificar um sinal de áudio ou para decodificar uma cena de áudio codificada

Info

Publication number
BR112023001616A2
BR112023001616A2 BR112023001616A BR112023001616A BR112023001616A2 BR 112023001616 A2 BR112023001616 A2 BR 112023001616A2 BR 112023001616 A BR112023001616 A BR 112023001616A BR 112023001616 A BR112023001616 A BR 112023001616A BR 112023001616 A2 BR112023001616 A2 BR 112023001616A2
Authority
BR
Brazil
Prior art keywords
frame
signal
encoded audio
decoding
audio signal
Prior art date
Application number
BR112023001616A
Other languages
English (en)
Inventor
Fuchs Guillaume
Tamarapu Archit
EICHENSEER Andrea
Korse Srikanth
Döhla Stefan
Multrus Markus
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of BR112023001616A2 publication Critical patent/BR112023001616A2/pt

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/173Transcoding, i.e. converting between two coded representations avoiding cascaded coding-decoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Mathematical Physics (AREA)
  • Stereophonic System (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

APARELHO, MÉTODO E PROGRAMA DE COMPUTADOR PARA CODIFICAR UM SINAL DE ÁUDIO OU PARA DECODIFICAR UMA CENA DE ÁUDIO CODIFICADA. A presente invenção refere-se a aparelho (200) para gerar cena de áudio codificada (304) e decodificá-la e/ou processá-la; e aos métodos relacionados e unidades de armazenamento não transitório cujas instruções, quando executadas por processador, acarretam executar tal método. Pode compreender, em primeiro quadro (346), primeira representação de parâmetro de campo do som (316) e sinal de áudio codificado (346), em que segundo quadro (348) é inativo; e compreende: detector de atividade (2200) segundo quadro (348) inativo; sintetizador de sinal sintético (210) para sinal de áudio sintético (228) para o segundo quadro (308) ao usar a descrição paramétrica (348) para o segundo quadro (308); decodificador de áudio (230) para sinal de áudio codificado (346) para o primeiro quadro (306); e renderizador espacial (240) para o sinal (202) para o primeiro quadro (306) ao usar a primeira representação de parâmetro de campo do som (316) e ao usar o sinal (228) para o segundo quadro (308), ou transcodificador para gerar formato de saída assistido por metadados que compreende o sinal (346) para o primeiro quadro (306), a primeira representação de parâmetro (316) para o primeiro quadro (306), o sinal (228) para o segundo quadro (308) e segunda representação de parâmetro (318) para o segundo quadro (308).
BR112023001616A 2020-07-30 2021-05-31 Aparelho, método e programa de computador para codificar um sinal de áudio ou para decodificar uma cena de áudio codificada BR112023001616A2 (pt)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP20188707 2020-07-30
PCT/EP2021/064576 WO2022022876A1 (en) 2020-07-30 2021-05-31 Apparatus, method and computer program for encoding an audio signal or for decoding an encoded audio scene

Publications (1)

Publication Number Publication Date
BR112023001616A2 true BR112023001616A2 (pt) 2023-02-23

Family

ID=71894727

Family Applications (1)

Application Number Title Priority Date Filing Date
BR112023001616A BR112023001616A2 (pt) 2020-07-30 2021-05-31 Aparelho, método e programa de computador para codificar um sinal de áudio ou para decodificar uma cena de áudio codificada

Country Status (12)

Country Link
US (1) US20230306975A1 (pt)
EP (1) EP4189674A1 (pt)
JP (1) JP2023536156A (pt)
KR (1) KR20230049660A (pt)
CN (1) CN116348951A (pt)
AU (2) AU2021317755B2 (pt)
BR (1) BR112023001616A2 (pt)
CA (1) CA3187342A1 (pt)
MX (1) MX2023001152A (pt)
TW (2) TW202347316A (pt)
WO (1) WO2022022876A1 (pt)
ZA (1) ZA202301024B (pt)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3719799A1 (en) * 2019-04-04 2020-10-07 FRAUNHOFER-GESELLSCHAFT zur Förderung der angewandten Forschung e.V. A multi-channel audio encoder, decoder, methods and computer program for switching between a parametric multi-channel operation and an individual channel operation
WO2024051955A1 (en) 2022-09-09 2024-03-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decoder and decoding method for discontinuous transmission of parametrically coded independent streams with metadata
WO2024051954A1 (en) 2022-09-09 2024-03-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder and encoding method for discontinuous transmission of parametrically coded independent streams with metadata
WO2024056702A1 (en) * 2022-09-13 2024-03-21 Telefonaktiebolaget Lm Ericsson (Publ) Adaptive inter-channel time difference estimation
CN116368460A (zh) * 2023-02-14 2023-06-30 北京小米移动软件有限公司 音频处理方法、装置

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE0004187D0 (sv) * 2000-11-15 2000-11-15 Coding Technologies Sweden Ab Enhancing the performance of coding systems that use high frequency reconstruction methods
JP5753540B2 (ja) * 2010-11-17 2015-07-22 パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America ステレオ信号符号化装置、ステレオ信号復号装置、ステレオ信号符号化方法及びステレオ信号復号方法
TWI543642B (zh) * 2011-07-01 2016-07-21 杜比實驗室特許公司 用於適應性音頻信號的產生、譯碼與呈現之系統與方法
PL2823479T3 (pl) * 2012-09-11 2015-10-30 Ericsson Telefon Ab L M Generowanie szumu komfortowego
US9489955B2 (en) * 2014-01-30 2016-11-08 Qualcomm Incorporated Indicating frame parameter reusability for coding vectors
CN110459229B (zh) * 2014-06-27 2023-01-10 杜比国际公司 用于解码声音或声场的高阶高保真度立体声响复制(hoa)表示的方法
CA2987808C (en) * 2016-01-22 2020-03-10 Guillaume Fuchs Apparatus and method for encoding or decoding an audio multi-channel signal using spectral-domain resampling
CN107742521B (zh) * 2016-08-10 2021-08-13 华为技术有限公司 多声道信号的编码方法和编码器
CN117392988A (zh) * 2016-09-28 2024-01-12 华为技术有限公司 一种处理多声道音频信号的方法、装置和系统
BR112020026793A2 (pt) * 2018-06-28 2021-03-30 Telefonaktiebolaget Lm Ericsson (Publ) Determinação de parâmetro de ruído de conforto adaptativo
CN109448741B (zh) * 2018-11-22 2021-05-11 广州广晟数码技术有限公司 一种3d音频编码、解码方法及装置

Also Published As

Publication number Publication date
WO2022022876A1 (en) 2022-02-03
TW202230333A (zh) 2022-08-01
TW202347316A (zh) 2023-12-01
MX2023001152A (es) 2023-04-05
EP4189674A1 (en) 2023-06-07
AU2021317755A1 (en) 2023-03-02
KR20230049660A (ko) 2023-04-13
TWI794911B (zh) 2023-03-01
AU2021317755B2 (en) 2023-11-09
US20230306975A1 (en) 2023-09-28
AU2023286009A1 (en) 2024-01-25
CA3187342A1 (en) 2022-02-03
ZA202301024B (en) 2024-04-24
CN116348951A (zh) 2023-06-27
JP2023536156A (ja) 2023-08-23

Similar Documents

Publication Publication Date Title
BR112023001616A2 (pt) Aparelho, método e programa de computador para codificar um sinal de áudio ou para decodificar uma cena de áudio codificada
BRPI0515343A8 (pt) Codificador e decodificador de áudio, métodos de codificar um sinal de áudio e de decodificar um sinal de áudio codificado, sinal de áudio codificado, meio de armazenamento, dispositivo, e, código de programa legível por computador
BR112015025080A2 (pt) codificador e decodificador de áudio estereofônico
MX354657B (es) Codificador de audio, decodificador de audio y métodos relacionados que usan procesamiento de dos canales dentro de un marco de relleno inteligente de espacios.
DE602007012730D1 (de) Kodierung und dekodierung von audio-objekten
MX349398B (es) Metodo, aparato y programa de computadora para evitar artefactos de recorte.
US20060215754A1 (en) Method and apparatus for performing video decoding in a multi-thread environment
JP2015527610A5 (pt)
IL292856A (en) Adaptive processing with multiple media processor nodes
BR112013017067A2 (pt) dispositivo de codificação de vídeo para codificar dados de vídeo, método de codificação de vídeo, dispositivo de decodificação de vídeo, sistema de decodificação de vídeo e programa de computador
MX2016005542A (es) Decodificador de audio y metodo para proveer una informacion de audio decodificada usando un ocultamiento de error que modifica una señal de excitacion de dominio de tiempo.
BR112015007723A2 (pt) codificador e decodificador de áudio com sonoridade de programa e metadados de limite
BR112015025393A2 (pt) Sistema, método e meio legível por computador
ATE456261T1 (de) Audiokodierung und audiodekodierung
BR112015031180A2 (pt) aparelho e método para desvanecimento de sinal aperfeiçoado para sistemas de codificação de áudio comutação durante ocultação de erros
JP2015516758A5 (pt)
WO2012138819A3 (en) Audio encoding method and system for generating a unified bitstream decodable by decoders implementing different decoding protocols
BR112013020878A2 (pt) dispositivos e métodos de codificação e decodificação de imagem
BR112013032727A2 (pt) processador de sinais de áudio e método de processamento de sinais de áudio
BR112015007532A8 (pt) codificador, decodificador e métodos para codificação de objeto de áudio espacial multirresolução compatível regressivo
BR122022004784B8 (pt) Método de decodificação em um sistema de processamento de áudio de múltiplos canais e decodificador para um sistema de processamento de áudio de múltiplos canais
MX351193B (es) Codificador, decodificador, sistema y metodo que emplean un concepto residual para codificar objetos de audio parametricos.
EP2881942B1 (en) Watermark insertion in frequency domain for audio decoding
US9905232B2 (en) Device and method for encoding and decoding of an audio signal
RU2015104055A (ru) Устройство и способы для адаптации аудиоинформации при пространственном кодировании аудиообъектов