BR112023023662A2 - Método e aparelho de codificação de sinal de áudio tridimensional e codificador - Google Patents
Método e aparelho de codificação de sinal de áudio tridimensional e codificadorInfo
- Publication number
- BR112023023662A2 BR112023023662A2 BR112023023662A BR112023023662A BR112023023662A2 BR 112023023662 A2 BR112023023662 A2 BR 112023023662A2 BR 112023023662 A BR112023023662 A BR 112023023662A BR 112023023662 A BR112023023662 A BR 112023023662A BR 112023023662 A2 BR112023023662 A2 BR 112023023662A2
- Authority
- BR
- Brazil
- Prior art keywords
- coefficients
- encoder
- audio signal
- representative
- dimensional audio
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 title abstract 5
- 238000000034 method Methods 0.000 title abstract 4
- 230000006835 compression Effects 0.000 abstract 1
- 238000007906 compression Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/002—Dynamic bit allocation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/167—Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/032—Quantisation or dequantisation of spectral components
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Stereophonic System (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
método e aparelho de codificação de sinal de áudio tridimensional e codificador. a presente invenção refere-se a um método e aparelho de codificação de sinal de áudio tridimensional e um codificador e refere-se ao campo de multimídia. o método inclui: após obter uma quarta quantidade de coeficientes para um quadro corrente de um sinal de áudio tridimensional e valores de características de domínio de frequência da quarta quantidade de coeficientes, um codificador seleciona uma terceira quantidade de coeficientes representativos da quarta quantidade de coeficientes com base nos valores de características de domínio de frequência da quarta quantidade de coeficientes e seleciona uma segunda quantidade de alto-falantes virtuais representativos para o quadro corrente de um conjunto de alto-falantes virtuais candidatos com base na terceira quantidade de coeficientes representativos e então codifica o quadro corrente com base na segunda quantidade de alto-falantes virtuais representativos para o quadro corrente para obter um fluxo de bits. o codificador seleciona os alto-falantes virtuais representativos do conjunto de alto-falantes virtuais candidatos usando uma pequena quantidade de coeficientes representativos para representar todos os coeficientes. isto efetivamente reduz a complexidade de cálculo executado pelo codificador para pesquisar por um alto-falante virtual e a complexidade de cálculo de executar codificação de compressão no sinal de áudio tridimensional e, portanto, reduz a carga de cálculo do codificador.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110535832.3A CN115376527A (zh) | 2021-05-17 | 2021-05-17 | 三维音频信号编码方法、装置和编码器 |
PCT/CN2022/091558 WO2022242480A1 (zh) | 2021-05-17 | 2022-05-07 | 三维音频信号编码方法、装置和编码器 |
Publications (1)
Publication Number | Publication Date |
---|---|
BR112023023662A2 true BR112023023662A2 (pt) | 2024-01-30 |
Family
ID=84059746
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
BR112023023662A BR112023023662A2 (pt) | 2021-05-17 | 2022-05-07 | Método e aparelho de codificação de sinal de áudio tridimensional e codificador |
Country Status (9)
Country | Link |
---|---|
US (1) | US20240087580A1 (pt) |
EP (1) | EP4322158A1 (pt) |
JP (1) | JP2024520944A (pt) |
KR (1) | KR20240001226A (pt) |
CN (1) | CN115376527A (pt) |
BR (1) | BR112023023662A2 (pt) |
CA (1) | CA3220588A1 (pt) |
TW (1) | TWI834163B (pt) |
WO (1) | WO2022242480A1 (pt) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN118136027A (zh) * | 2022-12-02 | 2024-06-04 | 华为技术有限公司 | 场景音频编码方法及电子设备 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2469741A1 (en) * | 2010-12-21 | 2012-06-27 | Thomson Licensing | Method and apparatus for encoding and decoding successive frames of an ambisonics representation of a 2- or 3-dimensional sound field |
US20140358565A1 (en) * | 2013-05-29 | 2014-12-04 | Qualcomm Incorporated | Compression of decomposed representations of a sound field |
EP3591649B8 (en) * | 2014-03-21 | 2022-06-08 | Dolby International AB | Method and apparatus for decompressing a compressed hoa signal |
EP2934025A1 (en) * | 2014-04-15 | 2015-10-21 | Thomson Licensing | Method and device for applying dynamic range compression to a higher order ambisonics signal |
EP2963948A1 (en) * | 2014-07-02 | 2016-01-06 | Thomson Licensing | Method and apparatus for encoding/decoding of directions of dominant directional signals within subbands of a HOA signal representation |
EP2963949A1 (en) * | 2014-07-02 | 2016-01-06 | Thomson Licensing | Method and apparatus for decoding a compressed HOA representation, and method and apparatus for encoding a compressed HOA representation |
US9747910B2 (en) * | 2014-09-26 | 2017-08-29 | Qualcomm Incorporated | Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework |
EP3312833A1 (en) * | 2016-10-19 | 2018-04-25 | Holosbase GmbH | Decoding and encoding apparatus and corresponding methods |
IN201627036613A (pt) * | 2016-10-26 | 2016-11-18 | Qualcomm Inc | |
US11395083B2 (en) * | 2018-02-01 | 2022-07-19 | Qualcomm Incorporated | Scalable unified audio renderer |
CN114582356A (zh) * | 2020-11-30 | 2022-06-03 | 华为技术有限公司 | 一种音频编解码方法和装置 |
-
2021
- 2021-05-17 CN CN202110535832.3A patent/CN115376527A/zh active Pending
-
2022
- 2022-05-07 CA CA3220588A patent/CA3220588A1/en active Pending
- 2022-05-07 BR BR112023023662A patent/BR112023023662A2/pt unknown
- 2022-05-07 WO PCT/CN2022/091558 patent/WO2022242480A1/zh active Application Filing
- 2022-05-07 JP JP2023571383A patent/JP2024520944A/ja active Pending
- 2022-05-07 EP EP22803804.8A patent/EP4322158A1/en active Pending
- 2022-05-07 KR KR1020237040819A patent/KR20240001226A/ko unknown
- 2022-05-10 TW TW111117469A patent/TWI834163B/zh active
-
2023
- 2023-11-16 US US18/511,191 patent/US20240087580A1/en active Pending
Also Published As
Publication number | Publication date |
---|---|
JP2024520944A (ja) | 2024-05-27 |
KR20240001226A (ko) | 2024-01-03 |
WO2022242480A1 (zh) | 2022-11-24 |
US20240087580A1 (en) | 2024-03-14 |
CN115376527A (zh) | 2022-11-22 |
CA3220588A1 (en) | 2022-11-24 |
EP4322158A1 (en) | 2024-02-14 |
TW202247148A (zh) | 2022-12-01 |
TWI834163B (zh) | 2024-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5269914B2 (ja) | ステレオ音響信号符号化装置、ステレオ音響信号復号装置およびそれらの方法 | |
KR101100213B1 (ko) | 오디오 신호 처리 방법 및 장치 | |
KR102144389B1 (ko) | 고차 앰비소닉스(hoa) 신호를 압축하는 방법, 압축된 hoa 신호를 압축 해제하는 방법, hoa 신호를 압축하기 위한 장치, 및 압축된 hoa 신호를 압축 해제하기 위한 장치 | |
KR102144976B1 (ko) | 고차 앰비소닉스(hoa) 신호를 압축하는 방법, 압축된 hoa 신호를 압축 해제하는 방법, hoa 신호를 압축하기 위한 장치, 및 압축된 hoa 신호를 압축 해제하기 위한 장치 | |
KR101343898B1 (ko) | 오디오 디코딩 방법 및 오디오 디코더 | |
JP2014518407A (ja) | 多チャンネル・オーディオ信号を処理する方法および装置 | |
US20220180881A1 (en) | Speech signal encoding and decoding methods and apparatuses, electronic device, and storage medium | |
JPWO2014199632A1 (ja) | 音響信号の帯域幅拡張を行う装置及び方法 | |
BR112023023662A2 (pt) | Método e aparelho de codificação de sinal de áudio tridimensional e codificador | |
KR102143037B1 (ko) | 고차 앰비소닉스(hoa) 신호를 압축하는 방법, 압축된 hoa 신호를 압축 해제하는 방법, hoa 신호를 압축하기 위한 장치, 및 압축된 hoa 신호를 압축 해제하기 위한 장치 | |
BRPI0507124A (pt) | aparelho e método para escalonamento de tempo de um sinal, programa de computador, e, portador de gravação | |
Diener et al. | Interspeech 2022 audio deep packet loss concealment challenge | |
MX2020007820A (es) | Codificador de escena de audio, decodificador de escena de audio y metodos relacionados que utilizan el analisis espacial hibrido de codificador / decodificador. | |
RU2017117896A (ru) | Кодирование и декодирование аудиосигналов | |
JP2017111230A5 (pt) | ||
BR112023000850A2 (pt) | Método e aparelho de estimativa de atraso de sinal de áudio estéreo, aparelho de codificação de áudio e meio de armazenamento legível por computador | |
RU2009116279A (ru) | Способы и устройства кодирования и декодирования объектно-ориентированных аудиосигналов | |
KR102425236B1 (ko) | 채널-간 위상 차이 파라미터 코딩 방법 및 디바이스 | |
Wu et al. | Parametric stereo coding scheme with a new downmix method and whole band inter channel time/phase differences | |
MX368973B (es) | Corrección de pérdida de trama mejorada con información de voz. | |
BR112022000623A2 (pt) | Método, aparelho e sistema para codificar e decodificar um bloco de amostras de vídeo | |
BR112023023916A2 (pt) | Método e aparelho de codificação de sinal de áudio tridimensional, e codificador | |
Kawahara et al. | Evaluation of the low-delay coding of applause and hand-clapping sounds caused by music appreciation | |
BR112023025071A2 (pt) | Método e aparelho de processamento de sinal de áudio tridimensional | |
CN115881139A (zh) | 编解码方法、装置、设备、存储介质及计算机程序 |