EP4723108A2 - Parametrische räumliche audiokodierung - Google Patents

Parametrische räumliche audiokodierung

Info

Publication number
EP4723108A2
EP4723108A2 EP26159862.7A EP26159862A EP4723108A2 EP 4723108 A2 EP4723108 A2 EP 4723108A2 EP 26159862 A EP26159862 A EP 26159862A EP 4723108 A2 EP4723108 A2 EP 4723108A2
Authority
EP
European Patent Office
Prior art keywords
vector
audio
ratio
generating
quantized
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP26159862.7A
Other languages
English (en)
French (fr)
Inventor
Adriana Vasilache
Mikko-Ville Laitinen
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Technologies Oy
Original Assignee
Nokia Technologies Oy
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Technologies Oy filed Critical Nokia Technologies Oy
Publication of EP4723108A2 publication Critical patent/EP4723108A2/de
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/20Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0007Codebook element generation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/15Aspects of sound capture and related signal processing for recording or reproduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/03Application of parametric coding in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
EP26159862.7A 2022-11-29 2023-11-07 Parametrische räumliche audiokodierung Pending EP4723108A2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
GB2217884.2A GB2624869A (en) 2022-11-29 2022-11-29 Parametric spatial audio encoding
PCT/EP2023/080907 WO2024115052A1 (en) 2022-11-29 2023-11-07 Parametric spatial audio encoding
EP23805488.6A EP4627572B1 (de) 2022-11-29 2023-11-07 Parametrische räumliche audiokodierung

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
EP23805488.6A Division EP4627572B1 (de) 2022-11-29 2023-11-07 Parametrische räumliche audiokodierung

Publications (1)

Publication Number Publication Date
EP4723108A2 true EP4723108A2 (de) 2026-04-08

Family

ID=84889624

Family Applications (2)

Application Number Title Priority Date Filing Date
EP23805488.6A Active EP4627572B1 (de) 2022-11-29 2023-11-07 Parametrische räumliche audiokodierung
EP26159862.7A Pending EP4723108A2 (de) 2022-11-29 2023-11-07 Parametrische räumliche audiokodierung

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP23805488.6A Active EP4627572B1 (de) 2022-11-29 2023-11-07 Parametrische räumliche audiokodierung

Country Status (9)

Country Link
EP (2) EP4627572B1 (de)
JP (1) JP2025540763A (de)
KR (1) KR20250088634A (de)
CN (1) CN120226075B (de)
AU (1) AU2023405234B2 (de)
CO (1) CO2025006784A2 (de)
GB (1) GB2624869A (de)
MX (1) MX2025006029A (de)
WO (1) WO2024115052A1 (de)

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2814028B1 (de) * 2012-02-10 2016-08-17 Panasonic Intellectual Property Corporation of America Audio- und sprachcodierungsvorrichtung, audio- und sprachdecodierungsvorrichtung, audio- und sprachcodierungsverfahren sowie audio- und sprachdecodierungsverfahren
WO2014108738A1 (en) * 2013-01-08 2014-07-17 Nokia Corporation Audio signal multi-channel parameter encoder
CN106030703B (zh) * 2013-12-17 2020-02-04 诺基亚技术有限公司 音频信号编码器
GB2578603A (en) * 2018-10-31 2020-05-20 Nokia Technologies Oy Determination of spatial audio parameter encoding and associated decoding
CA3193359A1 (en) * 2019-06-14 2020-12-17 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Parameter encoding and decoding
GB2585187A (en) * 2019-06-25 2021-01-06 Nokia Technologies Oy Determination of spatial audio parameter encoding and associated decoding
GB2586214A (en) * 2019-07-31 2021-02-17 Nokia Technologies Oy Quantization of spatial audio direction parameters
KR20220062621A (ko) * 2019-09-17 2022-05-17 노키아 테크놀로지스 오와이 공간적 오디오 파라미터 인코딩 및 관련 디코딩
GB2592896A (en) * 2020-01-13 2021-09-15 Nokia Technologies Oy Spatial audio parameter encoding and associated decoding
WO2022200666A1 (en) * 2021-03-22 2022-09-29 Nokia Technologies Oy Combining spatial audio streams
WO2022223133A1 (en) * 2021-04-23 2022-10-27 Nokia Technologies Oy Spatial audio parameter encoding and associated decoding

Also Published As

Publication number Publication date
JP2025540763A (ja) 2025-12-16
CO2025006784A2 (es) 2025-06-06
CN120226075B (zh) 2026-03-13
EP4627572B1 (de) 2026-03-04
EP4627572C0 (de) 2026-03-04
KR20250088634A (ko) 2025-06-17
MX2025006029A (es) 2025-06-02
GB2624869A (en) 2024-06-05
AU2023405234A1 (en) 2025-05-29
EP4627572A1 (de) 2025-10-08
WO2024115052A1 (en) 2024-06-06
GB202217884D0 (en) 2023-01-11
AU2023405234B2 (en) 2025-09-25
CN120226075A (zh) 2025-06-27

Similar Documents

Publication Publication Date Title
EP4082009B1 (de) Zusammenführen von räumlichen audioparametern
US20240185869A1 (en) Combining spatial audio streams
CN116762127A (zh) 量化空间音频参数
WO2022223133A1 (en) Spatial audio parameter encoding and associated decoding
WO2024115050A1 (en) Parametric spatial audio encoding
EP4690186A1 (de) Parametrische räumliche audiokodierung mit niedriger kodierrate
EP4627572B1 (de) Parametrische räumliche audiokodierung
US20230335143A1 (en) Quantizing spatial audio parameters
EP4278347B1 (de) Transformation räumlicher audioparameter
WO2024175320A1 (en) Priority values for parametric spatial audio encoding
WO2024175319A1 (en) Combined input format spatial audio encoding
EP4627574A1 (de) Parametrische räumliche audiokodierung
WO2025078226A1 (en) Parametric spatial audio decoding with pass-through mode

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

17P Request for examination filed

Effective date: 20260220

AC Divisional application: reference to earlier application

Ref document number: 4627572

Country of ref document: EP

Kind code of ref document: P

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC ME MK MT NL NO PL PT RO RS SE SI SK SM TR