SG11201806256SA - Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision - Google Patents

Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision

Info

Publication number
SG11201806256SA
SG11201806256SA SG11201806256SA SG11201806256SA SG11201806256SA SG 11201806256S A SG11201806256S A SG 11201806256SA SG 11201806256S A SG11201806256S A SG 11201806256SA SG 11201806256S A SG11201806256S A SG 11201806256SA SG 11201806256S A SG11201806256S A SG 11201806256SA
Authority
SG
Singapore
Prior art keywords
channel
audio signal
signal
normalized
spectral band
Prior art date
Application number
SG11201806256SA
Inventor
Emmanuel Ravelli
Markus Schnell
Stefan Döhla
Wolfgang Jägers
Martin Dietz
Christian Helmrich
Goran Markovic
Eleni Fotopoulou
Markus Multrus
Stefan Bayer
Guillaume Fuchs
Jürgen Herre
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of SG11201806256SA publication Critical patent/SG11201806256SA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/03Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/032Quantisation or dequantisation of spectral components
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/22Mode decision, i.e. based on audio signal content versus external parameters

Abstract

120 first channel and second channel of audio input signal 11. encoding unit normalizer normalized audio signal encoded audio signal Fig. - 11 W O 20 17 / 1255 44 Al (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (10) International Publication Number (43) International Publication Date WO 2017/125544 Al 27 July 2017 (27.07.2017) WIPO I PCT 111111111111110111011111111111010111110111011101111111111111111111111111111111110111111 (51) International Patent Classification: GlOL 19/008 (2013.01) G1 0L 19/02 (2013.01) G1 0L 19/22 (2013.01) (21) International Application Number: PCT/EP2017/051177 (22) International Filing Date: 20 January 2017 (20.01.2017) (25) Filing Language: English (26) Publication Language: English (30) Priority Data: 16152457.4 22 January 2016 (22.01.2016) EP 16152454.1 22 January 2016 (22.01.2016) EP 16199895.0 21 November 2016 (21.11.2016) EP (71) Applicants: FRAUNHOFER-GESELLSCHAFT ZUR FORDERUNG DER ANGEWANDTEN FORSCHUNG E.V. [DE/DE]; HansastraBe 27c, 80686 Miinchen (DE). FRIEDRICH-ALEXANDER-UNIVERSITAET ER- LANGEN-NUERNBERG [DE/DE] ; Schlossplatz 4, 91054 Erlangen (DE). (72) Inventors: RAVELLI, Emmanuel; Donato-Polli-Str. 58, 91056 Erlangen (DE). SCHNELL, Markus; Labenwolfstr. 15, 90409 Niimberg (DE). DOHLA, Stefan; Saidelsteig 61, 91058 Erlangen (DE). JAGERS, Wolfgang; Kul- mbacher StraBe 47, 91056 Erlangen (DE). DIETZ, Mar- tin; DeutschherrnstraBe 37, 90429 Niimberg (DE). HELMRICH, Christian; Fraunhoferstr. 21, 10587 Berlin (DE). MARKOVIC, Goran; Aachener StraBe 19, 90425 Niimberg (DE). FOTOPOULOU, Eleni; Berckhauserstr. 33, 90409 Niimberg (DE). MULTRUS, Markus; Etzlaub- weg 7, 90469 Niirnberg (DE). BAYER, Stefan; Dortmun- der Strasse 14, 90425 Niimberg (DE). FUCHS, Guil- laume; Joseph-Otto-Kolb-Str. 31, 91088 Bubenreuth (DE). HERRE, Jurgen; Rathsberger Str. 10a, 91054 Erlangen (DE). (74) Agents: SCHAIRER, Oliver et al.; Schoppe, Zimmer- maim, Stockeler, Zinkler, Schenk & Partner mbB, Radlkoferstr.2, 81373 Miinchen (DE). (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG). [Continued on next page] (54) Title: APPARATUS AND METHOD FOR MDCT M/S STEREO WITH GLOBAL ILD WITH IMPROVED MID/SIDE DE- CISION (57) : Fig. illustrates an apparatus for encoding a first channel and a second channel of an audio input signal comprising two or more channels to obtain an encoded audio signal according to an embodiment. The apparatus comprises a normalizer (110) configured to determine a normalization value for the audio input signal depending on the first channel of the audio input signal and depending on the second channel of the audio input signal, wherein the normalizer (110) is configured to determine a first channel and a second channel of a normalized audio signal by modifying, depending on the normalization value, at least one of the first channel and the second channel of the audio input signal. Moreover, the apparatus comprises an encoding unit (120) being con- figured to generate a processed audio signal having a first channel and a second channel, such that one or more spectral bands of the first channel of the processed audio signal are one or more spectral bands of the first channel of the normalized audio signal, such that one or more spectral bands of the second channel of the processed audio signal are one or more spectral bands of the second channel of the normalized audio signal, such that at least one spectral band of the first channel of the processed audio signal is a spectral band of a mid signal depending on a spectral band of the first channel of the normalized audio signal and depending on a spectral band of the second channel of the normalized audio signal, and such that at least one spectral band of the second channel of the processed audio signal is a spectral band of a side signal depending on a spectral band of the first channel of the normalized au - dio signal and depending on a spectral band of the second channel of the normalized audio signal. The encoding unit (120) is con- figured to encode the processed audio signal to obtain the encoded audio signal. WO 2017/125544 Al 1#11101M011010112010EMOMOHOVEM10011111111111110110111111 Published: — before the expiration of the time limit for amending the — with international search report (Art. 21(3)) claims and to be republished in the event of receipt of amendments (Rule 48.2(h))
SG11201806256SA 2016-01-22 2017-01-20 Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision SG11201806256SA (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
EP16152457 2016-01-22
EP16152454 2016-01-22
EP16199895 2016-11-21
PCT/EP2017/051177 WO2017125544A1 (en) 2016-01-22 2017-01-20 Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision

Publications (1)

Publication Number Publication Date
SG11201806256SA true SG11201806256SA (en) 2018-08-30

Family

ID=57860879

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201806256SA SG11201806256SA (en) 2016-01-22 2017-01-20 Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision

Country Status (18)

Country Link
US (2) US11842742B2 (en)
EP (2) EP3405950B1 (en)
JP (3) JP6864378B2 (en)
KR (1) KR102230668B1 (en)
CN (2) CN109074812B (en)
AU (1) AU2017208561B2 (en)
BR (1) BR112018014813A2 (en)
CA (1) CA3011883C (en)
ES (1) ES2932053T3 (en)
FI (1) FI3405950T3 (en)
MX (1) MX2018008886A (en)
MY (1) MY188905A (en)
PL (1) PL3405950T3 (en)
RU (1) RU2713613C1 (en)
SG (1) SG11201806256SA (en)
TW (1) TWI669704B (en)
WO (1) WO2017125544A1 (en)
ZA (1) ZA201804866B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10734001B2 (en) * 2017-10-05 2020-08-04 Qualcomm Incorporated Encoding or decoding of audio signals
CN110556116B (en) 2018-05-31 2021-10-22 华为技术有限公司 Method and apparatus for calculating downmix signal and residual signal
CN115132214A (en) 2018-06-29 2022-09-30 华为技术有限公司 Coding method, decoding method, coding device and decoding device for stereo signal
WO2020007719A1 (en) * 2018-07-04 2020-01-09 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Multisignal audio coding using signal whitening as preprocessing
JP7130878B2 (en) 2019-01-13 2022-09-05 華為技術有限公司 High resolution audio coding
DE102020210917B4 (en) 2019-08-30 2023-10-19 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung eingetragener Verein Improved M/S stereo encoder and decoder
WO2023153228A1 (en) * 2022-02-08 2023-08-17 パナソニック インテレクチュアル プロパティ コーポレーション オブ アメリカ Encoding device and encoding method

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3435674B2 (en) * 1994-05-06 2003-08-11 日本電信電話株式会社 Signal encoding and decoding methods, and encoder and decoder using the same
DE19628293C1 (en) * 1996-07-12 1997-12-11 Fraunhofer Ges Forschung Encoding and decoding audio signals using intensity stereo and prediction
US6370502B1 (en) * 1999-05-27 2002-04-09 America Online, Inc. Method and system for reduction of quantization-induced block-discontinuities and general purpose audio codec
DE19959156C2 (en) * 1999-12-08 2002-01-31 Fraunhofer Ges Forschung Method and device for processing a stereo audio signal to be encoded
ATE378677T1 (en) * 2004-03-12 2007-11-15 Nokia Corp SYNTHESIS OF A MONO AUDIO SIGNAL FROM A MULTI-CHANNEL AUDIO SIGNAL
US8041042B2 (en) * 2006-11-30 2011-10-18 Nokia Corporation Method, system, apparatus and computer program product for stereo coding
US8706480B2 (en) 2007-06-11 2014-04-22 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder for encoding an audio signal having an impulse-like portion and stationary portion, encoding methods, decoder, decoding method, and encoding audio signal
BRPI0906079B1 (en) * 2008-03-04 2020-12-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. mixing input data streams and generating an output data stream from them
EP2144231A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme with common preprocessing
RU2520329C2 (en) * 2009-03-17 2014-06-20 Долби Интернешнл Аб Advanced stereo coding based on combination of adaptively selectable left/right or mid/side stereo coding and parametric stereo coding
DE102010014599A1 (en) 2010-04-09 2010-11-18 Continental Automotive Gmbh Air-flow meter for measuring mass flow rate of fluid in air intake manifold of e.g. diesel engine, has transfer element transferring signals processed by linearization element, filter element and conversion element
EP2375409A1 (en) * 2010-04-09 2011-10-12 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder and related methods for processing multi-channel audio signals using complex prediction
CA3076786C (en) 2010-04-09 2021-04-13 Dolby International Ab Mdct-based complex prediction stereo coding
AU2012217156B2 (en) * 2011-02-14 2015-03-19 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Linear prediction based coding scheme using spectral domain noise shaping
CN105225669B (en) * 2011-03-04 2018-12-21 瑞典爱立信有限公司 Rear quantization gain calibration in audio coding
US8654984B2 (en) * 2011-04-26 2014-02-18 Skype Processing stereophonic audio signals
CN104050969A (en) 2013-03-14 2014-09-17 杜比实验室特许公司 Space comfortable noise
EP2830059A1 (en) * 2013-07-22 2015-01-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Noise filling energy adjustment
CN110970041B (en) * 2014-07-01 2023-10-20 韩国电子通信研究院 Method and apparatus for processing multi-channel audio signal
US10152977B2 (en) * 2015-11-20 2018-12-11 Qualcomm Incorporated Encoding of multiple audio signals
US10115403B2 (en) * 2015-12-18 2018-10-30 Qualcomm Incorporated Encoding of multiple audio signals

Also Published As

Publication number Publication date
CN117542365A (en) 2024-02-09
EP4123645A1 (en) 2023-01-25
TWI669704B (en) 2019-08-21
JP6864378B2 (en) 2021-04-28
MX2018008886A (en) 2018-11-09
JP2019506633A (en) 2019-03-07
CN109074812B (en) 2023-11-17
US20240071395A1 (en) 2024-02-29
MY188905A (en) 2022-01-13
US20180330740A1 (en) 2018-11-15
TW201732780A (en) 2017-09-16
KR102230668B1 (en) 2021-03-22
KR20180103102A (en) 2018-09-18
RU2713613C1 (en) 2020-02-05
AU2017208561B2 (en) 2020-04-16
JP2023109851A (en) 2023-08-08
AU2017208561A1 (en) 2018-08-09
JP2021119383A (en) 2021-08-12
US11842742B2 (en) 2023-12-12
JP7280306B2 (en) 2023-05-23
EP3405950B1 (en) 2022-09-28
ES2932053T3 (en) 2023-01-09
CN109074812A (en) 2018-12-21
CA3011883C (en) 2020-10-27
ZA201804866B (en) 2019-04-24
FI3405950T3 (en) 2022-12-15
CA3011883A1 (en) 2017-07-27
BR112018014813A2 (en) 2018-12-18
EP3405950A1 (en) 2018-11-28
WO2017125544A1 (en) 2017-07-27
PL3405950T3 (en) 2023-01-30

Similar Documents

Publication Publication Date Title
SG11201806256SA (en) Apparatus and method for mdct m/s stereo with global ild with improved mid/side decision
SG11201806955QA (en) Apparatus and method for stereo filling in multichannel coding
SG11201900399WA (en) Method and apparatus for performing access barring check
SG11201902667UA (en) Methods and systems for chromatography data analysis
SG11201806216YA (en) Apparatus and method for encoding or decoding a multi-channel signal using a broadband alignment parameter and a plurality of narrowband alignment parameters
SG11201908787WA (en) Novel bispecific antigen binding molecules capable of specific binding to cd40 and to fap
SG11201901506VA (en) Distributed systems for the efficient production and use of microbe-based compositions
SG11201903857UA (en) Antibodies to pd-1 and uses thereof
SG11201806738PA (en) Using luma information for chroma prediction with separate luma-chroma framework in video coding
SG11201909955XA (en) Formulations of anti-lag3 antibodies and co-formulations of anti-lag3 antibodies and anti-pd-1 antibodies
SG11201807573VA (en) Methods for providing single-stranded rna
SG11201804361YA (en) Method for managing a trusted identity
CA3015496A1 (en) Voice control of a media playback system
SG11201806737RA (en) Decoding video data using a two-level multi-type-tree framework
SG11201900109SA (en) Intra video coding using a decoupled tree structure
SG11201804190YA (en) Method and system for blockchain variant using digital signatures
SG11201901168UA (en) Apparatuses and methods including ferroelectric memory and for operating ferroelectric memory
SG11201807827VA (en) Process for the preparation of pegylated drug-linkers and intermediates thereof
SG11201806853VA (en) Abstracted graphs from social relationship graph
SG11201901645SA (en) Apparatus and method for encoding an audio signal using a compensation value
SG11201808640VA (en) Systems and methods for quick start-up of playback
SG11201808929PA (en) Systems and methods for secure storage of user information in a user profile
SG11201804892PA (en) Synthesis of signals for immersive audio playback
SG11201809311SA (en) Radial-path filter elements, systems and methods of using same
SG11201906367PA (en) Audio decoder, audio encoder, method for providing a decoded audio signal, method for providing an encoded audio signal, audio stream, audio stream provider and computer program using a stream identifier