SG11201907670UA - Inter-channel bandwidth extension - Google Patents

Inter-channel bandwidth extension

Info

Publication number
SG11201907670UA
SG11201907670UA SG11201907670UA SG11201907670UA SG 11201907670U A SG11201907670U A SG 11201907670UA SG 11201907670U A SG11201907670U A SG 11201907670UA SG 11201907670U A SG11201907670U A SG 11201907670UA
Authority
SG
Singapore
Prior art keywords
channel
bitstream
band
band mid
international
Prior art date
Application number
Inventor
Venkata Subrahmanyam Chandra Sekhar Chebiyyam
Venkatraman Atti
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of SG11201907670UA publication Critical patent/SG11201907670UA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/083Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being an excitation gain
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/12Circuits for transducers, loudspeakers or microphones for distributing signals to two or more loudspeakers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/03Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/13Aspects of volume control, not necessarily automatic, in stereophonic sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/15Aspects of sound capture and related signal processing for recording or reproduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Otolaryngology (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Display Devices Of Pinball Game Machines (AREA)

Abstract

AnalysisData Final Shift Value 116 First Device 104 Memory 153 Transmitter 110 Input Interface(s) 112 Encoder 114 First Audio Channel 130 First Microphone 146 Second Audio Channel 132 Second Microphone 148 Sound Source 152 Second Device r 118 Decoder ICBWE Spatial Gain Mapping Parameter Generator M 106 N. First Output Channel 126 First Loudspeaker 142 Seco d Loudspeaker 144 Second Output — Channel 128 (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (19) World Intellectual Property Organization International Bureau (43) International Publication Date 11 October 2018 (11.10.2018) WIPO I PCT olimion °nolo mol mons mionflom oimIE (10) International Publication Number WO 2018/187082 Al (51) International Patent Classification: G10L 19/008 (2013.01) G1 0L 21/038 (2013.01) (21) International Application Number: PCT/US2018/024500 (22) International Filing Date: 27 March 2018 (27.03.2018) (25) Filing Language: English (26) Publication Language: English (30) Priority Data: 62/482,150 05 April 2017 (05.04.2017) US 15/935,952 26 March 2018 (26.03.2018) US (71) Applicant: QUALCOMM INCORPORATED [US/US]; ATTN: International IP Administration, 5775 Morehouse Drive, San Diego, California 92121-1714 (US). (72) Inventors: CHEBIYYAM, Venkata Subrahmanyam Chandra Sekhar; 590 Mill Creek Lane, #205, Santa Clara, California 95054 (US). ATTI, Venkatraman; 5775 More- house Drive, San Diego, California 92121-1714 (US). (74) Agent: TOLER, Jeffrey G. et al.; Toler Law Group, PC, 8500 Bluffstone Cove, Suite A201, Austin, Texas 78759 (US). (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, (54) Title: INTER-CHANNEL BANDWIDTH EXTENSION 100 Stereo Downmix/Upmix Parameter Bitstream 290 Low-Band Mid Channel Bitstream 292 High-Band Mid Channel BWE Bitstream 294 Stereo Downmix/Upmix Parameter Bitstream 290 Low-Band Mid Channel Bitstream 292 High-Band Mid Channel BWE Bitstream 294 1-1 N GC O N GC 1-1 00 O 1-1 N O FIG. 1 (57) : A method includes decoding a low-band mid channel bitstream to generate a low-band mid signal and a low-band mid excitation signal. The method further includes decoding a high-band mid channel bandwidth extension bitstream to generate a synthesized high-band mid signal. The method also includes determining an inter-channel bandwidth extension (ICBWE) gain mapping parameter corresponding to the synthesized high-band mid signal. The ICBWE gain mapping parameter is based on a selected frequen- cy-domain gain parameter that is extracted from a stereo downmix/upmix parameter bitstream. The method further includes performing a gain scaling operation on the synthesized high-band mid signal based on the ICBWE gain mapping parameter to generate a reference high-band channel and a target high-band channel. The method includes outputting a first audio channel and a second audio channel. The first audio channel is based on the reference high-band channel, and the second audio channel is based on target high-band channel. [Continued on next page] WO 2018/187082 Al 0 1OIIIIIII SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG). Published: — with international search report (Art. 21(3))
SG11201907670U 2017-04-05 2018-03-27 Inter-channel bandwidth extension SG11201907670UA (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201762482150P 2017-04-05 2017-04-05
US15/935,952 US10573326B2 (en) 2017-04-05 2018-03-26 Inter-channel bandwidth extension
PCT/US2018/024500 WO2018187082A1 (en) 2017-04-05 2018-03-27 Inter-channel bandwidth extension

Publications (1)

Publication Number Publication Date
SG11201907670UA true SG11201907670UA (en) 2019-10-30

Family

ID=63711139

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201907670U SG11201907670UA (en) 2017-04-05 2018-03-27 Inter-channel bandwidth extension

Country Status (8)

Country Link
US (1) US10573326B2 (en)
EP (1) EP3607549B1 (en)
KR (1) KR102208602B1 (en)
CN (1) CN110447072B (en)
BR (1) BR112019020643A2 (en)
SG (1) SG11201907670UA (en)
TW (1) TWI724290B (en)
WO (1) WO2018187082A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10535357B2 (en) * 2017-10-05 2020-01-14 Qualcomm Incorporated Encoding or decoding of audio signals
WO2020216459A1 (en) * 2019-04-23 2020-10-29 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method or computer program for generating an output downmix representation

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2252170A1 (en) * 1998-10-27 2000-04-27 Bruno Bessette A method and device for high quality coding of wideband speech and audio signals
CA2327041A1 (en) * 2000-11-22 2002-05-22 Voiceage Corporation A method for indexing pulse positions and signs in algebraic codebooks for efficient coding of wideband signals
US8605911B2 (en) * 2001-07-10 2013-12-10 Dolby International Ab Efficient and scalable parametric stereo coding for low bitrate audio coding applications
US8260609B2 (en) * 2006-07-31 2012-09-04 Qualcomm Incorporated Systems, methods, and apparatus for wideband encoding and decoding of inactive frames
US8015368B2 (en) * 2007-04-20 2011-09-06 Siport, Inc. Processor extensions for accelerating spectral band replication
ATE500588T1 (en) * 2008-01-04 2011-03-15 Dolby Sweden Ab AUDIO ENCODERS AND DECODERS
US8060042B2 (en) * 2008-05-23 2011-11-15 Lg Electronics Inc. Method and an apparatus for processing an audio signal
US8355921B2 (en) * 2008-06-13 2013-01-15 Nokia Corporation Method, apparatus and computer program product for providing improved audio processing
ES2592416T3 (en) * 2008-07-17 2016-11-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio coding / decoding scheme that has a switchable bypass
CA3162807C (en) * 2009-01-16 2024-04-23 Dolby International Ab Cross product enhanced harmonic transposition
CA3076203C (en) 2009-01-28 2021-03-16 Dolby International Ab Improved harmonic transposition
US9070361B2 (en) * 2011-06-10 2015-06-30 Google Technology Holdings LLC Method and apparatus for encoding a wideband speech signal utilizing downmixing of a highband component
WO2014005327A1 (en) * 2012-07-06 2014-01-09 深圳广晟信源技术有限公司 Method for encoding multichannel digital audio
EP2830051A3 (en) * 2013-07-22 2015-03-04 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals
US9984699B2 (en) * 2014-06-26 2018-05-29 Qualcomm Incorporated High-band signal coding using mismatched frequency ranges
EP3067887A1 (en) * 2015-03-09 2016-09-14 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal
TW202242853A (en) * 2015-03-13 2022-11-01 瑞典商杜比國際公司 Decoding audio bitstreams with enhanced spectral band replication metadata in at least one fill element
US10109284B2 (en) * 2016-02-12 2018-10-23 Qualcomm Incorporated Inter-channel encoding and decoding of multiple high-band audio signals
US10157621B2 (en) * 2016-03-18 2018-12-18 Qualcomm Incorporated Audio signal decoding
US10249307B2 (en) 2016-06-27 2019-04-02 Qualcomm Incorporated Audio decoding using intermediate sampling rate
US10431231B2 (en) * 2017-06-29 2019-10-01 Qualcomm Incorporated High-band residual prediction with time-domain inter-channel bandwidth extension

Also Published As

Publication number Publication date
CN110447072A (en) 2019-11-12
BR112019020643A2 (en) 2020-04-28
EP3607549A1 (en) 2020-02-12
KR102208602B1 (en) 2021-01-27
US20180293992A1 (en) 2018-10-11
TW201903754A (en) 2019-01-16
EP3607549B1 (en) 2022-09-28
WO2018187082A1 (en) 2018-10-11
KR20190134641A (en) 2019-12-04
US10573326B2 (en) 2020-02-25
CN110447072B (en) 2020-11-06
TWI724290B (en) 2021-04-11

Similar Documents

Publication Publication Date Title
SG11201908390UA (en) Non-harmonic speech detection and bandwidth extension in a multi-source environment
SG11201810874YA (en) Virtual, augmented, and mixed reality
SG11201808968WA (en) Time division multiplexing of synchronization channels
SG11201907090WA (en) Affine motion information derivation
SG11201909348QA (en) Stereo parameters for stereo decoding
SG11201803050PA (en) Electronic device generating notification based on context data in response to speech phrase from user
SG11201807334SA (en) Methods, compositions, and devices for information storage
KR101424752B1 (en) An Apparatus for Determining a Spatial Output Multi-Channel Audio Signal
SG11201804892PA (en) Synthesis of signals for immersive audio playback
SG11201903738QA (en) Offshore gnss reference station apparatus, offshore gnss positioning system, and method of generating positioning reference data offshore
SG11201903130WA (en) Sequence to sequence transformations for speech synthesis via recurrent neural networks
SG11201907531XA (en) Constraining motion vector information derived by decoder-side motion vector derivation
SG11202000287RA (en) Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques
SG11201909450WA (en) 360-degree video coding using face continuities
SG11201807369UA (en) System and method for beam adjustment request
SG11201907753TA (en) Bispecific binding molecules that are capable of binding cd137 and tumor antigens, and uses thereof
SG11201811674WA (en) Systems and methods for path determination
SG11201901632QA (en) Motion vector prediction for affine motion models in video coding
SG11201806739UA (en) Determining prediction parameters for non-square blocks in video coding
SG11201907116UA (en) Target sample generation
SG11201908549RA (en) Automated quality control and spectral error correction for sample analysis instruments
SG11201806955QA (en) Apparatus and method for stereo filling in multichannel coding
SG11201906584WA (en) Inter-channel bandwidth extension spectral mapping and adjustment
SG11201906484YA (en) Extracting mcts sub-bitstreams for video coding
CN101911733A (en) The method and apparatus that is used for audio signal