SG11202000287RA - Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques - Google Patents

Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques

Info

Publication number
SG11202000287RA
SG11202000287RA SG11202000287RA SG11202000287RA SG11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA
Authority
SG
Singapore
Prior art keywords
sound field
field description
generating
international
meta data
Prior art date
Application number
SG11202000287RA
Inventor
Jürgen Herre
Emanuel Habets
Axel Plinge
Oliver Thiergart
Fabian Küch
Original Assignee
Fraunhofer Ges Forschung
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fraunhofer Ges Forschung filed Critical Fraunhofer Ges Forschung
Publication of SG11202000287RA publication Critical patent/SG11202000287RA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/011Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
    • G06F3/012Head tracking input arrangements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • H04S7/304For headphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/03Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/11Application of ambisonics in stereophonic audio systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Quality & Reliability (AREA)
  • Stereophonic System (AREA)
  • Electrophonic Musical Instruments (AREA)

Abstract

International Patent Classification: G10L 19/008 (2013.01) G06F 3/01 (2006.01) H04S 7/00 (2006.01) GlOL 21/0272 (2013.01) (21) International Application Number: PCT/EP2018/069147 (22) International Filing Date: (25) Filing Language: (26) Publication Language: (30) Priority Data: 17181481.7 14 July 2017 (14.07.2017) EP (71) Applicants: FRAUNHOFER-GESELLSCHAFT ZUR FORDERUNG DER ANGEWANDTEN FORSCHUNG E.V. [DE/DE]; HansastraBe 27c, 80686 Miinchen (DE). FRIEDRICH-ALEXANDER-UNIVERSITAET 13 July 2018 (13.07.2018) English English ERLANGEN-NUERNBERG [DE/DE]; Schlossplatz 4, 91054 Erlangen (DE). (72) Inventors: HERRE, Jurgen; Rathsberger Str. 10a, 91054 Erlangen (DE). HABETS, Emanuel; SchwedenstraBe 13, 91080 Spardorf (DE). (74) Agent: ZINKLER, Franz et al.; Schoppe, Zimmermann, Stockeler, Zinkler, Schenk & Partner mbB, Radlkoferstr. 2, 81373 Miinchen (DE). (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, (54) Title: CONCEPT FOR GENERATING AN ENHANCED SOUND-FIELD DESCRIPTION OR A MODIFIED SOUND FIELD DESCRIPTION USING A DEPTH-EXTENDED DIRAC TECHNIQUE OR OTHER TECHNIQUES 200 w 210 DiFF real or sound field (DirAC) 1.• x y DoA virtual microphone z analyzer/ downmixer 120 110 downmix mono/ stereo DoA DiFF • meta data generator combiner (output interface) ► depth map if diff > thresh. then distance = N.A. or distance= predet. value (such as HIGH, INF., ...) enhanced sound field description Fig. 2 (57) : An apparatus for generating an enhanced sound field description comprises: a sound field generator (100) for generating at least one sound field description indicating a sound field with respect to at least one reference location; and a meta data generator (110) for generating meta data relating to spatial information of the sound field, wherein the at least one sound field description and the meta data constitute the enhanced sound field description. The meta data can be a depth map associating a distance information to a direction in a full band or a subband, i.e., a time frequency bin. (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) vitivio VIII 00 los Oil OH om nava Hs (19) World Intellectual Property Organization International Bureau (10) International Publication Number WO 2019/012135 Al (43) International Publication Date 17 January 2019 (17.01.2019) WIPO I PCT [Continued on next page] WO 2019/012135 Al MIDEDIMOMOIDEIROMOIONEEHOHOMMODEVOIMIE SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG). Published: — with international search report (Art. 21(3))
SG11202000287RA 2017-07-14 2018-07-13 Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques SG11202000287RA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP17181481 2017-07-14
PCT/EP2018/069147 WO2019012135A1 (en) 2017-07-14 2018-07-13 Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques

Publications (1)

Publication Number Publication Date
SG11202000287RA true SG11202000287RA (en) 2020-02-27

Family

ID=59631528

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202000287RA SG11202000287RA (en) 2017-07-14 2018-07-13 Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques

Country Status (14)

Country Link
US (1) US11477594B2 (en)
EP (1) EP3652737A1 (en)
JP (1) JP7122793B2 (en)
KR (2) KR102448736B1 (en)
CN (1) CN111108555B (en)
AR (1) AR112556A1 (en)
AU (2) AU2018298878A1 (en)
BR (1) BR112020000779A2 (en)
CA (1) CA3069772C (en)
RU (1) RU2736274C1 (en)
SG (1) SG11202000287RA (en)
TW (1) TWI684978B (en)
WO (1) WO2019012135A1 (en)
ZA (1) ZA202000169B (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114125690A (en) 2017-12-18 2022-03-01 杜比国际公司 Method and system for rendering audio signals in a virtual reality environment
PL3891736T3 (en) * 2018-12-07 2023-06-26 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus, method and computer program for encoding, decoding, scene processing and other procedures related to dirac based spatial audio coding using low-order, mid-order and high-order components generators
WO2021006871A1 (en) * 2019-07-08 2021-01-14 Dts, Inc. Non-coincident audio-visual capture system
JP2023504990A (en) * 2019-10-10 2023-02-08 ディーティーエス・インコーポレイテッド Spatial audio capture by depth
CN113284504A (en) 2020-02-20 2021-08-20 北京三星通信技术研究有限公司 Attitude detection method and apparatus, electronic device, and computer-readable storage medium
US11164550B1 (en) * 2020-04-23 2021-11-02 Hisep Technology Ltd. System and method for creating and outputting music
WO2021261085A1 (en) * 2020-06-26 2021-12-30 ソニーグループ株式会社 Information processing device, information processing method, program, and recording reproduction system
US11830471B1 (en) * 2020-08-31 2023-11-28 Amazon Technologies, Inc. Surface augmented ray-based acoustic modeling
GB2602148A (en) * 2020-12-21 2022-06-22 Nokia Technologies Oy Audio rendering with spatial metadata interpolation and source position information
US11758348B1 (en) * 2021-01-07 2023-09-12 Apple Inc. Auditory origin synthesis
EP4174637A1 (en) * 2021-10-26 2023-05-03 Koninklijke Philips N.V. Bitstream representing audio in an environment
GB2614253A (en) * 2021-12-22 2023-07-05 Nokia Technologies Oy Apparatus, methods and computer programs for providing spatial audio
WO2023147864A1 (en) * 2022-02-03 2023-08-10 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method to transform an audio stream
WO2024014711A1 (en) * 2022-07-11 2024-01-18 한국전자통신연구원 Audio rendering method based on recording distance parameter and apparatus for performing same
WO2024069796A1 (en) * 2022-09-28 2024-04-04 三菱電機株式会社 Sound space construction device, sound space construction system, program, and sound space construction method

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB1512514A (en) * 1974-07-12 1978-06-01 Nat Res Dev Microphone assemblies
JPH08107600A (en) * 1994-10-04 1996-04-23 Yamaha Corp Sound image localization device
US5970152A (en) 1996-04-30 1999-10-19 Srs Labs, Inc. Audio enhancement system for use in a surround sound environment
JP2006074589A (en) 2004-09-03 2006-03-16 Matsushita Electric Ind Co Ltd Acoustic processing device
EP2346028A1 (en) * 2009-12-17 2011-07-20 Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. An apparatus and a method for converting a first parametric spatial audio signal into a second parametric spatial audio signal
EP2450880A1 (en) * 2010-11-05 2012-05-09 Thomson Licensing Data structure for Higher Order Ambisonics audio data
AU2011334851B2 (en) 2010-12-03 2015-01-22 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Sound acquisition via the extraction of geometrical information from direction of arrival estimates
EP2600637A1 (en) 2011-12-02 2013-06-05 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for microphone positioning based on a spatial power density
EP2600343A1 (en) 2011-12-02 2013-06-05 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for merging geometry - based spatial audio coding streams
BR112014017457A8 (en) 2012-01-19 2017-07-04 Koninklijke Philips Nv spatial audio transmission apparatus; space audio coding apparatus; method of generating spatial audio output signals; and spatial audio coding method
US9516446B2 (en) 2012-07-20 2016-12-06 Qualcomm Incorporated Scalable downmix design for object-based surround codec with cluster analysis by synthesis
JP6085029B2 (en) 2012-08-31 2017-02-22 ドルビー ラボラトリーズ ライセンシング コーポレイション System for rendering and playing back audio based on objects in various listening environments
US9769588B2 (en) * 2012-11-20 2017-09-19 Nokia Technologies Oy Spatial audio enhancement apparatus
US9685163B2 (en) 2013-03-01 2017-06-20 Qualcomm Incorporated Transforming spherical harmonic coefficients
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9384741B2 (en) 2013-05-29 2016-07-05 Qualcomm Incorporated Binauralization of rotated higher order ambisonics
US9769586B2 (en) 2013-05-29 2017-09-19 Qualcomm Incorporated Performing order reduction with respect to higher order ambisonic coefficients
US20150127354A1 (en) 2013-10-03 2015-05-07 Qualcomm Incorporated Near field compensation for decomposed representations of a sound field
JP6586885B2 (en) * 2014-01-16 2019-10-09 ソニー株式会社 Audio processing apparatus and method, and program
EP3007167A1 (en) 2014-10-10 2016-04-13 Thomson Licensing Method and apparatus for low bit rate compression of a Higher Order Ambisonics HOA signal representation of a sound field
CN105635635A (en) * 2014-11-19 2016-06-01 杜比实验室特许公司 Adjustment for space consistency in video conference system
US10524075B2 (en) 2015-12-10 2019-12-31 Sony Corporation Sound processing apparatus, method, and program
US10659906B2 (en) 2017-01-13 2020-05-19 Qualcomm Incorporated Audio parallax for virtual reality, augmented reality, and mixed reality

Also Published As

Publication number Publication date
TW201919043A (en) 2019-05-16
US11477594B2 (en) 2022-10-18
RU2736274C1 (en) 2020-11-13
TWI684978B (en) 2020-02-11
AU2021218089A1 (en) 2021-09-09
US20200154229A1 (en) 2020-05-14
BR112020000779A2 (en) 2020-07-14
ZA202000169B (en) 2022-07-27
KR102568365B1 (en) 2023-08-18
KR102448736B1 (en) 2022-09-30
CN111108555B (en) 2023-12-15
CN111108555A (en) 2020-05-05
WO2019012135A1 (en) 2019-01-17
CA3069772C (en) 2024-01-02
JP7122793B2 (en) 2022-08-22
KR20220044393A (en) 2022-04-07
JP2020527887A (en) 2020-09-10
KR20200041307A (en) 2020-04-21
AU2018298878A1 (en) 2020-01-30
AR112556A1 (en) 2019-11-13
EP3652737A1 (en) 2020-05-20
CA3069772A1 (en) 2019-01-17

Similar Documents

Publication Publication Date Title
SG11202000287RA (en) Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques
SG11202000330XA (en) Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description
SG11202000285QA (en) Concept for generating an enhanced sound-field description or a modified sound field description using a multi-layer description
SG11201910019PA (en) Microorganisms and methods for producing cannabinoids and cannabinoid derivatives
SG11201806650VA (en) Systems and methods for providing a personal distributed ledger
SG11201903787YA (en) Exploiting input data sparsity in neural network compute units
SG11201908451UA (en) Method, apparatus, and system for blockchain consensus
SG11201808968WA (en) Time division multiplexing of synchronization channels
SG11201907753TA (en) Bispecific binding molecules that are capable of binding cd137 and tumor antigens, and uses thereof
SG11201807401RA (en) Therapeutic membrane vesicles
SG11201809123UA (en) Informing base station regarding user equipment's reception of beam change instruction
SG11201909160WA (en) Multispecific polypeptide constructs having constrained cd3 binding and methods of using the same
SG11201809617RA (en) Intracellular delivery of biomolecules to induce tolerance
SG11201901210UA (en) Ferroelectric memory cells
SG11201804771WA (en) Systems and methods for providing financial data to financial instruments in a distributed ledger system
SG11201803050PA (en) Electronic device generating notification based on context data in response to speech phrase from user
SG11201808528VA (en) Modified wound dressings
SG11201811095UA (en) Multi-level storage in ferroelectric memory
SG11201906468TA (en) Chimeric antigen receptors against axl or ror2 and methods of use thereof
SG11201901783UA (en) A device to manipulate ions of same or different polarities
SG11201903958SA (en) Intuitive occluded object indicator
SG11201806823YA (en) Extended synchronization signal for symbol index detection
SG11201804892PA (en) Synthesis of signals for immersive audio playback
SG11201906413XA (en) Exposure apparatus
SG11201901211XA (en) Apparatuses and methods including ferroelectric memory and for accessing ferroelectric memory