SG11202000287RA - Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques - Google Patents
Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniquesInfo
- Publication number
- SG11202000287RA SG11202000287RA SG11202000287RA SG11202000287RA SG11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA SG 11202000287R A SG11202000287R A SG 11202000287RA
- Authority
- SG
- Singapore
- Prior art keywords
- sound field
- field description
- generating
- international
- meta data
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
- H04S7/303—Tracking of listener position or orientation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/011—Arrangements for interaction with the human body, e.g. for user immersion in virtual reality
- G06F3/012—Head tracking input arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S3/00—Systems employing more than two channels, e.g. quadraphonic
- H04S3/008—Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
- H04S7/303—Tracking of listener position or orientation
- H04S7/304—For headphones
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/01—Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/03—Aspects of down-mixing multi-channel audio to configurations with lower numbers of playback channels, e.g. 7.1 -> 5.1
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/11—Positioning of individual sound objects, e.g. moving airplane, within a sound field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/01—Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Abstract
International Patent Classification: G10L 19/008 (2013.01) G06F 3/01 (2006.01) H04S 7/00 (2006.01) GlOL 21/0272 (2013.01) (21) International Application Number: PCT/EP2018/069147 (22) International Filing Date: (25) Filing Language: (26) Publication Language: (30) Priority Data: 17181481.7 14 July 2017 (14.07.2017) EP (71) Applicants: FRAUNHOFER-GESELLSCHAFT ZUR FORDERUNG DER ANGEWANDTEN FORSCHUNG E.V. [DE/DE]; HansastraBe 27c, 80686 Miinchen (DE). FRIEDRICH-ALEXANDER-UNIVERSITAET 13 July 2018 (13.07.2018) English English ERLANGEN-NUERNBERG [DE/DE]; Schlossplatz 4, 91054 Erlangen (DE). (72) Inventors: HERRE, Jurgen; Rathsberger Str. 10a, 91054 Erlangen (DE). HABETS, Emanuel; SchwedenstraBe 13, 91080 Spardorf (DE). (74) Agent: ZINKLER, Franz et al.; Schoppe, Zimmermann, Stockeler, Zinkler, Schenk & Partner mbB, Radlkoferstr. 2, 81373 Miinchen (DE). (81) Designated States (unless otherwise indicated, for every kind of national protection available): AE, AG, AL, AM, AO, AT, AU, AZ, BA, BB, BG, BH, BN, BR, BW, BY, BZ, CA, CH, CL, CN, CO, CR, CU, CZ, DE, DJ, DK, DM, DO, DZ, EC, EE, EG, ES, FI, GB, GD, GE, GH, GM, GT, HN, HR, HU, ID, IL, IN, IR, IS, JO, JP, KE, KG, KH, KN, KP, KR, KW, KZ, LA, LC, LK, LR, LS, LU, LY, MA, MD, ME, MG, MK, MN, MW, MX, MY, MZ, NA, NG, NI, NO, NZ, OM, PA, PE, PG, PH, PL, PT, QA, RO, RS, RU, RW, SA, (54) Title: CONCEPT FOR GENERATING AN ENHANCED SOUND-FIELD DESCRIPTION OR A MODIFIED SOUND FIELD DESCRIPTION USING A DEPTH-EXTENDED DIRAC TECHNIQUE OR OTHER TECHNIQUES 200 w 210 DiFF real or sound field (DirAC) 1.• x y DoA virtual microphone z analyzer/ downmixer 120 110 downmix mono/ stereo DoA DiFF • meta data generator combiner (output interface) ► depth map if diff > thresh. then distance = N.A. or distance= predet. value (such as HIGH, INF., ...) enhanced sound field description Fig. 2 (57) : An apparatus for generating an enhanced sound field description comprises: a sound field generator (100) for generating at least one sound field description indicating a sound field with respect to at least one reference location; and a meta data generator (110) for generating meta data relating to spatial information of the sound field, wherein the at least one sound field description and the meta data constitute the enhanced sound field description. The meta data can be a depth map associating a distance information to a direction in a full band or a subband, i.e., a time frequency bin. (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) vitivio VIII 00 los Oil OH om nava Hs (19) World Intellectual Property Organization International Bureau (10) International Publication Number WO 2019/012135 Al (43) International Publication Date 17 January 2019 (17.01.2019) WIPO I PCT [Continued on next page] WO 2019/012135 Al MIDEDIMOMOIDEIROMOIONEEHOHOMMODEVOIMIE SC, SD, SE, SG, SK, SL, SM, ST, SV, SY, TH, TJ, TM, TN, TR, TT, TZ, UA, UG, US, UZ, VC, VN, ZA, ZM, ZW. (84) Designated States (unless otherwise indicated, for every kind of regional protection available): ARIPO (BW, GH, GM, KE, LR, LS, MW, MZ, NA, RW, SD, SL, ST, SZ, TZ, UG, ZM, ZW), Eurasian (AM, AZ, BY, KG, KZ, RU, TJ, TM), European (AL, AT, BE, BG, CH, CY, CZ, DE, DK, EE, ES, FI, FR, GB, GR, HR, HU, IE, IS, IT, LT, LU, LV, MC, MK, MT, NL, NO, PL, PT, RO, RS, SE, SI, SK, SM, TR), OAPI (BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, KM, ML, MR, NE, SN, TD, TG). Published: — with international search report (Art. 21(3))
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP17181481 | 2017-07-14 | ||
PCT/EP2018/069147 WO2019012135A1 (en) | 2017-07-14 | 2018-07-13 | Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11202000287RA true SG11202000287RA (en) | 2020-02-27 |
Family
ID=59631528
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11202000287RA SG11202000287RA (en) | 2017-07-14 | 2018-07-13 | Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques |
Country Status (14)
Country | Link |
---|---|
US (1) | US11477594B2 (en) |
EP (1) | EP3652737A1 (en) |
JP (1) | JP7122793B2 (en) |
KR (2) | KR102448736B1 (en) |
CN (1) | CN111108555B (en) |
AR (1) | AR112556A1 (en) |
AU (2) | AU2018298878A1 (en) |
BR (1) | BR112020000779A2 (en) |
CA (1) | CA3069772C (en) |
RU (1) | RU2736274C1 (en) |
SG (1) | SG11202000287RA (en) |
TW (1) | TWI684978B (en) |
WO (1) | WO2019012135A1 (en) |
ZA (1) | ZA202000169B (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11109178B2 (en) * | 2017-12-18 | 2021-08-31 | Dolby International Ab | Method and system for handling local transitions between listening positions in a virtual reality environment |
BR112021010964A2 (en) * | 2018-12-07 | 2021-08-31 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | DEVICE AND METHOD TO GENERATE A SOUND FIELD DESCRIPTION |
WO2021006871A1 (en) * | 2019-07-08 | 2021-01-14 | Dts, Inc. | Non-coincident audio-visual capture system |
KR20220079931A (en) * | 2019-10-10 | 2022-06-14 | 디티에스, 인코포레이티드 | Spatial Audio Capture with Depth |
CN113284504A (en) | 2020-02-20 | 2021-08-20 | 北京三星通信技术研究有限公司 | Attitude detection method and apparatus, electronic device, and computer-readable storage medium |
US11164550B1 (en) * | 2020-04-23 | 2021-11-02 | Hisep Technology Ltd. | System and method for creating and outputting music |
WO2021261085A1 (en) * | 2020-06-26 | 2021-12-30 | ソニーグループ株式会社 | Information processing device, information processing method, program, and recording reproduction system |
US11830471B1 (en) * | 2020-08-31 | 2023-11-28 | Amazon Technologies, Inc. | Surface augmented ray-based acoustic modeling |
GB2602148A (en) * | 2020-12-21 | 2022-06-22 | Nokia Technologies Oy | Audio rendering with spatial metadata interpolation and source position information |
US11758348B1 (en) * | 2021-01-07 | 2023-09-12 | Apple Inc. | Auditory origin synthesis |
EP4174637A1 (en) * | 2021-10-26 | 2023-05-03 | Koninklijke Philips N.V. | Bitstream representing audio in an environment |
GB2614253A (en) * | 2021-12-22 | 2023-07-05 | Nokia Technologies Oy | Apparatus, methods and computer programs for providing spatial audio |
WO2023147864A1 (en) * | 2022-02-03 | 2023-08-10 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method to transform an audio stream |
WO2024014711A1 (en) * | 2022-07-11 | 2024-01-18 | 한국전자통신연구원 | Audio rendering method based on recording distance parameter and apparatus for performing same |
WO2024069796A1 (en) * | 2022-09-28 | 2024-04-04 | 三菱電機株式会社 | Sound space construction device, sound space construction system, program, and sound space construction method |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB1512514A (en) * | 1974-07-12 | 1978-06-01 | Nat Res Dev | Microphone assemblies |
JPH08107600A (en) * | 1994-10-04 | 1996-04-23 | Yamaha Corp | Sound image localization device |
US5970152A (en) | 1996-04-30 | 1999-10-19 | Srs Labs, Inc. | Audio enhancement system for use in a surround sound environment |
JP2006074589A (en) * | 2004-09-03 | 2006-03-16 | Matsushita Electric Ind Co Ltd | Acoustic processing device |
EP2346028A1 (en) * | 2009-12-17 | 2011-07-20 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | An apparatus and a method for converting a first parametric spatial audio signal into a second parametric spatial audio signal |
EP2450880A1 (en) * | 2010-11-05 | 2012-05-09 | Thomson Licensing | Data structure for Higher Order Ambisonics audio data |
PL2647222T3 (en) | 2010-12-03 | 2015-04-30 | Fraunhofer Ges Forschung | Sound acquisition via the extraction of geometrical information from direction of arrival estimates |
EP2600343A1 (en) | 2011-12-02 | 2013-06-05 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for merging geometry - based spatial audio coding streams |
EP2600637A1 (en) | 2011-12-02 | 2013-06-05 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for microphone positioning based on a spatial power density |
CN104054126B (en) | 2012-01-19 | 2017-03-29 | 皇家飞利浦有限公司 | Space audio is rendered and is encoded |
US9479886B2 (en) | 2012-07-20 | 2016-10-25 | Qualcomm Incorporated | Scalable downmix design with feedback for object-based surround codec |
JP6085029B2 (en) | 2012-08-31 | 2017-02-22 | ドルビー ラボラトリーズ ライセンシング コーポレイション | System for rendering and playing back audio based on objects in various listening environments |
EP2923502A4 (en) * | 2012-11-20 | 2016-06-15 | Nokia Technologies Oy | Spatial audio enhancement apparatus |
US9685163B2 (en) | 2013-03-01 | 2017-06-20 | Qualcomm Incorporated | Transforming spherical harmonic coefficients |
US9854377B2 (en) | 2013-05-29 | 2017-12-26 | Qualcomm Incorporated | Interpolation for decomposed representations of a sound field |
US9466305B2 (en) | 2013-05-29 | 2016-10-11 | Qualcomm Incorporated | Performing positional analysis to code spherical harmonic coefficients |
US9384741B2 (en) | 2013-05-29 | 2016-07-05 | Qualcomm Incorporated | Binauralization of rotated higher order ambisonics |
US20150127354A1 (en) | 2013-10-03 | 2015-05-07 | Qualcomm Incorporated | Near field compensation for decomposed representations of a sound field |
BR112016015971B1 (en) * | 2014-01-16 | 2022-11-16 | Sony Corporation | AUDIO PROCESSING DEVICE AND METHOD, AND COMPUTER READABLE STORAGE MEDIA |
EP3007167A1 (en) | 2014-10-10 | 2016-04-13 | Thomson Licensing | Method and apparatus for low bit rate compression of a Higher Order Ambisonics HOA signal representation of a sound field |
CN105635635A (en) * | 2014-11-19 | 2016-06-01 | 杜比实验室特许公司 | Adjustment for space consistency in video conference system |
JP6841229B2 (en) | 2015-12-10 | 2021-03-10 | ソニー株式会社 | Speech processing equipment and methods, as well as programs |
US10659906B2 (en) | 2017-01-13 | 2020-05-19 | Qualcomm Incorporated | Audio parallax for virtual reality, augmented reality, and mixed reality |
-
2018
- 2018-07-13 CN CN201880060085.9A patent/CN111108555B/en active Active
- 2018-07-13 AR ARP180101956 patent/AR112556A1/en active IP Right Grant
- 2018-07-13 CA CA3069772A patent/CA3069772C/en active Active
- 2018-07-13 KR KR1020207001256A patent/KR102448736B1/en active IP Right Grant
- 2018-07-13 BR BR112020000779-0A patent/BR112020000779A2/en unknown
- 2018-07-13 EP EP18737642.1A patent/EP3652737A1/en active Pending
- 2018-07-13 SG SG11202000287RA patent/SG11202000287RA/en unknown
- 2018-07-13 WO PCT/EP2018/069147 patent/WO2019012135A1/en active Search and Examination
- 2018-07-13 AU AU2018298878A patent/AU2018298878A1/en not_active Abandoned
- 2018-07-13 RU RU2020106726A patent/RU2736274C1/en active
- 2018-07-13 KR KR1020227010356A patent/KR102568365B1/en active IP Right Grant
- 2018-07-13 JP JP2020500707A patent/JP7122793B2/en active Active
- 2018-07-16 TW TW107124518A patent/TWI684978B/en active
-
2020
- 2020-01-10 ZA ZA2020/00169A patent/ZA202000169B/en unknown
- 2020-01-10 US US16/740,290 patent/US11477594B2/en active Active
-
2021
- 2021-08-18 AU AU2021218089A patent/AU2021218089A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
BR112020000779A2 (en) | 2020-07-14 |
US11477594B2 (en) | 2022-10-18 |
EP3652737A1 (en) | 2020-05-20 |
KR20220044393A (en) | 2022-04-07 |
AU2018298878A1 (en) | 2020-01-30 |
KR102568365B1 (en) | 2023-08-18 |
RU2736274C1 (en) | 2020-11-13 |
TWI684978B (en) | 2020-02-11 |
AR112556A1 (en) | 2019-11-13 |
CA3069772C (en) | 2024-01-02 |
AU2021218089A1 (en) | 2021-09-09 |
US20200154229A1 (en) | 2020-05-14 |
JP7122793B2 (en) | 2022-08-22 |
KR102448736B1 (en) | 2022-09-30 |
CA3069772A1 (en) | 2019-01-17 |
WO2019012135A1 (en) | 2019-01-17 |
ZA202000169B (en) | 2022-07-27 |
KR20200041307A (en) | 2020-04-21 |
TW201919043A (en) | 2019-05-16 |
JP2020527887A (en) | 2020-09-10 |
CN111108555B (en) | 2023-12-15 |
CN111108555A (en) | 2020-05-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11202000287RA (en) | Concept for generating an enhanced sound-field description or a modified sound field description using a depth-extended dirac technique or other techniques | |
SG11202000330XA (en) | Concept for generating an enhanced sound field description or a modified sound field description using a multi-point sound field description | |
SG11202000285QA (en) | Concept for generating an enhanced sound-field description or a modified sound field description using a multi-layer description | |
SG11201910019PA (en) | Microorganisms and methods for producing cannabinoids and cannabinoid derivatives | |
SG11201806650VA (en) | Systems and methods for providing a personal distributed ledger | |
SG11201908451UA (en) | Method, apparatus, and system for blockchain consensus | |
SG11201808968WA (en) | Time division multiplexing of synchronization channels | |
SG11201907753TA (en) | Bispecific binding molecules that are capable of binding cd137 and tumor antigens, and uses thereof | |
SG11201807401RA (en) | Therapeutic membrane vesicles | |
SG11201901550WA (en) | Method and apparatus for data processing | |
SG11201809123UA (en) | Informing base station regarding user equipment's reception of beam change instruction | |
SG11201909160WA (en) | Multispecific polypeptide constructs having constrained cd3 binding and methods of using the same | |
SG11201809617RA (en) | Intracellular delivery of biomolecules to induce tolerance | |
SG11201901210UA (en) | Ferroelectric memory cells | |
SG11201811095UA (en) | Multi-level storage in ferroelectric memory | |
SG11201808528VA (en) | Modified wound dressings | |
SG11201906468TA (en) | Chimeric antigen receptors against axl or ror2 and methods of use thereof | |
SG11201901783UA (en) | A device to manipulate ions of same or different polarities | |
SG11201903958SA (en) | Intuitive occluded object indicator | |
SG11201806823YA (en) | Extended synchronization signal for symbol index detection | |
SG11201906413XA (en) | Exposure apparatus | |
SG11201900548WA (en) | Method and system for private communication with multiple parties | |
SG11201901211XA (en) | Apparatuses and methods including ferroelectric memory and for accessing ferroelectric memory | |
SG11201804892PA (en) | Synthesis of signals for immersive audio playback | |
SG11201908743SA (en) | Methods and compositions for modulation of immune cells |