EP4292084A4 - Mehrkanal-sprachkompressionssystem und verfahren - Google Patents

Mehrkanal-sprachkompressionssystem und verfahren

Info

Publication number
EP4292084A4
EP4292084A4 EP22753376.7A EP22753376A EP4292084A4 EP 4292084 A4 EP4292084 A4 EP 4292084A4 EP 22753376 A EP22753376 A EP 22753376A EP 4292084 A4 EP4292084 A4 EP 4292084A4
Authority
EP
European Patent Office
Prior art keywords
compression system
speech compression
channel speech
channel
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
EP22753376.7A
Other languages
English (en)
French (fr)
Other versions
EP4292084A1 (de
Inventor
Dushyant Sharma
Patrick A. NAYLOR
Uwe Helmut Jost
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Microsoft Technology Licensing LLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing LLC filed Critical Microsoft Technology Licensing LLC
Publication of EP4292084A1 publication Critical patent/EP4292084A1/de
Publication of EP4292084A4 publication Critical patent/EP4292084A4/de
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S11/00Systems for determining distance or velocity not using reflection or reradiation
    • G01S11/14Systems for determining distance or velocity not using reflection or reradiation using ultrasonic, sonic, or infrasonic waves
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S3/00Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received
    • G01S3/80Direction-finders for determining the direction from which infrasonic, sonic, ultrasonic, or electromagnetic waves, or particle emission, not having a directional significance, are being received using ultrasonic, sonic or infrasonic waves
    • G01S3/802Systems for determining direction or deviation from predetermined direction
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/18Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/18Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
    • G01S5/30Determining absolute distances from a plurality of spaced points of known location
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/70Determining position or orientation of objects or cameras
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/167Audio streaming, i.e. formatting and decoding of an encoded audio signal representation into a data stream for transmission or storage purposes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/027Spatial or constructional arrangements of microphones, e.g. in dummy heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/008Systems employing more than two channels, e.g. quadraphonic in which the audio signals are in digital form, i.e. employing more than two discrete digital channels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/305Electronic adaptation of stereophonic audio signals to reverberation of the listening space
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0002Codebook adaptations
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L2021/02082Noise filtering the noise being echo, reverberation of the speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/4012D or 3D arrays of transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/20Processing of the output signals of the acoustic transducers of an array for obtaining a desired directivity characteristic
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/01Multi-channel, i.e. more than two input channels, sound reproduction with two speakers wherein the multi-channel information is substantially preserved
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/15Aspects of sound capture and related signal processing for recording or reproduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Otolaryngology (AREA)
  • General Physics & Mathematics (AREA)
  • Remote Sensing (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Mathematical Physics (AREA)
  • General Health & Medical Sciences (AREA)
  • Quality & Reliability (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Stereophonic System (AREA)
EP22753376.7A 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren Pending EP4292084A4 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US202163148427P 2021-02-11 2021-02-11
US202163183848P 2021-05-04 2021-05-04
PCT/US2022/016034 WO2022173990A1 (en) 2021-02-11 2022-02-10 Multi-channel speech compression system and method

Publications (2)

Publication Number Publication Date
EP4292084A1 EP4292084A1 (de) 2023-12-20
EP4292084A4 true EP4292084A4 (de) 2025-01-01

Family

ID=82837295

Family Applications (7)

Application Number Title Priority Date Filing Date
EP22753370.0A Pending EP4292295A4 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren
EP22753376.7A Pending EP4292084A4 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren
EP22753366.8A Pending EP4292086A4 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren
EP22753368.4A Pending EP4292091A4 (de) 2021-02-11 2022-02-10 Vergleich von akustischen relativen übertragungsfunktionen aus mindestens einem paar von zeitrahmen
EP22753372.6A Pending EP4292079A4 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren
EP22753374.2A Withdrawn EP4292087A1 (de) 2021-02-11 2022-02-10 Erste und zweite einbettung akustischer relativer übertragungsfunktionen
EP22753375.9A Withdrawn EP4292296A1 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren

Family Applications Before (1)

Application Number Title Priority Date Filing Date
EP22753370.0A Pending EP4292295A4 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren

Family Applications After (5)

Application Number Title Priority Date Filing Date
EP22753366.8A Pending EP4292086A4 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren
EP22753368.4A Pending EP4292091A4 (de) 2021-02-11 2022-02-10 Vergleich von akustischen relativen übertragungsfunktionen aus mindestens einem paar von zeitrahmen
EP22753372.6A Pending EP4292079A4 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren
EP22753374.2A Withdrawn EP4292087A1 (de) 2021-02-11 2022-02-10 Erste und zweite einbettung akustischer relativer übertragungsfunktionen
EP22753375.9A Withdrawn EP4292296A1 (de) 2021-02-11 2022-02-10 Mehrkanal-sprachkompressionssystem und verfahren

Country Status (3)

Country Link
US (10) US11924624B2 (de)
EP (7) EP4292295A4 (de)
WO (7) WO2022173986A1 (de)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP4292295A4 (de) 2021-02-11 2025-02-26 Microsoft Technology Licensing, LLC Mehrkanal-sprachkompressionssystem und verfahren
CN115482828A (zh) * 2021-06-15 2022-12-16 华为技术有限公司 声音信号处理方法及装置、计算机可读存储介质
TWI866361B (zh) * 2023-07-28 2024-12-11 英屬開曼群島商意騰科技股份有限公司 具干擾抑制功能之音訊裝置、音訊系統及音訊處理方法
US12387736B2 (en) 2023-07-29 2025-08-12 Zon Global Ip Inc. Audio compression with generative adversarial networks
US12437213B2 (en) 2023-07-29 2025-10-07 Zon Global Ip Inc. Bayesian graph-based retrieval-augmented generation with synthetic feedback loop (BG-RAG-SFL)
US12382051B2 (en) 2023-07-29 2025-08-05 Zon Global Ip Inc. Advanced maximal entropy media compression processing
US12482446B2 (en) * 2023-08-11 2025-11-25 British Cayman Islands Intelligo Technology Inc. Audio device with distractor suppression

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120163606A1 (en) * 2009-06-23 2012-06-28 Nokia Corporation Method and Apparatus for Processing Audio Signals
WO2020231883A1 (en) * 2019-05-15 2020-11-19 Ocelot Laboratories Llc Separating and rendering voice and ambience signals
US20220345813A1 (en) * 2019-10-10 2022-10-27 Dts, Inc. Spatial audio capture and analysis with depth

Family Cites Families (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA2010830C (en) * 1990-02-23 1996-06-25 Jean-Pierre Adoul Dynamic codebook for efficient speech coding based on algebraic codes
GB2352152B (en) 1998-03-31 2003-03-26 Lake Technology Ltd Formulation of complex room impulse responses from 3-D audio information
US6483532B1 (en) * 1998-07-13 2002-11-19 Netergy Microelectronics, Inc. Video-assisted audio signal processing system and method
FR2883656B1 (fr) 2005-03-25 2008-09-19 Imra Europ Sas Soc Par Actions Traitement continu de la parole utilisant une fonction de transfert heterogene et adaptee
WO2007011157A1 (en) * 2005-07-19 2007-01-25 Electronics And Telecommunications Research Institute Virtual source location information based channel level difference quantization and dequantization method
JP5092974B2 (ja) * 2008-07-30 2012-12-05 富士通株式会社 伝達特性推定装置、雑音抑圧装置、伝達特性推定方法及びコンピュータプログラム
US8204198B2 (en) 2009-06-19 2012-06-19 Magor Communications Corporation Method and apparatus for selecting an audio stream
US8335689B2 (en) 2009-10-14 2012-12-18 Cogi, Inc. Method and system for efficient management of speech transcribers
US9191738B2 (en) * 2010-12-21 2015-11-17 Nippon Telgraph and Telephone Corporation Sound enhancement method, device, program and recording medium
US20130022189A1 (en) * 2011-07-21 2013-01-24 Nuance Communications, Inc. Systems and methods for receiving and processing audio signals captured using multiple devices
JP5685177B2 (ja) * 2011-12-12 2015-03-18 本田技研工業株式会社 情報伝達システム
KR101794733B1 (ko) 2011-12-26 2017-11-09 한국전자통신연구원 음장 변화 패턴 분석을 통한 보안 시스템 및 그 방법
US20130253923A1 (en) 2012-03-21 2013-09-26 Her Majesty The Queen In Right Of Canada, As Represented By The Minister Of Industry Multichannel enhancement system for preserving spatial cues
JP5931661B2 (ja) * 2012-09-14 2016-06-08 本田技研工業株式会社 音源方向推定装置、音源方向推定方法、及び音源方向推定プログラム
US20150189455A1 (en) * 2013-12-30 2015-07-02 Aliphcom Transformation of multiple sound fields to generate a transformed reproduced sound field including modified reproductions of the multiple sound fields
US9516413B1 (en) 2014-09-30 2016-12-06 Apple Inc. Location based storage and upload of acoustic environment related information
FR3040807B1 (fr) 2015-09-07 2022-10-14 3D Sound Labs Procede et systeme d'elaboration d'une fonction de transfert relative a la tete adaptee a un individu
KR102586089B1 (ko) 2015-11-17 2023-10-10 돌비 레버러토리즈 라이쎈싱 코오포레이션 파라메트릭 바이너럴 출력 시스템 및 방법을 위한 머리추적
US10373612B2 (en) * 2016-03-21 2019-08-06 Amazon Technologies, Inc. Anchored speech detection and speech recognition
US9955279B2 (en) * 2016-05-11 2018-04-24 Ossic Corporation Systems and methods of calibrating earphones
US10063965B2 (en) 2016-06-01 2018-08-28 Google Llc Sound source estimation using neural networks
US20180007488A1 (en) * 2016-07-01 2018-01-04 Ronald Jeffrey Horowitz Sound source rendering in virtual environment
EP3285500B1 (de) * 2016-08-05 2021-03-10 Oticon A/s Zur positionsbestimmung einer schallquelle konfiguriertes, binaurales hörsystem
US10848899B2 (en) * 2016-10-13 2020-11-24 Philip Scott Lyren Binaural sound in visual entertainment media
US10219098B2 (en) * 2017-03-03 2019-02-26 GM Global Technology Operations LLC Location estimation of active speaker
EP3373602A1 (de) * 2017-03-09 2018-09-12 Oticon A/s Verfahren zur lokalisierung einer schallquelle, hörvorrichtung und hörsystem
EP3413589B1 (de) 2017-06-09 2022-11-16 Oticon A/s Mikrofonsystem und hörgerät mit einem mikrofonsystem
US10939222B2 (en) 2017-08-10 2021-03-02 Lg Electronics Inc. Three-dimensional audio playing method and playing apparatus
US10535361B2 (en) 2017-10-19 2020-01-14 Kardome Technology Ltd. Speech enhancement using clustering of cues
US10598543B1 (en) * 2017-12-04 2020-03-24 Amazon Technologies, Inc. Multi microphone wall detection and location estimation
US10679617B2 (en) * 2017-12-06 2020-06-09 Synaptics Incorporated Voice enhancement in audio signals through modified generalized eigenvalue beamformer
US10390171B2 (en) * 2018-01-07 2019-08-20 Creative Technology Ltd Method for generating customized spatial audio with head tracking
US10717197B2 (en) 2018-01-08 2020-07-21 Digital Dream Labs, Llc Spatial acoustic filtering by a mobile robot
US11495244B2 (en) * 2018-04-04 2022-11-08 Pindrop Security, Inc. Voice modification detection using physical models of speech production
WO2019197002A1 (en) 2018-04-13 2019-10-17 Aalborg Universitet Generating sound zones using variable span filters
US10529356B2 (en) * 2018-05-15 2020-01-07 Cirrus Logic, Inc. Detecting unwanted audio signal components by comparing signals processed with differing linearity
CN111373769B (zh) 2018-05-24 2022-11-01 索尼公司 信息处理装置和信息处理方法
WO2019233588A1 (en) 2018-06-07 2019-12-12 Sonova Ag Microphone device to provide audio with spatial context
US10728657B2 (en) * 2018-06-22 2020-07-28 Facebook Technologies, Llc Acoustic transfer function personalization using simulation
US11070912B2 (en) * 2018-06-22 2021-07-20 Facebook Technologies, Llc Audio system for dynamic determination of personalized acoustic transfer functions
US11568864B2 (en) * 2018-08-13 2023-01-31 Carnegie Mellon University Processing speech signals of a user to generate a visual representation of the user
US11205435B2 (en) * 2018-08-17 2021-12-21 Dts, Inc. Spatial audio signal encoder
JP7027283B2 (ja) 2018-08-31 2022-03-01 本田技研工業株式会社 伝達関数生成装置、伝達関数生成方法、およびプログラム
EP3655947B1 (de) * 2018-09-25 2022-03-09 Google LLC Sprecherdiarisierung unter verwendung von sprechereinbettung(en) und trainiertem generativem modell
US10880669B2 (en) 2018-09-28 2020-12-29 EmbodyVR, Inc. Binaural sound source localization
US10672382B2 (en) * 2018-10-15 2020-06-02 Tencent America LLC Input-feeding architecture for attention based end-to-end speech recognition
GB2578625A (en) 2018-11-01 2020-05-20 Nokia Technologies Oy Apparatus, methods and computer programs for encoding spatial metadata
US20200304933A1 (en) 2019-03-19 2020-09-24 Htc Corporation Sound processing system of ambisonic format and sound processing method of ambisonic format
US11435429B2 (en) * 2019-03-20 2022-09-06 Intel Corporation Method and system of acoustic angle of arrival detection
US10638252B1 (en) * 2019-05-20 2020-04-28 Facebook Technologies, Llc Dynamic adjustment of signal enhancement filters for a microphone array
JP7108147B2 (ja) * 2019-05-23 2022-07-27 グーグル エルエルシー 表現用エンドツーエンド音声合成における変分埋め込み容量
US11043207B2 (en) * 2019-06-14 2021-06-22 Nuance Communications, Inc. System and method for array data simulation and customized acoustic modeling for ambient ASR
US11531807B2 (en) * 2019-06-28 2022-12-20 Nuance Communications, Inc. System and method for customized text macros
US11234073B1 (en) 2019-07-05 2022-01-25 Facebook Technologies, Llc Selective active noise cancellation
US10999690B2 (en) * 2019-09-05 2021-05-04 Facebook Technologies, Llc Selecting spatial locations for audio personalization
US11501102B2 (en) 2019-11-21 2022-11-15 Adobe Inc. Automated sound matching within an audio recording
CN112346012A (zh) * 2020-11-13 2021-02-09 南京地平线机器人技术有限公司 声源位置确定方法和装置、可读存储介质、电子设备
EP4292295A4 (de) 2021-02-11 2025-02-26 Microsoft Technology Licensing, LLC Mehrkanal-sprachkompressionssystem und verfahren
EP4138418A1 (de) 2021-08-20 2023-02-22 Oticon A/s Hörsystem mit einer datenbank mit akustischen übertragungsfunktionen

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120163606A1 (en) * 2009-06-23 2012-06-28 Nokia Corporation Method and Apparatus for Processing Audio Signals
WO2020231883A1 (en) * 2019-05-15 2020-11-19 Ocelot Laboratories Llc Separating and rendering voice and ambience signals
US20220345813A1 (en) * 2019-10-10 2022-10-27 Dts, Inc. Spatial audio capture and analysis with depth

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of WO2022173990A1 *

Also Published As

Publication number Publication date
US20220254356A1 (en) 2022-08-11
US20250220378A1 (en) 2025-07-03
WO2022173989A1 (en) 2022-08-18
WO2022173986A1 (en) 2022-08-18
US12289595B2 (en) 2025-04-29
US20220254358A1 (en) 2022-08-11
US12143798B2 (en) 2024-11-12
US12114147B2 (en) 2024-10-08
WO2022173982A1 (en) 2022-08-18
US20220256303A1 (en) 2022-08-11
WO2022173984A1 (en) 2022-08-18
EP4292296A1 (de) 2023-12-20
US20220254357A1 (en) 2022-08-11
US12452620B2 (en) 2025-10-21
EP4292295A4 (de) 2025-02-26
EP4292084A1 (de) 2023-12-20
US20220254359A1 (en) 2022-08-11
EP4292091A1 (de) 2023-12-20
US11997469B2 (en) 2024-05-28
WO2022173988A1 (en) 2022-08-18
US20220254360A1 (en) 2022-08-11
US11950081B2 (en) 2024-04-02
EP4292087A1 (de) 2023-12-20
US20240323630A1 (en) 2024-09-26
WO2022173980A1 (en) 2022-08-18
EP4292079A4 (de) 2025-01-01
EP4292295A1 (de) 2023-12-20
US20250373997A1 (en) 2025-12-04
US12149914B2 (en) 2024-11-19
EP4292079A1 (de) 2023-12-20
US20220254361A1 (en) 2022-08-11
WO2022173990A1 (en) 2022-08-18
EP4292086A1 (de) 2023-12-20
US11924624B2 (en) 2024-03-05
EP4292091A4 (de) 2024-12-25
EP4292086A4 (de) 2025-01-08

Similar Documents

Publication Publication Date Title
EP4292084A4 (de) Mehrkanal-sprachkompressionssystem und verfahren
EP3867900C0 (de) System und verfahren zur erkennung von mehrfach gesprochener sprache
EP3920178C0 (de) Verfahren und system zur audioerkennung sowie vorrichtung
EP4099316A4 (de) Sprachsyntheseverfahren und -system
EP4026121A4 (de) Systeme und verfahren zur spracherkennung
EP4138677A4 (de) Teledermatologisches system und verfahren
EP4297023A4 (de) Sprachsteuerungsverfahren und -vorrichtung
EP4228285A4 (de) Audiosteuerungsverfahren und -vorrichtung
EP4147637C0 (de) Ganganalysesystem und -verfahren
EP4159853A4 (de) Genomeditiersystem und -verfahren
EP4320877C0 (de) Audiovorrichtung und verfahren dafür
EP4374541A4 (de) System und verfahren für quantensichere mikronetze
EP4004674C0 (de) System und verfahren für biometrische protokollstandards
EP4002354C0 (de) Verfahren und system zur automatischen spracherkennung in ressourcenbeschränkten vorrichtungen
EP4115309A4 (de) System und verfahren für telefonprivatsphäre
EP4020467C0 (de) Stimm-coaching-system und entsprechende verfahren
EP4460985A4 (de) Mehrkanaliges lautsprechersystem und verfahren dafür
EP4275360A4 (de) Drahtloses mikrofonsystem und verfahren
EP4437268A4 (de) System und verfahren für tragbare sicherheitsbeleuchtung
EP4145791A4 (de) Verifizierungsverfahren und -vorrichtung
EP4024395C0 (de) Sprachanalysator und zugehöriges verfahren
EP4458013A4 (de) Verfahren und vorrichtungen zur decodiererseitigen intramodusableitung
EP4546011A4 (de) Positionierungssystem und -verfahren
EP4307298A4 (de) Verfahren und system zur erkennung von sprachaktivitäten sowie verfahren und system zur sprachverbesserung
EP3535752A4 (de) System und verfahren zur parametrisierung von grammatiken zur spracherkennungsgrammatikspezifikation (srgs)

Legal Events

Date Code Title Description
STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE INTERNATIONAL PUBLICATION HAS BEEN MADE

PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: REQUEST FOR EXAMINATION WAS MADE

17P Request for examination filed

Effective date: 20230725

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: MICROSOFT TECHNOLOGY LICENSING, LLC

DAV Request for validation of the european patent (deleted)
DAX Request for extension of the european patent (deleted)
REG Reference to a national code

Ref country code: DE

Ref legal event code: R079

Free format text: PREVIOUS MAIN CLASS: G10L0015200000

Ipc: G10L0021020000

A4 Supplementary search report drawn up and despatched

Effective date: 20241204

RIC1 Information provided on ipc code assigned before grant

Ipc: H04S 7/00 20060101ALN20241128BHEP

Ipc: H04S 3/00 20060101ALN20241128BHEP

Ipc: G10L 21/0216 20130101ALN20241128BHEP

Ipc: G10L 19/00 20130101ALN20241128BHEP

Ipc: G10L 21/0208 20130101ALN20241128BHEP

Ipc: G10L 17/06 20130101ALN20241128BHEP

Ipc: G10L 15/00 20130101ALN20241128BHEP

Ipc: H04R 5/027 20060101ALI20241128BHEP

Ipc: H04R 3/00 20060101ALI20241128BHEP

Ipc: H04R 1/40 20060101ALI20241128BHEP

Ipc: G01S 11/14 20060101ALI20241128BHEP

Ipc: G01S 5/30 20060101ALI20241128BHEP

Ipc: G01S 5/18 20060101ALI20241128BHEP

Ipc: G01S 3/802 20060101ALI20241128BHEP

Ipc: G10L 25/51 20130101ALI20241128BHEP

Ipc: G10L 19/008 20130101ALI20241128BHEP

Ipc: G10L 21/02 20130101AFI20241128BHEP