HK1204134A1 - 聽覺場景中的講話者衝突 - Google Patents

聽覺場景中的講話者衝突

Info

Publication number
HK1204134A1
HK1204134A1 HK15104256.4A HK15104256A HK1204134A1 HK 1204134 A1 HK1204134 A1 HK 1204134A1 HK 15104256 A HK15104256 A HK 15104256A HK 1204134 A1 HK1204134 A1 HK 1204134A1
Authority
HK
Hong Kong
Prior art keywords
talker
collisions
auditory scene
auditory
scene
Prior art date
Application number
HK15104256.4A
Other languages
English (en)
Inventor
加里.施皮特勒
邁克爾.奧利耶
Original Assignee
Dolby Lab Licensing Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dolby Lab Licensing Corp filed Critical Dolby Lab Licensing Corp
Publication of HK1204134A1 publication Critical patent/HK1204134A1/zh

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L21/0232Processing in the frequency domain
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/003Changing voice quality, e.g. pitch or formants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/568Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Quality & Reliability (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)
  • Stereophonic System (AREA)
HK15104256.4A 2012-03-23 2015-05-05 聽覺場景中的講話者衝突 HK1204134A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201261614577P 2012-03-23 2012-03-23
PCT/US2013/033366 WO2013142727A1 (en) 2012-03-23 2013-03-21 Talker collisions in an auditory scene

Publications (1)

Publication Number Publication Date
HK1204134A1 true HK1204134A1 (zh) 2015-11-06

Family

ID=48096233

Family Applications (1)

Application Number Title Priority Date Filing Date
HK15104256.4A HK1204134A1 (zh) 2012-03-23 2015-05-05 聽覺場景中的講話者衝突

Country Status (6)

Country Link
US (1) US9502047B2 (zh)
EP (1) EP2828849B1 (zh)
JP (1) JP6023823B2 (zh)
CN (1) CN104205212B (zh)
HK (1) HK1204134A1 (zh)
WO (1) WO2013142727A1 (zh)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9237238B2 (en) * 2013-07-26 2016-01-12 Polycom, Inc. Speech-selective audio mixing for conference
CN104767652B (zh) * 2014-01-08 2020-01-17 杜比实验室特许公司 监视数字传输环境性能的方法
US10079941B2 (en) 2014-07-07 2018-09-18 Dolby Laboratories Licensing Corporation Audio capture and render device having a visual display and user interface for use for audio conferencing
CN106878533B (zh) * 2015-12-10 2021-03-19 北京奇虎科技有限公司 一种移动终端的通信方法和装置
EP3291226B1 (en) * 2016-09-05 2020-11-04 Unify Patente GmbH & Co. KG A method of treating speech data, a device for handling telephone calls and a hearing device
US11017790B2 (en) * 2018-11-30 2021-05-25 International Business Machines Corporation Avoiding speech collisions among participants during teleconferences
CN111354356B (zh) * 2018-12-24 2024-04-30 北京搜狗科技发展有限公司 一种语音数据处理方法及装置
CN117461323A (zh) * 2021-06-08 2024-01-26 索尼集团公司 信息处理装置、信息处理方法、信息处理程序和信息处理系统
CN114915690B (zh) * 2022-05-05 2024-08-20 广州美录电子有限公司 音频信号的处理方法、装置、设备及介质

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7391877B1 (en) 2003-03-31 2008-06-24 United States Of America As Represented By The Secretary Of The Air Force Spatial processor for enhanced performance in multi-talker speech displays
JP2005267667A (ja) 2004-03-16 2005-09-29 Denon Ltd 音声記録再生装置
WO2006050353A2 (en) 2004-10-28 2006-05-11 Verax Technologies Inc. A system and method for generating sound events
US7970115B1 (en) * 2005-10-05 2011-06-28 Avaya Inc. Assisted discrimination of similar sounding speakers
US20100235169A1 (en) 2006-06-02 2010-09-16 Koninklijke Philips Electronics N.V. Speech differentiation
US7853649B2 (en) 2006-09-21 2010-12-14 Apple Inc. Audio processing for improved user experience
US8559646B2 (en) 2006-12-14 2013-10-15 William G. Gardner Spatial audio teleconferencing
US20080298610A1 (en) 2007-05-30 2008-12-04 Nokia Corporation Parameter Space Re-Panning for Spatial Audio
ATE504010T1 (de) 2007-06-01 2011-04-15 Univ Graz Tech Gemeinsame positions-tonhöhenschätzung akustischer quellen zu ihrer verfolgung und trennung
GB0712099D0 (en) 2007-06-22 2007-08-01 Wivenhoe Technology Ltd Transmission Of Audio Information
US8180029B2 (en) 2007-06-28 2012-05-15 Voxer Ip Llc Telecommunication and multimedia management method and apparatus
JP2009139592A (ja) * 2007-12-05 2009-06-25 Sony Corp 音声処理装置、音声処理システム及び音声処理プログラム
JP5195652B2 (ja) 2008-06-11 2013-05-08 ソニー株式会社 信号処理装置、および信号処理方法、並びにプログラム
US20110109798A1 (en) 2008-07-09 2011-05-12 Mcreynolds Alan R Method and system for simultaneous rendering of multiple multi-media presentations
WO2010092914A1 (ja) * 2009-02-13 2010-08-19 日本電気株式会社 多チャンネル音響信号処理方法、そのシステム及びプログラム
US8417703B2 (en) 2009-11-03 2013-04-09 Qualcomm Incorporated Data searching using spatial auditory cues

Also Published As

Publication number Publication date
CN104205212A (zh) 2014-12-10
JP6023823B2 (ja) 2016-11-09
EP2828849A1 (en) 2015-01-28
CN104205212B (zh) 2016-09-07
US20150012266A1 (en) 2015-01-08
EP2828849B1 (en) 2016-07-20
JP2015511029A (ja) 2015-04-13
WO2013142727A1 (en) 2013-09-26
US9502047B2 (en) 2016-11-22

Similar Documents

Publication Publication Date Title
HK1209256A1 (zh) 立體聲耳機
EP2846531A4 (en) STEREO CAMERA AND STEREO CAMERA SYSTEM
EP2875315A4 (en) STEREOSCOPIC CAMERA
EP2779684A4 (en) BONE CONDUCTION SPEAKER UNIT
AU351366S (en) Subwoofer
HK1204134A1 (zh) 聽覺場景中的講話者衝突
EP2813091A4 (en) FORMBLE HEARING SYSTEM
GB201217019D0 (en) Communications system
GB2502274B (en) Telecommunications systems and methods
HK1230823A1 (zh) 骨傳導揚聲器
GB2502275B (en) Telecommunications systems and methods
EP2859242A4 (en) EJECTORS
EP2832127A4 (en) COMMUNICATION SYSTEM
EP2717374A4 (en) lamination system
GB2506152C (en) Telecommunications systems and methods
GB2501516B (en) 3D Camera system
GB201216938D0 (en) Telecommunications systems and methods
HK1204153A1 (zh) 層叠系統
GB201318717D0 (en) Earpiece
ZA201309051B (en) Methods and systems for providing efficient telecommunications servives
GB201216927D0 (en) Communications system
EP2852950A4 (en) ACOUSTIC PLATE
HK1203661A1 (zh) 電子會議系統
EP2885887A4 (en) SYSTEMS AND METHOD FOR PERFORMANCE-OPTIMIZED FRAMING
GB201222843D0 (en) Laser System