JP2021007216A5 - - Google Patents

Download PDF

Info

Publication number
JP2021007216A5
JP2021007216A5 JP2020096190A JP2020096190A JP2021007216A5 JP 2021007216 A5 JP2021007216 A5 JP 2021007216A5 JP 2020096190 A JP2020096190 A JP 2020096190A JP 2020096190 A JP2020096190 A JP 2020096190A JP 2021007216 A5 JP2021007216 A5 JP 2021007216A5
Authority
JP
Japan
Prior art keywords
signal
sound source
image
audio
channel audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
JP2020096190A
Other languages
English (en)
Japanese (ja)
Other versions
JP7525304B2 (ja
JP2021007216A (ja
Filing date
Publication date
Priority claimed from US16/455,668 external-priority patent/US11082460B2/en
Application filed filed Critical
Publication of JP2021007216A publication Critical patent/JP2021007216A/ja
Publication of JP2021007216A5 publication Critical patent/JP2021007216A5/ja
Application granted granted Critical
Publication of JP7525304B2 publication Critical patent/JP7525304B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

JP2020096190A 2019-06-27 2020-06-02 映像データを用いて容易化された音源強調 Active JP7525304B2 (ja)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16/455,668 US11082460B2 (en) 2019-06-27 2019-06-27 Audio source enhancement facilitated using video data
US16/455,668 2019-06-27

Publications (3)

Publication Number Publication Date
JP2021007216A JP2021007216A (ja) 2021-01-21
JP2021007216A5 true JP2021007216A5 (https=) 2023-05-31
JP7525304B2 JP7525304B2 (ja) 2024-07-30

Family

ID=73887691

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2020096190A Active JP7525304B2 (ja) 2019-06-27 2020-06-02 映像データを用いて容易化された音源強調

Country Status (3)

Country Link
US (1) US11082460B2 (https=)
JP (1) JP7525304B2 (https=)
CN (1) CN112151063B (https=)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2565315B (en) * 2017-08-09 2022-05-04 Emotech Ltd Robots, methods, computer programs, computer-readable media, arrays of microphones and controllers
CN110364161A (zh) * 2019-08-22 2019-10-22 北京小米智能科技有限公司 响应语音信号的方法、电子设备、介质及系统
FR3103955A1 (fr) * 2019-11-29 2021-06-04 Orange Dispositif et procédé d’analyse environnementale, et dispositif et procédé d’assistance vocale les implémentant
CN114830233B (zh) 2019-12-09 2025-07-01 杜比实验室特许公司 基于噪声指标和语音可懂度指标来调整音频和非音频特征
TWI740339B (zh) * 2019-12-31 2021-09-21 宏碁股份有限公司 自動調整特定聲源的方法及應用其之電子裝置
US11234090B2 (en) * 2020-01-06 2022-01-25 Facebook Technologies, Llc Using audio visual correspondence for sound source identification
US11087777B1 (en) 2020-02-11 2021-08-10 Facebook Technologies, Llc Audio visual correspondence based signal augmentation
US11460927B2 (en) * 2020-03-19 2022-10-04 DTEN, Inc. Auto-framing through speech and video localizations
US11250869B2 (en) * 2020-04-16 2022-02-15 Lg Electronics Inc. Audio zoom based on speaker detection using lip reading
US11190735B1 (en) * 2020-07-16 2021-11-30 International Business Machines Corporation Video modifying conferencing system
US11915716B2 (en) 2020-07-16 2024-02-27 International Business Machines Corporation Audio modifying conferencing system
US11303465B2 (en) 2020-07-16 2022-04-12 International Business Machines Corporation Contextually aware conferencing system
CN111885414B (zh) * 2020-07-24 2023-03-21 腾讯科技(深圳)有限公司 一种数据处理方法、装置、设备及可读存储介质
US11082465B1 (en) * 2020-08-20 2021-08-03 Avaya Management L.P. Intelligent detection and automatic correction of erroneous audio settings in a video conference
WO2022146169A1 (en) * 2020-12-30 2022-07-07 Ringcentral, Inc., (A Delaware Corporation) System and method for noise cancellation
US11748845B2 (en) 2021-01-27 2023-09-05 Nvidia Corporation Machine learning techniques for enhancing video conferencing applications
US20250088795A1 (en) * 2021-08-14 2025-03-13 Clearone, Inc. Muting Specific Talkers Using a Beamforming Microphone Array
CN113676687A (zh) * 2021-08-30 2021-11-19 联想(北京)有限公司 一种信息处理方法及电子设备
WO2023234939A1 (en) * 2022-06-02 2023-12-07 Innopeak Technology, Inc. Methods and systems for audio processing using visual information
US12581038B2 (en) * 2023-12-18 2026-03-17 Gn Hearing A/S Audio processing in video conferencing system using multimodal features
CN118865995B (zh) * 2024-09-04 2025-08-12 美的集团(上海)有限公司 多通道语音的降噪方法及系统、电子设备及存储介质
CN119766954A (zh) * 2024-12-27 2025-04-04 成都维海德科技有限公司 数据处理方法、装置、设备以及存储介质

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3714706B2 (ja) * 1995-02-17 2005-11-09 株式会社竹中工務店 音抽出装置
US7590941B2 (en) * 2003-10-09 2009-09-15 Hewlett-Packard Development Company, L.P. Communication and collaboration system using rich media environments
KR100754385B1 (ko) * 2004-09-30 2007-08-31 삼성전자주식회사 오디오/비디오 센서를 이용한 위치 파악, 추적 및 분리장치와 그 방법
US20110099017A1 (en) * 2009-10-26 2011-04-28 Ure Michael J System and method for interactive communication with a media device user such as a television viewer
US20120013620A1 (en) * 2010-07-13 2012-01-19 International Business Machines Corporation Animating Speech Of An Avatar Representing A Participant In A Mobile Communications With Background Media
US8839358B2 (en) * 2011-08-31 2014-09-16 Microsoft Corporation Progressive authentication
KR101971697B1 (ko) * 2012-02-24 2019-04-23 삼성전자주식회사 사용자 디바이스에서 복합 생체인식 정보를 이용한 사용자 인증 방법 및 장치
IL229370A (en) * 2013-11-11 2015-01-29 Mera Software Services Inc Interface system and method for providing user interaction with network entities
US9609273B2 (en) * 2013-11-20 2017-03-28 Avaya Inc. System and method for not displaying duplicate images in a video conference
KR102217191B1 (ko) * 2014-11-05 2021-02-18 삼성전자주식회사 단말 장치 및 그 정보 제공 방법
US9445050B2 (en) * 2014-11-17 2016-09-13 Freescale Semiconductor, Inc. Teleconferencing environment having auditory and visual cues
WO2016081624A1 (en) * 2014-11-18 2016-05-26 Branch Media Labs, Inc. Automatic identification and mapping of consumer electronic devices to ports on an hdmi switch
US9426139B1 (en) * 2015-03-30 2016-08-23 Amazon Technologies, Inc. Triggering a request for an authentication
EP3101838A1 (en) * 2015-06-03 2016-12-07 Thomson Licensing Method and apparatus for isolating an active participant in a group of participants
ITUB20153347A1 (it) * 2015-09-02 2017-03-02 Stefano Spattini Apparato per la videocomunicazione
CN105957521B (zh) * 2016-02-29 2020-07-10 青岛克路德机器人有限公司 一种用于机器人的语音和图像复合交互执行方法及系统
US10250848B2 (en) * 2016-06-03 2019-04-02 Avaya Inc. Positional controlled muting
CN109478400B (zh) * 2016-07-22 2023-07-07 杜比实验室特许公司 现场音乐表演的多媒体内容的基于网络的处理及分布
JP6410769B2 (ja) * 2016-07-28 2018-10-24 キヤノン株式会社 情報処理システム及びその制御方法、コンピュータプログラム
CN106328156B (zh) * 2016-08-22 2020-02-18 华南理工大学 一种音视频信息融合的麦克风阵列语音增强系统及方法
US10754608B2 (en) * 2016-11-29 2020-08-25 Nokia Technologies Oy Augmented reality mixing for distributed audio capture
CN106782584B (zh) * 2016-12-28 2023-11-07 北京地平线信息技术有限公司 音频信号处理设备、方法和电子设备
CN106653041B (zh) * 2017-01-17 2020-02-14 北京地平线信息技术有限公司 音频信号处理设备、方法和电子设备
CN107993671A (zh) * 2017-12-04 2018-05-04 南京地平线机器人技术有限公司 声音处理方法、装置和电子设备
US10867610B2 (en) * 2018-05-04 2020-12-15 Microsoft Technology Licensing, Llc Computerized intelligent assistant for conferences

Similar Documents

Publication Publication Date Title
JP2021007216A5 (https=)
CN110808048B (zh) 语音处理方法、装置、系统及存储介质
JP5565552B2 (ja) 映像音響処理装置、映像音響処理方法及びプログラム
WO2016183791A1 (zh) 一种语音信号处理方法及装置
JP2016146547A5 (https=)
EP3499900A3 (en) Video processing method, apparatus and device
JP2018189924A5 (ja) 信号処理装置、信号処理方法、およびプログラム
JP2017067666A5 (https=)
EP3177040A3 (en) Information processing apparatus, information processing method, and program
CN111933174B (zh) 语音处理方法、装置、设备和系统
US20150281839A1 (en) Background noise cancellation using depth
JP2018019294A5 (https=)
JP2013115751A5 (https=)
US9165182B2 (en) Method and apparatus for using face detection information to improve speaker segmentation
CN111863005A (zh) 声音信号获取方法和装置、存储介质、电子设备
JP2018006826A5 (ja) 音声処理装置および音声処理方法
JP6016277B2 (ja) 映像音響処理システム、映像音響処理方法及びプログラム
JP2018074251A5 (ja) 音響処理システム、音響処理方法、プログラム
US10812898B2 (en) Sound collection apparatus, method of controlling sound collection apparatus, and non-transitory computer-readable storage medium
JP2011069948A (ja) 音源信号分離装置、音源信号分離方法及びプログラム
CN114586374A (zh) 拾音装置以及拾音方法
US11363374B2 (en) Signal processing apparatus, method of controlling signal processing apparatus, and non-transitory computer-readable storage medium
JP2023084843A5 (https=)
JP6966165B2 (ja) 映像音声信号処理装置、その方法とプログラム
EP3706432A1 (en) Processing multiple spatial audio signals which have a spatial overlap