CA2889706C - Video and audio tagging for active speaker detection - Google Patents
Video and audio tagging for active speaker detection Download PDFInfo
- Publication number
- CA2889706C CA2889706C CA2889706A CA2889706A CA2889706C CA 2889706 C CA2889706 C CA 2889706C CA 2889706 A CA2889706 A CA 2889706A CA 2889706 A CA2889706 A CA 2889706A CA 2889706 C CA2889706 C CA 2889706C
- Authority
- CA
- Canada
- Prior art keywords
- audio
- tag
- audio signal
- signal
- video
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001514 detection method Methods 0.000 title description 5
- 230000005236 sound signal Effects 0.000 claims abstract description 111
- 238000000034 method Methods 0.000 claims description 20
- 230000005540 biological transmission Effects 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 241000282412 Homo Species 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000000593 degrading effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000002592 echocardiography Methods 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000007723 transport mechanism Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
- H04N7/155—Conference systems involving storage of or access to video conference sessions
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Closed-Circuit Television Systems (AREA)
- Burglar Alarm Systems (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US13/719,314 | 2012-12-19 | ||
| US13/719,314 US9065971B2 (en) | 2012-12-19 | 2012-12-19 | Video and audio tagging for active speaker detection |
| PCT/US2013/076671 WO2014100466A2 (en) | 2012-12-19 | 2013-12-19 | Video and audio tagging for active speaker detection |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CA2889706A1 CA2889706A1 (en) | 2014-06-26 |
| CA2889706C true CA2889706C (en) | 2020-04-28 |
Family
ID=49943568
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CA2889706A Active CA2889706C (en) | 2012-12-19 | 2013-12-19 | Video and audio tagging for active speaker detection |
Country Status (11)
| Country | Link |
|---|---|
| US (1) | US9065971B2 (enExample) |
| EP (1) | EP2912841B1 (enExample) |
| JP (1) | JP6321033B2 (enExample) |
| KR (1) | KR102110632B1 (enExample) |
| CN (1) | CN104937926B (enExample) |
| AU (1) | AU2013361258B2 (enExample) |
| BR (1) | BR112015011758B1 (enExample) |
| CA (1) | CA2889706C (enExample) |
| MX (1) | MX352445B (enExample) |
| RU (1) | RU2632469C2 (enExample) |
| WO (1) | WO2014100466A2 (enExample) |
Families Citing this family (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9065971B2 (en) * | 2012-12-19 | 2015-06-23 | Microsoft Technology Licensing, Llc | Video and audio tagging for active speaker detection |
| US20150281832A1 (en) * | 2014-03-28 | 2015-10-01 | Panasonic Intellectual Property Management Co., Ltd. | Sound processing apparatus, sound processing system and sound processing method |
| US9681097B1 (en) | 2016-01-20 | 2017-06-13 | Global Tel*Link Corporation | Secure video visitation system |
| US10296994B2 (en) | 2016-02-11 | 2019-05-21 | Global Tel*Link Corporation | System and method for visitation management in a controlled environment |
| US9558523B1 (en) | 2016-03-23 | 2017-01-31 | Global Tel* Link Corp. | Secure nonscheduled video visitation system |
| US10311219B2 (en) * | 2016-06-07 | 2019-06-04 | Vocalzoom Systems Ltd. | Device, system, and method of user authentication utilizing an optical microphone |
| JP6520878B2 (ja) * | 2016-09-21 | 2019-05-29 | トヨタ自動車株式会社 | 音声取得システムおよび音声取得方法 |
| KR102717784B1 (ko) | 2017-02-14 | 2024-10-16 | 한국전자통신연구원 | 스테레오 오디오 신호에 대한 태그 삽입 장치 및 태그 삽입 방법, 그리고, 태그 추출 장치 및 태그 추출 방법 |
| US11282537B2 (en) | 2017-06-09 | 2022-03-22 | International Business Machines Corporation | Active speaker detection in electronic meetings for providing video from one device to plurality of other devices |
| KR102827290B1 (ko) * | 2022-01-13 | 2025-06-27 | 최종성 | 사용자 추적이 가능한 ai 거치대 |
Family Cites Families (41)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US5099319A (en) * | 1989-10-23 | 1992-03-24 | Esch Arthur G | Video information delivery method and apparatus |
| US5689641A (en) | 1993-10-01 | 1997-11-18 | Vicor, Inc. | Multimedia collaboration system arrangement for routing compressed AV signal through a participant site without decompressing the AV signal |
| AUPP392498A0 (en) * | 1998-06-04 | 1998-07-02 | Innes Corporation Pty Ltd | Traffic verification system |
| US7081915B1 (en) | 1998-06-17 | 2006-07-25 | Intel Corporation | Control of video conferencing using activity detection |
| US7062039B1 (en) * | 1999-05-27 | 2006-06-13 | Telefonaktiebolaget Lm Ericsson | Methods and apparatus for improving adaptive filter performance by inclusion of inaudible information |
| US6594629B1 (en) * | 1999-08-06 | 2003-07-15 | International Business Machines Corporation | Methods and apparatus for audio-visual speech detection and recognition |
| JP2002223422A (ja) * | 2001-01-29 | 2002-08-09 | Nec Corp | 多地点テレビ会議制御装置およびビデオパケット送信方法 |
| US7161939B2 (en) * | 2001-06-29 | 2007-01-09 | Ip Unity | Method and system for switching among independent packetized audio streams |
| KR100552468B1 (ko) * | 2001-07-19 | 2006-02-15 | 삼성전자주식회사 | 음성인식에 따른 오동작을 방지 및 음성인식율을 향상 할수 있는 전자기기 및 방법 |
| US6749512B2 (en) * | 2002-03-15 | 2004-06-15 | Macgregor Brian | Computer network implemented gaming system and method of using same |
| EP1443498B1 (en) * | 2003-01-24 | 2008-03-19 | Sony Ericsson Mobile Communications AB | Noise reduction and audio-visual speech activity detection |
| GB2404297B (en) * | 2003-07-24 | 2007-12-05 | Hewlett Packard Development Co | Editing multiple camera outputs |
| JP4414708B2 (ja) * | 2003-09-19 | 2010-02-10 | 株式会社リコー | 動画表示用パーソナルコンピュータ、データ表示システム、動画表示方法、動画表示プログラムおよび記録媒体 |
| US7379875B2 (en) * | 2003-10-24 | 2008-05-27 | Microsoft Corporation | Systems and methods for generating audio thumbnails |
| US20050138674A1 (en) * | 2003-12-17 | 2005-06-23 | Quadrock Communications, Inc | System and method for integration and synchronization of interactive content with television content |
| US7563168B2 (en) * | 2004-02-13 | 2009-07-21 | Texas Instruments Incorporated | Audio effect rendering based on graphic polygons |
| GB2415639B (en) * | 2004-06-29 | 2008-09-17 | Sony Comp Entertainment Europe | Control of data processing |
| US7304585B2 (en) * | 2004-07-02 | 2007-12-04 | Nokia Corporation | Initiation of actions with compressed action language representations |
| US20060147063A1 (en) | 2004-12-22 | 2006-07-06 | Broadcom Corporation | Echo cancellation in telephones with multiple microphones |
| US7450752B2 (en) * | 2005-04-07 | 2008-11-11 | Hewlett-Packard Development Company, L.P. | System and method for automatic detection of the end of a video stream |
| US9300790B2 (en) * | 2005-06-24 | 2016-03-29 | Securus Technologies, Inc. | Multi-party conversation analyzer and logger |
| CN100596061C (zh) * | 2006-01-12 | 2010-03-24 | 大连理工大学 | 一种基于盲源分离的小波域数字音频多目的水印方法 |
| CA2544459A1 (en) * | 2006-04-21 | 2007-10-21 | Evertz Microsystems Ltd. | Systems and methods for synchronizing audio and video data signals |
| US8087044B2 (en) * | 2006-09-18 | 2011-12-27 | Rgb Networks, Inc. | Methods, apparatus, and systems for managing the insertion of overlay content into a video signal |
| US7688889B2 (en) * | 2006-09-18 | 2010-03-30 | Rgb Networks, Inc. | Methods, apparatus, and systems for insertion of overlay content into a video signal with transrating capabilities |
| US20080136623A1 (en) * | 2006-12-06 | 2008-06-12 | Russell Calvarese | Audio trigger for mobile devices |
| US8633960B2 (en) * | 2007-02-20 | 2014-01-21 | St-Ericsson Sa | Communication device for processing person associated pictures and video streams |
| US8385233B2 (en) * | 2007-06-12 | 2013-02-26 | Microsoft Corporation | Active speaker identification |
| US8300080B2 (en) * | 2007-06-29 | 2012-10-30 | Microsoft Corporation | Techniques for detecting a display device |
| US20090210789A1 (en) * | 2008-02-14 | 2009-08-20 | Microsoft Corporation | Techniques to generate a visual composition for a multimedia conference event |
| FR2952263B1 (fr) * | 2009-10-29 | 2012-01-06 | Univ Paris Descartes | Procede et dispositif d'annulation d'echo acoustique par tatouage audio |
| US8713593B2 (en) * | 2010-03-01 | 2014-04-29 | Zazum, Inc. | Detection system and method for mobile device application |
| US20110214143A1 (en) * | 2010-03-01 | 2011-09-01 | Rits Susan K | Mobile device application |
| US8635066B2 (en) * | 2010-04-14 | 2014-01-21 | T-Mobile Usa, Inc. | Camera-assisted noise cancellation and speech recognition |
| US8468012B2 (en) * | 2010-05-26 | 2013-06-18 | Google Inc. | Acoustic model adaptation using geographic information |
| US8723914B2 (en) | 2010-11-19 | 2014-05-13 | Cisco Technology, Inc. | System and method for providing enhanced video processing in a network environment |
| US8589167B2 (en) * | 2011-05-11 | 2013-11-19 | Nuance Communications, Inc. | Speaker liveness detection |
| US20120321062A1 (en) * | 2011-06-17 | 2012-12-20 | Fitzsimmons Jeffrey E | Telephonic Conference Access System |
| CN102368816A (zh) * | 2011-12-01 | 2012-03-07 | 中科芯集成电路股份有限公司 | 一种视频会议智能前端系统 |
| US8886011B2 (en) * | 2012-12-07 | 2014-11-11 | Cisco Technology, Inc. | System and method for question detection based video segmentation, search and collaboration in a video processing environment |
| US9065971B2 (en) * | 2012-12-19 | 2015-06-23 | Microsoft Technology Licensing, Llc | Video and audio tagging for active speaker detection |
-
2012
- 2012-12-19 US US13/719,314 patent/US9065971B2/en active Active
-
2013
- 2013-12-19 CA CA2889706A patent/CA2889706C/en active Active
- 2013-12-19 KR KR1020157016315A patent/KR102110632B1/ko active Active
- 2013-12-19 EP EP13818933.7A patent/EP2912841B1/en active Active
- 2013-12-19 WO PCT/US2013/076671 patent/WO2014100466A2/en not_active Ceased
- 2013-12-19 MX MX2015008119A patent/MX352445B/es active IP Right Grant
- 2013-12-19 BR BR112015011758-9A patent/BR112015011758B1/pt active IP Right Grant
- 2013-12-19 CN CN201380066894.8A patent/CN104937926B/zh active Active
- 2013-12-19 AU AU2013361258A patent/AU2013361258B2/en not_active Ceased
- 2013-12-19 JP JP2015549731A patent/JP6321033B2/ja active Active
- 2013-12-19 RU RU2015123696A patent/RU2632469C2/ru active
Also Published As
| Publication number | Publication date |
|---|---|
| BR112015011758A2 (pt) | 2017-07-11 |
| MX2015008119A (es) | 2016-04-25 |
| JP6321033B2 (ja) | 2018-05-09 |
| WO2014100466A3 (en) | 2014-08-07 |
| WO2014100466A2 (en) | 2014-06-26 |
| EP2912841A2 (en) | 2015-09-02 |
| CN104937926A (zh) | 2015-09-23 |
| BR112015011758B1 (pt) | 2023-04-18 |
| EP2912841B1 (en) | 2020-10-28 |
| MX352445B (es) | 2017-11-24 |
| KR102110632B1 (ko) | 2020-05-13 |
| KR20150096419A (ko) | 2015-08-24 |
| CN104937926B (zh) | 2018-05-25 |
| CA2889706A1 (en) | 2014-06-26 |
| RU2015123696A (ru) | 2017-01-10 |
| US9065971B2 (en) | 2015-06-23 |
| US20140168352A1 (en) | 2014-06-19 |
| AU2013361258A1 (en) | 2015-05-14 |
| JP2016506670A (ja) | 2016-03-03 |
| RU2632469C2 (ru) | 2017-10-05 |
| AU2013361258B2 (en) | 2017-03-09 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CA2889706C (en) | Video and audio tagging for active speaker detection | |
| US11356488B2 (en) | Frame synchronous rendering of remote participant identities | |
| US9497412B1 (en) | Video conference audio/video verification | |
| US20110052136A1 (en) | Pattern-based monitoring of media synchronization | |
| US20100199187A1 (en) | Instant data sharing system and machine readable medium thereof | |
| US20150029301A1 (en) | Teleconference system and teleconference terminal | |
| KR20100028060A (ko) | 디스플레이 장치 검출 기법 | |
| US7808521B2 (en) | Multimedia conference recording and manipulation interface | |
| US10762913B2 (en) | Image-based techniques for audio content | |
| US10142583B1 (en) | Computing system with external speaker detection feature | |
| WO2021204139A1 (zh) | 视频显示方法、装置、设备和存储介质 | |
| US20130100152A1 (en) | Method and apparatus for processing image display | |
| CN108337535B (zh) | 客户端视频的转发方法、装置、设备和存储介质 | |
| US20190222898A1 (en) | Video playing method, device and storage | |
| CN113630650B (zh) | 基于音视频切换的数字电视播放方法、装置和计算机设备 | |
| JP2020058014A (ja) | 映像処理装置、ビデオ会議システム、映像処理方法、およびプログラム | |
| CN110662082A (zh) | 数据处理方法、装置、系统、移动终端及存储介质 | |
| CN109076251B (zh) | 远程会议传输 | |
| US9456180B2 (en) | Image processing apparatus, communication system, and computer program | |
| CN113141480A (zh) | 录屏方法、装置、设备及存储介质 | |
| US8943247B1 (en) | Media sink device input identification | |
| CN109831703B (zh) | 一种针对hdmi信号的运动补偿方法及装置 | |
| US20130007351A1 (en) | Information processor, information processing method, and computer program product | |
| TW202236845A (zh) | 視頻顯示方法、裝置、設備和儲存媒體 | |
| CN113489921A (zh) | 视频图像显示控制方法、设备及系统 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| EEER | Examination request |
Effective date: 20181128 |