CN111477244A - 一种面向用户的自定义体育赛事解说增强方法 - Google Patents
一种面向用户的自定义体育赛事解说增强方法 Download PDFInfo
- Publication number
- CN111477244A CN111477244A CN202010284204.8A CN202010284204A CN111477244A CN 111477244 A CN111477244 A CN 111477244A CN 202010284204 A CN202010284204 A CN 202010284204A CN 111477244 A CN111477244 A CN 111477244A
- Authority
- CN
- China
- Prior art keywords
- video
- time frame
- user
- commentary
- sports event
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 50
- 230000002708 enhancing effect Effects 0.000 title claims abstract description 7
- 239000013598 vector Substances 0.000 claims abstract description 61
- 238000013528 artificial neural network Methods 0.000 claims abstract description 14
- 230000008569 process Effects 0.000 claims abstract description 12
- 238000012545 processing Methods 0.000 claims abstract description 12
- 238000001228 spectrum Methods 0.000 claims description 32
- 239000000203 mixture Substances 0.000 claims description 16
- 238000007476 Maximum Likelihood Methods 0.000 claims description 9
- 238000012549 training Methods 0.000 claims description 6
- 230000008676 import Effects 0.000 claims description 5
- 238000000926 separation method Methods 0.000 claims description 4
- 238000005315 distribution function Methods 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 3
- 239000004576 sand Substances 0.000 claims description 3
- 230000011218 segmentation Effects 0.000 claims description 3
- 230000001755 vocal effect Effects 0.000 abstract description 6
- 230000006870 function Effects 0.000 abstract description 5
- 125000004122 cyclic group Chemical group 0.000 abstract description 2
- 230000000875 corresponding effect Effects 0.000 description 13
- 238000005516 engineering process Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000009826 distribution Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000002123 temporal effect Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 230000002596 correlated effect Effects 0.000 description 1
- 238000007405 data analysis Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 210000005069 ears Anatomy 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 230000036651 mood Effects 0.000 description 1
- 230000000877 morphologic effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000012358 sourcing Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
- G10L21/028—Voice signal separating using properties of sound source
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
- G10L17/06—Decision making techniques; Pattern matching strategies
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
- G10L25/57—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for processing of video signals
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Business, Economics & Management (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Abstract
Description
解说员ID | 解说员身份向量 | 是否屏蔽该解说员 |
001 | <1,1,1> | 0 |
002 | <2,5,6> | 1 |
003 | <3,5,1> | 0 |
… | … | … |
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010284204.8A CN111477244B (zh) | 2020-04-13 | 2020-04-13 | 一种面向用户的自定义体育赛事解说增强方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010284204.8A CN111477244B (zh) | 2020-04-13 | 2020-04-13 | 一种面向用户的自定义体育赛事解说增强方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111477244A true CN111477244A (zh) | 2020-07-31 |
CN111477244B CN111477244B (zh) | 2023-09-22 |
Family
ID=71752182
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010284204.8A Active CN111477244B (zh) | 2020-04-13 | 2020-04-13 | 一种面向用户的自定义体育赛事解说增强方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111477244B (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112216306A (zh) * | 2020-09-25 | 2021-01-12 | 广东电网有限责任公司佛山供电局 | 基于声纹的通话管理方法、装置、电子设备及存储介质 |
CN114491143A (zh) * | 2022-02-12 | 2022-05-13 | 北京蜂巢世纪科技有限公司 | 现场活动的音频解说搜索方法、装置、设备及介质 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090119729A1 (en) * | 2002-12-10 | 2009-05-07 | Onlive, Inc. | Method for multicasting views of real-time streaming interactive video |
US20090118017A1 (en) * | 2002-12-10 | 2009-05-07 | Onlive, Inc. | Hosting and broadcasting virtual events using streaming interactive video |
CN102163397A (zh) * | 2011-05-27 | 2011-08-24 | 大连交通大学 | 自助式多媒体智能解说系统 |
CN104135667A (zh) * | 2014-06-10 | 2014-11-05 | 腾讯科技(深圳)有限公司 | 一种视频异地解说同步方法、终端设备,及系统 |
CN105898605A (zh) * | 2016-04-29 | 2016-08-24 | 乐视控股(北京)有限公司 | 一种实现平民解说的方法及装置 |
CN107423274A (zh) * | 2017-06-07 | 2017-12-01 | 北京百度网讯科技有限公司 | 基于人工智能的比赛解说内容生成方法、装置及存储介质 |
CN110971964A (zh) * | 2019-12-12 | 2020-04-07 | 腾讯科技(深圳)有限公司 | 智能解说生成、播放方法、装置、设备及存储介质 |
-
2020
- 2020-04-13 CN CN202010284204.8A patent/CN111477244B/zh active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090119729A1 (en) * | 2002-12-10 | 2009-05-07 | Onlive, Inc. | Method for multicasting views of real-time streaming interactive video |
US20090118017A1 (en) * | 2002-12-10 | 2009-05-07 | Onlive, Inc. | Hosting and broadcasting virtual events using streaming interactive video |
CN102163397A (zh) * | 2011-05-27 | 2011-08-24 | 大连交通大学 | 自助式多媒体智能解说系统 |
CN104135667A (zh) * | 2014-06-10 | 2014-11-05 | 腾讯科技(深圳)有限公司 | 一种视频异地解说同步方法、终端设备,及系统 |
CN105898605A (zh) * | 2016-04-29 | 2016-08-24 | 乐视控股(北京)有限公司 | 一种实现平民解说的方法及装置 |
CN107423274A (zh) * | 2017-06-07 | 2017-12-01 | 北京百度网讯科技有限公司 | 基于人工智能的比赛解说内容生成方法、装置及存储介质 |
CN110971964A (zh) * | 2019-12-12 | 2020-04-07 | 腾讯科技(深圳)有限公司 | 智能解说生成、播放方法、装置、设备及存储介质 |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112216306A (zh) * | 2020-09-25 | 2021-01-12 | 广东电网有限责任公司佛山供电局 | 基于声纹的通话管理方法、装置、电子设备及存储介质 |
CN114491143A (zh) * | 2022-02-12 | 2022-05-13 | 北京蜂巢世纪科技有限公司 | 现场活动的音频解说搜索方法、装置、设备及介质 |
Also Published As
Publication number | Publication date |
---|---|
CN111477244B (zh) | 2023-09-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Chaudhuri et al. | Ava-speech: A densely labeled dataset of speech activity in movies | |
CN111128214B (zh) | 音频降噪方法、装置、电子设备及介质 | |
US20100005485A1 (en) | Annotation of video footage and personalised video generation | |
Vinciarelli | Speakers role recognition in multiparty audio recordings using social network analysis and duration distribution modeling | |
JP2004258659A (ja) | スポーツイベントのオーディオ信号からハイライトを抽出する方法およびシステム | |
CN106991163A (zh) | 一种基于演唱者声音特质的歌曲推荐方法 | |
CN108962229B (zh) | 一种基于单通道、无监督式的目标说话人语音提取方法 | |
JP5843401B2 (ja) | コンテンツ情報提供装置、コンテンツ情報提供システム、コンテンツ情報提供方法及びコンテンツ情報提供プログラム | |
CN102073635A (zh) | 节目端点时间检测装置和方法以及节目信息检索系统 | |
CN111477244B (zh) | 一种面向用户的自定义体育赛事解说增强方法 | |
CN109525865B (zh) | 基于区块链的收视率监测方法和计算机可读存储介质 | |
CN109271550A (zh) | 一种基于深度学习的音乐个性化分类推荐方法 | |
Brown et al. | Playing a part: Speaker verification at the movies | |
CN102073636A (zh) | 节目高潮检索方法和系统 | |
WO2019233361A1 (zh) | 对音乐进行音量调节的方法及设备 | |
CN110580914A (zh) | 一种音频处理方法、设备及具有存储功能的装置 | |
CN108629047B (zh) | 一种歌曲清单生成方法及终端设备 | |
CN113707183B (zh) | 一种视频中的音频处理方法及装置 | |
CN112632318A (zh) | 一种音频推荐方法、装置、系统及存储介质 | |
Schaffer et al. | Music separation enhancement with generative modeling | |
KR100863122B1 (ko) | 오디오 신호 특성을 이용한 멀티미디어 동영상 색인 방법 | |
Pham et al. | An audio-based deep learning framework for BBC television programme classification | |
Nwe et al. | Broadcast news segmentation by audio type analysis | |
Jani et al. | Experimental investigation of transitions for mixed speech and music playlist generation | |
CN111986696B (zh) | 一种高效处理歌曲音量均衡的方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB03 | Change of inventor or designer information |
Inventor after: Chen Xingguo Inventor after: Qiao Yiming Inventor after: Liu Wei Inventor after: Zhu Jie Inventor after: Zhang Peng Inventor before: Chen Xingguo Inventor before: Zhang Peng Inventor before: Liu Wei Inventor before: Zhu Jie |
|
CB03 | Change of inventor or designer information | ||
CB02 | Change of applicant information |
Address after: 210000, 66 new model street, Gulou District, Jiangsu, Nanjing Applicant after: NANJING University OF POSTS AND TELECOMMUNICATIONS Address before: Yuen Road Ya Dong Qixia District of Nanjing City, Jiangsu province 210000 New District No. 9 Applicant before: NANJING University OF POSTS AND TELECOMMUNICATIONS |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |