CN112016406B - 一种基于全卷积网络的视频关键帧提取方法 - Google Patents
一种基于全卷积网络的视频关键帧提取方法 Download PDFInfo
- Publication number
- CN112016406B CN112016406B CN202010789819.6A CN202010789819A CN112016406B CN 112016406 B CN112016406 B CN 112016406B CN 202010789819 A CN202010789819 A CN 202010789819A CN 112016406 B CN112016406 B CN 112016406B
- Authority
- CN
- China
- Prior art keywords
- network
- feature vector
- video
- convolution
- feature
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000605 extraction Methods 0.000 title claims abstract description 28
- 239000013598 vector Substances 0.000 claims abstract description 131
- 238000000034 method Methods 0.000 claims abstract description 44
- 230000002776 aggregation Effects 0.000 claims abstract description 11
- 238000004220 aggregation Methods 0.000 claims abstract description 11
- 230000006870 function Effects 0.000 claims description 44
- 230000009467 reduction Effects 0.000 claims description 14
- 238000012549 training Methods 0.000 claims description 14
- 238000004590 computer program Methods 0.000 claims description 12
- 230000015654 memory Effects 0.000 claims description 9
- 230000004913 activation Effects 0.000 claims description 5
- 238000011176 pooling Methods 0.000 claims description 5
- 230000008569 process Effects 0.000 description 13
- 238000010586 diagram Methods 0.000 description 9
- 230000007246 mechanism Effects 0.000 description 9
- 230000000007 visual effect Effects 0.000 description 8
- 238000012545 processing Methods 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 5
- 239000000284 extract Substances 0.000 description 5
- 238000012360 testing method Methods 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 2
- 230000007547 defect Effects 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000011800 void material Substances 0.000 description 2
- 238000010420 art technique Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 230000006403 short-term memory Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2415—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on parametric or probabilistic models, e.g. based on likelihood ratio or false acceptance rate versus a false rejection rate
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/047—Probabilistic or stochastic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/44—Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Biophysics (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Probability & Statistics with Applications (AREA)
- Evolutionary Biology (AREA)
- Multimedia (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010789819.6A CN112016406B (zh) | 2020-08-07 | 2020-08-07 | 一种基于全卷积网络的视频关键帧提取方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010789819.6A CN112016406B (zh) | 2020-08-07 | 2020-08-07 | 一种基于全卷积网络的视频关键帧提取方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112016406A CN112016406A (zh) | 2020-12-01 |
CN112016406B true CN112016406B (zh) | 2022-12-02 |
Family
ID=73499297
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010789819.6A Active CN112016406B (zh) | 2020-08-07 | 2020-08-07 | 一种基于全卷积网络的视频关键帧提取方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112016406B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112906609B (zh) * | 2021-03-05 | 2022-06-07 | 清华大学 | 基于双路交叉注意力网络的视频重要区域预测方法和装置 |
CN113076849A (zh) * | 2021-03-29 | 2021-07-06 | 宁波方太厨具有限公司 | 基于动作识别的油烟机控制方法、系统、设备及存储介质 |
CN113221951B (zh) * | 2021-04-13 | 2023-02-17 | 天津大学 | 一种基于时域注意力池化网络的动图分类方法及装置 |
CN113627285A (zh) * | 2021-07-26 | 2021-11-09 | 长沙理工大学 | 视频取证方法、系统和介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107808389A (zh) * | 2017-10-24 | 2018-03-16 | 上海交通大学 | 基于深度学习的无监督视频分割方法 |
CN110933518A (zh) * | 2019-12-11 | 2020-03-27 | 浙江大学 | 一种利用卷积多层注意力网络机制生成面向查询的视频摘要的方法 |
CN111460979A (zh) * | 2020-03-30 | 2020-07-28 | 上海大学 | 一种基于多层时空框架的关键镜头视频摘要方法 |
-
2020
- 2020-08-07 CN CN202010789819.6A patent/CN112016406B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107808389A (zh) * | 2017-10-24 | 2018-03-16 | 上海交通大学 | 基于深度学习的无监督视频分割方法 |
CN110933518A (zh) * | 2019-12-11 | 2020-03-27 | 浙江大学 | 一种利用卷积多层注意力网络机制生成面向查询的视频摘要的方法 |
CN111460979A (zh) * | 2020-03-30 | 2020-07-28 | 上海大学 | 一种基于多层时空框架的关键镜头视频摘要方法 |
Non-Patent Citations (4)
Title |
---|
CBAM: Convolutional Block Attention Module;Sanghyun Woo et al.;《arXiv》;20180718;第1-13页 * |
CCNet: Criss-Cross Attention for Semantic Segmentation;Zilong Huang et al.;《arXiv》;20200709;第1-17页 * |
Deep Interest Evolution Network for Click-Through Rate Prediction;Xiaoqiang Zhu et al.;《 Proceedings of the AAAI Conference on Artificial Intelligence》;20190731;第5941- 5948页 * |
Video Summarization Using Fully Convolutional Sequence Networks;Mrigank Rochan et al.;《European Conference on Computer Vision》;20181206;第358–374页 * |
Also Published As
Publication number | Publication date |
---|---|
CN112016406A (zh) | 2020-12-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112016406B (zh) | 一种基于全卷积网络的视频关键帧提取方法 | |
CN111581510B (zh) | 分享内容处理方法、装置、计算机设备和存储介质 | |
CN115002559B (zh) | 基于门控多头位置注意力机制的视频摘要算法及系统 | |
CN113392288A (zh) | 视觉问答及其模型训练的方法、装置、设备及存储介质 | |
CN111783712A (zh) | 一种视频处理方法、装置、设备及介质 | |
CN111062395A (zh) | 一种实时的视频语义分割方法 | |
CN113066089B (zh) | 一种基于注意力引导机制的实时图像语义分割方法 | |
CN112070040A (zh) | 一种用于视频字幕的文本行检测方法 | |
CN114332094A (zh) | 基于轻量级多尺度信息融合网络的语义分割方法及装置 | |
CN116958324A (zh) | 图像生成模型的训练方法、装置、设备及存储介质 | |
CN113420179B (zh) | 基于时序高斯混合空洞卷积的语义重构视频描述方法 | |
CN113011320B (zh) | 视频处理方法、装置、电子设备及存储介质 | |
CN113079420A (zh) | 视频生成方法、装置、电子设备及计算机可读存储介质 | |
CN113761282B (zh) | 视频查重方法、装置、电子设备以及存储介质 | |
CN110942463B (zh) | 一种基于生成对抗网络的视频目标分割方法 | |
CN116758449A (zh) | 一种基于深度学习的视频显著目标检测方法及系统 | |
CN116363361A (zh) | 基于实时语义分割网络的自动驾驶方法 | |
CN113962332B (zh) | 基于自优化融合反馈的显著目标识别方法 | |
CN113222016B (zh) | 一种基于高层和低层特征交叉增强的变化检测方法及装置 | |
CN110969187B (zh) | 一种图谱迁移的语义分析方法 | |
CN116992947A (zh) | 模型训练方法、视频查询方法和装置 | |
CN114979801A (zh) | 基于双向卷积长短期记忆网络的动态视频摘要算法及系统 | |
Li et al. | MVVA-Net: a Video Aesthetic Quality Assessment Network with Cognitive Fusion of Multi-type Feature–Based Strong Generalization | |
CN112364690A (zh) | 基于多尺度密集时序池化的视频时空信息表征方法 | |
CN117710777B (zh) | 模型训练方法、关键帧抽取方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20231129 Address after: Room 0601, 6th Floor, Building 4, Section 8, Jinci Road, Xiayuan Street, Wanbailin District, Taiyuan City, Shanxi Province, 030000 Patentee after: Judian Shanxi Technology Co.,Ltd. Address before: 509 Kangrui Times Square, Keyuan Business Building, 39 Huarong Road, Gaofeng Community, Dalang Street, Longhua District, Shenzhen, Guangdong Province, 518000 Patentee before: Shenzhen lizhuan Technology Transfer Center Co.,Ltd. Effective date of registration: 20231129 Address after: 509 Kangrui Times Square, Keyuan Business Building, 39 Huarong Road, Gaofeng Community, Dalang Street, Longhua District, Shenzhen, Guangdong Province, 518000 Patentee after: Shenzhen lizhuan Technology Transfer Center Co.,Ltd. Address before: 266061 Songling Road, Laoshan District, Qingdao, Shandong Province, No. 99 Patentee before: QINGDAO University OF SCIENCE AND TECHNOLOGY |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240409 Address after: 101300 room 818-030, building 1, yard 3, Jinhang East Road, Shunyi District, Beijing (Tianzhu comprehensive free trade zone) Patentee after: Interstellar Digital Technology Co.,Ltd. Country or region after: China Address before: Room 0601, 6th Floor, Building 4, Section 8, Jinci Road, Xiayuan Street, Wanbailin District, Taiyuan City, Shanxi Province, 030000 Patentee before: Judian Shanxi Technology Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right |