CN113590874A - 一种视频定位方法及装置、模型训练方法及设备 - Google Patents
一种视频定位方法及装置、模型训练方法及设备 Download PDFInfo
- Publication number
- CN113590874A CN113590874A CN202111139903.4A CN202111139903A CN113590874A CN 113590874 A CN113590874 A CN 113590874A CN 202111139903 A CN202111139903 A CN 202111139903A CN 113590874 A CN113590874 A CN 113590874A
- Authority
- CN
- China
- Prior art keywords
- video
- modality
- attention
- word
- segment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 74
- 238000012549 training Methods 0.000 title claims abstract description 43
- 238000011176 pooling Methods 0.000 claims description 25
- 238000000605 extraction Methods 0.000 claims description 24
- 239000000126 substance Substances 0.000 claims description 24
- 230000004927 fusion Effects 0.000 claims description 23
- 238000012545 processing Methods 0.000 claims description 19
- 230000006870 function Effects 0.000 claims description 17
- 238000000354 decomposition reaction Methods 0.000 claims description 12
- 238000013527 convolutional neural network Methods 0.000 claims description 8
- 239000012634 fragment Substances 0.000 claims description 7
- 238000004364 calculation method Methods 0.000 claims description 6
- 230000004807 localization Effects 0.000 claims description 6
- 238000003058 natural language processing Methods 0.000 claims description 5
- 239000011541 reaction mixture Substances 0.000 claims description 5
- NAWXUBYGYWOOIX-SFHVURJKSA-N (2s)-2-[[4-[2-(2,4-diaminoquinazolin-6-yl)ethyl]benzoyl]amino]-4-methylidenepentanedioic acid Chemical compound C1=CC2=NC(N)=NC(N)=C2C=C1CCC1=CC=C(C(=O)N[C@@H](CC(=C)C(O)=O)C(O)=O)C=C1 NAWXUBYGYWOOIX-SFHVURJKSA-N 0.000 claims description 4
- 230000004913 activation Effects 0.000 claims description 4
- 238000004590 computer program Methods 0.000 claims description 4
- 238000010606 normalization Methods 0.000 claims description 4
- 230000003993 interaction Effects 0.000 abstract description 6
- 230000000007 visual effect Effects 0.000 description 17
- 230000008569 process Effects 0.000 description 13
- 238000010586 diagram Methods 0.000 description 6
- 230000004931 aggregating effect Effects 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 4
- 238000002372 labelling Methods 0.000 description 4
- 230000002776 aggregation Effects 0.000 description 2
- 238000004220 aggregation Methods 0.000 description 2
- 230000002457 bidirectional effect Effects 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 238000011840 criminal investigation Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000003780 insertion Methods 0.000 description 1
- 230000037431 insertion Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 239000013598 vector Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/73—Querying
- G06F16/732—Query formulation
- G06F16/7343—Query language or query format
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/70—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F16/78—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/7867—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Multimedia (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Mathematical Physics (AREA)
- Library & Information Science (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111139903.4A CN113590874B (zh) | 2021-09-28 | 2021-09-28 | 一种视频定位方法及装置、模型训练方法及设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202111139903.4A CN113590874B (zh) | 2021-09-28 | 2021-09-28 | 一种视频定位方法及装置、模型训练方法及设备 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113590874A true CN113590874A (zh) | 2021-11-02 |
CN113590874B CN113590874B (zh) | 2022-02-11 |
Family
ID=78242204
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202111139903.4A Active CN113590874B (zh) | 2021-09-28 | 2021-09-28 | 一种视频定位方法及装置、模型训练方法及设备 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113590874B (zh) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116226443A (zh) * | 2023-05-11 | 2023-06-06 | 山东建筑大学 | 基于大规模视频语料库的弱监督视频片段定位方法及系统 |
CN116385946A (zh) * | 2023-06-06 | 2023-07-04 | 山东大学 | 面向视频的目标片段定位方法、系统、存储介质及设备 |
CN116843727A (zh) * | 2023-09-01 | 2023-10-03 | 广东师大维智信息科技有限公司 | 一种跨视频源的目标交接定位方法及系统 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108932304A (zh) * | 2018-06-12 | 2018-12-04 | 山东大学 | 基于跨模态的视频时刻定位方法、系统及存储介质 |
CN109344288A (zh) * | 2018-09-19 | 2019-02-15 | 电子科技大学 | 一种基于多模态特征结合多层注意力机制的结合视频描述方法 |
CN109905772A (zh) * | 2019-03-12 | 2019-06-18 | 腾讯科技(深圳)有限公司 | 视频片段查询方法、装置、计算机设备及存储介质 |
CN110019849A (zh) * | 2018-05-23 | 2019-07-16 | 山东大学 | 一种基于注意力机制的视频关注时刻检索方法及装置 |
US20200302294A1 (en) * | 2019-03-22 | 2020-09-24 | Nec Laboratories America, Inc. | Efficient and fine-grained video retrieval |
CN111930999A (zh) * | 2020-07-21 | 2020-11-13 | 山东省人工智能研究院 | 逐帧跨模态相似度关联实施文本查询定位视频片段方法 |
CN112650886A (zh) * | 2020-12-28 | 2021-04-13 | 电子科技大学 | 基于跨模态动态卷积网络的跨模态视频时刻检索方法 |
CN112685597A (zh) * | 2021-03-12 | 2021-04-20 | 杭州一知智能科技有限公司 | 一种基于擦除机制的弱监督视频片段检索方法和系统 |
CN112989120A (zh) * | 2021-05-13 | 2021-06-18 | 广东众聚人工智能科技有限公司 | 一种视频片段查询系统和视频片段查询方法 |
-
2021
- 2021-09-28 CN CN202111139903.4A patent/CN113590874B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110019849A (zh) * | 2018-05-23 | 2019-07-16 | 山东大学 | 一种基于注意力机制的视频关注时刻检索方法及装置 |
CN108932304A (zh) * | 2018-06-12 | 2018-12-04 | 山东大学 | 基于跨模态的视频时刻定位方法、系统及存储介质 |
CN109344288A (zh) * | 2018-09-19 | 2019-02-15 | 电子科技大学 | 一种基于多模态特征结合多层注意力机制的结合视频描述方法 |
CN109905772A (zh) * | 2019-03-12 | 2019-06-18 | 腾讯科技(深圳)有限公司 | 视频片段查询方法、装置、计算机设备及存储介质 |
US20200302294A1 (en) * | 2019-03-22 | 2020-09-24 | Nec Laboratories America, Inc. | Efficient and fine-grained video retrieval |
CN111930999A (zh) * | 2020-07-21 | 2020-11-13 | 山东省人工智能研究院 | 逐帧跨模态相似度关联实施文本查询定位视频片段方法 |
CN112650886A (zh) * | 2020-12-28 | 2021-04-13 | 电子科技大学 | 基于跨模态动态卷积网络的跨模态视频时刻检索方法 |
CN112685597A (zh) * | 2021-03-12 | 2021-04-20 | 杭州一知智能科技有限公司 | 一种基于擦除机制的弱监督视频片段检索方法和系统 |
CN112989120A (zh) * | 2021-05-13 | 2021-06-18 | 广东众聚人工智能科技有限公司 | 一种视频片段查询系统和视频片段查询方法 |
Non-Patent Citations (2)
Title |
---|
YUPENG HU ET AL.: "Video Moment Localization via Deep Cross-Modal Hashing", 《IEEE TRANSACTIONS ON IMAGE PROCESSING》 * |
王迎新: "基于注意力机制的视频哈希检索方法研究", 《万方数据库》 * |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116226443A (zh) * | 2023-05-11 | 2023-06-06 | 山东建筑大学 | 基于大规模视频语料库的弱监督视频片段定位方法及系统 |
CN116226443B (zh) * | 2023-05-11 | 2023-07-21 | 山东建筑大学 | 基于大规模视频语料库的弱监督视频片段定位方法及系统 |
CN116385946A (zh) * | 2023-06-06 | 2023-07-04 | 山东大学 | 面向视频的目标片段定位方法、系统、存储介质及设备 |
CN116385946B (zh) * | 2023-06-06 | 2023-08-29 | 山东大学 | 面向视频的目标片段定位方法、系统、存储介质及设备 |
CN116843727A (zh) * | 2023-09-01 | 2023-10-03 | 广东师大维智信息科技有限公司 | 一种跨视频源的目标交接定位方法及系统 |
CN116843727B (zh) * | 2023-09-01 | 2023-11-24 | 广东师大维智信息科技有限公司 | 一种跨视频源的目标交接定位方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN113590874B (zh) | 2022-02-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110162593B (zh) | 一种搜索结果处理、相似度模型训练方法及装置 | |
CN113590874B (zh) | 一种视频定位方法及装置、模型训练方法及设备 | |
CN107526799B (zh) | 一种基于深度学习的知识图谱构建方法 | |
CN111797893B (zh) | 一种神经网络的训练方法、图像分类系统及相关设备 | |
CN112182166B (zh) | 一种文本匹配方法、装置、电子设备及存储介质 | |
EP3968179A1 (en) | Place recognition method and apparatus, model training method and apparatus for place recognition, and electronic device | |
CN110852368A (zh) | 全局与局部特征嵌入及图文融合的情感分析方法与系统 | |
CN110851641B (zh) | 跨模态检索方法、装置和可读存储介质 | |
CN109271539B (zh) | 一种基于深度学习的图像自动标注方法及装置 | |
CN111159485B (zh) | 尾实体链接方法、装置、服务器及存储介质 | |
US20150178321A1 (en) | Image-based 3d model search and retrieval | |
WO2020238353A1 (zh) | 数据处理方法和装置、存储介质及电子装置 | |
CN113344206A (zh) | 融合通道与关系特征学习的知识蒸馏方法、装置及设备 | |
CN113836992B (zh) | 识别标签的方法、训练标签识别模型的方法、装置及设备 | |
CN112819023A (zh) | 样本集的获取方法、装置、计算机设备和存储介质 | |
CN112069884A (zh) | 一种暴力视频分类方法、系统和存储介质 | |
CN112925904B (zh) | 一种基于Tucker分解的轻量级文本分类方法 | |
CN114550053A (zh) | 一种交通事故定责方法、装置、计算机设备及存储介质 | |
CN111783903A (zh) | 文本处理方法、文本模型的处理方法及装置、计算机设备 | |
CN113569118B (zh) | 自媒体推送方法、装置、计算机设备及存储介质 | |
CN114782752A (zh) | 基于自训练的小样本图像集成分类方法及装置 | |
CN117033609B (zh) | 文本视觉问答方法、装置、计算机设备和存储介质 | |
CN112749556B (zh) | 多语言模型的训练方法和装置、存储介质和电子设备 | |
CN112613451A (zh) | 一种跨模态文本图片检索模型的建模方法 | |
CN116977701A (zh) | 视频分类模型训练的方法、视频分类的方法和装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CP02 | Change in the address of a patent holder | ||
CP02 | Change in the address of a patent holder |
Address after: Room 1409, Floor 14, Building 1, High tech Zone Entrepreneurship Center, No. 177, Gaoxin 6th Road, Rizhao, Shandong 276801 Patentee after: Shandong Liju Robot Technology Co.,Ltd. Address before: 276808 No.99, Yuquan 2nd Road, antonwei street, Lanshan District, Rizhao City, Shandong Province Patentee before: Shandong Liju Robot Technology Co.,Ltd. |
|
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Xie Chihao Inventor after: Fang Tipin Inventor after: Teng Juanya Inventor after: Lu Xiankai Inventor after: Yang Guangyuan Inventor before: Fang Tipin Inventor before: Teng Juanya Inventor before: Lu Xiankai Inventor before: Yang Guangyuan |