CN112801068B - 一种视频多目标跟踪与分割系统和方法 - Google Patents
一种视频多目标跟踪与分割系统和方法 Download PDFInfo
- Publication number
- CN112801068B CN112801068B CN202110397488.6A CN202110397488A CN112801068B CN 112801068 B CN112801068 B CN 112801068B CN 202110397488 A CN202110397488 A CN 202110397488A CN 112801068 B CN112801068 B CN 112801068B
- Authority
- CN
- China
- Prior art keywords
- video
- target
- frame
- feature
- segmentation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 33
- 230000011218 segmentation Effects 0.000 claims abstract description 71
- 238000001514 detection method Methods 0.000 claims abstract description 33
- 238000009826 distribution Methods 0.000 claims abstract description 24
- 238000000605 extraction Methods 0.000 claims abstract description 22
- 238000012549 training Methods 0.000 claims description 36
- 230000010354 integration Effects 0.000 claims description 8
- 238000002372 labelling Methods 0.000 claims description 4
- 230000007246 mechanism Effects 0.000 claims description 4
- 230000006835 compression Effects 0.000 claims description 3
- 238000007906 compression Methods 0.000 claims description 3
- 238000005096 rolling process Methods 0.000 claims description 3
- 238000005070 sampling Methods 0.000 claims description 3
- 230000006837 decompression Effects 0.000 claims description 2
- 230000002441 reversible effect Effects 0.000 claims description 2
- 230000001131 transforming effect Effects 0.000 claims description 2
- 230000006870 function Effects 0.000 description 10
- 230000008569 process Effects 0.000 description 10
- 238000003860 storage Methods 0.000 description 10
- 238000004422 calculation algorithm Methods 0.000 description 5
- 230000009286 beneficial effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000004590 computer program Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000007476 Maximum Likelihood Methods 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 238000013527 convolutional neural network Methods 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000007704 transition Effects 0.000 description 2
- 230000004913 activation Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000003709 image segmentation Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/20—Movements or behaviour, e.g. gesture recognition
- G06V40/23—Recognition of whole body movements, e.g. for sport training
- G06V40/25—Recognition of walking or running movements, e.g. gait recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/20—Analysis of motion
- G06T7/246—Analysis of motion using feature-based methods, e.g. the tracking of corners or segments
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/25—Determination of region of interest [ROI] or a volume of interest [VOI]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/26—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
- G06V10/267—Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2207/00—Indexing scheme for image analysis or image enhancement
- G06T2207/10—Image acquisition modality
- G06T2207/10016—Video; Image sequence
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Data Mining & Analysis (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Psychiatry (AREA)
- Social Psychology (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110397488.6A CN112801068B (zh) | 2021-04-14 | 2021-04-14 | 一种视频多目标跟踪与分割系统和方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110397488.6A CN112801068B (zh) | 2021-04-14 | 2021-04-14 | 一种视频多目标跟踪与分割系统和方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112801068A CN112801068A (zh) | 2021-05-14 |
CN112801068B true CN112801068B (zh) | 2021-07-16 |
Family
ID=75816994
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110397488.6A Active CN112801068B (zh) | 2021-04-14 | 2021-04-14 | 一种视频多目标跟踪与分割系统和方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112801068B (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114494297B (zh) * | 2022-01-28 | 2022-12-06 | 杭州电子科技大学 | 处理多种先验知识的自适应视频目标分割方法 |
CN115063453B (zh) * | 2022-06-24 | 2023-08-29 | 南京农业大学 | 植物叶片气孔个体行为检测分析方法、系统及存储介质 |
CN115719368B (zh) * | 2022-11-29 | 2024-05-17 | 上海船舶运输科学研究所有限公司 | 一种多目标船舶跟踪方法及系统 |
CN117494921B (zh) * | 2023-12-29 | 2024-04-12 | 湖南工商大学 | 一种多目标类型的路径模型求解方法及装置 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9476983B2 (en) * | 2012-03-15 | 2016-10-25 | GM Global Technology Operations LLC | System and method for fusing radar/camera object data and LiDAR scan points |
CN109003267A (zh) * | 2017-08-09 | 2018-12-14 | 深圳科亚医疗科技有限公司 | 从3d图像自动检测目标对象的计算机实现方法和系统 |
CN110660080A (zh) * | 2019-09-11 | 2020-01-07 | 昆明理工大学 | 一种基于学习率调整融合多层卷积特征的多尺度目标跟踪方法 |
CN111985464A (zh) * | 2020-08-13 | 2020-11-24 | 山东大学 | 面向法院判决文书的多尺度学习的文字识别方法及系统 |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6621914B1 (en) * | 1999-10-22 | 2003-09-16 | Lockheed Martin Corporation | Method and software-implemented apparatus for detecting objects in multi-dimensional data |
CN103077534B (zh) * | 2012-12-31 | 2015-08-19 | 南京华图信息技术有限公司 | 时空多尺度运动目标检测方法 |
WO2016016033A1 (en) * | 2014-07-31 | 2016-02-04 | Thomson Licensing | Method and apparatus for interactive video segmentation |
CN108182388A (zh) * | 2017-12-14 | 2018-06-19 | 哈尔滨工业大学(威海) | 一种基于图像的运动目标跟踪方法 |
CN109886090B (zh) * | 2019-01-07 | 2020-12-04 | 北京大学 | 一种基于多时间尺度卷积神经网络的视频行人再识别方法 |
CN110705431B (zh) * | 2019-09-26 | 2022-03-15 | 中国人民解放军陆军炮兵防空兵学院 | 基于深度c3d特征的视频显著性区域检测方法及系统 |
-
2021
- 2021-04-14 CN CN202110397488.6A patent/CN112801068B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9476983B2 (en) * | 2012-03-15 | 2016-10-25 | GM Global Technology Operations LLC | System and method for fusing radar/camera object data and LiDAR scan points |
CN109003267A (zh) * | 2017-08-09 | 2018-12-14 | 深圳科亚医疗科技有限公司 | 从3d图像自动检测目标对象的计算机实现方法和系统 |
CN110660080A (zh) * | 2019-09-11 | 2020-01-07 | 昆明理工大学 | 一种基于学习率调整融合多层卷积特征的多尺度目标跟踪方法 |
CN111985464A (zh) * | 2020-08-13 | 2020-11-24 | 山东大学 | 面向法院判决文书的多尺度学习的文字识别方法及系统 |
Non-Patent Citations (1)
Title |
---|
基于深度学习的视频多目标跟踪算法研究;储琪;《中国博士学位论文全文数据库 信息科技辑》;20190815;第I138-43页 * |
Also Published As
Publication number | Publication date |
---|---|
CN112801068A (zh) | 2021-05-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112801068B (zh) | 一种视频多目标跟踪与分割系统和方法 | |
Oh et al. | Fast video object segmentation by reference-guided mask propagation | |
Abbas et al. | A comprehensive review of recent advances on deep vision systems | |
Tu et al. | Action-stage emphasized spatiotemporal VLAD for video action recognition | |
Shou et al. | Cdc: Convolutional-de-convolutional networks for precise temporal action localization in untrimmed videos | |
Ge et al. | An attention mechanism based convolutional LSTM network for video action recognition | |
Khan et al. | Automatic shadow detection and removal from a single image | |
Vahdani et al. | Deep learning-based action detection in untrimmed videos: A survey | |
Li et al. | Spatio-temporal unity networking for video anomaly detection | |
Lorre et al. | Temporal contrastive pretraining for video action recognition | |
Li et al. | Transformer-based visual segmentation: A survey | |
CN110889375B (zh) | 用于行为识别的隐双流协作学习网络及方法 | |
Khan et al. | Revisiting crowd counting: State-of-the-art, trends, and future perspectives | |
Yu et al. | Joint spatial-temporal attention for action recognition | |
CN113822368A (zh) | 一种基于无锚的增量式目标检测方法 | |
Naeem et al. | T-VLAD: Temporal vector of locally aggregated descriptor for multiview human action recognition | |
Zhou et al. | Transformer-based multi-scale feature integration network for video saliency prediction | |
US20230154139A1 (en) | Systems and methods for contrastive pretraining with video tracking supervision | |
Patel et al. | A study on video semantics; overview, challenges, and applications | |
Katircioglu et al. | Self-supervised human detection and segmentation via background inpainting | |
Taghinezhad et al. | A new unsupervised video anomaly detection using multi-scale feature memorization and multipath temporal information prediction | |
Roy et al. | Sparsity-inducing dictionaries for effective action classification | |
Wang et al. | Non-local attention association scheme for online multi-object tracking | |
Singh et al. | Attention-guided generator with dual discriminator GAN for real-time video anomaly detection | |
Sun et al. | Weak supervised learning based abnormal behavior detection |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Jiang Zhifang Inventor after: Fang Tipin Inventor after: Qin Zheyun Inventor after: Lu Xiankai Inventor after: Ding Dongrui Inventor before: Fang Tipin Inventor before: Qin Zheyun Inventor before: Lu Xiankai Inventor before: Ding Dongrui |
|
CB03 | Change of inventor or designer information | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240207 Address after: Room 1609, 16th Floor, Building 2, Xinsheng Building, Northwest Corner of Xinluo Street and Yingxiu Road Intersection, Shunhua Road Street, Jinan Area, China (Shandong) Pilot Free Trade Zone, Jinan City, Shandong Province, 250014 Patentee after: Lingxin Huizhi (Shandong) Intelligent Technology Co.,Ltd. Country or region after: China Address before: Room 156-8, No.5 Lingbin Road, Dangan Town, Xiangzhou District, Zhuhai City, Guangdong Province 519000 Patentee before: Guangdong Zhongju Artificial Intelligence Technology Co.,Ltd. Country or region before: China |
|
TR01 | Transfer of patent right |