CN116708725B - 基于语义编解码的低带宽人群场景安防监控方法及系统 - Google Patents
基于语义编解码的低带宽人群场景安防监控方法及系统 Download PDFInfo
- Publication number
- CN116708725B CN116708725B CN202310980716.1A CN202310980716A CN116708725B CN 116708725 B CN116708725 B CN 116708725B CN 202310980716 A CN202310980716 A CN 202310980716A CN 116708725 B CN116708725 B CN 116708725B
- Authority
- CN
- China
- Prior art keywords
- target object
- frame
- monitoring video
- sketch
- semantic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012544 monitoring process Methods 0.000 title claims abstract description 227
- 238000000034 method Methods 0.000 title claims abstract description 56
- 238000012545 processing Methods 0.000 claims abstract description 76
- 230000006399 behavior Effects 0.000 claims description 86
- 230000011218 segmentation Effects 0.000 claims description 30
- 230000004927 fusion Effects 0.000 claims description 23
- 238000000605 extraction Methods 0.000 claims description 19
- 238000013136 deep learning model Methods 0.000 claims description 10
- 230000000877 morphologic effect Effects 0.000 claims description 6
- 230000004660 morphological change Effects 0.000 claims description 6
- 230000005540 biological transmission Effects 0.000 abstract description 9
- 230000008569 process Effects 0.000 abstract description 7
- 238000010586 diagram Methods 0.000 description 18
- 238000004590 computer program Methods 0.000 description 7
- 230000009471 action Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000006870 function Effects 0.000 description 6
- 238000004458 analytical method Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 4
- 230000007246 mechanism Effects 0.000 description 4
- 238000013473 artificial intelligence Methods 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 239000013598 vector Substances 0.000 description 2
- 206010000117 Abnormal behaviour Diseases 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000004075 alteration Effects 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000003708 edge detection Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
- G06N3/0455—Auto-encoder networks; Encoder-decoder networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/0475—Generative networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/80—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
- G06V10/806—Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of extracted features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/82—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/49—Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
- G06V20/53—Recognition of crowd images, e.g. recognition of crowd congestion
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/42—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D30/00—Reducing energy consumption in communication networks
- Y02D30/70—Reducing energy consumption in communication networks in wireless communication networks
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Evolutionary Computation (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Life Sciences & Earth Sciences (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Mathematical Physics (AREA)
- Databases & Information Systems (AREA)
- Medical Informatics (AREA)
- Signal Processing (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310980716.1A CN116708725B (zh) | 2023-08-07 | 2023-08-07 | 基于语义编解码的低带宽人群场景安防监控方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310980716.1A CN116708725B (zh) | 2023-08-07 | 2023-08-07 | 基于语义编解码的低带宽人群场景安防监控方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN116708725A CN116708725A (zh) | 2023-09-05 |
CN116708725B true CN116708725B (zh) | 2023-10-31 |
Family
ID=87837864
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310980716.1A Active CN116708725B (zh) | 2023-08-07 | 2023-08-07 | 基于语义编解码的低带宽人群场景安防监控方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116708725B (zh) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106210612A (zh) * | 2015-04-30 | 2016-12-07 | 杭州海康威视数字技术股份有限公司 | 视频编码方法、解码方法及其装置 |
CN108509457A (zh) * | 2017-02-28 | 2018-09-07 | 阿里巴巴集团控股有限公司 | 一种视频数据的推荐方法和装置 |
US10163227B1 (en) * | 2016-12-28 | 2018-12-25 | Shutterstock, Inc. | Image file compression using dummy data for non-salient portions of images |
CN111581436A (zh) * | 2020-03-30 | 2020-08-25 | 西安天和防务技术股份有限公司 | 目标识别方法、装置、计算机设备和存储介质 |
CN111918071A (zh) * | 2020-06-29 | 2020-11-10 | 北京大学 | 数据压缩的方法、装置、设备及存储介质 |
-
2023
- 2023-08-07 CN CN202310980716.1A patent/CN116708725B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106210612A (zh) * | 2015-04-30 | 2016-12-07 | 杭州海康威视数字技术股份有限公司 | 视频编码方法、解码方法及其装置 |
US10163227B1 (en) * | 2016-12-28 | 2018-12-25 | Shutterstock, Inc. | Image file compression using dummy data for non-salient portions of images |
CN108509457A (zh) * | 2017-02-28 | 2018-09-07 | 阿里巴巴集团控股有限公司 | 一种视频数据的推荐方法和装置 |
CN111581436A (zh) * | 2020-03-30 | 2020-08-25 | 西安天和防务技术股份有限公司 | 目标识别方法、装置、计算机设备和存储介质 |
CN111918071A (zh) * | 2020-06-29 | 2020-11-10 | 北京大学 | 数据压缩的方法、装置、设备及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN116708725A (zh) | 2023-09-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Xiong et al. | Learning to generate time-lapse videos using multi-stage dynamic generative adversarial networks | |
Skorokhodov et al. | Stylegan-v: A continuous video generator with the price, image quality and perks of stylegan2 | |
Duan et al. | Compact descriptors for video analysis: The emerging MPEG standard | |
CN111709408B (zh) | 图像真伪检测方法和装置 | |
CN111489287A (zh) | 图像转换方法、装置、计算机设备和存储介质 | |
CN113538480A (zh) | 图像分割处理方法、装置、计算机设备和存储介质 | |
CN113392270A (zh) | 视频处理方法、装置、计算机设备以及存储介质 | |
AU2022215283B2 (en) | A method of training a machine learning algorithm to identify objects or activities in video surveillance data | |
CN111914676A (zh) | 人体摔倒检测方法、装置、电子设备和存储介质 | |
CN113572976A (zh) | 视频处理方法、装置、电子设备及可读存储介质 | |
WO2023279799A1 (zh) | 对象识别方法、装置和电子系统 | |
CN116665083A (zh) | 一种视频分类方法、装置、电子设备及存储介质 | |
Ehsan et al. | An accurate violence detection framework using unsupervised spatial–temporal action translation network | |
CN112804558B (zh) | 视频拆分方法、装置及设备 | |
CN114529785A (zh) | 模型的训练方法、视频生成方法和装置、设备、介质 | |
Badale et al. | Deep fake detection using neural networks | |
CN116708725B (zh) | 基于语义编解码的低带宽人群场景安防监控方法及系统 | |
CN111246176A (zh) | 一种节带化视频传输方法 | |
Supangkat et al. | Moving Image Interpretation Models to Support City Analysis | |
EP4164221A1 (en) | Processing image data | |
CN113822117B (zh) | 一种数据处理方法、设备以及计算机可读存储介质 | |
CN114694065A (zh) | 视频处理方法、装置、计算机设备及存储介质 | |
He et al. | MTRFN: Multiscale temporal receptive field network for compressed video action recognition at edge servers | |
CN114120076A (zh) | 基于步态运动估计的跨视角视频步态识别方法 | |
Hui-bin et al. | Recognition of individual object in focus people group based on deep learning |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information | ||
CB03 | Change of inventor or designer information |
Inventor after: Cheng Baoping Inventor after: Tao Xiaoming Inventor after: Shang Ziqin Inventor after: Huang Yan Inventor after: Xie Xiaoyan Inventor after: Ge Ning Inventor after: Duan Yiping Inventor before: Cheng Baoping Inventor before: Tao Xiaoming Inventor before: Shang Ziqin Inventor before: Huang Yan Inventor before: Xie Xiaoyan |