KR20210093875A - 비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치 - Google Patents
비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치 Download PDFInfo
- Publication number
- KR20210093875A KR20210093875A KR1020217013635A KR20217013635A KR20210093875A KR 20210093875 A KR20210093875 A KR 20210093875A KR 1020217013635 A KR1020217013635 A KR 1020217013635A KR 20217013635 A KR20217013635 A KR 20217013635A KR 20210093875 A KR20210093875 A KR 20210093875A
- Authority
- KR
- South Korea
- Prior art keywords
- offset
- information
- feature map
- video
- dimensional feature
- Prior art date
Links
- 238000004458 analytical method Methods 0.000 title claims abstract description 141
- 238000000034 method Methods 0.000 title claims abstract description 69
- 238000012549 training Methods 0.000 title claims abstract description 44
- 238000000605 extraction Methods 0.000 claims abstract description 111
- 238000012545 processing Methods 0.000 claims description 168
- 238000005070 sampling Methods 0.000 claims description 50
- 230000004913 activation Effects 0.000 claims description 35
- 238000003012 network analysis Methods 0.000 claims description 13
- 238000004364 calculation method Methods 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 5
- 238000010586 diagram Methods 0.000 description 15
- 230000008569 process Effects 0.000 description 15
- 230000006870 function Effects 0.000 description 5
- 238000003062 neural network model Methods 0.000 description 5
- 208000037170 Delayed Emergence from Anesthesia Diseases 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000011478 gradient descent method Methods 0.000 description 3
- 230000002123 temporal effect Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000006399 behavior Effects 0.000 description 2
- 230000003542 behavioural effect Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 210000002569 neuron Anatomy 0.000 description 2
- 238000011176 pooling Methods 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/29—Graphical models, e.g. Bayesian networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/049—Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/46—Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/44—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
- H04N21/44008—Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Computing Systems (AREA)
- General Health & Medical Sciences (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Health & Medical Sciences (AREA)
- Molecular Biology (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Image Analysis (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010053048.4A CN111291631B (zh) | 2020-01-17 | 2020-01-17 | 视频分析方法及其相关的模型训练方法、设备、装置 |
CN202010053048.4 | 2020-01-17 | ||
PCT/CN2020/078656 WO2021142904A1 (zh) | 2020-01-17 | 2020-03-10 | 视频分析方法及其相关的模型训练方法、设备、装置 |
Publications (1)
Publication Number | Publication Date |
---|---|
KR20210093875A true KR20210093875A (ko) | 2021-07-28 |
Family
ID=71025430
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020217013635A KR20210093875A (ko) | 2020-01-17 | 2020-03-10 | 비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치 |
Country Status (5)
Country | Link |
---|---|
JP (1) | JP7096431B2 (zh) |
KR (1) | KR20210093875A (zh) |
CN (1) | CN111291631B (zh) |
TW (1) | TWI761813B (zh) |
WO (1) | WO2021142904A1 (zh) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111695519B (zh) * | 2020-06-12 | 2023-08-08 | 北京百度网讯科技有限公司 | 关键点定位方法、装置、设备以及存储介质 |
CN112417952B (zh) * | 2020-10-10 | 2022-11-11 | 北京理工大学 | 一种车辆碰撞防控系统的环境视频信息可用性测评方法 |
CN112464898A (zh) * | 2020-12-15 | 2021-03-09 | 北京市商汤科技开发有限公司 | 事件检测方法及装置、电子设备和存储介质 |
CN112949449B (zh) * | 2021-02-25 | 2024-04-19 | 北京达佳互联信息技术有限公司 | 交错判断模型训练方法及装置和交错图像确定方法及装置 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104199902A (zh) | 2014-08-27 | 2014-12-10 | 中国科学院自动化研究所 | 一种线性动态系统的相似性度量计算方法 |
US10223582B2 (en) * | 2014-10-28 | 2019-03-05 | Watrix Technology | Gait recognition method based on deep learning |
US9626803B2 (en) * | 2014-12-12 | 2017-04-18 | Qualcomm Incorporated | Method and apparatus for image processing in augmented reality systems |
CN108229522B (zh) | 2017-03-07 | 2020-07-17 | 北京市商汤科技开发有限公司 | 神经网络的训练方法、属性检测方法、装置及电子设备 |
CN108229280B (zh) * | 2017-04-20 | 2020-11-13 | 北京市商汤科技开发有限公司 | 时域动作检测方法和系统、电子设备、计算机存储介质 |
US10707837B2 (en) | 2017-07-06 | 2020-07-07 | Analog Photonics LLC | Laser frequency chirping structures, methods, and applications |
WO2019035854A1 (en) * | 2017-08-16 | 2019-02-21 | Kla-Tencor Corporation | MACHINE LEARNING IN RELATION TO METROLOGY MEASUREMENTS |
US10430654B1 (en) * | 2018-04-20 | 2019-10-01 | Surfline\Wavetrak, Inc. | Automated detection of environmental measures within an ocean environment using image data |
CN109919025A (zh) * | 2019-01-30 | 2019-06-21 | 华南理工大学 | 基于深度学习的视频场景文本检测方法、系统、设备及介质 |
CN110084742B (zh) * | 2019-05-08 | 2024-01-26 | 北京奇艺世纪科技有限公司 | 一种视差图预测方法、装置及电子设备 |
CN110660082B (zh) * | 2019-09-25 | 2022-03-08 | 西南交通大学 | 一种基于图卷积与轨迹卷积网络学习的目标跟踪方法 |
-
2020
- 2020-01-17 CN CN202010053048.4A patent/CN111291631B/zh active Active
- 2020-03-10 KR KR1020217013635A patent/KR20210093875A/ko unknown
- 2020-03-10 JP JP2021521512A patent/JP7096431B2/ja active Active
- 2020-03-10 WO PCT/CN2020/078656 patent/WO2021142904A1/zh active Application Filing
- 2020-04-21 TW TW109113378A patent/TWI761813B/zh active
Also Published As
Publication number | Publication date |
---|---|
TWI761813B (zh) | 2022-04-21 |
WO2021142904A1 (zh) | 2021-07-22 |
JP7096431B2 (ja) | 2022-07-05 |
CN111291631A (zh) | 2020-06-16 |
JP2022520511A (ja) | 2022-03-31 |
CN111291631B (zh) | 2023-11-07 |
TW202129535A (zh) | 2021-08-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR20210093875A (ko) | 비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치 | |
US20220215227A1 (en) | Neural Architecture Search Method, Image Processing Method And Apparatus, And Storage Medium | |
CN109840531A (zh) | 训练多标签分类模型的方法和装置 | |
CN112396002A (zh) | 一种基于SE-YOLOv3的轻量级遥感目标检测方法 | |
CN108510402A (zh) | 险种信息推荐方法、装置、计算机设备及存储介质 | |
CN113807399B (zh) | 一种神经网络训练方法、检测方法以及装置 | |
CN111352965B (zh) | 序列挖掘模型的训练方法、序列数据的处理方法及设备 | |
CN110826379B (zh) | 一种基于特征复用与YOLOv3的目标检测方法 | |
CN110765865B (zh) | 基于改进的yolo算法的水下目标检测方法 | |
CN110686633B (zh) | 一种滑坡位移预测方法、装置及电子设备 | |
CN110222718B (zh) | 图像处理的方法及装置 | |
CN114332578A (zh) | 图像异常检测模型训练方法、图像异常检测方法和装置 | |
CN113095370B (zh) | 图像识别方法、装置、电子设备及存储介质 | |
CN112884742A (zh) | 一种基于多算法融合的多目标实时检测、识别及跟踪方法 | |
KR102093577B1 (ko) | 학습네트워크를 이용한 예측 영상 생성 방법 및 예측 영상 생성 장치 | |
CN117037215B (zh) | 人体姿态估计模型训练方法、估计方法、装置及电子设备 | |
CN112883227B (zh) | 一种基于多尺度时序特征的视频摘要生成方法和装置 | |
CN112036381B (zh) | 视觉跟踪方法、视频监控方法及终端设备 | |
EP3995992A1 (en) | Method and system for detecting an action in a video clip | |
CN117237756A (zh) | 一种训练目标分割模型的方法、目标分割方法及相关装置 | |
KR20220058915A (ko) | 이미지 검출 및 관련 모델 트레이닝 방법, 장치, 기기, 매체 및 프로그램 | |
CN114565092A (zh) | 一种神经网络结构确定方法及其装置 | |
CN116994114A (zh) | 一种基于改进YOLOv8的轻量化家居小目标检测模型构建方法 | |
CN116758331A (zh) | 物体检测方法、装置及存储介质 | |
KR102462966B1 (ko) | Yolo 알고리즘을 사용하는 장치의 성능 향상 방법 |