KR20210093875A - 비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치 - Google Patents

비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치 Download PDF

Info

Publication number
KR20210093875A
KR20210093875A KR1020217013635A KR20217013635A KR20210093875A KR 20210093875 A KR20210093875 A KR 20210093875A KR 1020217013635 A KR1020217013635 A KR 1020217013635A KR 20217013635 A KR20217013635 A KR 20217013635A KR 20210093875 A KR20210093875 A KR 20210093875A
Authority
KR
South Korea
Prior art keywords
offset
information
feature map
video
dimensional feature
Prior art date
Application number
KR1020217013635A
Other languages
English (en)
Korean (ko)
Inventor
하오 샤오
유 리우
Original Assignee
베이징 센스타임 테크놀로지 디벨롭먼트 컴퍼니 리미티드
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 베이징 센스타임 테크놀로지 디벨롭먼트 컴퍼니 리미티드 filed Critical 베이징 센스타임 테크놀로지 디벨롭먼트 컴퍼니 리미티드
Publication of KR20210093875A publication Critical patent/KR20210093875A/ko

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/29Graphical models, e.g. Bayesian networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/049Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44008Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving operations for analysing video streams, e.g. detecting features or characteristics in the video stream

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Image Analysis (AREA)
KR1020217013635A 2020-01-17 2020-03-10 비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치 KR20210093875A (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN202010053048.4A CN111291631B (zh) 2020-01-17 2020-01-17 视频分析方法及其相关的模型训练方法、设备、装置
CN202010053048.4 2020-01-17
PCT/CN2020/078656 WO2021142904A1 (zh) 2020-01-17 2020-03-10 视频分析方法及其相关的模型训练方法、设备、装置

Publications (1)

Publication Number Publication Date
KR20210093875A true KR20210093875A (ko) 2021-07-28

Family

ID=71025430

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020217013635A KR20210093875A (ko) 2020-01-17 2020-03-10 비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치

Country Status (5)

Country Link
JP (1) JP7096431B2 (zh)
KR (1) KR20210093875A (zh)
CN (1) CN111291631B (zh)
TW (1) TWI761813B (zh)
WO (1) WO2021142904A1 (zh)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111695519B (zh) * 2020-06-12 2023-08-08 北京百度网讯科技有限公司 关键点定位方法、装置、设备以及存储介质
CN112417952B (zh) * 2020-10-10 2022-11-11 北京理工大学 一种车辆碰撞防控系统的环境视频信息可用性测评方法
CN112464898A (zh) * 2020-12-15 2021-03-09 北京市商汤科技开发有限公司 事件检测方法及装置、电子设备和存储介质
CN112949449B (zh) * 2021-02-25 2024-04-19 北京达佳互联信息技术有限公司 交错判断模型训练方法及装置和交错图像确定方法及装置

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199902A (zh) 2014-08-27 2014-12-10 中国科学院自动化研究所 一种线性动态系统的相似性度量计算方法
US10223582B2 (en) * 2014-10-28 2019-03-05 Watrix Technology Gait recognition method based on deep learning
US9626803B2 (en) * 2014-12-12 2017-04-18 Qualcomm Incorporated Method and apparatus for image processing in augmented reality systems
CN108229522B (zh) 2017-03-07 2020-07-17 北京市商汤科技开发有限公司 神经网络的训练方法、属性检测方法、装置及电子设备
CN108229280B (zh) * 2017-04-20 2020-11-13 北京市商汤科技开发有限公司 时域动作检测方法和系统、电子设备、计算机存储介质
US10707837B2 (en) 2017-07-06 2020-07-07 Analog Photonics LLC Laser frequency chirping structures, methods, and applications
WO2019035854A1 (en) * 2017-08-16 2019-02-21 Kla-Tencor Corporation MACHINE LEARNING IN RELATION TO METROLOGY MEASUREMENTS
US10430654B1 (en) * 2018-04-20 2019-10-01 Surfline\Wavetrak, Inc. Automated detection of environmental measures within an ocean environment using image data
CN109919025A (zh) * 2019-01-30 2019-06-21 华南理工大学 基于深度学习的视频场景文本检测方法、系统、设备及介质
CN110084742B (zh) * 2019-05-08 2024-01-26 北京奇艺世纪科技有限公司 一种视差图预测方法、装置及电子设备
CN110660082B (zh) * 2019-09-25 2022-03-08 西南交通大学 一种基于图卷积与轨迹卷积网络学习的目标跟踪方法

Also Published As

Publication number Publication date
TWI761813B (zh) 2022-04-21
WO2021142904A1 (zh) 2021-07-22
JP7096431B2 (ja) 2022-07-05
CN111291631A (zh) 2020-06-16
JP2022520511A (ja) 2022-03-31
CN111291631B (zh) 2023-11-07
TW202129535A (zh) 2021-08-01

Similar Documents

Publication Publication Date Title
KR20210093875A (ko) 비디오 분석 방법 및 연관된 모델 훈련 방법, 기기, 장치
US20220215227A1 (en) Neural Architecture Search Method, Image Processing Method And Apparatus, And Storage Medium
CN109840531A (zh) 训练多标签分类模型的方法和装置
CN112396002A (zh) 一种基于SE-YOLOv3的轻量级遥感目标检测方法
CN108510402A (zh) 险种信息推荐方法、装置、计算机设备及存储介质
CN113807399B (zh) 一种神经网络训练方法、检测方法以及装置
CN111352965B (zh) 序列挖掘模型的训练方法、序列数据的处理方法及设备
CN110826379B (zh) 一种基于特征复用与YOLOv3的目标检测方法
CN110765865B (zh) 基于改进的yolo算法的水下目标检测方法
CN110686633B (zh) 一种滑坡位移预测方法、装置及电子设备
CN110222718B (zh) 图像处理的方法及装置
CN114332578A (zh) 图像异常检测模型训练方法、图像异常检测方法和装置
CN113095370B (zh) 图像识别方法、装置、电子设备及存储介质
CN112884742A (zh) 一种基于多算法融合的多目标实时检测、识别及跟踪方法
KR102093577B1 (ko) 학습네트워크를 이용한 예측 영상 생성 방법 및 예측 영상 생성 장치
CN117037215B (zh) 人体姿态估计模型训练方法、估计方法、装置及电子设备
CN112883227B (zh) 一种基于多尺度时序特征的视频摘要生成方法和装置
CN112036381B (zh) 视觉跟踪方法、视频监控方法及终端设备
EP3995992A1 (en) Method and system for detecting an action in a video clip
CN117237756A (zh) 一种训练目标分割模型的方法、目标分割方法及相关装置
KR20220058915A (ko) 이미지 검출 및 관련 모델 트레이닝 방법, 장치, 기기, 매체 및 프로그램
CN114565092A (zh) 一种神经网络结构确定方法及其装置
CN116994114A (zh) 一种基于改进YOLOv8的轻量化家居小目标检测模型构建方法
CN116758331A (zh) 物体检测方法、装置及存储介质
KR102462966B1 (ko) Yolo 알고리즘을 사용하는 장치의 성능 향상 방법