SG11201909887RA - Methods and apparatuses for recognizing video and training, electronic device and medium - Google Patents

Methods and apparatuses for recognizing video and training, electronic device and medium

Info

Publication number
SG11201909887RA
SG11201909887RA SG11201909887RA SG11201909887RA SG 11201909887R A SG11201909887R A SG 11201909887RA SG 11201909887R A SG11201909887R A SG 11201909887RA SG 11201909887R A SG11201909887R A SG 11201909887RA
Authority
SG
Singapore
Prior art keywords
video
key frame
frame
features
key
Prior art date
Application number
Inventor
Tangcongrui He
Hongwei Qin
Original Assignee
Beijing Sensetime Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Sensetime Technology Development Co Ltd filed Critical Beijing Sensetime Technology Development Co Ltd
Publication of SG11201909887RA publication Critical patent/SG11201909887RA/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/48Matching video sequences
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/253Fusion techniques of extracted features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T5/00Image enhancement or restoration
    • G06T5/50Image enhancement or restoration using two or more images, e.g. averaging or subtraction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/46Extracting features or characteristics from the video content, e.g. video fingerprints, representative shots or key frames
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/49Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment
    • H04N5/262Studio circuits, e.g. for mixing, switching-over, change of character of image, other special effects ; Cameras specially adapted for the electronic generation of special effects
    • H04N5/265Mixing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20212Image combination
    • G06T2207/20221Image fusion; Image merging

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Signal Processing (AREA)
  • Computing Systems (AREA)
  • Image Analysis (AREA)

Abstract

METHODS AND APPARATUSES FOR RECOGNIZING VIDEO AND TRAINING, ELECTRONIC DEVICE AND MEDIUM 5 A method and an apparatus for recognizing and training a video, an electronic device and a storage medium include: extracting features of a first key frame in a video; performing fusion on the features of the first key frame and fusion features of a second key frame in the video to obtain fusion features of the first key frame, where a detection sequence of the second key frame in the video precedes that of the first key 10 frame; and performing detection on the first key frame according to the fusion features of the first key frame to obtain an object detection result of the first key frame. Through iterative multi-frame feature fusion, information contained in shared features of these key frames in the video can be enhanced, thereby improving frame recognition accuracy and video recognition efficiency. 15 [Figure 1]
SG11201909887R 2017-12-13 2018-10-16 Methods and apparatuses for recognizing video and training, electronic device and medium SG11201909887RA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711329718.5A CN108229336B (en) 2017-12-13 2017-12-13 Video recognition and training method and apparatus, electronic device, program, and medium
PCT/CN2018/110500 WO2019114405A1 (en) 2017-12-13 2018-10-16 Video recognition and training method and apparatus, electronic device and medium

Publications (1)

Publication Number Publication Date
SG11201909887RA true SG11201909887RA (en) 2019-11-28

Family

ID=62652263

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201909887R SG11201909887RA (en) 2017-12-13 2018-10-16 Methods and apparatuses for recognizing video and training, electronic device and medium

Country Status (6)

Country Link
US (1) US10909380B2 (en)
JP (1) JP6837158B2 (en)
KR (1) KR102365521B1 (en)
CN (2) CN108229336B (en)
SG (1) SG11201909887RA (en)
WO (1) WO2019114405A1 (en)

Families Citing this family (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108229336B (en) * 2017-12-13 2021-06-04 北京市商汤科技开发有限公司 Video recognition and training method and apparatus, electronic device, program, and medium
CN108810620B (en) * 2018-07-18 2021-08-17 腾讯科技(深圳)有限公司 Method, device, equipment and storage medium for identifying key time points in video
CN109344703B (en) * 2018-08-24 2021-06-25 深圳市商汤科技有限公司 Object detection method and device, electronic equipment and storage medium
CN109389086B (en) * 2018-10-09 2021-03-05 北京科技大学 Method and system for detecting unmanned aerial vehicle image target
CN111353597B (en) * 2018-12-24 2023-12-05 杭州海康威视数字技术股份有限公司 Target detection neural network training method and device
CN111383245B (en) * 2018-12-29 2023-09-22 北京地平线机器人技术研发有限公司 Video detection method, video detection device and electronic equipment
CN109886951A (en) * 2019-02-22 2019-06-14 北京旷视科技有限公司 Method for processing video frequency, device and electronic equipment
CN111754544B (en) * 2019-03-29 2023-09-05 杭州海康威视数字技术股份有限公司 Video frame fusion method and device and electronic equipment
CN109977912B (en) * 2019-04-08 2021-04-16 北京环境特性研究所 Video human body key point detection method and device, computer equipment and storage medium
CN110060264B (en) * 2019-04-30 2021-03-23 北京市商汤科技开发有限公司 Neural network training method, video frame processing method, device and system
CN110427800B (en) * 2019-06-17 2024-09-10 平安科技(深圳)有限公司 Video object acceleration detection method, device, server and storage medium
CN110149482B (en) * 2019-06-28 2021-02-02 Oppo广东移动通信有限公司 Focusing method, focusing device, electronic equipment and computer readable storage medium
CN112199978B (en) * 2019-07-08 2024-07-26 北京地平线机器人技术研发有限公司 Video object detection method and device, storage medium and electronic equipment
CN110503076B (en) * 2019-08-29 2023-06-30 腾讯科技(深圳)有限公司 Video classification method, device, equipment and medium based on artificial intelligence
CN110751022B (en) * 2019-09-03 2023-08-22 平安科技(深圳)有限公司 Urban pet activity track monitoring method based on image recognition and related equipment
CN110738108A (en) * 2019-09-09 2020-01-31 北京地平线信息技术有限公司 Target object detection method, target object detection device, storage medium and electronic equipment
CN110807379B (en) * 2019-10-21 2024-08-27 腾讯科技(深圳)有限公司 Semantic recognition method, semantic recognition device and computer storage medium
CN110751646A (en) * 2019-10-28 2020-02-04 支付宝(杭州)信息技术有限公司 Method and device for identifying damage by using multiple image frames in vehicle video
CN110933429B (en) * 2019-11-13 2021-11-12 南京邮电大学 Video compression sensing and reconstruction method and device based on deep neural network
CN110909655A (en) * 2019-11-18 2020-03-24 上海眼控科技股份有限公司 Method and equipment for identifying video event
CN110841287B (en) * 2019-11-22 2023-09-26 腾讯科技(深圳)有限公司 Video processing method, apparatus, computer readable storage medium and computer device
CN112862828B (en) * 2019-11-26 2022-11-18 华为技术有限公司 Semantic segmentation method, model training method and device
CN111062395B (en) * 2019-11-27 2020-12-18 北京理工大学 Real-time video semantic segmentation method
CN111629262B (en) * 2020-05-08 2022-04-12 Oppo广东移动通信有限公司 Video image processing method and device, electronic equipment and storage medium
CN111582185B (en) * 2020-05-11 2023-06-30 北京百度网讯科技有限公司 Method and device for recognizing images
CN111652081B (en) * 2020-05-13 2022-08-05 电子科技大学 Video semantic segmentation method based on optical flow feature fusion
CN111881726B (en) * 2020-06-15 2022-11-25 马上消费金融股份有限公司 Living body detection method and device and storage medium
CN111783784A (en) * 2020-06-30 2020-10-16 创新奇智(合肥)科技有限公司 Method and device for detecting building cavity, electronic equipment and storage medium
CN111860400B (en) * 2020-07-28 2024-06-07 平安科技(深圳)有限公司 Face enhancement recognition method, device, equipment and storage medium
CN112036446B (en) * 2020-08-06 2023-12-12 汇纳科技股份有限公司 Method, system, medium and device for fusing target identification features
CN112085097A (en) * 2020-09-09 2020-12-15 北京市商汤科技开发有限公司 Image processing method and device, electronic equipment and storage medium
CN112115299B (en) * 2020-09-17 2024-08-13 北京百度网讯科技有限公司 Video searching method, video searching device, video recommending method, electronic equipment and storage medium
CN112241470B (en) * 2020-09-24 2024-02-02 北京影谱科技股份有限公司 Video classification method and system
CN112435653B (en) * 2020-10-14 2024-07-30 北京地平线机器人技术研发有限公司 Voice recognition method and device and electronic equipment
CN112418104B (en) * 2020-11-24 2024-08-02 深圳云天励飞技术股份有限公司 Pedestrian tracking method and related equipment
CN112528786B (en) * 2020-11-30 2023-10-31 北京百度网讯科技有限公司 Vehicle tracking method and device and electronic equipment
CN112766215B (en) * 2021-01-29 2024-08-09 北京字跳网络技术有限公司 Face image processing method and device, electronic equipment and storage medium
CN112561912B (en) * 2021-02-20 2021-06-01 四川大学 Medical image lymph node detection method based on priori knowledge
CN113011371A (en) * 2021-03-31 2021-06-22 北京市商汤科技开发有限公司 Target detection method, device, equipment and storage medium
US20220383509A1 (en) * 2021-05-21 2022-12-01 Honda Motor Co., Ltd. System and method for learning temporally consistent video synthesis using fake optical flow
CN113674189B (en) * 2021-08-17 2024-09-20 Oppo广东移动通信有限公司 Image processing method, apparatus, electronic device, and computer-readable storage medium
CN113963287A (en) * 2021-09-15 2022-01-21 北京百度网讯科技有限公司 Scoring model obtaining and video identifying method, device and storage medium
CN114120166B (en) * 2021-10-14 2023-09-22 北京百度网讯科技有限公司 Video question-answering method and device, electronic equipment and storage medium
CN114528923B (en) * 2022-01-25 2023-09-26 山东浪潮科学研究院有限公司 Video target detection method, device, equipment and medium based on time domain context
CN115115822B (en) * 2022-06-30 2023-10-31 小米汽车科技有限公司 Vehicle-end image processing method and device, vehicle, storage medium and chip

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07181024A (en) * 1993-12-24 1995-07-18 Canon Inc Method and apparatus for measuring three-dimensional profile
JP4181473B2 (en) * 2003-10-15 2008-11-12 日本放送協会 Video object trajectory synthesis apparatus, method and program thereof
US8021160B2 (en) * 2006-07-22 2011-09-20 Industrial Technology Research Institute Learning assessment method and device using a virtual tutor
US8135221B2 (en) 2009-10-07 2012-03-13 Eastman Kodak Company Video concept classification using audio-visual atoms
CN101673404B (en) * 2009-10-19 2015-03-04 北京中星微电子有限公司 Target detection method and device
CN102014295B (en) * 2010-11-19 2012-11-28 嘉兴学院 Network sensitive video detection method
CN102682302B (en) * 2012-03-12 2014-03-26 浙江工业大学 Human body posture identification method based on multi-characteristic fusion of key frame
US8989503B2 (en) * 2012-08-03 2015-03-24 Kodak Alaris Inc. Identifying scene boundaries using group sparsity analysis
US9129399B2 (en) * 2013-03-11 2015-09-08 Adobe Systems Incorporated Optical flow with nearest neighbor field fusion
US9892745B2 (en) * 2013-08-23 2018-02-13 At&T Intellectual Property I, L.P. Augmented multi-tier classifier for multi-modal voice activity detection
WO2015038749A1 (en) * 2013-09-13 2015-03-19 Arris Enterprises, Inc. Content based video content segmentation
US10262426B2 (en) * 2014-10-31 2019-04-16 Fyusion, Inc. System and method for infinite smoothing of image sequences
KR20160099289A (en) * 2015-02-12 2016-08-22 대전대학교 산학협력단 Method and system for video search using convergence of global feature and region feature of image
CN105005772B (en) * 2015-07-20 2018-06-12 北京大学 A kind of video scene detection method
KR102444712B1 (en) * 2016-01-12 2022-09-20 한국전자통신연구원 System for automatically re-creating a personal media with Multi-modality feature and method thereof
US9805255B2 (en) * 2016-01-29 2017-10-31 Conduent Business Services, Llc Temporal fusion of multimodal data from multiple data acquisition systems to automatically recognize and classify an action
US20170277955A1 (en) * 2016-03-23 2017-09-28 Le Holdings (Beijing) Co., Ltd. Video identification method and system
BR102016007265B1 (en) * 2016-04-01 2022-11-16 Samsung Eletrônica da Amazônia Ltda. MULTIMODAL AND REAL-TIME METHOD FOR FILTERING SENSITIVE CONTENT
JP6609505B2 (en) * 2016-04-06 2019-11-20 Kddi株式会社 Image composition apparatus and program
CN106599907B (en) * 2016-11-29 2019-11-29 北京航空航天大学 The dynamic scene classification method and device of multiple features fusion
CN107392917B (en) * 2017-06-09 2021-09-28 深圳大学 Video significance detection method and system based on space-time constraint
CN107463881A (en) * 2017-07-07 2017-12-12 中山大学 A kind of character image searching method based on depth enhancing study
CN107463949B (en) * 2017-07-14 2020-02-21 北京协同创新研究院 Video action classification processing method and device
CN108229336B (en) * 2017-12-13 2021-06-04 北京市商汤科技开发有限公司 Video recognition and training method and apparatus, electronic device, program, and medium

Also Published As

Publication number Publication date
WO2019114405A1 (en) 2019-06-20
CN110546645B (en) 2023-09-19
CN108229336A (en) 2018-06-29
US20190266409A1 (en) 2019-08-29
KR102365521B1 (en) 2022-02-21
CN110546645A (en) 2019-12-06
JP2020512647A (en) 2020-04-23
JP6837158B2 (en) 2021-03-03
US10909380B2 (en) 2021-02-02
CN108229336B (en) 2021-06-04
KR20190126366A (en) 2019-11-11

Similar Documents

Publication Publication Date Title
SG11201909887RA (en) Methods and apparatuses for recognizing video and training, electronic device and medium
PH12019501009A1 (en) Face liveness detection method and apparatus, and electronic device
SG11201901766YA (en) Electronic device, method and system of identity verification and computer readable storage medium
SG11201900263SA (en) Method, device and server for recognizing characters of claim document, and storage medium
PH12018501058A1 (en) Order clustering and malicious information combating method and apparatus
SG11201909139TA (en) Methods and apparatuses for recognizing dynamic gesture, and control methods and apparatuses using gesture interaction
SG11201913865PA (en) Method and apparatus for recognizing sequence in image, electronic device, and storage medium
WO2019133928A8 (en) Hierarchical, parallel models for extracting in real time high-value information from data streams and system and method for creation of same
SG11202105174XA (en) Text sequence recognition method and apparatus, electronic device, and storage medium
SG11201809816YA (en) Vehicle identification method and apparatus
SG11202002078UA (en) Method and apparatus for training semantic segmentation model, computer device, and storage medium
MX2019004994A (en) Method and apparatus for verifying documents and identity.
MX2016003724A (en) Picture scene determining method and apparatus, and server.
SG11201809210VA (en) Face image data collection method, apparatus, terminal device and storage medium
EP4116940A3 (en) Method and apparatus for processing image, electronic device and storage medium
MY182985A (en) Keyframe scheduling method and apparatus, electronic device, program and medium
MX2017011632A (en) System for distributing metadata embedded in video.
MX2016003774A (en) Fingerprint recognition method and device.
HK1175358A2 (en) Apparatus and method for recognizing content using audio signal
EP3364337A3 (en) Persistent feature descriptors for video
SG11202103969XA (en) Attribute value recovery method and device, storage medium, and electronic device
SG11201909071UA (en) Image processing methods and apparatuses, computer readable storage media and eletronic devices
EP3822842A3 (en) Method and apparatus for generating semantic representation model, electronic device, and storage medium
MX2021009164A (en) Pet food recommendation devices and methods.
HK1100586A1 (en) Apparatus and method for handwriting recognition