SG11201909887RA

SG11201909887RA - Methods and apparatuses for recognizing video and training, electronic device and medium

Info

Publication number: SG11201909887RA
Authority: SG
Inventors: Tangcongrui He; Hongwei Qin
Original assignee: Beijing Sensetime Technology Development Co Ltd
Priority date: 2017-12-13
Filing date: 2018-10-16
Publication date: 2019-11-28
Also published as: WO2019114405A1; CN110546645B; CN108229336A; US20190266409A1; KR102365521B1; CN110546645A; JP2020512647A; JP6837158B2; US10909380B2; CN108229336B; KR20190126366A

Abstract

METHODS AND APPARATUSES FOR RECOGNIZING VIDEO AND TRAINING, ELECTRONIC DEVICE AND MEDIUM 5 A method and an apparatus for recognizing and training a video, an electronic device and a storage medium include: extracting features of a first key frame in a video; performing fusion on the features of the first key frame and fusion features of a second key frame in the video to obtain fusion features of the first key frame, where a detection sequence of the second key frame in the video precedes that of the first key 10 frame; and performing detection on the first key frame according to the fusion features of the first key frame to obtain an object detection result of the first key frame. Through iterative multi-frame feature fusion, information contained in shared features of these key frames in the video can be enhanced, thereby improving frame recognition accuracy and video recognition efficiency. 15 [Figure 1]