ZA202307784B

ZA202307784B - Video data processing method and apparatus, electronic device, and storage medium

Info

Publication number: ZA202307784B
Application number: ZA2023/07784A
Authority: ZA
Inventors: Shaojun Quan; Ge Lin; Xiaoyan Chen; Shaoling Liang
Original assignee: Longse Tech Co Ltd
Priority date: 2022-03-23
Filing date: 2023-08-08
Publication date: 2024-03-27
Also published as: CN114387567B; WO2023179429A1; CN114387567A

Abstract

The present application is applicable to the field of multimedia technology, and provides a video data processing method and apparatus, an electronic device, and a storage medium. The method includes: inputting a target video into a multi-modal feature extraction model in response to a type identification instruction of the target video, and outputting modal features of a plurality of different modalities corresponding to each video image frame in the target video; generating a fusion feature corresponding to each modal feature respectively based on a preset mutual causal relationship between the different modalities; constructing a modal object diagram corresponding to the target video according to fusion features of all video image frames in various modalities, determining an attention feature corresponding to the target video through the modal object diagram, the attention feature fusing the fusion features of the plurality of modalities; determining a video type of the target video based on the attention feature. With the above method, the accuracy of video surveillance is improved, and the labor cost of video surveillance is also reduced.