CN110719520A - Video information extraction method, system and device - Google Patents

Video information extraction method, system and device Download PDF

Info

Publication number
CN110719520A
CN110719520A CN201910930076.7A CN201910930076A CN110719520A CN 110719520 A CN110719520 A CN 110719520A CN 201910930076 A CN201910930076 A CN 201910930076A CN 110719520 A CN110719520 A CN 110719520A
Authority
CN
China
Prior art keywords
video
frame
key frame
information
video file
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910930076.7A
Other languages
Chinese (zh)
Inventor
彭岳松
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Lajin Zhongbo Technology Co ltd
Original Assignee
Tianmai Juyuan (hangzhou) Media Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianmai Juyuan (hangzhou) Media Technology Co Ltd filed Critical Tianmai Juyuan (hangzhou) Media Technology Co Ltd
Priority to CN201910930076.7A priority Critical patent/CN110719520A/en
Publication of CN110719520A publication Critical patent/CN110719520A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/44004Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving video buffer management, e.g. video decoder buffer or video display buffer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

The invention discloses a method, a system and a device for extracting video information, wherein the method comprises the following steps: receiving a video file, and analyzing the video file based on AVasset to obtain basic information of the video file; and intercepting the key frame according to the preset time and size to obtain the key frame information. The system comprises: the device comprises an analysis module, a screenshot module and a simplification module. The device comprises a memory and a processor for executing the video information extraction method. By using the invention, the user can conveniently and quickly obtain the video information. The video information extraction method, the system and the device can be widely applied to the field of video analysis.

Description

Video information extraction method, system and device
Technical Field
The present invention relates to the field of video parsing, and in particular, to a method, a system, and an apparatus for extracting video information.
Background
The method for extracting the key frames is simple, but sometimes a part of important key frames are lost or repeated similar key frames are extracted, so that the storage space is greatly occupied.
Disclosure of Invention
In order to solve the above technical problems, an object of the present invention is to provide a method, a system and a device for extracting video information, so as to facilitate a user to obtain video information quickly.
The first technical scheme adopted by the invention is as follows: a video information extraction method comprises the following steps:
receiving a video file, and analyzing the video file based on AVasset to obtain basic information of the video file;
and intercepting the key frame according to the preset time and size to obtain the key frame information.
Further, the method also comprises the following steps:
and further extracting the key frame to obtain the key frame with redundancy removed.
Further, the step of receiving the video file, analyzing the video file based on AVasset to obtain the basic information of the video file:
receiving a video file, and creating an AVasset object according to the video file;
and accessing the AVasset object, and acquiring basic information of the video file according to the preset AVasset attribute.
Further, the step of intercepting the key frame according to the preset time and size and storing the key frame locally specifically includes:
accessing an AVasset object;
obtaining the maximum size of a screenshot according to the display size of the current equipment, and capturing an image according to the preset time and the maximum size of the screenshot to obtain the screenshot image as a key frame;
and acquiring the supplementary information of the key frame, and storing the key frame and the supplementary information of the key frame to the local to obtain the key frame information.
Further, the basic information comprises video duration, video size and video cover map.
Further, the step of further extracting the key frame to obtain the key frame with redundancy removed specifically includes:
dividing the key frames into a plurality of types of video segments, calculating the average mutual information amount of the video segments, extracting the maximum frame and the minimum frame of each video segment, if the difference between the maximum frame and the minimum frame of each video segment is greater than the video frequency of the video segment to which the maximum frame and the minimum frame belong, selecting the maximum frame and the minimum frame as the final key frame of the video segment, otherwise, selecting a frame closest to the average mutual information amount from the maximum frame and the minimum frame as the final key frame of the video segment.
Further, the supplementary information of the key frame includes a difference frame and a bidirectional difference frame.
The second technical scheme adopted by the invention is as follows: a video information extraction system:
the analysis module is used for receiving the video file and analyzing the video file based on AVasset to obtain the basic information of the video file;
and the screenshot module is used for intercepting the key frames according to the preset time and size to obtain the key frame information.
And the simplification module is used for further extracting the key frame to obtain the key frame with redundancy removed.
The third technical scheme adopted by the invention is as follows: a video information extraction apparatus:
at least one processor;
at least one memory for storing at least one program;
when the at least one program is executed by the at least one processor, the at least one processor is caused to implement the method for video information extraction.
The method, the system and the device have the advantages that: the video file is analyzed based on the AVasset class, the basic information and the key frame information of the video are extracted from the video, so that a user can conveniently and quickly know the content of the video, and the redundant information of the video is reduced to save the storage space.
Drawings
FIG. 1 is a flow chart of the steps of a video information extraction method of the present invention;
fig. 2 is a block diagram of a video information extraction system according to the present invention.
Detailed Description
The invention is described in further detail below with reference to the figures and the specific embodiments. The step numbers in the following embodiments are provided only for convenience of illustration, the order between the steps is not limited at all, and the execution order of each step in the embodiments can be adapted according to the understanding of those skilled in the art.
As shown in fig. 1, the present invention provides a video information extraction method, which includes the following steps:
s101, receiving a video file, and analyzing the video file based on AVasset to obtain basic information of the video file;
in particular, an incoming video file is received, an AVasset object is created, the video file is loaded, AVasset is an abstract class, and a system provides a dedicated class for modeling synchronous audiovisual media.
S102, intercepting the key frame according to preset time and size to obtain key frame information;
specifically, the key frame is a visual image, the video frame with complete image data is stored in a local special data file and is read later, the preset time can be set to be equal interval time or intercepted according to the required time, and the size is limited by the display size of the equipment.
Further as a preferred embodiment of the method, the method further comprises the following steps:
and S103, further extracting the key frame to obtain the key frame with redundancy removed.
Specifically, the extraction of the key frames of the video inevitably causes redundancy of the extracted key frames due to various reasons, the selected key frames are further extracted according to certain rules such as the distance in time among the extracted key frames, and the clustering method is adopted to further extract the key frames.
As a further preferred embodiment of the method, the step of parsing the video based on the AVAsset to obtain basic information of the video specifically further includes:
receiving a video file, and creating an AVasset object according to the video file;
and accessing the AVasset object, and acquiring basic information of the video file according to the preset AVasset attribute.
Specifically, the basic model of an audiovisual media file is extended by creating an AVasset object, with the AVasset attribute set according to the video information that the user needs to obtain.
As a preferred embodiment of the method, the step of capturing the key frame according to the preset time and size to obtain the key frame information specifically further includes:
accessing an AVasset object;
obtaining the maximum size of a screenshot according to the display size of the current equipment, and capturing an image according to the preset time and the maximum size of the screenshot to obtain the screenshot image as a key frame;
and acquiring the supplementary information of the key frame, and storing the key frame and the supplementary information of the key frame to the local to obtain the key frame information.
Further as a preferred embodiment of the method, the basic information includes a video duration, a video size, and a video cover map.
Specifically, the basic information differs depending on the attribute setting of AVAsset, and may be set in the attribute of AVAsset as necessary to acquire more information.
Further as a preferred embodiment of the method, the step of further extracting the key frame to obtain the key frame with redundancy removed specifically includes:
dividing the key frames into a plurality of types of video segments, calculating the average mutual information amount of the video segments, extracting the maximum frame and the minimum frame of each video segment, if the difference between the maximum frame and the minimum frame of each video segment is greater than the video frequency of the video segment to which the maximum frame and the minimum frame belong, selecting the maximum frame and the minimum frame as the final key frame of the video segment, otherwise, selecting a frame closest to the average mutual information amount from the maximum frame and the minimum frame as the final key frame of the video segment.
Specifically, adjacent frames and mutual information amount are used as characteristic parameters of interframe change, the smaller the value of the mutual information amount is, the larger the interframe change is, a threshold value is set, the key frames are divided into multiple types of video segments according to the threshold value, the frames with the maximum mutual information amount, the minimum mutual information amount and the closest average mutual information amount in each type are selected as candidate frames of the key frames, and finally, the final key frames are selected from the candidate frames and stored locally.
Further as a preferred embodiment of the method, the supplementary information of the key frame includes a difference frame and a bidirectional difference frame.
Specifically, the difference frame refers to a difference between the frame and a previous frame, the difference frame can be used as a reference frame when a subsequent image is encoded, a difference defined by the frame needs to be superimposed on a picture buffered before decoding, and a final picture is generated.
As shown in fig. 2, a video information extraction system includes:
the analysis module is used for receiving the video file and analyzing the video file based on AVasset to obtain the basic information of the video file;
and the screenshot module is used for intercepting the key frames according to the preset time and size to obtain the key frame information.
Preferably, the video information extraction system further includes:
and the simplification module is used for further extracting the key frame to obtain the key frame with redundancy removed.
The specific embodiment of the invention is as follows:
the method comprises the steps of receiving an input video file, creating an AVasset object, setting an AVasset attribute according to needs, analyzing the video file according to a preset attribute to obtain basic information of the video file, simultaneously intercepting key frames, obtaining difference frames and bidirectional difference frames of the key frames according to the intercepted key frames to provide more perfect key frame data, finally further extracting the key frames through a clustering method, reducing redundant key frames, and storing the key frames to the local after finishing.
The contents in the above method embodiments are all applicable to the present apparatus embodiment, the functions specifically implemented by the present apparatus embodiment are the same as those in the above method embodiments, and the advantageous effects achieved by the present apparatus embodiment are also the same as those achieved by the above method embodiments.
From the above, the present invention has the advantages of:
1. the key frame is used for representing the video, so that the video is simpler, a large amount of redundant information among video frames is reduced, and the storage space is saved.
2. The index can be established through the bottom layer information of the key frames, and the user can conveniently inquire the video content.
While the preferred embodiments of the present invention have been illustrated and described, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.

Claims (10)

1. A method for extracting video information, comprising: the method comprises the following steps:
receiving a video file, and analyzing the video file based on AVasset to obtain basic information of the video file;
and intercepting the key frame according to the preset time and size to obtain the key frame information.
2. The method of claim 1, wherein: further comprising:
and further extracting the key frame to obtain the key frame with redundancy removed.
3. The method of claim 1, wherein: the step of receiving the video file, and analyzing the video file based on the AVAsset to obtain the basic information of the video file specifically includes:
receiving a video file, and creating an AVasset object according to the video file;
and accessing the AVasset object, and acquiring basic information of the video file according to the preset AVasset attribute.
4. A method as claimed in claim 3, wherein: the step of capturing the key frame according to the preset time and size to obtain the key frame information specifically further includes:
accessing an AVasset object;
obtaining the maximum size of a screenshot according to the display size of the current equipment, and capturing an image according to the preset time and the maximum size of the screenshot to obtain the screenshot image as a key frame;
and acquiring the supplementary information of the key frame, and storing the key frame and the supplementary information of the key frame to the local to obtain the key frame information.
5. A method as claimed in claim 3, wherein: the basic information comprises video duration, video size and a video cover picture.
6. The method of claim 2, wherein: the step of further extracting the key frame to obtain the key frame with redundancy removed specifically includes:
dividing the key frames into a plurality of types of video segments, calculating the average mutual information amount of the video segments, extracting the maximum frame and the minimum frame of each video segment, if the difference between the maximum frame and the minimum frame of each video segment is greater than the video frequency of the video segment to which the maximum frame and the minimum frame belong, selecting the maximum frame and the minimum frame as the final key frame of the video segment, otherwise, selecting a frame closest to the average mutual information amount from the maximum frame and the minimum frame as the final key frame of the video segment.
7. The method of claim 4, wherein: the supplemental information for the key frames includes difference frames and bidirectional difference frames.
8. A video information extraction system characterized by:
the analysis module is used for receiving the video file and analyzing the video file based on AVasset to obtain the basic information of the video file;
and the screenshot module is used for intercepting the key frames according to the preset time and size to obtain the key frame information.
9. The video information extraction system according to claim 8, wherein:
and the simplification module is used for further extracting the key frame to obtain the key frame with redundancy removed.
10. A video information extraction apparatus characterized by:
at least one processor;
at least one memory for storing at least one program;
when executed by the at least one processor, cause the at least one processor to implement a method of video information extraction as claimed in any one of claims 1 to 7.
CN201910930076.7A 2019-09-29 2019-09-29 Video information extraction method, system and device Pending CN110719520A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910930076.7A CN110719520A (en) 2019-09-29 2019-09-29 Video information extraction method, system and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910930076.7A CN110719520A (en) 2019-09-29 2019-09-29 Video information extraction method, system and device

Publications (1)

Publication Number Publication Date
CN110719520A true CN110719520A (en) 2020-01-21

Family

ID=69212030

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910930076.7A Pending CN110719520A (en) 2019-09-29 2019-09-29 Video information extraction method, system and device

Country Status (1)

Country Link
CN (1) CN110719520A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112528748A (en) * 2020-11-16 2021-03-19 南京邮电大学 Method for identifying and intercepting static slide from video

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008236390A (en) * 2007-03-20 2008-10-02 Asv:Kk Video seek control method in video reproduction system
CN102780919A (en) * 2012-08-24 2012-11-14 乐视网信息技术(北京)股份有限公司 Method for carrying out video location and displaying through key frame
CN104540000A (en) * 2014-12-04 2015-04-22 广东欧珀移动通信有限公司 Method for generating dynamic thumbnail and terminal
CN105072456A (en) * 2015-07-14 2015-11-18 中国科学院信息工程研究所 Hadoop-based ciphertext video flow processing method, device, server and system
CN105139421A (en) * 2015-08-14 2015-12-09 西安西拓电气股份有限公司 Video key frame extracting method of electric power system based on amount of mutual information
CN107231581A (en) * 2016-03-25 2017-10-03 中国电信股份有限公司 Method, system and streaming media playing for video playback control server
CN109660852A (en) * 2017-10-10 2019-04-19 武汉斗鱼网络科技有限公司 Video previewing method, storage medium, equipment and system before recorded video publication

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008236390A (en) * 2007-03-20 2008-10-02 Asv:Kk Video seek control method in video reproduction system
CN102780919A (en) * 2012-08-24 2012-11-14 乐视网信息技术(北京)股份有限公司 Method for carrying out video location and displaying through key frame
CN104540000A (en) * 2014-12-04 2015-04-22 广东欧珀移动通信有限公司 Method for generating dynamic thumbnail and terminal
CN105072456A (en) * 2015-07-14 2015-11-18 中国科学院信息工程研究所 Hadoop-based ciphertext video flow processing method, device, server and system
CN105139421A (en) * 2015-08-14 2015-12-09 西安西拓电气股份有限公司 Video key frame extracting method of electric power system based on amount of mutual information
CN107231581A (en) * 2016-03-25 2017-10-03 中国电信股份有限公司 Method, system and streaming media playing for video playback control server
CN109660852A (en) * 2017-10-10 2019-04-19 武汉斗鱼网络科技有限公司 Video previewing method, storage medium, equipment and system before recorded video publication

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112528748A (en) * 2020-11-16 2021-03-19 南京邮电大学 Method for identifying and intercepting static slide from video
CN112528748B (en) * 2020-11-16 2022-08-26 南京邮电大学 Method for identifying and intercepting static slide from video

Similar Documents

Publication Publication Date Title
CN109803180B (en) Video preview generation method and device, computer equipment and storage medium
CN106254933B (en) Subtitle extraction method and device
CN110557678B (en) Video processing method, device and equipment
WO2021012837A1 (en) Method and apparatus for determining recommendation information implantation position, device and storage medium
CN106713964A (en) Method of generating video abstract viewpoint graph and apparatus thereof
CN105744292A (en) Video data processing method and device
EP2034487A1 (en) Method and system for generating thumbnails for video files
WO2019041442A1 (en) Method and system for structural extraction of figure data, electronic device, and computer readable storage medium
US20210304796A1 (en) Data processing method and system, storage medium, and computing device
CN104980681A (en) Video acquisition method and video acquisition device
CN111062854A (en) Method, device, terminal and storage medium for detecting watermark
CN111401238A (en) Method and device for detecting character close-up segments in video
CN114708287A (en) Shot boundary detection method, device and storage medium
CN111294600A (en) Compression method, decompression method and device for video sequence frame
CN113705300A (en) Method, device and equipment for acquiring phonetic-to-text training corpus and storage medium
CN104778252A (en) Index storage method and index storage device
WO2019047663A1 (en) Video format-based end-to-end automatic driving data storage method and device
CN110719520A (en) Video information extraction method, system and device
CN103916677B (en) A kind of advertisement video recognition methods and device
CN109286718A (en) Screen recording method and device and electronic equipment
CN112738629B (en) Video display method and device, electronic equipment and storage medium
KR20190119240A (en) Apparatus and method for providing meta-data
CN104637496A (en) Computer system and audio comparison method
CN114449255A (en) Video jamming detection method and device, computing equipment and medium
CN112307226B (en) Slide playing control method, device and system and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20221010

Address after: Room 1602, 16th Floor, Building 18, Yard 6, Wenhuayuan West Road, Beijing Economic and Technological Development Zone, Daxing District, Beijing 100176

Applicant after: Beijing Lajin Zhongbo Technology Co.,Ltd.

Address before: 310000 room 650, building 3, No. 16, Zhuantang science and technology economic block, Xihu District, Hangzhou City, Zhejiang Province

Applicant before: Tianmai Juyuan (Hangzhou) Media Technology Co.,Ltd.

WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20200121