CN113205010A - Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering - Google Patents

Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering Download PDF

Info

Publication number
CN113205010A
CN113205010A CN202110418264.9A CN202110418264A CN113205010A CN 113205010 A CN113205010 A CN 113205010A CN 202110418264 A CN202110418264 A CN 202110418264A CN 113205010 A CN113205010 A CN 113205010A
Authority
CN
China
Prior art keywords
frame
target
video
video frame
data block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110418264.9A
Other languages
Chinese (zh)
Other versions
CN113205010B (en
Inventor
陈寿平
何建宗
刘仁恭
刘通
袁展图
钟荣富
冼庆祺
赵善龙
萧镜辉
林钦文
李文丁
王宇斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Original Assignee
Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd filed Critical Dongguan Power Supply Bureau of Guangdong Power Grid Co Ltd
Priority to CN202110418264.9A priority Critical patent/CN113205010B/en
Publication of CN113205010A publication Critical patent/CN113205010A/en
Application granted granted Critical
Publication of CN113205010B publication Critical patent/CN113205010B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/26Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion
    • G06V10/267Segmentation of patterns in the image field; Cutting or merging of image elements to establish the pattern region, e.g. clustering-based techniques; Detection of occlusion by performing operations on regions, e.g. growing, shrinking or watersheds
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/30Noise filtering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/49Segmenting video sequences, i.e. computational techniques such as parsing or cutting the sequence, low-level clustering or determining units such as shots or scenes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/50Context or environment of the image
    • G06V20/52Surveillance or monitoring of activities, e.g. for recognising suspicious objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/42Methods or arrangements for coding, decoding, compressing or decompressing digital video signals characterised by implementation details or hardware specially adapted for video compression or decompression, e.g. dedicated software implementation

Abstract

The invention discloses an intelligent disaster-exploration on-site video frame efficient compression system based on target clustering, wherein a frame sequence extraction module is used for obtaining a monitoring video frame sequence; the video frame segmentation module is used for obtaining a foreground data block and a target data block; the video frame judgment module is used for obtaining a target frame set and a monitoring video background frame set; the target frame clustering module is used for carrying out clustering analysis on target frames in the monitoring video target frame set; the video frame compression module is used for obtaining target frame compression data and background frame compression data. The invention can eliminate information redundancy, improve the compression ratio, accelerate the compression speed and further improve the video data transmission speed.

Description

Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering
Technical Field
The invention relates to the technical field of intelligent disaster exploration emergency command site video information processing, in particular to a system and a method for efficiently compressing a video frame of an intelligent disaster exploration site based on target clustering.
Background
In the field of electric power emergency repair command, a plurality of monitoring devices based on intelligent sensing are deployed on site and are responsible for collecting site data, the site data are uploaded to a command center in real time through a wireless network, a central server decompresses, analyzes and excavates video data, the command center manages and controls the site in real time, and decision support is provided for emergency command.
At present, video monitoring is one of important means for collecting power emergency repair data, and real-time uploading of massive high-definition video data is completed, so that timeliness of emergency repair is directly limited. In a disaster site, in consideration of factors such as reduction of reliability of a wireless channel, limitation of broadband resources, reduction of video definition and the like, great challenges are brought to reliable real-time transmission of massive high-definition video data, and a high-magnification video frame compression method needs to be researched to reduce transmission of redundant information in video stream and realize reliable and rapid transmission of video stream data.
However, existing video compression methods, such as h.264/h.265, implement content compression based on content similarity, and have a large dependency on hardware storage resources and a low compression rate in an application process, which further limits a throughput rate in a video decompression process. Therefore, the research on the high-magnification video frame compression method suitable for the fixed position monitoring equipment in the intelligent disaster exploration site has important significance in ensuring the video stream compression rate, reducing the video data transmission amount, increasing the video decompression throughput, realizing the safe, reliable and rapid transmission of the video stream, and providing support for the auxiliary decision of the emergency repair command center.
Disclosure of Invention
The invention aims to provide a system and a method for efficiently compressing video frames of an intelligent disaster-exploration site based on target clustering, which can solve the problems of high dependence on hardware storage resources, low compression ratio and low video decompression throughput in the application process of the conventional video compression technology.
In order to achieve the purpose, the invention designs an intelligent disaster-exploration field video frame high-efficiency compression system based on target clustering, which is characterized in that: the video frame clustering system comprises a frame sequence extraction module, a video frame segmentation module, a video frame judgment module, a target frame clustering module and a video frame compression module;
the frame sequence extraction module is used for converting the monitoring video data in a preset time period into a corresponding monitoring video frame sequence;
the video frame segmentation module is used for carrying out image segmentation on the monitoring video frame sequence by adopting an improved watershed algorithm based on gray threshold segmentation to form a foreground data block and a target data block;
the video frame judging module is used for judging each frame of the monitoring video frame sequence according to the sequence of the frame sequence, judging whether each frame of the monitoring video frame sequence contains a target data block, wherein the frames containing the target data block form a monitoring video target frame set arranged according to a time sequence, the frames not containing the target data block form a monitoring video background frame set arranged according to the time sequence, the monitoring video target frame set contains a foreground data block and a target data block, and the monitoring video background frame set only contains the foreground data block;
the target frame clustering module is used for carrying out clustering analysis on target frames in the monitoring video target frame set, dividing the target frames containing the same target into a subset, and carrying out segmentation processing on the monitoring video frame sequence according to a clustering result so that each segment of video frame contains the same target;
the video frame compression module is used for compressing the video frame segments containing the same target by adopting a video frame compression method based on target frame combination to obtain target frame compression data; the video frame compression module is further used for compressing the monitoring video background frame set by adopting a video frame compression method based on background frame combination to obtain background frame compression data.
The invention has the beneficial effects that:
the invention divides the video frame into the target data block set and the background data block set through the image segmentation, the target clustering and other technologies, and respectively compresses the video frame, and the adoption of the separate compression method is favorable for eliminating information redundancy, improving the compression ratio, accelerating the compression speed and further improving the video data transmission speed because the similarity between the blocks in the target data block set is extremely high and the similarity between the blocks in the background data block set is also high.
Drawings
FIG. 1 is a schematic structural view of the present invention;
FIG. 2 is a flowchart of an efficient compression method for video frames in an intelligent disaster-exploration site based on target clustering according to an embodiment of the present invention;
FIG. 3 is a flowchart of an image region segmentation method based on threshold segmentation according to an embodiment of the present invention;
FIG. 4 is a flowchart of a video compression method based on H.264 encoding rules according to an embodiment of the present invention;
the video frame reconstruction system comprises a frame sequence extraction module 1, a video frame segmentation module 2, a video frame judgment module 3, a target frame clustering module 4, a video frame compression module 5 and a video reconstruction module 6.
Detailed Description
The invention is described in further detail below with reference to the following figures and specific examples:
as shown in fig. 1, the system for efficiently compressing video frames in an intelligent disaster-exploration site based on target clustering is characterized in that: the video frame clustering system comprises a frame sequence extraction module 1, a video frame segmentation module 2, a video frame judgment module 3, a target frame clustering module 4 and a video frame compression module 5;
the frame sequence extraction module 1 is used for converting the monitoring video data in a preset time period into a corresponding monitoring video frame sequence;
the video frame segmentation module 2 is used for carrying out image segmentation on the monitoring video frame sequence by adopting an improved watershed algorithm based on gray threshold segmentation to form a foreground data block and a target data block, and the operation process is simple and stable in the mode, and a better image segmentation result can be obtained;
the video frame judging module 3 is used for judging each frame of the monitoring video frame sequence according to the sequence of the frame sequence, judging whether each frame of the monitoring video frame sequence contains a target data block, wherein the frames containing the target data block form a monitoring video target frame set arranged according to a time sequence, the frames not containing the target data block form a monitoring video background frame set arranged according to the time sequence, the monitoring video target frame set contains a foreground data block and a target data block, the monitoring video background frame set only contains the foreground data block, and the video frames are divided into the foreground data block and the target data block according to an image segmentation result and are compressed separately, so that the compression speed can be effectively increased;
the target frame clustering module 4 is used for clustering target frames in the monitoring video target frame set, dividing the target frames containing the same target into a subset, performing segmentation processing on the monitoring video frame sequence according to a clustering result to enable each segment of video frame to contain the same target, clustering the target frames, and dividing the frame sequence containing the same type of target into a subset through clustering, so that the higher subsequent compression speed is facilitated;
the video frame compression module 5 is used for compressing the video frame segments containing the same target by adopting a video frame compression method based on target frame combination to obtain target frame compression data; the video frame compression module 5 is further configured to perform compression processing on the monitoring video background frame set by using a video frame compression method based on background frame merging to obtain background frame compression data, and separately compress the target frame and the background frame, so as to reduce data redundancy and accelerate compression speed and efficiency.
In the above technical solution, the video reconstruction module 6 is further included, and the video reconstruction module 6 is configured to perform inverse quantization and decoding on the target frame compressed data and the background frame compressed data; according to different data block division rules, image frame recombination is carried out to obtain a complete target frame set and a complete background frame set; and then, by overlapping the boundary frames in the target frame data and the background frame data and deleting redundant data, the reconstruction of the whole monitoring video data can be completed.
In the above technical solution, the video frame segmentation module 2 adopts an improved watershed algorithm based on gray threshold segmentation to perform image segmentation on a monitored video frame sequence, and the specific method for forming the foreground data block and the target data block comprises:
firstly, performing median filtering denoising pretreatment on an input video frame sequence;
then, sequentially adopting an improved watershed model to perform image threshold segmentation on the filtered and denoised frame image sequence, wherein the segmentation process comprises the following steps: traversing the image, obtaining gradient data of each pixel of a frame image sequence by using a Sobel operator, performing flood treatment, starting flood treatment from the minimum gradient, marking basin regions, merging the regions with the number of pixel points smaller than a given threshold, performing initial marking of the frame image sequence according to the gray level of the pixel points, wherein one marking point represents the central point of a region with the gray level within a preset range, then performing region merging according to the Euclidean distance between the marking points, generating new marking points after merging, performing super change, updating a marking region table, finally obtaining a binary image of the image, wherein the white region of the binary image is a target data block, the black region is a foreground data block, segmenting the image by using an improved watershed model, and dividing the video frame into a foreground data block and a target data block according to a segmentation result, the compression is carried out separately, and the compression speed can be effectively accelerated.
In the above technical solution, the specific process of the target frame clustering module 4 performing the segmentation processing on the monitored video frame sequence according to the clustering result is as follows: extracting SIFT (Scale-invariant feature transform) feature points of adjacent target data blocks in a monitored video frame sequence, and determining SIFT feature vectors of targets; determining the similarity between the adjacent target data blocks according to the matching number of SIFT feature points of the two target data blocks; extracting key frames according to a preset similarity threshold, and extracting all key frame sequences for the target data block; and taking the key frame as a dividing line, sequentially dividing the monitoring video frame sequence into a plurality of different video segments, wherein each segment contains the same target, and carrying out SIFT feature extraction on the images for target clustering.
In the above technical solution, the video frame compression module 5 is configured to perform quantization and coding on video frame segments containing the same target by using a video frame compression method based on target frame merging and using an h.264 coding rule to perform compression processing, so as to obtain target frame compression data.
The video frame compression module 5 is further configured to perform quantization and coding compression processing on the monitoring video background frame set by using a video frame compression method based on background frame merging and by using an h.264 coding rule, so as to obtain target frame compression data.
In the above technical solution, the video reconstruction module 6 is configured to perform inverse quantization and decoding on the target frame compressed data and the background frame compressed data according to an h.264 coding rule.
An intelligent disaster-exploration on-site video frame efficient compression method based on target clustering is characterized in that:
step 1: converting monitoring video data in a preset time period into a corresponding monitoring video frame sequence;
step 2: performing image segmentation on the monitoring video frame sequence by adopting an improved watershed algorithm based on gray threshold segmentation to form a foreground data block and a target data block;
and step 3: judging each frame of the monitoring video frame sequence according to the sequence of the frame sequence, and judging whether each frame of the monitoring video frame sequence contains a target data block, wherein the frames containing the target data block form a monitoring video target frame set arranged according to a time sequence, the frames not containing the target data block form a monitoring video background frame set arranged according to the time sequence, the monitoring video target frame set contains a foreground data block and a target data block, and the monitoring video background frame set only contains the foreground data block;
and 4, step 4: performing cluster analysis on target frames in a monitoring video target frame set, dividing the target frames containing the same target into a subset, and performing segmentation processing on the monitoring video frame sequence according to a clustering result to enable each segment of video frame to contain the same target;
and 5: compressing the video frame segments containing the same target by adopting a video frame compression method based on target frame combination to obtain target frame compressed data; and compressing the monitoring video background frame set by adopting a video frame compression method based on background frame merging to obtain background frame compressed data.
The specific method for obtaining the monitoring video frame sequence in the step 1 is to select and download a field monitoring video, intercept a section of the field monitoring video as original video data, perform size scaling processing on the image, and sequentially number the video frames according to a time sequence to obtain a video frame image sequence with uniform size.
In step 3, for the monitoring device of the fixed airplane position, the video images monitored normally in daily life are almost the same and can be regarded as background frames, when abnormal conditions such as foreign matter invasion, serious damage of the monitoring device or disaster occur in the monitoring area, new targets appear in the monitoring images, and the image frames can be regarded as target frames. Therefore, whether the video frames contain related targets, such as foreign matters of the power transmission line, typical disasters and the like, can be sequentially judged according to the segmentation result, and the video frames are determined to be target frames and background frames to form a target frame set and a background frame set;
in the step 5, the video segments containing the same target are compressed by a video frame compression method based on target frame merging, which includes the following steps:
firstly, dividing a target frame into a minimum data block containing a current target and a plurality of remaining background blocks by adopting a target-based dividing mode, and dividing all data frames containing the same target in a set by adopting the same dividing rule to obtain a divided data block set;
then, adopting a video compression method in an H.264 coding rule to carry out quantization processing on all frames in the data block, removing high-frequency components, and outputting a transformed signal set;
and finally, coding the data block by adopting a video compression method in an H.264 coding rule to form an I, P, B frame sequence, outputting the I, P, B frame sequence and compressing video frames, wherein I \ P \ B are three frames defined in an H264 protocol respectively, the I frame is a complete coding frame, a frame which only contains difference part coding and is generated by the I frame is a P frame, and frames which refer to the front frame and the rear frame coding are B frames.
For the monitoring video background frame set, a video frame compression method based on background frame merging is adopted for compression, the compression method is also based on a video compression method specified in an H.264 coding rule, firstly, data block division is carried out, secondly, quantization and coding are carried out on the data block, compression is completed, and data transmission is carried out.
Referring to fig. 2, this embodiment is completed according to the following steps: extracting monitored video data according to a disaster site, and extracting a section of video as a video frame sequence (frame 1, frame 2, …, frame 1000) to be processed, wherein the length of the video frame sequence is 1000; dividing the video frame by adopting an image region division method based on threshold value division, and respectively dividing the ith frame into miA data block; classifying all frames according to whether the image frames contain targets or not, and dividing the frames into a target frame set and a background frame set; for a target frame set, a video frame segmentation method based on SIFT feature point matching is adopted to segment target data blocks, and the target data blocks are sequentially divided intoThe p segments are sequentially encoded and compressed by adopting an H.264-based encoding rule to obtain a compressed target data block; and for the background frame set, directly adopting an H.264 coding rule to carry out coding compression to obtain a compressed background data block. The specific implementation method comprises the following steps:
collecting the monitoring video data of the emergency repair site, intercepting a section of video as a video section to be processed, and then extracting a frame sequence of the video section to form an image frame sequence with the length of N.
Referring to fig. 3, the method for segmenting each frame of image by using an image region segmentation method based on threshold segmentation to obtain data blocks includes the following steps: first, a frame image is subjected to denoising processing. Performing median filtering denoising pretreatment on an input video frame sequence; and secondly, sequentially adopting an improved watershed model to perform image threshold segmentation on the frame image sequence. Then, outputting the image frame segmentation result; and finally, sequentially carrying out next frame image segmentation processing, repeating the steps until the last frame of the sequence is reached, and outputting a video frame sequence segmentation result.
According to the segmentation result, whether the video frame includes related targets, such as transmission line foreign objects, power equipment, typical disasters, etc., is sequentially determined, and finally the target frame set and the background frame set are formed by determining the frames numbered as 185, 386-488, 765-1000 as the target data frame and the frames numbered as 1-154, 186-385, 489-764 as the background data frame.
For all image frames in the target frame set, a video frame segmentation method based on SIFT feature point matching is adopted to segment the target data blocks, and the process specifically comprises the following steps: extracting SIFT feature points of adjacent target blocks, and determining SIFT feature vectors of the targets; determining the similarity between the adjacent target blocks according to the matching number of the feature points of the two target blocks; extracting key frames, and extracting all key frame sequences for the target data block; and taking the key frame as a dividing line, and sequentially dividing the target data block into a plurality of different target sections. Finally, the third segment 765-1000 in the target frame set is subdivided into 2 video segments, namely {765-890} and {891-1000 }.
Referring to fig. 4, video segments containing the same target are compressed by a video frame compression method based on target frame merging, and first, video frames belonging to the same type of target in a target frame set are divided by the same division rule by a data block division method based on the target; each data block is then quantized and encoded for video compression based on h.264 coding rules. And aiming at the background frame set, adopting a video frame compression method based on background frame merging to compress all data blocks in the background data block set, firstly adopting an average division principle to divide the data blocks of the background data, and then adopting H.264 to quantize and change the data blocks to compress the video. Resulting in compressed video data.
And performing video reconstruction on the compressed video data. According to the above process, two different types of compressed data can be obtained: target frame compressed data and background frame compressed data. The video reconstruction process is as follows: firstly, according to the H.264 coding rule, respectively carrying out inverse quantization and decoding on two types of compressed data; then, according to different data block division rules, image frame recombination is carried out to obtain a complete target frame set and a complete background frame set; and then, by overlapping the boundary frames in the target frame data and the background frame data and deleting redundant data, the reconstruction of the whole video segment can be completed.
Further, the video frame compression result is finally output.
In summary, the video frame is divided into the target data block set and the background data block set by the technologies of image segmentation, target clustering and the like, and the target data block set and the background data block set are combined and compressed respectively.
Details not described in this specification are within the skill of the art that are well known to those skilled in the art.

Claims (10)

1. An intelligent disaster-exploration on-site video frame efficient compression system based on target clustering is characterized in that: the video frame clustering system comprises a frame sequence extraction module (1), a video frame segmentation module (2), a video frame judgment module (3), a target frame clustering module (4) and a video frame compression module (5);
the frame sequence extraction module (1) is used for converting monitoring video data in a preset time period into a corresponding monitoring video frame sequence;
the video frame segmentation module (2) is used for carrying out image segmentation on the monitoring video frame sequence by adopting an improved watershed algorithm based on gray threshold segmentation to form a foreground data block and a target data block;
the video frame judging module (3) is used for judging each frame of the monitoring video frame sequence according to the sequence of the frame sequence, judging whether each frame of the monitoring video frame sequence contains a target data block, wherein the frames containing the target data block form a monitoring video target frame set arranged according to a time sequence, the frames not containing the target data block form a monitoring video background frame set arranged according to the time sequence, the monitoring video target frame set contains a foreground data block and a target data block, and the monitoring video background frame set only contains the foreground data block;
the target frame clustering module (4) is used for clustering and analyzing target frames in the monitoring video target frame set, dividing the target frames containing the same target into a subset, and performing segmentation processing on the monitoring video frame sequence according to a clustering result to enable each segment of video frame to contain the same target;
the video frame compression module (5) is used for compressing the video frame segments containing the same target by adopting a video frame compression method based on target frame combination to obtain target frame compression data; the video frame compression module (5) is also used for compressing the monitoring video background frame set by adopting a video frame compression method based on background frame combination to obtain background frame compression data.
2. The intelligent disaster-exploration on-site video frame efficient compression system based on target clustering as claimed in claim 1, wherein: the video reconstruction method comprises a video reconstruction module (6), wherein the video reconstruction module (6) is used for carrying out inverse quantization and decoding on target frame compressed data and background frame compressed data; according to different data block division rules, image frame recombination is carried out to obtain a complete target frame set and a complete background frame set; and then, by overlapping the boundary frames in the target frame data and the background frame data and deleting redundant data, the reconstruction of the whole monitoring video data can be completed.
3. The intelligent disaster-exploration on-site video frame efficient compression system based on target clustering as claimed in claim 1, wherein: the video frame segmentation module (2) adopts an improved watershed algorithm based on gray threshold segmentation to perform image segmentation on a monitoring video frame sequence to form a foreground data block and a target data block, and the specific method comprises the following steps:
firstly, performing median filtering denoising pretreatment on an input video frame sequence;
then, sequentially adopting an improved watershed model to perform image threshold segmentation on the filtered and denoised frame image sequence, wherein the segmentation process comprises the following steps: obtaining gradient data of each pixel of a frame image sequence by utilizing a Sobel operator, carrying out flooding treatment from the minimum gradient, marking a basin region, then carrying out merging treatment on regions with the number of pixel points smaller than a given threshold value, carrying out initial marking on the frame image sequence according to the gray level of the pixel points, wherein one marking point represents the central point of a region with the gray level within a preset range, then carrying out region merging according to the Euclidean distance between the marking points, generating new marking points for super change after merging, updating a marking region table, and finally obtaining a binary image of the image, wherein the white region of the binary image is a target data block, and the black region is a foreground data block.
4. The intelligent disaster-exploration on-site video frame efficient compression system based on target clustering as claimed in claim 1, wherein: the specific process of the target frame clustering module (4) for carrying out segmentation processing on the monitoring video frame sequence according to the clustering result is as follows: extracting SIFT feature points of adjacent target data blocks in a monitoring video frame sequence, and determining SIFT feature vectors of targets; determining the similarity between the adjacent target data blocks according to the matching number of SIFT feature points of the two target data blocks; extracting key frames according to a preset similarity threshold, and extracting all key frame sequences for the target data block; the key frame is taken as a dividing line, the monitoring video frame sequence is sequentially divided into a plurality of different video segments, and each segment contains the same target.
5. The intelligent disaster-exploration on-site video frame efficient compression system based on target clustering as claimed in claim 1, wherein: the video frame compression module (5) is used for carrying out quantization and coding on video frame segments containing the same target by using a video frame compression method based on target frame combination by using an H.264 coding rule for compression processing, and obtaining target frame compression data.
6. The intelligent disaster-exploration on-site video frame efficient compression system based on target clustering as claimed in claim 1, wherein: the video frame compression module (5) is also used for carrying out quantization and coding compression processing on the monitoring video background frame set by adopting a video frame compression method based on background frame combination by utilizing an H.264 coding rule to obtain target frame compression data.
7. The intelligent disaster-exploration on-site video frame efficient compression system based on target clustering as claimed in claim 1, wherein: the video reconstruction module (6) is used for carrying out inverse quantization and decoding on the target frame compressed data and the background frame compressed data according to an H.264 coding rule.
8. An intelligent disaster-exploration on-site video frame efficient compression method based on target clustering is characterized in that:
step 1: converting monitoring video data in a preset time period into a corresponding monitoring video frame sequence;
step 2: performing image segmentation on the monitoring video frame sequence by adopting an improved watershed algorithm based on gray threshold segmentation to form a foreground data block and a target data block;
and step 3: judging each frame of the monitoring video frame sequence according to the sequence of the frame sequence, and judging whether each frame of the monitoring video frame sequence contains a target data block, wherein the frames containing the target data block form a monitoring video target frame set arranged according to a time sequence, the frames not containing the target data block form a monitoring video background frame set arranged according to the time sequence, the monitoring video target frame set contains a foreground data block and a target data block, and the monitoring video background frame set only contains the foreground data block;
and 4, step 4: performing cluster analysis on target frames in a monitoring video target frame set, dividing the target frames containing the same target into a subset, and performing segmentation processing on the monitoring video frame sequence according to a clustering result to enable each segment of video frame to contain the same target;
and 5: compressing the video frame segments containing the same target by adopting a video frame compression method based on target frame combination to obtain target frame compressed data; and compressing the monitoring video background frame set by adopting a video frame compression method based on background frame merging to obtain background frame compressed data.
9. The intelligent disaster-exploration on-site video frame efficient compression method based on target clustering according to claim 8, characterized in that: the specific method for obtaining the monitoring video frame sequence in the step 1 is to select and download a field monitoring video, intercept a section of the field monitoring video as original video data, perform size scaling processing on the image, and sequentially number the video frames according to a time sequence to obtain a video frame image sequence with uniform size.
10. The intelligent disaster-exploration on-site video frame efficient compression method based on target clustering according to claim 8, characterized in that:
in the step 5, the video segments containing the same target are compressed by a video frame compression method based on target frame merging, which includes the following steps:
firstly, dividing a target frame into a minimum data block containing a current target and a plurality of remaining background blocks by adopting a target-based dividing mode, and dividing all data frames containing the same target in a set by adopting the same dividing rule to obtain a divided data block set;
then, adopting a video compression method in an H.264 coding rule to carry out quantization processing on all frames in the data block, removing high-frequency components, and outputting a transformed signal set;
and finally, coding the data blocks by adopting a video compression method in the H.264 coding rule to form I, P, B frame sequences, outputting the sequences and realizing the compression of the video frames.
For the monitoring video background frame set, a video frame compression method based on background frame merging is adopted for compression, the compression method is also based on a video compression method specified in an H.264 coding rule, firstly, data block division is carried out, secondly, quantization and coding are carried out on the data block, compression is completed, and data transmission is carried out.
CN202110418264.9A 2021-04-19 2021-04-19 Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering Active CN113205010B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110418264.9A CN113205010B (en) 2021-04-19 2021-04-19 Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110418264.9A CN113205010B (en) 2021-04-19 2021-04-19 Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering

Publications (2)

Publication Number Publication Date
CN113205010A true CN113205010A (en) 2021-08-03
CN113205010B CN113205010B (en) 2023-02-28

Family

ID=77027417

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110418264.9A Active CN113205010B (en) 2021-04-19 2021-04-19 Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering

Country Status (1)

Country Link
CN (1) CN113205010B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114157873A (en) * 2021-11-25 2022-03-08 中国通信建设第四工程局有限公司 Video compression method and video compression system

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101610411A (en) * 2009-07-16 2009-12-23 中国科学技术大学 A kind of method and system of video sequence mixed encoding and decoding
WO2010146522A2 (en) * 2009-06-14 2010-12-23 Rafael Advanced Defense Systems Ltd. Systems and methods for streaming and archiving video with geographic anchoring of frame contents
CN103093481A (en) * 2013-01-28 2013-05-08 中国科学院上海微系统与信息技术研究所 Moving object detection method under static background based on watershed segmentation
US20150249829A1 (en) * 2011-09-15 2015-09-03 Libre Communications Inc. Method, Apparatus and Computer Program Product for Video Compression
CN105407353A (en) * 2014-09-11 2016-03-16 腾讯科技(深圳)有限公司 Image compression method and apparatus
CN106203277A (en) * 2016-06-28 2016-12-07 华南理工大学 Fixed lens real-time monitor video feature extracting method based on SIFT feature cluster
WO2018187622A1 (en) * 2017-04-05 2018-10-11 Lyrical Labs Holdings, Llc Video processing and encoding
CN111901604A (en) * 2020-09-29 2020-11-06 创新奇智(南京)科技有限公司 Video compression method, video reconstruction method, corresponding devices, camera and video processing equipment
CN112085031A (en) * 2020-09-11 2020-12-15 河北工程大学 Target detection method and system

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010146522A2 (en) * 2009-06-14 2010-12-23 Rafael Advanced Defense Systems Ltd. Systems and methods for streaming and archiving video with geographic anchoring of frame contents
CN101610411A (en) * 2009-07-16 2009-12-23 中国科学技术大学 A kind of method and system of video sequence mixed encoding and decoding
US20150249829A1 (en) * 2011-09-15 2015-09-03 Libre Communications Inc. Method, Apparatus and Computer Program Product for Video Compression
CN103093481A (en) * 2013-01-28 2013-05-08 中国科学院上海微系统与信息技术研究所 Moving object detection method under static background based on watershed segmentation
CN105407353A (en) * 2014-09-11 2016-03-16 腾讯科技(深圳)有限公司 Image compression method and apparatus
CN106203277A (en) * 2016-06-28 2016-12-07 华南理工大学 Fixed lens real-time monitor video feature extracting method based on SIFT feature cluster
WO2018187622A1 (en) * 2017-04-05 2018-10-11 Lyrical Labs Holdings, Llc Video processing and encoding
CN112085031A (en) * 2020-09-11 2020-12-15 河北工程大学 Target detection method and system
CN111901604A (en) * 2020-09-29 2020-11-06 创新奇智(南京)科技有限公司 Video compression method, video reconstruction method, corresponding devices, camera and video processing equipment

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
刘尚武等: "大场景视频协同监控系统的研究与实现", 《工业控制计算机》 *
方帅等: "一种运动区域敏感的监控视频压缩算法", 《合肥工业大学学报(自然科学版)》 *
王小平: "基于运动背景的自适应视频对象分割算法", 《重庆邮电大学学报(自然科学版)》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114157873A (en) * 2021-11-25 2022-03-08 中国通信建设第四工程局有限公司 Video compression method and video compression system

Also Published As

Publication number Publication date
CN113205010B (en) 2023-02-28

Similar Documents

Publication Publication Date Title
CN110225341B (en) Task-driven code stream structured image coding method
CN106231214B (en) High-speed CMOS sensor image approximate lossless compression method based on adjustable macro block
US8019171B2 (en) Vision-based compression
CN109996084B (en) HEVC intra-frame prediction method based on multi-branch convolutional neural network
US20230291909A1 (en) Coding video frame key points to enable reconstruction of video frame
Zhang et al. Msfc: Deep feature compression in multi-task network
CN113205010B (en) Intelligent disaster-exploration on-site video frame efficient compression system and method based on target clustering
CN111669588B (en) Ultra-high definition video compression coding and decoding method with ultra-low time delay
CN105825530B (en) Littoral zone high spectrum image distribution lossy coding and coding/decoding method based on area-of-interest
Zhao et al. Intelligent analysis oriented surveillance video coding
CN113221674A (en) Video stream key frame extraction system and method based on rough set reduction and SIFT
CN112153388A (en) Image compression method, device and related equipment
CN116896638A (en) Data compression coding technology for transmission operation detection scene
WO2023203509A1 (en) Image data compression method and device using segmentation and classification
CN112770116B (en) Method for extracting video key frame by using video compression coding information
US20230342986A1 (en) Autoencoder-based segmentation mask generation in an alpha channel
KR20220045920A (en) Method and apparatus for processing images/videos for machine vision
JP2022069398A (en) Video coding device and method, video decoding device and method, and codec system
CN113542771A (en) Video high-efficiency compression processing method based on content weight
CN113784147A (en) Efficient video coding method and system based on convolutional neural network
CN113194312A (en) Planetary science exploration image adaptive quantization coding system combined with visual saliency
Jawahar et al. Compression of leather images for automatic leather grading system using multiwavelet
CN111901606A (en) Video coding method for improving caption coding quality
Jiajia et al. Minimum structural similarity distortion for reversible data hiding
Bakkouri et al. Effective CU size decision algorithm based on depth map homogeneity for 3D-HEVC inter-coding

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant