CN115510272A - Computer data processing system based on big data analysis - Google Patents

Computer data processing system based on big data analysis Download PDF

Info

Publication number
CN115510272A
CN115510272A CN202211141160.9A CN202211141160A CN115510272A CN 115510272 A CN115510272 A CN 115510272A CN 202211141160 A CN202211141160 A CN 202211141160A CN 115510272 A CN115510272 A CN 115510272A
Authority
CN
China
Prior art keywords
data
video
download
analysis
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202211141160.9A
Other languages
Chinese (zh)
Other versions
CN115510272B (en
Inventor
钟泽灵
张灿龙
尹成鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Chuangyan Information Technology Co.,Ltd.
Original Assignee
Harbin Mengdong Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Harbin Mengdong Technology Co ltd filed Critical Harbin Mengdong Technology Co ltd
Priority to CN202211141160.9A priority Critical patent/CN115510272B/en
Publication of CN115510272A publication Critical patent/CN115510272A/en
Application granted granted Critical
Publication of CN115510272B publication Critical patent/CN115510272B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/75Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • G06F11/3419Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment by assessing time
    • G06F11/3423Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment by assessing time where the assessed time is active or idle time
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/7867Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title and artist information, manually generated time, location and usage information, user ratings
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The invention discloses a computer data processing system based on big data analysis, which comprises a storage space analysis module, a target data selection module and a state information analysis module, wherein the storage space analysis module acquires video download data stored by a current computer as analysis data, if the storage space occupied by the analysis data is larger than a storage space threshold value, the target data selection module analyzes the analysis data, if the time interval duration between the latest triggered time of certain video download data and the current time in the analysis data is larger than a duration threshold value, the video download data is the target data, and the state information analysis module analyzes the state information of the target data and judges whether the target data is deleted.

Description

Computer data processing system based on big data analysis
Technical Field
The invention relates to the technical field of computers, in particular to a computer data processing system based on big data analysis.
Background
Along with the development of science and technology, the number of people who use and purchase computers is also increasing continuously, and the use of computers brings great convenience to people. People download videos into a computer in advance, so that the videos are convenient for subsequent off-line watching, but the video data often occupy a larger storage space, and if the video data in the computer occupy the space, the running speed of the computer is slower, and the normal work of the computer is influenced.
In the prior art, the downloaded video data is generally selected and deleted manually by a user, but the situation often occurs, and the work of a computer is influenced when the user deletes the video data.
Disclosure of Invention
The present invention is directed to a computer data processing system based on big data analysis, so as to solve the problems mentioned in the background art.
In order to solve the technical problems, the invention provides the following technical scheme: a computer data processing system based on big data analysis comprises a storage space analysis module, a target data selection module and a state information analysis module, wherein the storage space analysis module acquires video download data stored by a current computer as analysis data, if the storage space occupied by the analysis data is larger than a storage space threshold value, the target data selection module is enabled to analyze the analysis data, if the time interval duration between the latest triggered time of a certain video download data and the current time in the analysis data is larger than a duration threshold value, the video download data is the target data, and the state information analysis module analyzes the state information of the target data and judges whether the target data is deleted.
The state information analysis module comprises a sorting module, a demarcation data selection module, a set dividing module, a set analysis module and a deletion control module, wherein the sorting module acquires download initiating time of video download data downloaded by a computer history, the video download data are sequentially sorted from front to back according to the download initiating time to obtain sort sorting, the demarcation data selection module selects the demarcation data in the sort sorting, if the time interval between the download initiating times of two adjacent video download data in the sort sorting is greater than an interval threshold value, the front one of the two video download data in the sort sorting is the demarcation data, the set dividing module divides a plurality of download sets according to the positions of the demarcation data in the sort sorting, wherein the video download data in one download set is the video download data between two adjacent demarcation data in the sort sorting, the back one of the two adjacent demarcation data in the sort sorting is the demarcation data, the set analysis module sets the download set in which the target data are located as a central set, wherein the download set in which the target data are located in the central set is the reference set, the reference set is used for inquiring whether the target data set influencing the download data in the target data, and the target data is a target data, and the target data is deleted control module; and when the target data is the second data, directly deleting the target data.
Further, the set analysis module includes an influence threshold comparison module, a preferred data selection module, a reference index calculation module, a first index comparison module and an effective analysis module, where the influence threshold comparison module is configured to count the number of influence sets, if the number of influence sets is smaller than the influence threshold, the target data is the first data, otherwise, the preferred data selection module obtains a condition that each video download data is effectively triggered, and if a time interval duration between a latest effective triggering time of a certain video download data and a current time is smaller than or equal to a duration threshold, the video download data is the preferred data, where a certain video download data is opened by a user and viewed at a certain time and is effectively triggered at the time, the reference index calculation module calculates a reference index of the center set
Figure BDA0003853607290000021
Wherein m is the number of reference sets, c i The number of data downloaded for the video in the ith reference set,
Figure BDA0003853607290000022
F i is the ith parameterNumber of preferred data, H, for video download data in an exam i The first index calculation module calculates a first index P = u/v of the center set for the number of the video download data in the ith reference set, wherein u is the number of the preferred data in the center set, and v is the number of the video download data in the center set, the first index comparison module compares the first index of the center set with the reference index, if the first index of the center set is smaller than the reference index, the target data is the first data, and otherwise, the effective analysis module analyzes that the video of the reference set is effectively triggered.
Further, the effective analysis module includes a passive data judgment module, a passive index calculation module and an attention passive index calculation module, where the passive data judgment module refers to a preferred data in the set in a recent preset time period, when a certain time of effective triggering of a certain video download data in the influence set is after the certain video download data in the influence set is effectively triggered, the preferred data is the attention data, the attention data effectively triggered at the time is the passive data, the passive index calculation module calculates an influence factor of the certain attention data for the time of the passive index by taking the continuous number of the video download data in the influence set which are effectively triggered before the certain attention data is effectively triggered when the certain attention data is the passive data, and calculates the passive index of the certain attention data
Figure BDA0003853607290000023
Wherein e is the number of times that the concerned data is passive data in the latest preset time period, N is the number of times that the concerned data is effectively triggered in the latest preset time period, w is the average number of influence factors when the concerned data is passive data in the latest preset time period, and the concerned passive index calculation and comparison module calculates the concerned passive index of the preferred data of the reference set
Figure BDA0003853607290000024
Wherein S is the number of concerned data in the preferred data, T x Is the average value of the passive indexes of all the concerned data, R is the number of the preferred data, if the concerned passive index of the preferred data of the reference set is smaller than the passive indexAnd (4) threshold value, wherein the target data is the first data, and otherwise, the target data is the second data.
Further, the data processing system adopts a data processing method, and the data processing method comprises the following steps:
acquiring the video download data stored in the current computer as analysis data, if the storage space occupied by the analysis data is larger than the storage space threshold value,
if the time interval duration between the last triggered time of a certain video downloading data and the current time is greater than the time duration threshold value in the analysis data, the video downloading data is the target data,
and analyzing the state information of the target data and judging whether the target data needs to be deleted or not.
Further, the analyzing the state information of the target data includes:
acquiring download initiation time of video download data downloaded by a computer in history, sorting the video download data in sequence from front to back according to the download initiation time to obtain sort order,
in the sorting order, if the time interval between the download initiation times of two adjacent video download data is greater than the interval threshold, the one of the two video download data that is positioned at the front in the sorting order is the delimiting data,
dividing a plurality of downloading sets according to the positions of the demarcation data in the sorting order, wherein the video downloading data included in one downloading set is the video downloading data between two adjacent demarcation data in the sorting order and the demarcation data positioned at the back of the two adjacent demarcation data in the sorting order,
setting the download set where the target data is located as a center set, wherein the download set located in front of the center set in the sort order is a reference set, the download set located behind the center set in the sort order is an influence set,
analyzing the center set, the reference set and the influence set, judging the type of the target data,
if the target data is the first data, pushing inquiry information about whether the target data is deleted or not to a user;
and if the target data is the second data, directly deleting the target data.
Further, the analyzing the center set, the reference set, and the influence set includes:
if the number of impact sets is less than the impact threshold, then the target data is the first data,
otherwise, acquiring the condition that each video downloading data is effectively triggered, if the time interval between the latest effective triggering time of a certain video downloading data and the current time is less than or equal to the time threshold, then the video downloading data is the preferred data, wherein, the certain video downloading data is opened by the user for watching at a certain time, and the certain video downloading data is effectively triggered at the time,
calculating a reference index for a set of centers
Figure BDA0003853607290000041
Where m is the number of reference sets, C i The number of data downloaded for the video in the ith reference set,
Figure BDA0003853607290000042
F i downloading data for the video in the ith reference set as the preferred number of data, H i The number of data downloaded for the video in the ith reference set,
calculating a first index P = u/v of the center set, wherein u is the number of preferred data in the center set, and v is the number of video downloading data in the center set;
if the first index of the central set is less than the reference index, then the target data is the first data,
otherwise, analyzing the condition that the video of the reference set is effectively triggered.
Further, the analyzing the video of the reference set that is effectively triggered includes:
if a certain preferred data in the reference set is effectively triggered within the last preset time period after a certain video download data in the influence set is effectively triggered, the preferred data is the concerned data, the concerned data which is effectively triggered is the passive data,
if the continuous number of the video download data in the influence set before being effectively triggered when certain concerned data is passive data is the influence factor of the passive data at the time, calculating the passive index of the certain concerned data
Figure BDA0003853607290000043
Wherein e is the number of times that the concerned data is passive data in the latest preset time period, N is the number of times that the concerned data is effectively triggered in the latest preset time period, w is the average of the influence factors when the concerned data is passive data in the latest preset time period,
passive index of interest for preferred data of reference set
Figure BDA0003853607290000044
Wherein S is the number of concerned data in the preferred data, T x The average value of the passive indexes of all the concerned data is shown, and R is the number of the preferred data;
the target data is the first data if the interest passivity index of the preferred data of the reference set is less than the passivity threshold, otherwise the target data is the second data.
Further, the time when the certain video download data is triggered last time includes:
if the video download data is viewed by the user, the time when the video download data was last triggered is the time when the video download data was last viewed,
otherwise, the time when the video download data is triggered last time is the time when the video download data is downloaded.
Compared with the prior art, the invention has the following beneficial effects: the invention analyzes the video which is not watched for a long time, judges the probability condition that the follow-up user watches the video again, and directly deletes the video under the condition of low probability, thereby reducing the occupation of idle video data to the storage space of the computer, ensuring the normal operation of the computer and improving the operation efficiency of the computer.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:
FIG. 1 is a block diagram of a computer data processing system based on big data analysis according to the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, the present invention provides the following technical solutions: a computer data processing system based on big data analysis comprises a storage space analysis module, a target data selection module and a state information analysis module, wherein the storage space analysis module acquires video download data stored by a current computer as analysis data, if the storage space occupied by the analysis data is larger than a storage space threshold value, the target data selection module is enabled to analyze the analysis data, if the time interval duration between the latest triggered time of a certain video download data and the current time in the analysis data is larger than a duration threshold value, the video download data is the target data, and the state information analysis module analyzes the state information of the target data and judges whether the target data is deleted.
The state information analysis module comprises a sorting module, a demarcation data selection module, a set dividing module, a set analysis module and a deletion control module, wherein the sorting module acquires download initiating time of video download data downloaded by a computer history, the video download data are sequentially sorted from front to back according to the download initiating time to obtain classification sorting, the demarcation data selection module selects the demarcation data in the classification sorting, if the time interval between the download initiating times of two adjacent video download data in the classification sorting is greater than an interval threshold value, the first demarcation data of the two video download data in the classification sorting is the demarcation data, the set dividing module divides a plurality of download sets according to the positions of the demarcation data in the classification sorting, wherein the video download data in one download set is the video download data between two adjacent demarcation data in the sorting and the demarcation data behind the two adjacent demarcation data in the classification sorting, the set analysis module sets the download set of target data in the sorting as a central set, wherein the download set in the classification sorting is a reference set, the classification set in the front of the central set affects the central set to the download set, and judges whether the target data are the download set which affects the download set and delete control module, and whether the target data are the target data, and the target data are the type of the deletion module; and when the target data is the second data, directly deleting the target data.
The set analysis module comprises an influence threshold comparison module, a preferred data selection module, a reference index calculation module, a first index comparison module and an effective analysis module, wherein the influence threshold comparison module is used for counting the number of influence sets, if the number of the influence sets is smaller than the influence threshold, target data is first data, otherwise, the preferred data selection module obtains the condition that each piece of video downloading data is effectively triggered, if the time interval duration between the latest effective triggering time of a certain piece of video downloading data and the current time is smaller than or equal to a duration threshold, the video downloading data is preferred data, wherein the certain piece of video downloading data is opened by a user and viewed by the user, the video downloading data is effectively triggered at the time, and the reference index calculation module calculates the reference index of a central set
Figure BDA0003853607290000061
Where m is the number of reference sets, C i The number of data downloaded for the video in the ith reference set,
Figure BDA0003853607290000062
F i number of preferred data, H, for video download in ith reference set i The first index calculation module calculates a first index P = u/v of the center set for the number of the video download data in the ith reference set, wherein u is the number of the preferred data in the center set, and v is the number of the video download data in the center set, the first index comparison module compares the first index of the center set with the reference index, if the first index of the center set is smaller than the reference index, the target data is the first data, and otherwise, the effective analysis module analyzes that the video of the reference set is effectively triggered.
The effective analysis module comprises a passive data judgment module, a passive index calculation module and an attention passive index calculation module, wherein the passive data judgment module is used for judging whether a certain time of certain preferred data in a reference set is effectively triggered within a latest preset time period under the condition that certain video downloading data in an influence set is effectively triggered, the certain preferred data is the attention data, the attention data effectively triggered at the time is the passive data, the continuous number of the video downloading data in the influence set before the certain time of the attention data is effectively triggered is the influence factor of the time of the passive data, and the passive index of the certain attention data is calculated
Figure BDA0003853607290000063
Wherein e is the number of times that the concerned data is passive data in the latest preset time period, N is the number of times that the concerned data is effectively triggered in the latest preset time period, w is the average number of influence factors when the concerned data is passive data in the latest preset time period, and the concerned passive index calculation and comparison module calculates the concerned passive index of the preferred data of the reference set
Figure BDA0003853607290000064
Wherein S is the number of concerned data in the preferred data, T x And R is the number of the preferred data, if the attention passive index of the preferred data of the reference set is smaller than a passive threshold value, the target data is the first data, otherwise, the target data is the second data.
The data processing system adopts a data processing method, and the data processing method comprises the following steps:
acquiring the video download data stored in the current computer as analysis data, when the storage space occupied by the analysis data is larger than the storage space threshold value,
if the time interval between the latest triggered time of a certain video downloading data and the current time in the analysis data is greater than a time threshold, the video downloading data is the target data, wherein the latest triggered time of the certain video downloading data comprises: if the video download data is watched by the user, the latest triggered time of the video download data is the latest watched time of the video download data by the user, otherwise, the latest triggered time of the video download data is the downloading finish time of the video download data; if a certain video download data is not opened for viewing since being downloaded, or is opened for viewing long before, it is likely that the user does not need to use the video download data;
and analyzing the state information of the target data and judging whether the target data needs to be deleted or not.
The state information of the analysis target data includes:
acquiring download initiation time of video download data downloaded by computer, sorting the video download data in sequence from front to back according to the download initiation time to obtain sort order,
in the sorting order, if the time interval between the download initiation times of two adjacent video download data is greater than the interval threshold, the one of the two video download data that is located at the front in the sorting order is the delimiting data, such as the sorting order is: the method comprises the following steps that 1, 2, 3 and 4 videos are displayed, wherein the time interval between download initiating times of the 1 and 2 videos is smaller than an interval threshold, the time interval between download initiating times of the 2 and 3 videos is smaller than the interval threshold, the time interval between download initiating times of the 3 and 4 videos is larger than the interval threshold, and the 3 videos are demarcation data;
dividing a plurality of downloading sets according to the positions of the demarcation data in the sorting order, wherein the video downloading data included in one downloading set is the video downloading data between two adjacent demarcation data in the sorting order and the demarcation data positioned at the back of the two adjacent demarcation data in the sorting order,
setting a download set where the target data is located as a central set, where a download set located in front of the central set in the classification ordering is a reference set, and a download set located behind the central set in the classification ordering is an influence set, for example, the classification ordering is: video 1, video 2, video 3, video 4, video 5, video 6, video 7, video 8, video 9, the target data is video 6, the delimiting data is video 3, video 6 respectively,
video 1, video 2, video 3 are a download set, video 4, video 5, video 6 are a download set, video 7, video 8, video 9 are a download set,
then video 1, video 2, video 3 are reference sets, video 4, video 5, video 6 are center sets, video 7, video 8, video 9 are influence sets,
analyzing the center set, the reference set and the influence set, judging the type of the target data,
if the target data is the first data, the probability that the target data is used by the user in the later period is high, and therefore inquiry information about whether the target data is deleted is sent to the user;
and if the target data is the second data, the probability that the target data is used by the user in the later period is very small, and the target data is directly deleted.
The analyzing the center set, the reference set and the influence set comprises:
if the number of influence sets is less than the influence threshold, which indicates that the newly downloaded video download data is less, the user may remember the target data, and may possibly use the video download data, then the target data is the first data,
otherwise, acquiring the condition that each video downloading data is effectively triggered, if the time interval between the latest effective triggering time of a certain video downloading data and the current time is less than or equal to the time threshold, the video downloading data is the preferred data, wherein the video downloading data is effectively triggered when the certain video downloading data is opened and viewed by the user at a certain time,
calculating a reference index for a set of centers
Figure BDA0003853607290000081
Wherein m is the number of reference sets, C i The number of data downloaded for the video in the ith reference set,
Figure BDA0003853607290000082
F i number of preferred data, H, for video download in ith reference set i The number of data downloaded for the video in the ith reference set,
calculating a first index P = u/v of the center set, wherein u is the number of preferred data in the center set, and v is the number of video downloading data in the center set;
if the first index of the center set is smaller than the reference index, the user can watch the previously downloaded data even if new video download data are stored in the computer, and the first index of the center set is smaller than the reference index, the probability that the user watches the video download data of the center set at a later stage is higher, the target data is the first data, the possibility that the user watches the target data is judged according to the watching situation of the previously video download data, and if the user still often watches the previously video download data, the possibility that the user watches the target data is higher, so that the user needs to be asked whether to delete the data to prevent mistaken deletion; the more data in the reference set, the more the contribution of the analysis should be to his reference, andto pass through
Figure BDA0003853607290000083
As a weight, the rationality of the reference index is improved, so that the judgment accuracy is improved; in practice, a threshold value may also be set according to the reference index, and if the first index of the central set is smaller than the threshold value, the target data is the first data;
otherwise, analyzing the condition that the video of the reference set is effectively triggered; the analyzing the video of the reference set for effective triggering comprises:
if a certain preferred data in the reference set is effectively triggered for a certain time within the last preset time period after a certain video download data in the influence set is effectively triggered, the preferred data is the attention data, the attention data which is effectively triggered for the time is passive data, for example, video 1, video 2 and video 3 are reference sets, video 7, video 8 and video 9 are influence sets, if a certain time within the last preset time period is that video 2 is seen after video 7 and video 9 are seen, video 2 is the attention data, and video 2 is passive data, if no video in any influence set is seen before video 2 is seen within the last preset time period, video 2 is not passive data;
if the continuous number of the video download data in the influence set before being effectively triggered when certain concerned data is passive data is the influence factor of the passive data at the time, calculating the passive index of the certain concerned data
Figure BDA0003853607290000091
Wherein e is the number of times that the data of interest is passive data in the last preset time period, N is the total number of times that the data of interest is effectively triggered in the last preset time period, w is the average of the influence factors when the data of interest is passive data in the last preset time period, for example, the video 2 is watched 3 times in the last time period, where two times the video 2 is watched after the influence set is watched, e =2, N =3, and when e/N is smaller, it indicates that the number of times that a certain data of interest is actively watched is the sameThe comparison is more, the active viewing of the user is stronger, when w is larger, it indicates that the user views a lot of videos influencing the concentration and then views the videos of the reference set, and when w is smaller, it indicates that the user easily thinks that the user views the videos of the reference set, so that the passive index is smaller, the initiative of the user in viewing the videos of the reference set is stronger, and the viewing probability is higher;
passive index of interest for preferred data of reference set
Figure BDA0003853607290000092
Wherein S is the number of concerned data in the preferred data, T x The average value of the passive indexes of all the concerned data is shown, and R is the number of the preferred data;
Figure BDA0003853607290000093
the smaller the preferred data, the more the actively viewed video download data is, i.e., the preferred data is
Figure BDA0003853607290000094
The smaller the probability, the greater the probability that the user will actively watch the target data subsequently;
if the attention passivity index of the preferred data of the reference set is smaller than a passivity threshold, the smaller the attention passivity index is, the higher the probability that the user actively watches the previous video is, the target data is the first data, otherwise, the target data is the second data.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that changes may be made in the embodiments and/or equivalents thereof without departing from the spirit and scope of the invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (9)

1. The utility model provides a computer data processing system based on big data analysis, characterized in that, processing system includes storage space analysis module, target data selection module and state information analysis module, storage space analysis module obtains the video download data of current computer storage and selects the module as the analysis data, if the storage space that the analysis data occupy is greater than the storage space threshold value, makes target data selection module carry out the analysis to the analysis data, if there is some video download data to be triggered last time in the analysis data and the time interval duration of current time is greater than the duration threshold value, then this video download data is the target data, state information analysis module analysis target data's state information judges whether to delete target data.
2. A computer data processing system based on big data analysis according to claim 1, characterized in that: the state information analysis module comprises a sorting module, a demarcation data selection module, a set dividing module, a set analysis module and a deletion control module, wherein the sorting module acquires download initiating time of video download data downloaded by a computer in history, the video download data are sequentially sorted from front to back according to the download initiating time to obtain classification sorting, the demarcation data selection module selects the demarcation data in the classification sorting, if the time interval between the download initiating times of two adjacent video download data in the classification sorting is greater than an interval threshold value, the front demarcation data of the two video download data in the classification sorting is the demarcation data, the set dividing module divides a plurality of download sets according to the positions of the demarcation data in the classification sorting, wherein the video download data in one download set is the video download data between two adjacent demarcation data in the sorting and the rear demarcation data of the two adjacent demarcation data in the classification sorting, the set analysis module sets the download set of the target data as a central set, wherein the download set in the classification sorting is a reference set, the classification sorting set is a reference set, and the classification sorting set is a target data analysis module judges whether the target download set affects the download set and the deletion control data, and the deletion control module pushes the target data to the target data; and when the target data is the second data, directly deleting the target data.
3. A computer data processing system based on big data analysis according to claim 2, characterized in that: the set analysis module comprises an influence threshold comparison module, a preferred data selection module, a reference index calculation module, a first index comparison module and an effective analysis module, wherein the influence threshold comparison module is used for counting the number of influence sets, if the number of the influence sets is smaller than the influence threshold, target data is first data, otherwise, the preferred data selection module acquires the condition that each piece of video downloading data is effectively triggered, if the time interval duration between the latest effective triggering time of a certain piece of video downloading data and the current time is smaller than or equal to a time duration threshold, the certain piece of video downloading data is preferred data, wherein the certain piece of video downloading data is opened by a user and observed and is effectively triggered at the time of the certain piece of video downloading data, and the reference index calculation module calculates the reference index of a central set
Figure FDA0003853607280000011
Where m is the number of reference sets, C i The number of data downloaded for the video in the ith reference set,
Figure FDA0003853607280000012
F i number of preferred data, H, for video download in ith reference set i The first index calculation module calculates a first index P = u/v of the center set for the number of the video downloading data in the ith reference set, wherein u is the number of the preferred data in the center set, v is the number of the video downloading data in the center set, the first index comparison module compares the first index of the center set with the reference index, if the first index of the center set is smaller than the reference index, the target data is the first data, and otherwise, the effective analysis module analyzes the condition that the video of the reference set is effectively triggered.
4. A computer data processing system based on big data analysis according to claim 3, characterized in that: the effective analysis module comprises a passive data judgment module, a passive index calculation module and an attention passive index calculation module, wherein the passive data judgment module is used for judging whether a certain preferred data in a reference set is effectively triggered in a latest preset time period or not when a certain video download data in an influence set is effectively triggered, the preferred data is the attention data, the attention data effectively triggered in the time is the passive data, the continuous number of the video download data in the influence set before the video download data are effectively triggered in the influence set before the certain attention data are effectively triggered when the certain attention data are the passive data is the influence factor of the time of the passive data, and the passive index calculation module is used for calculating the passive index of the certain attention data
Figure FDA0003853607280000021
Wherein e is the number of times that the concerned data is passive data in the latest preset time period, N is the number of times that the concerned data is effectively triggered in the latest preset time period, w is the average number of influence factors when the concerned data is passive data in the latest preset time period, and the concerned passive index calculation and comparison module calculates the concerned passive index of the preferred data of the reference set
Figure FDA0003853607280000022
Wherein S is the number of concerned data in the optimized data, T x And R is the number of the preferred data, if the concerned passive index of the preferred data of the reference set is less than a passive threshold value, the target data is the first data, otherwise, the target data is the second data.
5. A computer data processing system based on big data analysis according to claim 1, characterized in that: the data processing system adopts a data processing method, and the data processing method comprises the following steps:
acquiring the video download data stored in the current computer as analysis data, if the storage space occupied by the analysis data is larger than the storage space threshold value,
if the time interval between the last triggered time of certain video downloading data and the current time in the analysis data is longer than the time threshold, the video downloading data is the target data,
and analyzing the state information of the target data and judging whether the target data needs to be deleted or not.
6. The computer data processing system based on big data analysis according to claim 5, wherein: the state information of the analysis target data includes:
acquiring download initiation time of video download data downloaded by a computer in history, sorting the video download data in sequence from front to back according to the download initiation time to obtain sort order,
in the sorting order, if the time interval between the download initiation times of two adjacent video download data is greater than the interval threshold, the one of the two video download data that is positioned at the front in the sorting order is the delimiting data,
dividing a plurality of downloading sets according to the positions of the demarcation data in the sorting order, wherein the video downloading data included in one downloading set is the video downloading data between two adjacent demarcation data in the sorting order and the demarcation data positioned at the back of the two adjacent demarcation data in the sorting order,
setting the download set where the target data is located as a center set, wherein the download set located in front of the center set in the sort order is a reference set, the download set located behind the center set in the sort order is an influence set,
analyzing the center set, the reference set and the influence set, judging the type of the target data,
if the target data is the first data, pushing inquiry information about whether the target data is deleted or not to the user;
and if the target data is the second data, directly deleting the target data.
7. The computer data processing system based on big data analysis according to claim 6, wherein: the analyzing the center set, the reference set and the influence set comprises:
if the number of impact sets is less than the impact threshold, then the target data is the first data,
otherwise, acquiring the condition that each video downloading data is effectively triggered, if the time interval between the latest effective triggering time of a certain video downloading data and the current time is less than or equal to the time threshold, then the video downloading data is the preferred data, wherein, the certain video downloading data is opened by the user for watching at a certain time, and the certain video downloading data is effectively triggered at the time,
calculating a reference index for a set of centers
Figure FDA0003853607280000031
Where m is the number of reference sets, C i The number of data downloaded for the video in the ith reference set,
Figure FDA0003853607280000032
F i downloading data for the video in the ith reference set as the preferred number of data, H i For the video in the ith reference setThe number of the downloaded data is,
calculating a first index P = u/v of the center set, wherein u is the number of preferred data in the center set, and v is the number of video downloading data in the center set;
if the first index of the center set is less than the reference index, then the target data is the first data,
otherwise, analyzing the condition that the video of the reference set is effectively triggered.
8. A computer data processing system based on big data analysis according to claim 7, characterized in that: the analyzing the video of the reference set which is effectively triggered comprises the following steps:
if a certain preferred data in the reference set is effectively triggered within the last preset time period after a certain video download data in the influence set is effectively triggered, the preferred data is the concerned data, the concerned data which is effectively triggered is the passive data,
if the continuous number of the video download data in the influence set before being effectively triggered when certain concerned data is passive data is the influence factor of the passive data at the time, calculating the passive index of the certain concerned data
Figure FDA0003853607280000041
Wherein e is the number of times that the concerned data is passive data in the latest preset time period, N is the number of times that the concerned data is effectively triggered in the latest preset time period, w is the average number of influence factors when the concerned data is passive data in the latest preset time period,
passive index of interest for preferred data of reference set
Figure FDA0003853607280000042
Wherein S is the number of concerned data in the preferred data, T x The average value of the passive indexes of all the concerned data is shown, and R is the number of the preferred data;
the target data is the first data if the interest passivity index of the preferred data of the reference set is less than the passivity threshold, otherwise the target data is the second data.
9. A computer data processing system based on big data analysis according to claim 8, characterized in that: the time when the certain video downloading data is triggered last time comprises the following steps:
if the video download data is viewed by the user, the time when the video download data was last triggered is the time when the video download data was last viewed,
otherwise, the time when the video download data is triggered last time is the time when the video download data is downloaded.
CN202211141160.9A 2022-09-20 2022-09-20 Computer data processing system based on big data analysis Active CN115510272B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211141160.9A CN115510272B (en) 2022-09-20 2022-09-20 Computer data processing system based on big data analysis

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211141160.9A CN115510272B (en) 2022-09-20 2022-09-20 Computer data processing system based on big data analysis

Publications (2)

Publication Number Publication Date
CN115510272A true CN115510272A (en) 2022-12-23
CN115510272B CN115510272B (en) 2023-07-14

Family

ID=84503265

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211141160.9A Active CN115510272B (en) 2022-09-20 2022-09-20 Computer data processing system based on big data analysis

Country Status (1)

Country Link
CN (1) CN115510272B (en)

Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060022048A1 (en) * 2000-06-07 2006-02-02 Johnson William J System and method for anonymous location based services
CN1737937A (en) * 2004-08-16 2006-02-22 上海乐金广电电子有限公司 Method for deleting recorded programme in digital video recorder
JP2007043240A (en) * 2005-07-29 2007-02-15 Toshiba Corp Video recording and reproducing apparatus
JP2011059191A (en) * 2009-09-07 2011-03-24 Brother Industries Ltd Karaoke apparatus
CN102395955A (en) * 2009-02-13 2012-03-28 美国能量变换公司 Data center control
US8311863B1 (en) * 2009-02-24 2012-11-13 Accenture Global Services Limited Utility high performance capability assessment
CN105487790A (en) * 2014-08-02 2016-04-13 苹果公司 Context-specific user interfaces
US20160165307A1 (en) * 2008-01-15 2016-06-09 British Broadcasting Corporation Accessing broadcast media
CN105867845A (en) * 2016-03-28 2016-08-17 乐视控股(北京)有限公司 Application storage space management method and device
CN106325770A (en) * 2016-08-23 2017-01-11 成都卡莱博尔信息技术股份有限公司 Method for processing long-time idle data
CN111432246A (en) * 2020-03-23 2020-07-17 广州市百果园信息技术有限公司 Method, device and storage medium for pushing video data
CN112433993A (en) * 2020-11-16 2021-03-02 连邦网络科技服务南通有限公司 Network data processing and analyzing system based on computer
US20210124521A1 (en) * 2019-10-29 2021-04-29 EMC IP Holding Company LLC Method, device, and program product for managing storage space in storage system
CN112925804A (en) * 2021-05-12 2021-06-08 北京优炫软件股份有限公司 Database maintenance method and device
CN114281899A (en) * 2021-12-28 2022-04-05 浙江汇鼎华链科技有限公司 User data distributed cloud storage method and system based on network big data

Patent Citations (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060022048A1 (en) * 2000-06-07 2006-02-02 Johnson William J System and method for anonymous location based services
CN1737937A (en) * 2004-08-16 2006-02-22 上海乐金广电电子有限公司 Method for deleting recorded programme in digital video recorder
JP2007043240A (en) * 2005-07-29 2007-02-15 Toshiba Corp Video recording and reproducing apparatus
US20160165307A1 (en) * 2008-01-15 2016-06-09 British Broadcasting Corporation Accessing broadcast media
CN102395955A (en) * 2009-02-13 2012-03-28 美国能量变换公司 Data center control
US8311863B1 (en) * 2009-02-24 2012-11-13 Accenture Global Services Limited Utility high performance capability assessment
JP2011059191A (en) * 2009-09-07 2011-03-24 Brother Industries Ltd Karaoke apparatus
CN105487790A (en) * 2014-08-02 2016-04-13 苹果公司 Context-specific user interfaces
CN105867845A (en) * 2016-03-28 2016-08-17 乐视控股(北京)有限公司 Application storage space management method and device
CN106325770A (en) * 2016-08-23 2017-01-11 成都卡莱博尔信息技术股份有限公司 Method for processing long-time idle data
US20210124521A1 (en) * 2019-10-29 2021-04-29 EMC IP Holding Company LLC Method, device, and program product for managing storage space in storage system
CN111432246A (en) * 2020-03-23 2020-07-17 广州市百果园信息技术有限公司 Method, device and storage medium for pushing video data
CN112433993A (en) * 2020-11-16 2021-03-02 连邦网络科技服务南通有限公司 Network data processing and analyzing system based on computer
CN112925804A (en) * 2021-05-12 2021-06-08 北京优炫软件股份有限公司 Database maintenance method and device
CN114281899A (en) * 2021-12-28 2022-04-05 浙江汇鼎华链科技有限公司 User data distributed cloud storage method and system based on network big data

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
HUDA S. GOIAN等: "Popularity-Based Video Caching Techniques for Cache-Enabled Networks: A Survey" *
郝春媚: "涉密检查系统中信息检索技术研究" *

Also Published As

Publication number Publication date
CN115510272B (en) 2023-07-14

Similar Documents

Publication Publication Date Title
CN109492560A (en) Facial image Feature fusion, device and storage medium based on time scale
CN111327966B (en) Video double-speed playing method, system and storage medium
CN112463859B (en) User data processing method and server based on big data and business analysis
CN111107319B (en) Target tracking method, device and system based on regional camera
CN106951446A (en) Financial Information method for pushing and device
CN111510368B (en) Family group identification method, device, equipment and computer readable storage medium
CN113779381A (en) Resource recommendation method and device, electronic equipment and storage medium
CN111783743A (en) Image clustering method and device
CN113645488A (en) Internet television video recommendation system and method based on user data
CN112199530A (en) Multi-dimensional face library picture automatic updating method, system, equipment and medium
CN101464883A (en) Contents-retrieving apparatus and method
CN114140696A (en) Commodity identification system optimization method, commodity identification system optimization device, commodity identification equipment and storage medium
CN115510272A (en) Computer data processing system based on big data analysis
CN111898559B (en) Method and device for improving face recognition speed and electronic equipment
CN108764369A (en) Character recognition method, device based on data fusion and computer storage media
CN112651366A (en) Method and device for processing number of people in passenger flow, electronic equipment and storage medium
CN113158953B (en) Personnel searching method, device, equipment and medium
CN111507424B (en) Data processing method and device
CN114417884A (en) Intelligent display device and method for information technology consultation service
CN114296775A (en) Intelligent operation and maintenance method and system based on big data
CN114373209A (en) Video-based face recognition method and device, electronic equipment and storage medium
CN112215386A (en) Personnel activity prediction method and device and computer readable storage medium
CN107169071B (en) Video searching method and device
CN114679490B (en) Information message content intelligent pushing system and method based on big data
CN115086772B (en) Video desensitization method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20230621

Address after: Room 803, No. 65, Xiatang West Road, Tongxin Community, Dengfeng Street, Yuexiu District, Guangzhou, Guangdong 510000

Applicant after: Guangzhou Jinhu Intelligent Technology Co.,Ltd.

Address before: No. 5-26-4, Floor 1-8, No. 177, Dongzhi Road, Daowai District, Harbin, Heilongjiang 150050

Applicant before: Harbin Mengdong Technology Co.,Ltd.

GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20230826

Address after: Room B113, Room 2803-2810, No. 140-148 Tiyu East Road, Tianhe District, Guangzhou City, Guangdong Province, 510000 (not intended for use as a factory building) (office only)

Patentee after: Guangzhou Chuangyan Information Technology Co.,Ltd.

Address before: Room 803, No. 65, Xiatang West Road, Tongxin Community, Dengfeng Street, Yuexiu District, Guangzhou, Guangdong 510000

Patentee before: Guangzhou Jinhu Intelligent Technology Co.,Ltd.