CN109669779A - For determining the cleaning path of data, the method and apparatus of cleaning data - Google Patents

For determining the cleaning path of data, the method and apparatus of cleaning data Download PDF

Info

Publication number
CN109669779A
CN109669779A CN201811587961.1A CN201811587961A CN109669779A CN 109669779 A CN109669779 A CN 109669779A CN 201811587961 A CN201811587961 A CN 201811587961A CN 109669779 A CN109669779 A CN 109669779A
Authority
CN
China
Prior art keywords
data
cleaning
routing information
client
application
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811587961.1A
Other languages
Chinese (zh)
Other versions
CN109669779B (en
Inventor
孔柏林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Carbon Blue Network Technology Co Ltd
Original Assignee
Shanghai Carbon Blue Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Carbon Blue Network Technology Co Ltd filed Critical Shanghai Carbon Blue Network Technology Co Ltd
Priority to CN201811587961.1A priority Critical patent/CN109669779B/en
Publication of CN109669779A publication Critical patent/CN109669779A/en
Application granted granted Critical
Publication of CN109669779B publication Critical patent/CN109669779B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5011Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
    • G06F9/5022Mechanisms to release resources
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The embodiment of the present application discloses the method and apparatus cleared up path, clear up data for determining data.For determining that a specific embodiment of the method in the cleaning path of data includes: to obtain user in first time period using installing in client using the first behavioral data generated;The First Eigenvalue is determined based on the first behavioral data, wherein the First Eigenvalue is used to characterize the service condition of application;The First Eigenvalue is input to preset data scrubbing model, obtains the first cleaning routing information, wherein data scrubbing model is for determining cleaning routing information;The first cleaning routing information is sent to client, so that client is based on the first cleaning routing information and carries out data scrubbing.The embodiment determines cleaning path based on the behavioral data of user, without scanning the application installed in client one by one, substantially reduces the time for determining cleaning path, helps to improve data scrubbing efficiency.

Description

For determining the cleaning path of data, the method and apparatus of cleaning data
Technical field
The invention relates to field of computer technology, and in particular to for determining cleaning path, the cleaning number of data According to method and apparatus.
Background technique
Function in the clients such as mobile phone is more and more, provides convenience for the life and work of people.People can be Miscellaneous application is installed, to support the realization of client different function in client.Installation applying on the client During operation, some ephemeral datas can be generated, and are stored under the specified path in the memory space of client.Nonce A large amount of memory space in client can be occupied according to long-term accumulation, causes client operation slow.Therefore, it is necessary to ephemeral data It is cleared up in time, to discharge the memory space of client.
Existing data scrubbing mode is usually that the data scrubbing installed in client application is utilized to scan client one by one The application of upper installation, with the store path and occupied space of the data that each application of determination generates.It is to be scanned to finish, it will each answer It is presented to the user with the occupied space of the data of generation.When the data that certain applications generate are cleared up in user's selection, meeting The data stored under the store path of the data of these application generations are cleared up.
Summary of the invention
The embodiment of the present application proposes the method and apparatus cleared up path, clear up data for determining data.
In a first aspect, some embodiments of the present application provide a kind of method for determining the cleaning path of data, answer For server-side, comprising: obtain user in first time period using installing in client using the first behavior generated Data;The First Eigenvalue is determined based on the first behavioral data, wherein the First Eigenvalue is used to characterize the service condition of application;It will The First Eigenvalue is input to preset data scrubbing model, obtains the first cleaning routing information, wherein data scrubbing model is used for Determine cleaning routing information;The first cleaning routing information is sent to client, so that client is based on the first cleaning routing information Carry out data scrubbing.
In some embodiments, this method further include: it is clear based on the first cleaning routing information progress data to receive client The data scrubbing information sent after reason;Cleaning ratio is determined based on data scrubbing information;If clearing up ratio is greater than pre-set ratio threshold Value optimizes data scrubbing model using the First Eigenvalue and the first cleaning routing information as training sample, is optimized Data scrubbing model afterwards.
In some embodiments, this method further include: if cleaning ratio is less than or equal to pre-set ratio threshold value, obtain user Using using the second behavioral data generated in second time period;First behavioral data and the second behavioral data are merged, And it is determined based on the behavioral data after merging and merges characteristic value;Characteristic value will be merged and be input to data scrubbing model, closed And clear up routing information;It is sent to client and merges cleaning routing information, so that client is based on merging cleaning routing information again Secondary carry out data scrubbing.
In some embodiments, the first behavior data packet includes the mark of application and uses duration.
In some embodiments, the First Eigenvalue includes at least one of the following: the cleaning road of the frequency of use of application, application Diameter, using generate data speed, using generate data occupied space.
In some embodiments, the first cleaning routing information includes the cleaning path of application, or the cleaning including applying Confidence level and cleaning path.
In some embodiments, training obtains data scrubbing model as follows: obtaining training sample, wherein instruction Practicing sample includes sample characteristics and sample cleaning routing information;Equation of linear regression is constructed by independent variable of sample characteristics, Wherein, the corresponding weight coefficient of each independent variable of equation of linear regression;It is returned by independent variable constitutive logic of linear equation Function;Logistic regression function is trained using sample cleaning routing information as output using sample characteristics as input, Obtain data scrubbing model.
In some embodiments, the numerical value of weight coefficient is solved by maximum Likelihood.
Second aspect, some embodiments of the present application provide a kind of method for clearing up data, are applied to client, Include: collection and report of user in first time period using being installed in client using the first behavioral data generated; In response to receiving the first cleaning routing information of server-side transmission determined based on the first behavioral data, to the first cleaning path The data stored under information corresponding first cleaning path are cleared up.
In some embodiments, under to the first cleaning path indicated by the first cleaning routing information the data that store into After row cleaning, further includes: generate data scrubbing information, and be sent to server-side.
The third aspect, some embodiments of the present application provide it is a kind of for determine data cleaning path device, if It is placed in server-side, comprising: acquiring unit is configured to obtain user in first time period using the application installed in client First behavioral data generated;Determination unit is configured to determine the First Eigenvalue based on the first behavioral data, wherein the One characteristic value is used to characterize the service condition of application;Input unit is configured to for the First Eigenvalue to be input to preset data Model is cleared up, the first cleaning routing information is obtained, wherein data scrubbing model is for determining cleaning routing information;Transmission unit, It is configured to send the first cleaning routing information to client, so that client is clear based on the first cleaning routing information progress data Reason.
Fourth aspect, some embodiments of the present application provide a kind of for clearing up the device of data, are set to client, Include: reporting unit, is configured to collect and report of user is given birth in first time period using the application installed in client At the first behavioral data;Unit is cleared up, is configured in response to receive the true based on the first behavioral data of server-side transmission The first fixed cleaning routing information carries out the data stored under the first cleaning routing information corresponding first cleaning path clear Reason.
5th aspect, some embodiments of the present application provide a kind of computer equipment, which includes: one Or multiple processors;Storage device stores one or more programs thereon;When one or more programs are handled by one or more Device executes, so that one or more processors realize method such as implementation any in first aspect described in or realization such as the The method of any implementation description in two aspects.
6th aspect, some embodiments of the present application provide a kind of computer-readable medium, are stored thereon with computer Program realizes method or realization as described in implementation any in first aspect when the computer program is executed by processor Such as the method described in implementation any in second aspect.
The application's is provided by the above embodiment for determining the method and apparatus of data cleared up path, clear up data, User is obtained first in first time period using installing in client using the first behavioral data generated;It is based on later First behavioral data determines the First Eigenvalue;Then the First Eigenvalue is input to preset data scrubbing model, to obtain One cleaning routing information;The first cleaning routing information finally is sent to client, so that client is based on the first cleaning path letter Breath carries out data scrubbing.Cleaning path is determined based on the behavioral data of user, substantially reduces the time for determining cleaning path, has Help improve data scrubbing efficiency.Meanwhile it also avoiding in the prior art during the application installed on scanning client User needs to undergo the generation for the case where waiting and interrupt scanning for a long time, data scrubbing is caused to fail.
Detailed description of the invention
By reading a detailed description of non-restrictive embodiments in the light of the attached drawings below, the application's is other Feature, objects and advantages will become more apparent upon:
Fig. 1 is that some embodiments of the application can be applied to exemplary system architecture figure therein;
Fig. 2 is the flow chart according to one embodiment of the method in the cleaning path for determining data of the application;
Fig. 3 is the flow chart according to another embodiment of the method in the cleaning path for determining data of the application;
Fig. 4 is the flow chart according to one embodiment of the method for clearing up data of the application;
Fig. 5 is adapted for the structural schematic diagram for the computer system for realizing the computer equipment of the embodiment of the present application.
Specific embodiment
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that in order to Convenient for description, part relevant to related invention is illustrated only in attached drawing.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
Fig. 1 is shown can be using the method cleared up routing information, clear up data for determining data of the application Exemplary system architecture 100.
As shown in Figure 1, system architecture 100 may include equipment 101,102 and network 103.Network 103 is in equipment 101, the medium of communication link is provided between 102.Network 103 may include various connection types, such as wired, wireless communication link Road or fiber optic cables etc..
Equipment 101,102 can be support network connection to provide the hardware device or software of various network services.When When equipment is hardware, it can be various electronic equipments, including but not limited to smart phone, tablet computer, portable meter on knee Calculation machine, desktop computer and server etc..At this moment, as hardware device, the distribution of multiple equipment composition may be implemented into Formula device cluster, also may be implemented into individual equipment.When equipment is software, may be mounted in above-mentioned cited equipment.This When, as software, it may be implemented into for example for providing the multiple softwares or software module of Distributed Services, also may be implemented At single software or software module.It is not specifically limited herein.
In practice, equipment can provide corresponding network by installing corresponding client application or server-side application Service.Equipment can be presented as client after being mounted with client application in network communications.Correspondingly, it is installing After server-side application, it can be presented as server-side in network communications.
As an example, equipment 101 is presented as client, and equipment 102 is presented as server-side in Fig. 1.Specifically, if Standby 101 can be the client for being equipped with data scrubbing application, and equipment 103 can be the background server of data scrubbing application. The available user of background server of data scrubbing application is given birth in first time period using the application installed in client At the first behavioral data;The First Eigenvalue is determined based on the first behavioral data;The First Eigenvalue is input to preset data Model is cleared up, the first cleaning routing information is obtained;The first cleaning routing information is sent to client, so that client is based on first It clears up routing information and carries out data scrubbing.
It should be noted that for determining that the method in the cleaning path of data can be by setting provided by the embodiment of the present application Standby 102 execute, and the method for clearing up data can be executed by equipment 101.
It should be understood that the number of network and equipment in Fig. 1 is only schematical.According to needs are realized, can have Any number of network and equipment.
With continued reference to Fig. 2, it illustrates a realities according to the method in the cleaning path for determining data of the application Apply the process 200 of example.The method for being used to determine the cleaning path of data is applied to server-side, comprising the following steps:
Step 201, user is obtained in first time period using installing in client using the first behavior generated Data.
In the present embodiment, for determining executing subject (such as the equipment shown in FIG. 1 of the method in the cleaning path of data 102) available user is using installing in client (such as equipment 101 shown in FIG. 1) using behavioral data generated. In some embodiments, in the case where user authorizes, above-mentioned executing subject can just obtain the first behavioral data.In general, when using When family needs to clear up the ephemeral data stored in client, it can use client to above-mentioned executing subject and send data Cleaning request.For example, installed in client when the user clicks data scrubbing application in data scrubbing button when, client can To send data scrubbing request to above-mentioned executing subject.If receiving the data scrubbing request of client transmission, it is believed that on It states executing subject and obtains user's authorization, can just obtain the first behavioral data at this time.Wherein, the row generated in first time period It can be used as the first behavioral data for data.Wherein, first time period can be preset any time section, for example, First time period can be time interval corresponding to the week before current time.
In general, various applications can be equipped in client, such as data scrubbing application, the application of shopping class, web page browsing Device application, searching class application, instant messaging tools, mailbox client, social platform software etc..Appoint for what is installed in client Anticipate one and apply, when user using this in application, the behavioral data of the application can be generated.Wherein, the behavioral data of the application It may include the mark of the application and the use duration of the application.Therefore, the first behavioral data may include user at first Between the mark of application that uses in section and use duration.
In some embodiments, client can be with real-time collecting behavioral data, and is uploaded to above-mentioned hold in real time or periodically Row main body.
In some embodiments, client can integrate SDK (Software Development Kit, software development work Tool packet) module.In this way, whenever client is collected into behavioral data, it can be with real-time report to SDK platform.SDK platform can lead to It crosses the RESTful API that reports to be provided the behavioral data that client reports is sent to data storing platform in real time and deposit Storage.Above-mentioned executing subject can be by obtaining behavioral data with the interface or message queue of data storing platform.Wherein, RESTful API refers to the API of REST (REpresentational State Transfer, the transfer of performance layer state) style (Application Programming Interface, application programming interface).
Step 202, the First Eigenvalue is determined based on the first behavioral data.
In the present embodiment, above-mentioned executing subject can analyze the first behavioral data, determine the First Eigenvalue.Its In, the First Eigenvalue can be used for characterizing the service condition of application, and including but not limited at least one of following: application uses frequency Rate, application cleaning path, using generate the speed of data, using the occupied space etc. for generating data.
In general, when the first behavioral data may include the mark for the application that user uses in first time period and use It is long.Here, each application used in first time period for user, above-mentioned executing subject can be based on the first behavioral data It counts the access times of the application and uses total duration, to calculate the frequency of use of the application.Above-mentioned executing subject may be used also To analyze in advance multiple applications, the cleaning path for determining multiple applications, generates accounting for for data at the speed for generating data With space, then stored.For each application that user uses in first time period, above-mentioned executing subject can will be from The cleaning paths of pre-stored multiple applications, the speed for generating data generate in the occupied spaces of data and determine the application Cleaning path, generate data speed, generate data occupied space.
Step 203, the First Eigenvalue is input to preset data scrubbing model, obtains the first cleaning routing information.
In the present embodiment, the First Eigenvalue can be input to preset data scrubbing model by above-mentioned executing subject, from And obtain the first cleaning routing information.In some embodiments, the first cleaning routing information can only include the cleaning path of application. At this point, the first cleaning routing information cleaning path for including be usually installed in client, user makes in first time period With the cleaning path of the higher application of frequency.In some embodiments, the first cleaning routing information can both include the clear of application Line of reasoning diameter, and the cleaning confidence level including corresponding application.At this point, the cleaning path that the first cleaning routing information includes is usually visitor The cleaning path of application installed on the end of family, that user uses in first time period.The cleaning confidence level of application, which can be, answers The probability being cleaned is needed with generated data.In general, the frequency of use of application is higher, it is faster to generate the speed of data, produce The occupied space of raw data is bigger, and the probability for needing to be cleaned is bigger, conversely, the probability being cleaned is with regard to smaller.
Here, data scrubbing model be determined for cleaning routing information, characteristic feature value and cleaning routing information it Between corresponding relationship.
In some embodiments, data scrubbing model can be those skilled in the art can be to a large amount of characteristic values and correspondence Cleaning routing information it is for statistical analysis, and obtain be stored with multiple characteristic values with it is corresponding cleaning routing information it is corresponding Relation table.In this way, above-mentioned executing subject can calculate the phase between the First Eigenvalue and each characteristic value in mapping table Like degree, and it is based on similarity result, the first cleaning routing information is inquired from the mapping table.For example, from corresponding relationship Cleaning routing information corresponding with the highest characteristic value of the First Eigenvalue similarity is inquired in table, as the first cleaning path letter Breath.
In some embodiments, data scrubbing model can be using various machine learning methods and training sample to existing Machine learning model (such as various neural networks) carry out Training obtained from.For example, data scrubbing model can be with Training obtains as follows:
Firstly, obtaining training sample.
Wherein, training sample may include sample characteristics and sample cleaning routing information.Sample characteristics may include But be not limited at least one of following: the frequency of use of sample application, the cleaning path of sample application, sample application generate data Speed, sample application generate occupied space of data etc..Sample cleaning routing information may include that the cleaning of sample application is set The cleaning path of reliability and sample application.
Later, equation of linear regression is constructed by independent variable of sample characteristics.
Wherein, each independent variable of equation of linear regression can correspond to a weight coefficient.As an example, work as sample When characteristic value only includes this characteristic value of frequency of use of sample application, the equation of linear regression of construction may is that z=b+ ax.Wherein, x is the independent variable of equation of linear regression, characterizes the frequency of use of application, and a is weight coefficient corresponding with x, and b is also Weight coefficient.As another example, when sample characteristics includes the frequency of use of sample application, the cleaning road of sample application Diameter, sample application generate the speed of data, sample application generates the n such as the occupied space of data (n is greater than 1 integer) a feature When value, the equation of linear regression of construction may is that z=β01×x12×x23×x3+...+βn×xn.Wherein, x1、x2、 x3…xnIt is the independent variable of equation of linear regression, characterizes n characteristic value, β1、β2、β3…βnIt is and x1、x2、x3…xnCorresponding power Weight coefficient, β0It also is weight coefficient.
Then, using linear equation as independent variable constitutive logic regression function.
Continue by taking an example in previous step as an example, logistic regression function may is thatContinue above one For another example in step, logistic regression function be may is thatFurther data Conversion can be write as:
Finally, using sample characteristics as input, using sample cleaning routing information as export, to logistic regression function into Row training, obtains data scrubbing model.
Here it is possible to using sample characteristics as the value of independent variable, using sample cleaning routing information as the value of dependent variable It brings logistic regression function into, finds out the numerical value of the weight coefficient in logical function, the numerical value of the weight coefficient found out is brought into and is patrolled Collecting regression function can be obtained data scrubbing model.In some embodiments, the numerical value of weight coefficient can be through maximum seemingly What right estimation method solved.Wherein, maximal possibility estimation (maximum likelihood estimation, MLE) is a kind of The important and universal method for seeking estimator.Maximum Likelihood clearly uses probabilistic model, and target is to find energy Enough phylogenetic trees that observation data are generated with high probability.Here, maximum Likelihood is exactly to make in training sample The probability that data occur is big as far as possible.
Step 204, to client send first cleaning routing information so that client be based on first cleaning routing information into Row data scrubbing.
In the present embodiment, the first cleaning routing information can be sent to client by above-mentioned executing subject, such as be passed through First cleaning routing information is sent to client by API.In this way, client can be counted according to the first cleaning routing information According to cleaning.In some embodiments, in the case that the first cleaning routing information only includes the cleaning path of application, client can It is cleared up with the data stored under the cleaning path to application included by the first cleaning routing information.In some embodiments In, it had not only included the cleaning path of application in the first cleaning routing information, but also in the case where the cleaning confidence level including applying, visitor Family end can select the application that cleaning confidence level is greater than default cleaning confidence threshold value (such as 60%), and to selected taking-up The data stored under the cleaning path of application are cleared up.
The method provided by the above embodiment for determining the cleaning path of data of the application, acquisition user is the first Using being installed in client using the first behavioral data generated in one period;It is determined later based on the first behavioral data The First Eigenvalue;Then the First Eigenvalue is input to preset data scrubbing model, to obtain the first cleaning routing information;Most The first cleaning routing information is sent to client afterwards, so that client is based on the first cleaning routing information and carries out data scrubbing.Base Cleaning path is determined in the behavioral data of user, is substantially reduced the time for determining cleaning path, is helped to improve data scrubbing Efficiency.Meanwhile also avoid in the prior art scan client on install application during user need to undergo long when Between waiting and interrupt scanning, the generation for the case where causing data scrubbing to fail.
With further reference to Fig. 3, it illustrates according to the another of the method in the cleaning path for determining data of the application The process 300 of a embodiment.The method for being used to determine the cleaning path of data is applied to server-side, comprising the following steps:
Step 301, user is obtained in first time period using installing in client using the first behavior generated Data.
Step 302, the First Eigenvalue is determined based on the first behavioral data.
Step 303, the First Eigenvalue is input to preset data scrubbing model, obtains the first cleaning routing information.
Step 304, to client send first cleaning routing information so that client be based on first cleaning routing information into Row data scrubbing.
In the present embodiment, the behaviour of the concrete operations of step 301-304 and step 201-204 in embodiment shown in Fig. 2 Make essentially identical, details are not described herein.
Step 305, it receives client and the data scrubbing sent after data scrubbing letter is carried out based on the first cleaning routing information Breath.
In the present embodiment, client (such as equipment 101 shown in FIG. 1) is based on the first cleaning routing information and carries out data It is clear data can be sent to the executing subject (such as equipment 102 shown in FIG. 1) for determining the cleaning path of data after cleaning Manage information.Wherein, data scrubbing information may include that the memory space released after data scrubbing and data scrubbing are spent Time.It should be noted that the first cleaning routing information is described in detail with reference to the embodiment shown in FIG. 2, it is no longer superfluous herein It states.
Step 306, cleaning ratio is determined based on data scrubbing information.
In the present embodiment, above-mentioned executing subject can determine cleaning ratio based on data scrubbing information.For example, above-mentioned hold Row main body can obtain the average speed that the client within the past period generates data first, and client is then generated number According to average speed and the product of first time period the occupied spaces of data is generated as client in first time period, finally The ratio that the memory space and client that release after data scrubbing generate the occupied space of data in first time period is made To clear up ratio.
Step 307, determine whether cleaning ratio is greater than pre-set ratio threshold value.
In the present embodiment, above-mentioned executing subject can compare cleaning ratio with preset ratio threshold value (such as 80%) Compared with if cleaning ratio illustrates to have reached cleaning effect, continues to execute step 308 greater than pre-set ratio threshold value;If clearing up ratio Less than or equal to pre-set ratio threshold value, then cleaning effect has not yet been reached in explanation, continues to execute step 309.
Step 308, data scrubbing model is carried out using the First Eigenvalue and the first cleaning routing information as training sample Optimization, the data scrubbing model after being optimized.
In the present embodiment, if cleaning ratio is greater than pre-set ratio threshold value, above-mentioned executing subject can be by the First Eigenvalue As input, data scrubbing model is optimized using the first cleaning routing information as input, to adjust data scrubbing model Weight coefficient numerical value, thus the data scrubbing model after being optimized.In general, the data scrubbing model after optimization is determined Cleaning routing information precision it is higher.It should be noted that the First Eigenvalue and data scrubbing model are referring to shown in Fig. 2 It is described in detail in embodiment, details are not described herein.
Step 309, user is obtained in second time period using using the second behavioral data generated.
In the present embodiment, if cleaning ratio is less than or equal to pre-set ratio threshold value, the above-mentioned available use of executing subject Family is in second time period using using the second behavioral data generated.Wherein, second time period is usually first time period A period later.Behavioral data generated can be used as the second behavioral data in second time period.It needs to illustrate , behavioral data is described in detail with reference to the embodiment shown in FIG. 2, and details are not described herein.
Step 310, the first behavioral data and the second behavioral data are merged, and is determined based on the behavioral data after merging Merge characteristic value.
In the present embodiment, above-mentioned executing subject can merge the first behavioral data and the second behavioral data, Yi Jiji Behavioral data after merging, which determines, merges characteristic value.Here, the operation for merging characteristic value and determining the First Eigenvalue are determined Operate essentially identical, details are not described herein.
Step 311, characteristic value will be merged and is input to data scrubbing model, obtain merging cleaning routing information.
In the present embodiment, merging characteristic value can be input to data scrubbing model by above-mentioned executing subject, to obtain Merge cleaning routing information.Here, it obtains merging the operation of cleaning routing information and obtains the operation of the first cleaning routing information Essentially identical, details are not described herein.
Step 312, it is sent to client and merges cleaning routing information, so that client is based on merging cleaning routing information again Secondary carry out data scrubbing.
In the present embodiment, merging cleaning routing information can be sent to client by above-mentioned executing subject.In this way, client End can carry out data scrubbing according to cleaning routing information is merged.Here, client is carried out according to cleaning routing information is merged The operation that the operation of data scrubbing and client carry out data scrubbing according to the first cleaning routing information is essentially identical, herein no longer It repeats.
It should be noted that can be returned after client carries out data scrubbing again based on merging cleaning routing information It returns and continues to execute step 305, to continue to optimize data scrubbing model, or client is made to carry out data scrubbing again. It loops back and forth like this, the precision that data scrubbing model determines cleaning routing information is continuously improved.
From figure 3, it can be seen that the method for pushed information compared with the corresponding embodiment of Fig. 2, in the present embodiment Process 300 increase step 305-312.The scheme of the present embodiment description is in the case where reaching cleaning effect as a result, by the One characteristic value and the first cleaning routing information continue to optimize as training sample to data scrubbing model, clear to improve data Manage the precision that model determines cleaning routing information.In the case where not up to cleaning effect, more behavioral data weights are collected It is new to determine cleaning routing information, so that client is carried out data scrubbing again.It loops back and forth like this, data scrubbing model is continuously improved Determine the precision of cleaning routing information.
With continued reference to Fig. 4, it illustrates the processes according to one embodiment of the method for clearing up data of the application 400.The method for being used to clear up data is applied to client, comprising the following steps:
Step 401, collect and report of user in first time period using installing in client using generated the One behavioral data.
In the present embodiment, it can be received for clearing up the executing subject (such as equipment 101 shown in FIG. 1) of the method for data Collect user in first time period using installing in client using the first behavioral data generated, and is reported.Its In, the first behavioral data can directly be reported to server-side (such as equipment 102 shown in FIG. 1), and it is flat can also to report to SDK Platform is sent to data storing platform by SDK platform and is stored.In this way, server-side can be by connecing with data storing platform Mouth or message queue obtain the first behavioral data.It should be noted that first time period and the first behavioral data are referring to Fig. 2 It is described in detail in illustrated embodiment, details are not described herein.
Step 402, in response to receiving believing based on the first cleaning path that the first behavioral data determines for server-side transmission Breath clears up the data stored in the first cleaning routing information corresponding first cleaning path.
In the present embodiment, the first cleaning routing information that server-side can be determined based on the first behavioral data, and send To above-mentioned executing subject.Above-mentioned executing subject, can be to the first cleaning path letter after receiving the first cleaning routing information The data stored in corresponding first cleaning path are ceased to be cleared up.In some embodiments, when first clears up routing information only In the case where cleaning path including application, client can be to the cleaning path of application included by the first cleaning routing information The data of lower storage are cleared up.In some embodiments, in the cleaning path that the first cleaning routing information had not only included application, but also In the case where cleaning confidence level including application, client can select cleaning confidence level and be greater than default cleaning confidence threshold value The application of (such as 60%), and the data stored under the cleaning path of the application of selected taking-up are cleared up.
In some embodiments, after carrying out data scrubbing, above-mentioned executing subject can also generate data scrubbing information, And it is sent to server-side.It should be noted that data scrubbing information is described in detail with reference to the embodiment shown in FIG. 2, herein It repeats no more.
The method provided by the above embodiment for clearing up data of the application collects first and report of user is at first Between in section using being installed in client using generated first behavioral data;Then receive server-side transmission based on In the case where the first cleaning routing information that first behavioral data determines, to the first cleaning routing information corresponding first cleaning road The data stored under diameter are cleared up.Cleaning path is determined based on the behavioral data of user by server-side, substantially reduces determination The time for clearing up path, help to improve data scrubbing efficiency.Meanwhile it also avoiding pacifying in scanning client in the prior art User needs to undergo the hair for the case where waiting and interrupt scanning for a long time, data scrubbing is caused to fail during the application of dress It is raw.
Below with reference to Fig. 5, it illustrates the computer equipments for being suitable for being used to realize the embodiment of the present application (such as shown in Fig. 1 Equipment 101 or equipment 102) computer system 500 structural schematic diagram.Computer equipment shown in Fig. 5 is only one A example, should not function to the embodiment of the present application and use scope bring any restrictions.
As shown in figure 5, computer system 500 includes central processing unit (CPU) 501, it can be read-only according to being stored in Program in memory (ROM) 502 or be loaded into the program in random access storage device (RAM) 503 from storage section 508 and Execute various movements appropriate and processing.In RAM 503, also it is stored with system 500 and operates required various programs and data. CPU 501, ROM 502 and RAM 503 are connected with each other by bus 504.Input/output (I/O) interface 505 is also connected to always Line 504.
I/O interface 505 is connected to lower component: the importation 506 including keyboard, mouse etc.;It is penetrated including such as cathode The output par, c 507 of spool (CRT), liquid crystal display (LCD) etc. and loudspeaker etc.;Storage section 508 including hard disk etc.; And the communications portion 509 of the network interface card including LAN card, modem etc..Communications portion 509 via such as because The network of spy's net executes communication process.Driver 510 is also connected to I/O interface 505 as needed.Detachable media 511, such as Disk, CD, magneto-optic disk, semiconductor memory etc. are mounted on as needed on driver 510, in order to read from thereon Computer program be mounted into storage section 508 as needed.
Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be carried on computer-readable medium On computer program, which includes the program code for method shown in execution flow chart.In such reality It applies in example, which can be downloaded and installed from network by communications portion 509, and/or from detachable media 511 are mounted.When the computer program is executed by central processing unit (CPU) 501, limited in execution the present processes Above-mentioned function.It should be noted that computer-readable medium described herein can be computer-readable signal media or Computer-readable medium either the two any combination.Computer-readable medium for example can be --- but it is unlimited In system, device or the device of --- electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, or any above combination.It calculates The more specific example of machine readable medium can include but is not limited to: electrical connection, portable meter with one or more conducting wires Calculation machine disk, hard disk, random access storage device (RAM), read-only memory (ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD-ROM), light storage device, magnetic memory device or The above-mentioned any appropriate combination of person.In this application, computer-readable medium, which can be, any includes or storage program has Shape medium, the program can be commanded execution system, device or device use or in connection.And in the application In, computer-readable signal media may include in a base band or as carrier wave a part propagate data-signal, wherein Carry computer-readable program code.The data-signal of this propagation can take various forms, including but not limited to electric Magnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be computer-readable Jie Any computer-readable medium other than matter, the computer-readable medium can be sent, propagated or transmitted for being held by instruction Row system, device or device use or program in connection.The program code for including on computer-readable medium It can transmit with any suitable medium, including but not limited to: wireless, electric wire, optical cable, RF etc. or above-mentioned any conjunction Suitable combination.
The calculating of the operation for executing the application can be write with one or more programming languages or combinations thereof Machine program code, described program design language include object-oriented programming language-such as Java, Smalltalk, C+ +, further include conventional procedural programming language-such as " C " language or similar programming language.Program code can Fully to execute, partly execute on the user computer on the user computer, be executed as an independent software package, Part executes on the remote computer or executes on a remote computer or server completely on the user computer for part. In situations involving remote computers, remote computer can pass through the network of any kind --- including local area network (LAN) Or wide area network (WAN)-is connected to subscriber computer, or, it may be connected to outer computer (such as utilize Internet service Provider is connected by internet).
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the application, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can generation A part of one module, program segment or code of table, a part of the module, program segment or code include one or more use The executable instruction of the logic function as defined in realizing.It should also be noted that in some implementations as replacements, being marked in box The function of note can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated are actually It can be basically executed in parallel, they can also be executed in the opposite order sometimes, and this depends on the function involved.Also it to infuse Meaning, the combination of each box in block diagram and or flow chart and the box in block diagram and or flow chart can be with holding The dedicated hardware based system of functions or operations as defined in row is realized, or can use specialized hardware and computer instruction Combination realize.
Being described in unit involved in the embodiment of the present application can be realized by way of software, can also be by hard The mode of part is realized.Described unit also can be set in the processor, for example, can be described as: a kind of processor packet Include acquiring unit, determination unit, input unit and transmission unit.Wherein, the title of these units not structure under certain conditions The restriction of the pairs of unit itself, for example, acquiring unit is also described as " obtaining user in first time period using visitor The unit using the first behavioral data generated installed on the end of family ".In another example can be described as: a kind of processor includes Reporting unit and cleaning unit.Wherein, the title of these units does not constitute the restriction to the unit itself under certain conditions, For example, reporting unit is also described as " collecting and report of user being answered in first time period using what is installed in client With the unit of the first behavioral data generated ".
As on the other hand, present invention also provides a kind of computer-readable medium, which be can be Included in computer equipment described in above-described embodiment;It is also possible to individualism, and is set without the computer is incorporated In standby.Above-mentioned computer-readable medium carries one or more program, when said one or multiple programs are by the calculating When machine equipment executes, so that the computer equipment: obtaining user and apply institute using what is installed in client in first time period The first behavioral data generated;The First Eigenvalue is determined based on the first behavioral data, wherein the First Eigenvalue is for characterizing application Service condition;The First Eigenvalue is input to preset data scrubbing model, obtains the first cleaning routing information, wherein number According to cleaning model for determining cleaning routing information;The first cleaning routing information is sent to client, so that client is based on the One cleaning routing information carries out data scrubbing.Or make the computer equipment: collecting and report of user is in first time period Using being installed in client using the first behavioral data generated;In response to receive server-side transmission based on the first row For the first cleaning routing information that data determine, the data stored in path are cleared up to the first cleaning routing information corresponding first It is cleared up.
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.Those skilled in the art Member is it should be appreciated that invention scope involved in the application, however it is not limited to technology made of the specific combination of above-mentioned technical characteristic Scheme, while should also cover in the case where not departing from foregoing invention design, it is carried out by above-mentioned technical characteristic or its equivalent feature Any combination and the other technical solutions formed.Such as features described above has similar function with (but being not limited to) disclosed herein Can technical characteristic replaced mutually and the technical solution that is formed.

Claims (12)

1. a kind of method for determining the cleaning path of data, is applied to server-side, comprising:
User is obtained in first time period using installing in client using the first behavioral data generated;
The First Eigenvalue is determined based on first behavioral data, wherein the First Eigenvalue is for characterizing the application Service condition;
The First Eigenvalue is input to preset data scrubbing model, obtains the first cleaning routing information, wherein the number According to cleaning model for determining cleaning routing information;
The first cleaning routing information is sent to the client, so that the client is based on first cleaning path letter Breath carries out data scrubbing.
2. according to the method described in claim 1, wherein, the method also includes:
It receives the client and the data scrubbing information sent after data scrubbing is carried out based on the first cleaning routing information;
Cleaning information determines cleaning ratio based on the data;
If the cleaning ratio be greater than pre-set ratio threshold value, using the First Eigenvalue and it is described first cleaning routing information as Training sample optimizes the data scrubbing model, the data scrubbing model after being optimized.
3. according to the method described in claim 2, wherein, the method also includes:
If the cleaning ratio is less than or equal to the pre-set ratio threshold value, obtains user and answered in second time period using described With the second behavioral data generated;
First behavioral data and second behavioral data are merged, and is determined and is merged based on the behavioral data after merging Characteristic value;
The merging characteristic value is input to the data scrubbing model, obtains merging cleaning routing information;
The merging is sent to the client and clears up routing information, so that the client is based on merging cleaning path letter Breath carries out data scrubbing again.
4. according to the method described in claim 1, wherein, when the first behavior data packet includes the mark and use of the application It is long.
5. according to the method described in claim 1, wherein, the First Eigenvalue includes at least one of the following: the application Frequency of use, the cleaning path of the application, the speed of application generation data, the occupancy of application generation data are empty Between.
6. according to the method described in claim 1, wherein, the first cleaning routing information includes the cleaning road of the application Diameter, or the cleaning confidence level including the application and cleaning path.
7. method described in one of -6 according to claim 1, wherein the data scrubbing model is trained as follows It arrives:
Obtain training sample, wherein the training sample includes sample characteristics and sample cleaning routing information;
Equation of linear regression is constructed by independent variable of the sample characteristics, wherein each of described equation of linear regression becomes certainly Measure a corresponding weight coefficient;
Using the linear equation as independent variable constitutive logic regression function;
Using the sample characteristics as input, using sample cleaning routing information as output, to the logistic regression letter Number is trained, and obtains the data scrubbing model.
8. according to the method described in claim 7, wherein, the numerical value of the weight coefficient is asked by maximum Likelihood It solves.
9. a kind of method for clearing up data is applied to client, comprising:
Collect and report of user in first time period using installing in the client using the first behavior number generated According to;
In response to receiving the first cleaning routing information of server-side transmission determined based on first behavioral data, to described The data stored under first cleaning routing information corresponding first cleaning path are cleared up.
10. according to the method described in claim 9, wherein, clearing up first indicated by routing information described described first After the data stored under cleaning path are cleared up, further includes:
Data scrubbing information is generated, and is sent to the server-side.
11. a kind of computer equipment, comprising:
One or more processors;
Storage device stores one or more programs thereon;
When one or more of programs are executed by one or more of processors, so that one or more of processors are real The now method such as method described in any one of claims 1-8 or realization as described in claim 9 or 10.
12. a kind of computer readable storage medium, is stored thereon with computer program, the computer program is executed by processor Method of the Shi Shixian such as method described in any one of claims 1-8 or realization as described in claim 9 or 10.
CN201811587961.1A 2018-12-25 2018-12-25 Method and device for determining cleaning path of data and cleaning data Active CN109669779B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811587961.1A CN109669779B (en) 2018-12-25 2018-12-25 Method and device for determining cleaning path of data and cleaning data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811587961.1A CN109669779B (en) 2018-12-25 2018-12-25 Method and device for determining cleaning path of data and cleaning data

Publications (2)

Publication Number Publication Date
CN109669779A true CN109669779A (en) 2019-04-23
CN109669779B CN109669779B (en) 2023-05-26

Family

ID=66146062

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811587961.1A Active CN109669779B (en) 2018-12-25 2018-12-25 Method and device for determining cleaning path of data and cleaning data

Country Status (1)

Country Link
CN (1) CN109669779B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112632051A (en) * 2020-12-25 2021-04-09 中国工商银行股份有限公司 Neural network-based database cleaning method and system

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646086A (en) * 2013-12-13 2014-03-19 北京奇虎科技有限公司 Junk file cleaning method and device
CN105868367A (en) * 2016-03-30 2016-08-17 北京奇虎科技有限公司 Method, server and system for clearing away root paths in Android device
CN106201601A (en) * 2016-06-30 2016-12-07 北京奇虎科技有限公司 A kind of file clean-up method, electronic equipment and server
CN106202166A (en) * 2016-06-24 2016-12-07 北京奇虎科技有限公司 The method for cleaning of file, device and corresponding client
CN108337358A (en) * 2017-09-30 2018-07-27 广东欧珀移动通信有限公司 Using method for cleaning, device, storage medium and electronic equipment
CN108427737A (en) * 2018-02-28 2018-08-21 上海连尚网络科技有限公司 A kind of data clearing method, equipment and computer-readable medium
CN108932140A (en) * 2018-07-13 2018-12-04 重庆邮电大学 The method of cleaning background application based on Android user behavior habit

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646086A (en) * 2013-12-13 2014-03-19 北京奇虎科技有限公司 Junk file cleaning method and device
CN105868367A (en) * 2016-03-30 2016-08-17 北京奇虎科技有限公司 Method, server and system for clearing away root paths in Android device
CN106202166A (en) * 2016-06-24 2016-12-07 北京奇虎科技有限公司 The method for cleaning of file, device and corresponding client
CN106201601A (en) * 2016-06-30 2016-12-07 北京奇虎科技有限公司 A kind of file clean-up method, electronic equipment and server
CN108337358A (en) * 2017-09-30 2018-07-27 广东欧珀移动通信有限公司 Using method for cleaning, device, storage medium and electronic equipment
CN108427737A (en) * 2018-02-28 2018-08-21 上海连尚网络科技有限公司 A kind of data clearing method, equipment and computer-readable medium
CN108932140A (en) * 2018-07-13 2018-12-04 重庆邮电大学 The method of cleaning background application based on Android user behavior habit

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112632051A (en) * 2020-12-25 2021-04-09 中国工商银行股份有限公司 Neural network-based database cleaning method and system

Also Published As

Publication number Publication date
CN109669779B (en) 2023-05-26

Similar Documents

Publication Publication Date Title
CN109460513A (en) Method and apparatus for generating clicking rate prediction model
CN109460514A (en) Method and apparatus for pushed information
CN109190114A (en) Method and apparatus for generating return information
CN109118224A (en) Proof of work method, apparatus, medium and the electronic equipment of block chain network
CN103425703B (en) A kind for the treatment of method and apparatus of the network information
CN109976997A (en) Test method and device
CN111131356B (en) Method and apparatus for generating information
CN110334036A (en) A kind of method and apparatus for realizing data cached scheduling
CN109460652A (en) For marking the method, equipment and computer-readable medium of image pattern
CN109548161A (en) A kind of method, apparatus and terminal device of wireless resource scheduling
CN109725988A (en) A kind of method for scheduling task and device
CN108737486A (en) Information-pushing method and device
CN110324232A (en) Efficient multimedia messaging providers method and system based on user of service's situation in instant messaging service
CN108600329A (en) For pushed information, the method and apparatus for showing information
CN109669779A (en) For determining the cleaning path of data, the method and apparatus of cleaning data
CN110135903A (en) Advertisement position ranks method, system, medium and electronic equipment
CN109492687A (en) Method and apparatus for handling information
CN109388684A (en) Method and apparatus for generating information
CN113283830B (en) Method, device, equipment and computer readable medium for generating waybill information sequence
CN110032283A (en) The method and apparatus that a kind of pair of associational word is ranked up
EP4105869A1 (en) Method and apparatus for outputting information
CN113361739B (en) Method and device for generating picking path
Elahi et al. Online context-aware task assignment in mobile crowdsourcing via adaptive discretization
CN111949860B (en) Method and apparatus for generating a relevance determination model
CN112749822A (en) Method and device for generating route

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant