WO2021259197A1 - File processing method and apparatus, storage medium, and terminal - Google Patents

File processing method and apparatus, storage medium, and terminal Download PDF

Info

Publication number
WO2021259197A1
WO2021259197A1 PCT/CN2021/101220 CN2021101220W WO2021259197A1 WO 2021259197 A1 WO2021259197 A1 WO 2021259197A1 CN 2021101220 W CN2021101220 W CN 2021101220W WO 2021259197 A1 WO2021259197 A1 WO 2021259197A1
Authority
WO
WIPO (PCT)
Prior art keywords
file
cleaning
terminal
cleaned
result
Prior art date
Application number
PCT/CN2021/101220
Other languages
French (fr)
Chinese (zh)
Inventor
李文娟
易明
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2021259197A1 publication Critical patent/WO2021259197A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • G06F16/162Delete operations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Definitions

  • the present disclosure relates to, but is not limited to, the field of communications.
  • the cleaning model in the terminal can only mechanically scan the files in the terminal and display it to the user in the form of a list without distinction. It cannot push the list of files to be cleaned in a targeted manner, resulting in the accuracy of the cleaning model. Very low, users can only manually select the content they need to clean up one by one, and the cleaning efficiency is low.
  • the present disclosure provides a file processing method, including: using a second cleaning model to analyze a file stored in a terminal to determine whether the file needs to be cleaned, wherein the second cleaning model is using training data pair
  • the training data is obtained by training the first cleaning model, and the training data includes: the first operation feature of the target object on the first cleaning result, where the first cleaning result is the first cleaning model to the terminal It is obtained by analyzing the stored files, the first cleaning result indicates the file to be cleaned, and the first operation feature is used to indicate the operation of the target object on the file of the specified type in the first cleaning result
  • files of the same type have the same file feature; in the case where it is determined that the file needs to be cleaned, a second cleaning result is displayed, where the second cleaning result indicates the file to be cleaned.
  • the present disclosure also provides a file processing device, including: an analysis module configured to analyze a file stored in a terminal using a second cleaning model to determine whether the file needs to be cleaned, wherein the second cleaning
  • the model is obtained by training a first cleaning model using training data.
  • the training data includes: the first operation feature of the target object on the first cleaning result, wherein the first cleaning result is obtained through the first cleaning result.
  • the model is obtained by analyzing the files stored in the terminal, the first cleaning result indicates the file to be cleaned, and the first operating feature is used to instruct the target object to check the first cleaning result.
  • Operating characteristics of files of a specified type, files of the same type have the same file characteristics; the display module is configured to display a second cleaning result when it is determined that the file needs to be cleaned, wherein the second cleaning result indicates The files to be cleaned up.
  • the present disclosure also provides a computer-readable storage medium in which a computer program is stored, wherein the computer program implements any of the methods described herein when executed by a processor.
  • the present disclosure also provides a terminal, including a memory and a processor, and a computer program is stored in the memory, and the processor is configured to run the computer program to execute any method described herein.
  • FIG. 1 is a block diagram of the hardware structure of a terminal that implements the file processing method of the present disclosure
  • Figure 2 is a flowchart of a file processing method according to the present disclosure
  • Fig. 3 is a structural block diagram of a file processing device according to the present disclosure.
  • Figure 4 is a schematic structural diagram of a terminal cleaning system according to the present disclosure.
  • FIG. 5 is a schematic diagram of recording user behavior according to the input module of the present disclosure.
  • Fig. 6 is a schematic diagram of analyzing and processing user behaviors according to the processing module of the present disclosure
  • FIG. 7 is a schematic diagram of outputting cleaning results according to the output module of the present disclosure.
  • FIG. 8 is a schematic flowchart of a file cleaning method according to the present disclosure.
  • FIG. 1 is a hardware structure block diagram of a terminal that implements the file processing method of the present disclosure.
  • the mobile terminal may include one or more (only one is shown in FIG. 1) processor 102 (the processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) And the memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a transmission device 106 and an input/output device 108 for communication functions.
  • the structure shown in FIG. 1 is only for illustration, and does not limit the structure of the above-mentioned mobile terminal.
  • the mobile terminal may also include more or fewer components than shown in FIG. 1, or have a different configuration from that shown in FIG.
  • the memory 104 may be used to store computer programs, for example, software programs and modules of application software, such as the computer programs corresponding to the file processing method in the present disclosure.
  • the processor 102 executes various computer programs by running the computer programs stored in the memory 104. Functional application and data processing, that is, to achieve the above-mentioned methods.
  • the memory 104 may include a high-speed random access memory, and may also include a non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory.
  • the memory 104 may further include a memory remotely provided with respect to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, corporate intranets, local area networks, mobile communication networks, and combinations thereof.
  • the transmission device 106 is used to receive or send data via a network.
  • the above-mentioned specific examples of the network may include a wireless network provided by a communication provider of a mobile terminal.
  • the transmission device 106 includes a network adapter (Network Interface Controller, NIC for short), which can be connected to other network devices through a base station to communicate with the Internet.
  • the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF Radio Frequency
  • FIG. 2 is a flowchart of the method for processing files according to the present disclosure. As shown in FIG. 2, the method may include the following steps S202 and S204.
  • a second cleaning model is used to analyze a file stored in the terminal to determine whether the file needs to be cleaned.
  • the second cleaning model is obtained by training the first cleaning model using training data.
  • the training data includes: the first operation feature of the target object on the first cleaning result, where the first cleaning result is obtained by analyzing the files stored in the terminal through the first cleaning model, and the first cleaning result
  • the file to be cleaned is indicated, and the first operation characteristic is used to indicate the operation characteristic of the target object for the file of the specified type in the first clean-up result, and files of the same type have the same file characteristic.
  • step S204 in the case where it is determined that the file needs to be cleaned, a second cleanup result is displayed, where the second cleanup result indicates the file to be cleaned.
  • the cleaning model since the cleaning model is trained according to the operating characteristics of the cleaning result of the target object, the cleaning model is continuously optimized with the operating characteristics of the target object, so that the final cleaning result determined by the cleaning model is more in line with the behavior habits of the target object. Therefore, It can solve the problem of low accuracy of pushing the files to be cleaned in related technologies, and achieve the technical effect of improving the accuracy of pushing the files to be cleaned.
  • the cleanup model may be a cleanup application or cleanup code.
  • files of one type with the same file characteristics may be referred to as the same type of file or the same type of file.
  • the same file feature can be the same file name, belonging to the same application, belonging to the same type of application, belonging to the same webpage, belonging to the same type of webpage, and the content of the file containing the same object (for example, it contains characters). , For another example, including a specific person), belonging to the same contact object, etc.
  • the cleaning model may be trained through machine learning using training data.
  • the operating characteristics included in the training data may include: operating behavior and the file type to which the operating behavior points.
  • a terminal may be a terminal with a communication function, and the terminal may communicate with other devices through a network or a connection line or a connection interface.
  • the terminal in the present disclosure may include but is not limited to at least one of the following: mobile phones (such as Android phones, iOS phones, etc.), notebook computers, tablet computers, handheld computers, MID (Mobile Internet Devices), PAD, desktop computers , Smart TV, smart home equipment, etc.
  • the aforementioned networks may include, but are not limited to: wired networks, wireless networks, where the wired networks include: local area networks, metropolitan area networks, and wide area networks, and the wireless networks include: Bluetooth, WIFI, and other networks that implement wireless communication.
  • the wired networks include: local area networks, metropolitan area networks, and wide area networks
  • the wireless networks include: Bluetooth, WIFI, and other networks that implement wireless communication.
  • the method before using the second cleaning model to analyze the files stored in the terminal, the method further includes: using training data to train the first cleaning model to obtain the second cleaning model, wherein, the training of the first cleaning model using the training data includes: training the first cleaning model when the first operation feature indicates that the probability of the first type of file being retained is higher than the probability of the second type of file being retained.
  • the model preferentially determines the second type of file as the file to be cleaned; and/or, in the case that the first operating characteristic indicates that the probability of the third type of file being cleaned is higher than the probability of the fourth type of file being cleaned, train the The first cleaning model preferentially determines the file of the third type as the file to be cleaned.
  • the first operation feature is obtained according to the target object's operation behavior on the first cleaning result and the file pointed to by the operation behavior.
  • the training data further includes: a second operating feature of the target object on the file stored in the terminal, wherein the second operating feature is used to indicate that the target object has the target object on the file stored in the terminal.
  • the operating characteristics of the stored files of the specified type are used to indicate that the target object has the target object on the file stored in the terminal.
  • the method before using the second cleaning model to analyze the files stored in the terminal, the method further includes: using training data to train the first cleaning model to obtain the second cleaning model, wherein, the training of the first cleaning model by using the training data includes: training the first cleaning model in the case that the second operating feature indicates that the fifth type of file is accessed more frequently than the sixth type of file is accessed. The model prioritizes this sixth type of file as the file to be cleaned up.
  • the second operating feature is obtained according to the target object's operating behavior on the file stored in the terminal and the file pointed to by the operating behavior.
  • the method further includes: using a third cleaning model to analyze a file stored in the terminal to determine whether the file needs to be cleaned, wherein the first cleaning model
  • the third cleaning model is obtained by training the second cleaning model using training data.
  • the training data includes: a third operation feature of the target object on the second cleaning result, where the third operation feature is used to indicate the The target object's operating characteristics of the file of the specified type in the second cleaning result; in the case where it is determined that the file needs to be cleaned, the third cleaning result is displayed, where the third cleaning result indicates the file to be cleaned.
  • the training of the cleaning model can be iterative. For example, each time the cleaning model outputs the cleaning result and receives the operation of the target object on the cleaning result, it can be based on this time (or as of the current time).
  • the operation feature of the target object retrains the cleaning model, where the operation feature may be the operation feature of the cleaning result and/or the operation feature of the file stored in the terminal.
  • the method further includes: using the second cleaning model to analyze a file stored in the terminal to determine whether the file in the second cleaning result needs to be sent to other than the terminal.
  • External storage device in the case where it is determined that the file needs to be sent to a storage device other than the terminal, the analysis result is displayed, where the analysis result indicates the file to be sent.
  • the method before using the second cleaning model to analyze the files stored in the terminal, the method further includes: using training data to train the first cleaning model to obtain the second cleaning model, wherein, the training of the first cleaning model using the training data includes: before the first operating feature and the second operating feature indicate that the seventh type of file is to be cleaned, the situation is sent to a storage device other than the terminal Next, train the first cleaning model to preferentially determine the seventh type of file as the file to be sent.
  • the method before using the second cleaning model to analyze the files stored in the terminal, the method further includes: receiving a start signal through the terminal, wherein the start signal is used to instruct to start the second cleaning model. 2. Clean up the model.
  • the order of the file to be cleaned is determined according to the probability of the file to be cleaned being cleaned.
  • the method further includes: receiving a target operation of the target object on the second cleaning result; and executing a corresponding operation on the second cleaning result according to the target operation operate.
  • the method according to the above embodiment can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is Better implementation.
  • the technical solution of the present disclosure essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes several instructions to make a terminal device (which can be a mobile phone, a computer, a server, or a network device, etc.) execute the methods described in the various embodiments of the present disclosure.
  • the present disclosure also provides a file processing device, which is used to implement any of the above-mentioned methods, and those that have been explained will not be repeated.
  • the term "module” can implement a combination of software and/or hardware with predetermined functions.
  • the devices described in the following embodiments are preferably implemented by software, implementation by hardware or a combination of software and hardware is also possible and conceived.
  • Fig. 3 is a structural block diagram of a file processing device according to the present disclosure.
  • the device includes: a first analysis module 31 configured to use a second cleaning model to analyze files stored in the terminal to determine the Whether the file needs to be cleaned, wherein the second cleanup model is obtained by training the first cleanup model using training data, and the training data includes: the first operation feature of the target object on the first cleanup result, where the first cleanup model A cleaning result is obtained by analyzing the file stored in the terminal through the first cleaning model, the first cleaning result indicates the file to be cleaned, and the first operating feature is used to indicate that the target object has The operating characteristics of files of a specified type in the cleaning result, and files of the same type have the same file characteristics; the first display module 33 is configured to display the second cleaning result when it is determined that the file needs to be cleaned. The second cleaning result indicates the file to be cleaned.
  • the cleaning model since the cleaning model is trained according to the operating characteristics of the cleaning result of the target object, the cleaning model is continuously optimized with the operating characteristics of the target object, so that the final cleaning result determined by the cleaning model is more in line with the behavior habits of the target object. Therefore, It can solve the problem of low accuracy of pushing the files to be cleaned in related technologies, and achieve the technical effect of improving the accuracy of pushing the files to be cleaned.
  • the device further includes: a training module configured to use the training data to train the first cleaning model before using the second cleaning model to analyze the files stored in the terminal to obtain the The second cleaning model, wherein the training the first cleaning model using the training data includes: in the case that the first operating feature indicates that the probability of the first type of file being retained is higher than the probability of the second type of file being retained, Training the first cleaning model to prioritize determining the second type of file as a file to be cleaned; and/or, where the first operating feature indicates that the probability of the third type of file being cleaned is higher than the probability of the fourth type of file being cleaned In this case, the first cleaning model is trained to first determine the third type of files as files to be cleaned.
  • the first operation feature is obtained according to the target object's operation behavior on the first cleaning result and the file pointed to by the operation behavior.
  • the training data further includes: a second operating feature of the target object on the file stored in the terminal, wherein the second operating feature is used to indicate that the target object has the target object on the file stored in the terminal.
  • the operating characteristics of the stored files of the specified type are used to indicate that the target object has the target object on the file stored in the terminal.
  • the training module is further configured to use training data to train the first cleaning model to obtain the second cleaning model before analyzing the files stored in the terminal using the second cleaning model.
  • Model wherein the training the first cleaning model using the training data includes: training the first cleaning model when the second operating feature indicates that the fifth type of file is accessed more frequently than the sixth type of file is accessed.
  • a cleaning model prioritizes the sixth type of file as a file to be cleaned.
  • the second operating feature is obtained according to the target object's operating behavior on the file stored in the terminal and the file pointed to by the operating behavior.
  • the device further includes: a second analysis module configured to use the third cleaning model to analyze the file stored in the terminal to determine whether the file is required Is cleaned up, where the third clean-up model is obtained by training the second clean-up model using training data, and the training data includes: the third operation feature of the target object on the second clean-up result, wherein the first The third operating feature is used to indicate the operating feature of the target object on the file of the specified type in the second cleaning result; the second display module is configured to display the third cleaning result when it is determined that the file needs to be cleaned, where: The third cleaning result indicates the file to be cleaned.
  • a second analysis module configured to use the third cleaning model to analyze the file stored in the terminal to determine whether the file is required Is cleaned up, where the third clean-up model is obtained by training the second clean-up model using training data, and the training data includes: the third operation feature of the target object on the second clean-up result, wherein the first The third operating feature is used to indicate the operating feature of the target object on the file of the specified type in the
  • the training of the cleaning model can be iterative. For example, each time the cleaning model outputs the cleaning result and receives the operation of the target object on the cleaning result, it can be based on this time (or as of the current time).
  • the operation feature of the target object retrains the cleaning model, where the operation feature may be the operation feature of the cleaning result and/or the operation feature of the file stored in the terminal.
  • the device further includes: a third analysis module configured to use the second cleaning model to analyze the file stored in the terminal to determine whether the file in the second cleaning result is required Is sent to a storage device other than the terminal; the third display module is configured to display the analysis result when it is determined that the file needs to be sent to a storage device other than the terminal, where the analysis result indicates The file to be sent.
  • a third analysis module configured to use the second cleaning model to analyze the file stored in the terminal to determine whether the file in the second cleaning result is required Is sent to a storage device other than the terminal
  • the third display module is configured to display the analysis result when it is determined that the file needs to be sent to a storage device other than the terminal, where the analysis result indicates The file to be sent.
  • the training module is further configured to use training data to train the first cleaning model to obtain the second cleaning model before analyzing the files stored in the terminal using the second cleaning model.
  • a model, wherein the training of the first cleaning model using the training data includes: sending to a storage device other than the terminal before the first operating feature and the second operating feature indicate that the seventh type of file is to be cleaned
  • the first cleaning model is trained to preferentially determine the seventh type of file as the file to be sent.
  • the device further includes: a receiving module configured to receive a start signal before analyzing the file stored in the terminal using the second cleaning model, wherein the start signal is used to indicate the start The second cleaning model.
  • the order of the file to be cleaned is determined according to the probability of the file to be cleaned being cleaned.
  • the device further includes: an operation module configured to receive a target operation of the target object on the second cleaning result; and the second cleaning result according to the target operation Perform the corresponding operation on the cleanup result.
  • each of the above-mentioned modules can be implemented by software or hardware.
  • it can be implemented in the following way, but not limited to this: the above-mentioned modules are all located in the same processor; or, the above-mentioned modules are in any combination The forms are located in different processors.
  • the method provided in this embodiment can be executed in a smart mobile terminal or a personal computer. It should be pointed out that the execution of the method in this embodiment requires the read and write permissions of the terminal, that is, the system permissions within the security range, so that the user's operation information can be obtained, similar to the permissions of a mobile phone housekeeper.
  • an original file cleaning model is first placed in the terminal.
  • This model can be a cleaning mechanism model.
  • the default judgment parameters of the model include, but are not limited to: identifying images cached by the application, and images edited and modified by the user. Pictures with high browsing frequency, pictures in chat groups, pictures downloaded and saved by users, etc., can also identify various other file types, such as video, audio, and files.
  • This model will repeatedly collect and process the user's usage habits, so as to achieve long-term training, extract the characteristic points of the user's usage habits, and generate a new cleaning model after a stage.
  • the terminal When the cleanup is triggered, the terminal outputs a cleanup list for the user to choose according to the new cleanup model, where the cleanup list includes a list of content to be cleaned determined by the cleanup model.
  • the data generated by the user in the process of using the cleaning model can be used as input information in the embodiment of the present disclosure.
  • the embodiments of the present disclosure cannot accurately determine which data the user wants to delete and which data to retain at one time. Instead, it is more and more close to the user through repeated training.
  • the real intention is the purpose, so as to achieve the purpose of self-learning and intelligence.
  • the user can perform direct operations on the output results (that is, the cleanup list output by the terminal) (for example, check, uncheck, retain, one-click cleanup, delete one or more pieces of data, etc.), and the terminal records the user's operations on the output results and The content pointed to by the operation, for example, record the content information that is directly cleaned up (that is, the information of the data stored in the terminal), and also record the content information that the user checks or keeps, as part of the input data for the next retraining , And the method basis of retraining.
  • the terminal may record all the user's operations on the output result and the content pointed to by all operations; it may also record part of the user's operation on the output result and the content pointed to by the part of the operation.
  • Part of the operations that need to be recorded by the terminal may be preset, for example, may be operations related to the retention or deletion of content information, such as deletion operations and retention operations.
  • the input data for the next retraining consists of two parts: one part is the generation of new user data between the last and the next output, for example, after the last output is generated until the current The new user data generated in the time period before the output result is generated; the other part is the data that has not been deleted by the user in the last output result.
  • this part of the content can be determined to be that the user is not satisfied with the training result, or the training result is not the result that the user wants, so the next training is required to make The training model is more mature to achieve the goal of approaching the user's expected result next time.
  • the cleanup model provided in the present disclosure can provide the user to package the content that needs to be retained after outputting the result, so that the user can copy the content that needs to be retained in other storage devices but needs to be deleted locally to other storage devices. After the locale, you can safely delete these contents locally.
  • Fig. 4 is a schematic structural diagram of the terminal cleaning system according to the present disclosure.
  • the framework for implementing the method of the present disclosure may include: an input module, a processing module, an output module, and a user operation recording module.
  • the input module may record user behavior, and the user's operation and use behavior of the terminal is used as the initial input data of the cleaning model.
  • the input module has a dedicated big data storage space, which is configured to save or record the user's usage of each terminal application and the usage of each contact within a period of time. Exemplary, including but not limited to these content: the time period of using a certain application, the way of use, the generated cache file, the storage path of the file, the deletion of the file, and so on.
  • the processing module is configured to extract content that is frequently accessed by the user after training and learning the cleaning model for a period of time, such as contacts that frequently chat, web pages that are frequently browsed, and Watch videos, frequently listened to music apps, frequently used camera apps, etc., as well as different types of files generated after these corresponding apps and content are used, the path where the files exist, the size of the files, and so on.
  • the commonality of these file operations is extracted, for example: the downloaded video can be deleted after watching it, and the music file that is often listened to is not deleted.
  • the cache of frequently browsed web pages can be deleted, the cached videos of Moments can be deleted, the chat videos or pictures with whom are not deleted, the photos and videos of children, family, and friends can not be deleted, and the cached results of entertainment applications can be deleted, etc.
  • the processing module needs to perform processing on these files. Refine it and finally show it to users. For example, the user regularly packs and saves the children’s photos, videos, or travel photos, selfies, etc., before cleaning up the content on the terminal. Then, when the cleaning of this type of content is triggered, the content or the content needs to be cleaned up. The list of is displayed to the user, and allows the user to have an actionable plan.
  • a new cleaning model is generated after training, and the cleaning model calculates and presents the terminal user's operating data through a certain cleaning algorithm.
  • the output module is configured to provide an interface on the output terminal to display the output result of the training algorithm, that is, to provide a user cleanup list, and the cleanup list can be displayed according to a certain rule. For example, it can be displayed according to the user's possible cleaning priority: files that the user is likely to delete are the first priority, which can be ranked higher; the files that may be deleted are the second priority, and the ranking is slightly lower. And so on. It can also be displayed in accordance with the convergence accuracy level of the algorithm: the file that the algorithm estimates the most accurately can be safely deleted as the first priority, which can be ranked higher; the second is accurate, and the file that may require a little judgment from the user is the second Priority, sorted slightly later, and so on.
  • the display mode is not limited here, and all similar solutions fall within the protection scope of the present disclosure.
  • the output module may also display to the user the files extracted by the algorithm, which can be saved or packaged by the user, and provided for the user to operate.
  • the user can perform various operations on the cleanup list displayed on the output terminal, and the user operation recording module is configured to record user operations. If the user can clean up the displayed results with confidence, one-click cleaning can be performed; if the output result of the cleaning list displayed on the output terminal may need to be judged, after confirming, one or more items of content can be cleaned up or canceled.
  • the content that the user cancels to clean up loops to the input module, and enters the next round of training and learning as part of the input data.
  • the marking of this part of the content is particularly important, because it can be used as a parameter of the factor of the next revision of the algorithm, making the algorithm closer and closer to the user's intention.
  • the result of deleting the chat picture obtained by the algorithm is not the user's intention, so in the next algorithm, this part of the content may not appear in the content deleted by the user.
  • the list provided by the cleanup model is the content that the cleanup model thinks can be cleaned up. If the user cancels the cleanup, the algorithm determines that this part of the content should be deleted, but the user thinks it should be kept, so this part of the content will not appear in the user deleted content afterwards .
  • the content directly deleted by the user may be deleted once or kept for a period of time, so as to prevent the user from deleting it by mistake and wanting to retrieve it.
  • the algorithm shows that it is likely to be the file that the user wants to save, it is necessary to provide a channel for the user to package and save these files to other storage places.
  • the specific implementation method is not specifically limited here. For example, it can be checked, unchecked, combined and packaged, one-click packaged, saved to, sent to, etc.
  • the present disclosure provides a training-based terminal space cleaning system, including an input module, a processing module, and an output module.
  • Fig. 5 is a schematic diagram of the input module recording user behavior according to the present disclosure.
  • the input module is configured to record user behavior.
  • the user operates in the foreground and different databases in the background store various types of files generated during use.
  • the fields that need to be used include but are not limited to the fields in Figure 5, which can be specified according to actual needs.
  • the tables in the database are used as initial values. With the user's use, the content in each table is constantly updated, and there are records of different users' operations for subsequent training algorithms.
  • Fig. 6 is a schematic diagram of analyzing and processing user behaviors by a processing module according to the present disclosure.
  • the processing module is configured to analyze and process user behaviors.
  • the processing module uses the database to store all the user's processing behaviors of the background files, including the time of being deleted, saved, forwarded, and operation, etc., through these operations, according to the built-in cleaning algorithm, the user's usage habits and some
  • the tendentious behavior is to generate a new database.
  • the files in the old database are marked and classified, and finally a database file that can be basically used for subsequent output is formed.
  • FIG. 7 is a schematic diagram of the output module outputting the cleaning result according to the present disclosure.
  • the output module is configured to output the cleaning result.
  • the cleanup action is triggered, the cleanup result obtained by the training algorithm is displayed on the terminal user interface.
  • the cleanup list can be displayed according to certain rules. For example: it can be displayed according to the cleanup priority that the user may perform; it can also be displayed according to the convergence accuracy level of the algorithm, etc., without specific restrictions.
  • the user can delete part of the content, copy part of the content, and deselect part of the content when operating on the displayed interface.
  • the reserved content back-end database is specially marked as part of the input data for the next training.
  • FIG. 8 is a schematic flowchart of a file cleaning method according to the present disclosure. As shown in FIG. 8, the method may include steps S1 to S9.
  • step S1 the original cleaning algorithm is prefabricated.
  • step S2 the user operates the terminal.
  • step S3 the user's operation information on the file is obtained and stored.
  • step S4 commonality is extracted according to the user's operation over a period of time.
  • step S5 the pre-made original algorithm is used for algorithm training according to the extracted commonality to form a new algorithm, and the algorithm will be continuously revised according to the user's continuous operation in order to achieve the real goal of the user;
  • step S6 the trained algorithm has a result that can be output at any time.
  • the result of the algorithm is displayed to the user. It's just that this result may change over time, but it's all for better satisfying user needs.
  • step S7 the user deletes the given cleaning result, which means that the user approves the deletion of this part of the content, that is, the algorithm converges better for the part of the data.
  • step S8 the user cancels the deletion of the given cleaning result, which means that the user does not approve the deletion of this part of the content, that is to say, the part of the data algorithm needs to be continuously revised. Therefore, it enters the next cycle and continues to perform algorithm training.
  • step S9 the user agrees to the copy data provided, the user directly copies, and the packaging operation is performed in the background, and then the user deletes it.
  • the precondition is that the terminal has system permissions and a suitable cleaning algorithm is built-in, which can be the current mainstream algorithm.
  • the present disclosure is based on the original algorithm and is based on user habits.
  • the algorithm is continuously trained to reach an intelligent algorithm that is infinitely close to the real intention of the user.
  • check, uncheck, delete, copy, etc. operations are performed according to the displayed cleanup list, and the retained files continue to be trained for the next time and are used as algorithm corrections. factor.
  • the content user interface of the algorithm is not visible.
  • the user through the continuous training and learning of the algorithm, the user’s worries that users have not dared to clean up many pictures, videos, and files have been gradually solved to a certain extent; at the same time, it is convenient for the user to automatically integrate the required files for the user, and Clean up after copying or sharing to other places, users will not worry about things being lost, and a lot of space can be freed up.
  • the present disclosure also provides a computer-readable storage medium in which a computer program is stored, wherein the computer program is configured to execute any method described herein when running.
  • the foregoing computer-readable storage medium may include, but is not limited to: U disk, Read-Only Memory (Read-Only Memory, ROM for short), Random Access Memory (Random Access Memory, RAM for short) , Mobile hard drives, magnetic disks or optical discs and other media that can store computer programs.
  • U disk Read-Only Memory
  • ROM Read-Only Memory
  • RAM Random Access Memory
  • Mobile hard drives magnetic disks or optical discs and other media that can store computer programs.
  • the present disclosure also provides a terminal, including a memory and a processor, and a computer program is stored in the memory, and the processor is configured to run the computer program to execute any method described herein.
  • the aforementioned electronic device may further include a transmission device and an input-output device, wherein the transmission device is connected to the aforementioned processor, and the input-output device is connected to the aforementioned processor.
  • modules or steps of the present disclosure can be implemented by a general computing device, and they can be concentrated on a single computing device or distributed in a network composed of multiple computing devices. Above, they can be implemented with program codes executable by a computing device, so that they can be stored in a storage device for execution by the computing device, and in some cases, they can be executed in a different order than shown here. Or the described steps, or fabricate them into individual integrated circuit modules respectively, or fabricate multiple modules or steps of them into a single integrated circuit module to achieve. In this way, the present disclosure is not limited to any specific combination of hardware and software.

Abstract

The present application provides a file processing method and apparatus, a storage medium, and a terminal. The method comprises: using a second cleanup model to analyze a file stored in a terminal to determine whether the file needs to be cleaned up, wherein the second cleanup model is obtained by training a first cleanup model using training data, the training data comprises a first operation feature by a target object for a first cleanup result, the first cleanup result is obtained by analyzing, by means of the first cleanup model, the file stored in the terminal, the first cleanup result indicates a file to be cleaned up, the first operation feature is used for indicating the feature of an operation performed by the target object for a file of a specified type in the first cleanup result, and files of the same type have the same file feature; and if it is determined that the file needs to be cleaned up, displaying a second cleanup result, wherein the second cleanup result indicates the file to be cleaned up.

Description

文件的处理方法及装置、存储介质、终端File processing method and device, storage medium and terminal
相关申请的交叉引用Cross-references to related applications
本申请要求2020年6月22日提交给中国专利局的第202010575231.0号专利申请的优先权,其全部内容通过引用合并于此。This application claims the priority of the patent application No. 202010575231.0 submitted to the Chinese Patent Office on June 22, 2020, the entire content of which is incorporated herein by reference.
技术领域Technical field
本公开涉及但不限于通信领域。The present disclosure relates to, but is not limited to, the field of communications.
背景技术Background technique
相关技术中,终端中的清理模型只能够实现机械性的将终端中的文件扫描之后,以清单的形式无差别展示给用户,无法针对性的推送待清理文件的清单,导致清理模型的精确度很低,用户只能手动一条一条选择自己需要清理的内容,清理效率较低。In related technologies, the cleaning model in the terminal can only mechanically scan the files in the terminal and display it to the user in the form of a list without distinction. It cannot push the list of files to be cleaned in a targeted manner, resulting in the accuracy of the cleaning model. Very low, users can only manually select the content they need to clean up one by one, and the cleaning efficiency is low.
发明内容Summary of the invention
本公开提供了一种文件的处理方法,包括:使用第二清理模型对终端中所存储的文件进行分析,确定所述文件是否需要被清理,其中,所述第二清理模型为使用训练数据对第一清理模型进行训练所得到的,所述训练数据包括:目标对象对第一清理结果的第一操作特征,其中,所述第一清理结果是通过所述第一清理模型对所述终端中所存储的文件进行分析所得到的,所述第一清理结果指示了待清理的文件,所述第一操作特征用于指示所述目标对象对所述第一清理结果中指定类型的文件的操作特征,同一类型的文件具有相同的文件特征;在确定所述文件需要被清理的情况下,展示第二清理结果,其中,所述第二清理结果指示了待清理的所述文件。The present disclosure provides a file processing method, including: using a second cleaning model to analyze a file stored in a terminal to determine whether the file needs to be cleaned, wherein the second cleaning model is using training data pair The training data is obtained by training the first cleaning model, and the training data includes: the first operation feature of the target object on the first cleaning result, where the first cleaning result is the first cleaning model to the terminal It is obtained by analyzing the stored files, the first cleaning result indicates the file to be cleaned, and the first operation feature is used to indicate the operation of the target object on the file of the specified type in the first cleaning result Features: files of the same type have the same file feature; in the case where it is determined that the file needs to be cleaned, a second cleaning result is displayed, where the second cleaning result indicates the file to be cleaned.
本公开还提供了一种文件的处理装置,包括:分析模块,配置为使用第二清理模型对终端中所存储的文件进行分析,确定所述文件是否需要被清理,其中,所述第二清理模型为使用训练数据对第一清理模型进行训练所得到的,所述训练数据包括:目标对象对第一清理 结果的第一操作特征,其中,所述第一清理结果是通过所述第一清理模型对所述终端中所存储的文件进行分析所得到的,所述第一清理结果指示了待清理的文件,所述第一操作特征用于指示所述目标对象对所述第一清理结果中指定类型的文件的操作特征,同一类型的文件具有相同的文件特征;展示模块,配置为在确定所述文件需要被清理的情况下,展示第二清理结果,其中,所述第二清理结果指示了待清理的所述文件。The present disclosure also provides a file processing device, including: an analysis module configured to analyze a file stored in a terminal using a second cleaning model to determine whether the file needs to be cleaned, wherein the second cleaning The model is obtained by training a first cleaning model using training data. The training data includes: the first operation feature of the target object on the first cleaning result, wherein the first cleaning result is obtained through the first cleaning result. The model is obtained by analyzing the files stored in the terminal, the first cleaning result indicates the file to be cleaned, and the first operating feature is used to instruct the target object to check the first cleaning result. Operating characteristics of files of a specified type, files of the same type have the same file characteristics; the display module is configured to display a second cleaning result when it is determined that the file needs to be cleaned, wherein the second cleaning result indicates The files to be cleaned up.
本公开还提供了一种计算机可读存储介质,所述计算机可读存储介质中存储有计算机程序,其中,所述计算机程序在被处理器执行时实现本文所述任一项方法。The present disclosure also provides a computer-readable storage medium in which a computer program is stored, wherein the computer program implements any of the methods described herein when executed by a processor.
本公开还提供了一种终端,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行本文所述任一方法。The present disclosure also provides a terminal, including a memory and a processor, and a computer program is stored in the memory, and the processor is configured to run the computer program to execute any method described herein.
附图说明Description of the drawings
图1是实现本公开的文件的处理方法的终端的硬件结构框图;FIG. 1 is a block diagram of the hardware structure of a terminal that implements the file processing method of the present disclosure;
图2是根据本公开的文件的处理方法的流程图;Figure 2 is a flowchart of a file processing method according to the present disclosure;
图3是根据本公开的文件的处理装置的结构框图;Fig. 3 is a structural block diagram of a file processing device according to the present disclosure;
图4是根据本公开的终端清理系统的结构示意图;Figure 4 is a schematic structural diagram of a terminal cleaning system according to the present disclosure;
图5是根据本公开的输入模块记录用户行为的示意图;FIG. 5 is a schematic diagram of recording user behavior according to the input module of the present disclosure;
图6是根据本公开的处理模块对用户行为进行分析处理的示意图;Fig. 6 is a schematic diagram of analyzing and processing user behaviors according to the processing module of the present disclosure;
图7是根据本公开的输出模块输出清理结果的示意图;FIG. 7 is a schematic diagram of outputting cleaning results according to the output module of the present disclosure;
图8是根据本公开的文件清理方法的流程示意图。FIG. 8 is a schematic flowchart of a file cleaning method according to the present disclosure.
具体实施方式detailed description
下文中将参考附图并结合实施方式来详细说明本公开的实施方式。Hereinafter, the embodiments of the present disclosure will be described in detail with reference to the drawings and in combination with the embodiments.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定 的顺序或先后次序。It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above-mentioned drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence.
本公开中所提供的方法可以在移动终端、计算机终端或者类似的运算装置中执行。以运行在移动终端上为例,图1是实现本公开的文件的处理方法的终端的硬件结构框图。如图1所示,移动终端可以包括一个或多个(图1中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)和用于存储数据的存储器104,其中,上述移动终端还可以包括用于通信功能的传输设备106以及输入输出设备108。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述移动终端的结构造成限定。例如,移动终端还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。The method provided in the present disclosure can be executed in a mobile terminal, a computer terminal or a similar computing device. Taking running on a mobile terminal as an example, FIG. 1 is a hardware structure block diagram of a terminal that implements the file processing method of the present disclosure. As shown in FIG. 1, the mobile terminal may include one or more (only one is shown in FIG. 1) processor 102 (the processor 102 may include, but is not limited to, a processing device such as a microprocessor MCU or a programmable logic device FPGA) And the memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a transmission device 106 and an input/output device 108 for communication functions. Those of ordinary skill in the art can understand that the structure shown in FIG. 1 is only for illustration, and does not limit the structure of the above-mentioned mobile terminal. For example, the mobile terminal may also include more or fewer components than shown in FIG. 1, or have a different configuration from that shown in FIG.
存储器104可用于存储计算机程序,例如,应用软件的软件程序以及模块,如本公开中的文件的处理方法对应的计算机程序,处理器102通过运行存储在存储器104内的计算机程序,从而执行各种功能应用以及数据处理,即实现上述的方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至移动终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 may be used to store computer programs, for example, software programs and modules of application software, such as the computer programs corresponding to the file processing method in the present disclosure. The processor 102 executes various computer programs by running the computer programs stored in the memory 104. Functional application and data processing, that is, to achieve the above-mentioned methods. The memory 104 may include a high-speed random access memory, and may also include a non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include a memory remotely provided with respect to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, corporate intranets, local area networks, mobile communication networks, and combinations thereof.
传输装置106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括移动终端的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,简称为NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,简称为RF)模块,其用于通过无线方式与互联网进行通讯。The transmission device 106 is used to receive or send data via a network. The above-mentioned specific examples of the network may include a wireless network provided by a communication provider of a mobile terminal. In one example, the transmission device 106 includes a network adapter (Network Interface Controller, NIC for short), which can be connected to other network devices through a base station to communicate with the Internet. In an example, the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
在本公开中提供了一种运行于上述终端的文件的处理方法,图2是根据本公开的文件的处理方法的流程图,如图2所示,该方法可以包括如下步骤S202和S204。The present disclosure provides a method for processing files running on the above-mentioned terminal. FIG. 2 is a flowchart of the method for processing files according to the present disclosure. As shown in FIG. 2, the method may include the following steps S202 and S204.
在步骤S202,使用第二清理模型对终端中所存储的文件进行分析,确定该文件是否需要被清理,其中,该第二清理模型为使用训练数据对第一清理模型进行训练所得到的,该训练数据包括:目标对象对第一清理结果的第一操作特征,其中,该第一清理结果是通过该第一清理模型对该终端中所存储的文件进行分析所得到的,该第一清理结果指示了待清理的文件,该第一操作特征用于指示该目标对象对该第一清理结果中指定类型的文件的操作特征,同一类型的文件具有相同的文件特征。In step S202, a second cleaning model is used to analyze a file stored in the terminal to determine whether the file needs to be cleaned. The second cleaning model is obtained by training the first cleaning model using training data. The training data includes: the first operation feature of the target object on the first cleaning result, where the first cleaning result is obtained by analyzing the files stored in the terminal through the first cleaning model, and the first cleaning result The file to be cleaned is indicated, and the first operation characteristic is used to indicate the operation characteristic of the target object for the file of the specified type in the first clean-up result, and files of the same type have the same file characteristic.
在步骤S204,在确定该文件需要被清理的情况下,展示第二清理结果,其中,该第二清理结果指示了待清理的该文件。In step S204, in the case where it is determined that the file needs to be cleaned, a second cleanup result is displayed, where the second cleanup result indicates the file to be cleaned.
通过上述步骤,由于根据目标对象对清理结果的操作特征对清理模型进行训练,用目标对象的操作特征不断优化清理模型,使得清理模型最终确定的清理结果更贴合目标对象的行为习惯,因此,可以解决相关技术中待清理文件推送的精确度较低的问题,达到提高待清理文件的推送的精确度的技术效果。Through the above steps, since the cleaning model is trained according to the operating characteristics of the cleaning result of the target object, the cleaning model is continuously optimized with the operating characteristics of the target object, so that the final cleaning result determined by the cleaning model is more in line with the behavior habits of the target object. Therefore, It can solve the problem of low accuracy of pushing the files to be cleaned in related technologies, and achieve the technical effect of improving the accuracy of pushing the files to be cleaned.
在一个示例性的实施方式中,清理模型可以是一种清理应用程序,或者清理代码。In an exemplary embodiment, the cleanup model may be a cleanup application or cleanup code.
在一个示例性的实施方式中,具有相同的文件特征的一类文件的可以被称为同一种类型文件或者同一种文件类型。其中,相同的文件特征可以是相同的文件名称、归属于相同的应用、归属于相同类型的应用、归属于相同的网页、归属于相同类型的网页、文件内容中包含相同的对象(例如包含人物,又如,包含特定人物)、归属于相同的联系对象等。In an exemplary embodiment, files of one type with the same file characteristics may be referred to as the same type of file or the same type of file. Among them, the same file feature can be the same file name, belonging to the same application, belonging to the same type of application, belonging to the same webpage, belonging to the same type of webpage, and the content of the file containing the same object (for example, it contains characters). , For another example, including a specific person), belonging to the same contact object, etc.
在一个示例性的实施方式中,清理模型可以是使用训练数据通过机器学习训练出的。训练数据所包括的操作特征可以包括:操作行为,以及该操作行为所指向的文件类型。In an exemplary embodiment, the cleaning model may be trained through machine learning using training data. The operating characteristics included in the training data may include: operating behavior and the file type to which the operating behavior points.
在本公开中,终端(也称为终端设备)可以是具有通信功能的终端,终端可以通过网络或者连接线、连接接口与其他设备进行通信连接。本公开中的终端可以包括但不限于以下至少之一:手机(如Android手机、iOS手机等)、笔记本电脑、平板电脑、掌上电脑、 MID(Mobile Internet Devices,移动互联网设备)、PAD、台式电脑、智能电视、智能家居设备等。上述网络可以包括但不限于:有线网络,无线网络,其中,该有线网络包括:局域网、城域网和广域网,该无线网络包括:蓝牙、WIFI及其他实现无线通信的网络。上述仅是一种示例,本公开中对此不作任何限定。In the present disclosure, a terminal (also referred to as a terminal device) may be a terminal with a communication function, and the terminal may communicate with other devices through a network or a connection line or a connection interface. The terminal in the present disclosure may include but is not limited to at least one of the following: mobile phones (such as Android phones, iOS phones, etc.), notebook computers, tablet computers, handheld computers, MID (Mobile Internet Devices), PAD, desktop computers , Smart TV, smart home equipment, etc. The aforementioned networks may include, but are not limited to: wired networks, wireless networks, where the wired networks include: local area networks, metropolitan area networks, and wide area networks, and the wireless networks include: Bluetooth, WIFI, and other networks that implement wireless communication. The above is only an example, and this disclosure does not make any limitation on this.
在一个示例性的实施方式中,在该使用第二清理模型对终端中所存储的文件进行分析之前,该方法还包括:使用训练数据对第一清理模型进行训练,得到该第二清理模型,其中,该使用训练数据对第一清理模型进行训练包括:在该第一操作特征指示了第一类型文件被保留的几率高于第二类型文件被保留的几率的情况下,训练该第一清理模型将该第二类型文件优先确定为待清理文件;和/或,在该第一操作特征指示了第三类型文件被清理的几率高于第四类型文件被清理的几率的情况下,训练该第一清理模型将该第三类型文件优先确定为待清理文件。In an exemplary embodiment, before using the second cleaning model to analyze the files stored in the terminal, the method further includes: using training data to train the first cleaning model to obtain the second cleaning model, Wherein, the training of the first cleaning model using the training data includes: training the first cleaning model when the first operation feature indicates that the probability of the first type of file being retained is higher than the probability of the second type of file being retained. The model preferentially determines the second type of file as the file to be cleaned; and/or, in the case that the first operating characteristic indicates that the probability of the third type of file being cleaned is higher than the probability of the fourth type of file being cleaned, train the The first cleaning model preferentially determines the file of the third type as the file to be cleaned.
在一个示例性的实施方式中,该第一操作特征是根据该目标对象对该第一清理结果的操作行为以及该操作行为所指向的文件所得到的。In an exemplary embodiment, the first operation feature is obtained according to the target object's operation behavior on the first cleaning result and the file pointed to by the operation behavior.
在一个示例性的实施方式中,该训练数据还包括:该目标对象对该终端中所存储的文件的第二操作特征,其中,该第二操作特征用于指示该目标对象对该终端中所存储的指定类型的文件的操作特征。In an exemplary embodiment, the training data further includes: a second operating feature of the target object on the file stored in the terminal, wherein the second operating feature is used to indicate that the target object has the target object on the file stored in the terminal. The operating characteristics of the stored files of the specified type.
在一个示例性的实施方式中,在该使用第二清理模型对终端中所存储的文件进行分析之前,该方法还包括:使用训练数据对第一清理模型进行训练,得到该第二清理模型,其中,该使用训练数据对第一清理模型进行训练包括:在该第二操作特征指示了第五类型文件被访问的频率高于第六类型文件被访问的频率的情况下,训练该第一清理模型将该第六类型文件优先确定为待清理文件。In an exemplary embodiment, before using the second cleaning model to analyze the files stored in the terminal, the method further includes: using training data to train the first cleaning model to obtain the second cleaning model, Wherein, the training of the first cleaning model by using the training data includes: training the first cleaning model in the case that the second operating feature indicates that the fifth type of file is accessed more frequently than the sixth type of file is accessed. The model prioritizes this sixth type of file as the file to be cleaned up.
在一个示例性的实施方式中,该第二操作特征是根据该目标对象对该终端中所存储的文件的操作行为以及该操作行为所指向的文件所得到的。In an exemplary embodiment, the second operating feature is obtained according to the target object's operating behavior on the file stored in the terminal and the file pointed to by the operating behavior.
在一个示例性的实施方式中,在该展示第二清理结果之后,该 方法还包括:使用第三清理模型对终端中所存储的文件进行分析,确定该文件是否需要被清理,其中,该第三清理模型为使用训练数据对该第二清理模型进行训练所得到的,该训练数据包括:该目标对象对该第二清理结果的第三操作特征,其中,该第三操作特征用于指示该目标对象对该第二清理结果中指定类型的文件的操作特征;在确定该文件需要被清理的情况下,展示第三清理结果,其中,该第三清理结果指示了待清理的该文件。In an exemplary embodiment, after the second cleaning result is displayed, the method further includes: using a third cleaning model to analyze a file stored in the terminal to determine whether the file needs to be cleaned, wherein the first cleaning model The third cleaning model is obtained by training the second cleaning model using training data. The training data includes: a third operation feature of the target object on the second cleaning result, where the third operation feature is used to indicate the The target object's operating characteristics of the file of the specified type in the second cleaning result; in the case where it is determined that the file needs to be cleaned, the third cleaning result is displayed, where the third cleaning result indicates the file to be cleaned.
需要说明的是,在本公开中,清理模型的训练可以是迭代的,例如,每次清理模型输出清理结果,并接收目标对象对清理结果的操作之后,都可以基于本次(或者截止当前)目标对象的操作特征对清理模型进行再次训练,其中,操作特征可以是对清理结果的操作特征和/或对终端中所存储的文件的操作特征。It should be noted that in the present disclosure, the training of the cleaning model can be iterative. For example, each time the cleaning model outputs the cleaning result and receives the operation of the target object on the cleaning result, it can be based on this time (or as of the current time). The operation feature of the target object retrains the cleaning model, where the operation feature may be the operation feature of the cleaning result and/or the operation feature of the file stored in the terminal.
在一个示例性的实施方式中,该方法还包括:使用该第二清理模型对该终端中所存储的文件进行分析,确定该第二清理结果中的该文件是否需要被发送至除该终端之外的存储设备;在确定该文件需要被发送至除该终端之外的存储设备的情况下,展示分析结果,其中,该分析结果指示了待发送的该文件。In an exemplary embodiment, the method further includes: using the second cleaning model to analyze a file stored in the terminal to determine whether the file in the second cleaning result needs to be sent to other than the terminal. External storage device; in the case where it is determined that the file needs to be sent to a storage device other than the terminal, the analysis result is displayed, where the analysis result indicates the file to be sent.
需要说明的是,某些情况下,一些文件可能需要被存储至终端之外的其他存储设备中,在将这种文件发送给其他存储设备之后,才会在本终端上删除这种文件。所以,在本公开中,可以在确定某文件为待删除文件时,也区分该文件是否需要被发送至其他存储设备。It should be noted that, in some cases, some files may need to be stored in other storage devices other than the terminal. After sending such files to other storage devices, such files will be deleted on this terminal. Therefore, in the present disclosure, when it is determined that a certain file is a file to be deleted, it can also be distinguished whether the file needs to be sent to other storage devices.
在一个示例性的实施方式中,在该使用第二清理模型对终端中所存储的文件进行分析之前,该方法还包括:使用训练数据对第一清理模型进行训练,得到该第二清理模型,其中,该使用训练数据对第一清理模型进行训练包括:在该第一操作特征与该第二操作特征指示了第七类型文件被清理之前,被发送至除该终端之外的存储设备的情况下,训练该第一清理模型将该第七类型文件优先确定为待发送的文件。In an exemplary embodiment, before using the second cleaning model to analyze the files stored in the terminal, the method further includes: using training data to train the first cleaning model to obtain the second cleaning model, Wherein, the training of the first cleaning model using the training data includes: before the first operating feature and the second operating feature indicate that the seventh type of file is to be cleaned, the situation is sent to a storage device other than the terminal Next, train the first cleaning model to preferentially determine the seventh type of file as the file to be sent.
在一个示例性的实施方式中,在使用第二清理模型对该终端中所存储的文件进行分析之前,该方法还包括:通过该终端接收启动信 号,其中,该启动信号用于指示启动该第二清理模型。In an exemplary embodiment, before using the second cleaning model to analyze the files stored in the terminal, the method further includes: receiving a start signal through the terminal, wherein the start signal is used to instruct to start the second cleaning model. 2. Clean up the model.
在一个示例性的实施方式中,该第二清理结果中,按照该待清理的该文件被清理的几率的高低确定该待清理的该文件的排序。In an exemplary embodiment, in the second cleaning result, the order of the file to be cleaned is determined according to the probability of the file to be cleaned being cleaned.
在一个示例性的实施方式中,在该展示第二清理结果之后,该方法还包括:接收该目标对象对该第二清理结果的目标操作;根据该目标操作对该第二清理结果执行对应的操作。In an exemplary embodiment, after the displaying of the second cleaning result, the method further includes: receiving a target operation of the target object on the second cleaning result; and executing a corresponding operation on the second cleaning result according to the target operation operate.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到根据上述实施方式的方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本公开的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本公开各个实施方式所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the method according to the above embodiment can be implemented by means of software plus the necessary general hardware platform, of course, it can also be implemented by hardware, but in many cases the former is Better implementation. Based on this understanding, the technical solution of the present disclosure essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM, magnetic disk, The optical disc) includes several instructions to make a terminal device (which can be a mobile phone, a computer, a server, or a network device, etc.) execute the methods described in the various embodiments of the present disclosure.
本公开还提供了一种文件的处理装置,该装置用于实现上述任一方法,已经进行过说明的不再赘述。如以下所使用的,术语“模块”可以实现预定功能的软件和/或硬件的组合。尽管以下实施方式所描述的装置较佳地以软件来实现,但是硬件,或者软件和硬件的组合的实现也是可能并被构想的。The present disclosure also provides a file processing device, which is used to implement any of the above-mentioned methods, and those that have been explained will not be repeated. As used below, the term "module" can implement a combination of software and/or hardware with predetermined functions. Although the devices described in the following embodiments are preferably implemented by software, implementation by hardware or a combination of software and hardware is also possible and conceived.
图3是根据本公开的文件的处理装置的结构框图,如图3所示,该装置包括:第一分析模块31,配置为使用第二清理模型对终端中所存储的文件进行分析,确定该文件是否需要被清理,其中,该第二清理模型为使用训练数据对第一清理模型进行训练所得到的,该训练数据包括:目标对象对第一清理结果的第一操作特征,其中,该第一清理结果是通过该第一清理模型对该终端中所存储的文件进行分析所得到的,该第一清理结果指示了待清理的文件,该第一操作特征用于指示该目标对象对该第一清理结果中指定类型的文件的操作特征,同一类型的文件具有相同的文件特征;第一展示模块33,配置为在确定该文件需要被清理的情况下,展示第二清理结果,其中,该第二清理结果指示了待清理的该文件。Fig. 3 is a structural block diagram of a file processing device according to the present disclosure. As shown in Fig. 3, the device includes: a first analysis module 31 configured to use a second cleaning model to analyze files stored in the terminal to determine the Whether the file needs to be cleaned, wherein the second cleanup model is obtained by training the first cleanup model using training data, and the training data includes: the first operation feature of the target object on the first cleanup result, where the first cleanup model A cleaning result is obtained by analyzing the file stored in the terminal through the first cleaning model, the first cleaning result indicates the file to be cleaned, and the first operating feature is used to indicate that the target object has The operating characteristics of files of a specified type in the cleaning result, and files of the same type have the same file characteristics; the first display module 33 is configured to display the second cleaning result when it is determined that the file needs to be cleaned. The second cleaning result indicates the file to be cleaned.
通过上述步骤,由于根据目标对象对清理结果的操作特征对清理模型进行训练,用目标对象的操作特征不断优化清理模型,使得清理模型最终确定的清理结果更贴合目标对象的行为习惯,因此,可以解决相关技术中待清理文件推送的精确度较低的问题,达到提高待清理文件的推送的精确度的技术效果。Through the above steps, since the cleaning model is trained according to the operating characteristics of the cleaning result of the target object, the cleaning model is continuously optimized with the operating characteristics of the target object, so that the final cleaning result determined by the cleaning model is more in line with the behavior habits of the target object. Therefore, It can solve the problem of low accuracy of pushing the files to be cleaned in related technologies, and achieve the technical effect of improving the accuracy of pushing the files to be cleaned.
在一个示例性的实施方式中,该装置还包括:训练模块,配置为在该使用第二清理模型对终端中所存储的文件进行分析之前,使用训练数据对第一清理模型进行训练,得到该第二清理模型,其中,该使用训练数据对第一清理模型进行训练包括:在该第一操作特征指示了第一类型文件被保留的几率高于第二类型文件被保留的几率的情况下,训练该第一清理模型将该第二类型文件优先确定为待清理文件;和/或,在该第一操作特征指示了第三类型文件被清理的几率高于第四类型文件被清理的几率的情况下,训练该第一清理模型将该第三类型文件优先确定为待清理文件。In an exemplary embodiment, the device further includes: a training module configured to use the training data to train the first cleaning model before using the second cleaning model to analyze the files stored in the terminal to obtain the The second cleaning model, wherein the training the first cleaning model using the training data includes: in the case that the first operating feature indicates that the probability of the first type of file being retained is higher than the probability of the second type of file being retained, Training the first cleaning model to prioritize determining the second type of file as a file to be cleaned; and/or, where the first operating feature indicates that the probability of the third type of file being cleaned is higher than the probability of the fourth type of file being cleaned In this case, the first cleaning model is trained to first determine the third type of files as files to be cleaned.
在一个示例性的实施方式中,该第一操作特征是根据该目标对象对该第一清理结果的操作行为以及该操作行为所指向的文件所得到的。In an exemplary embodiment, the first operation feature is obtained according to the target object's operation behavior on the first cleaning result and the file pointed to by the operation behavior.
在一个示例性的实施方式中,该训练数据还包括:该目标对象对该终端中所存储的文件的第二操作特征,其中,该第二操作特征用于指示该目标对象对该终端中所存储的指定类型的文件的操作特征。In an exemplary embodiment, the training data further includes: a second operating feature of the target object on the file stored in the terminal, wherein the second operating feature is used to indicate that the target object has the target object on the file stored in the terminal. The operating characteristics of the stored files of the specified type.
在一个示例性的实施方式中,该训练模块还配置为,在该使用第二清理模型对终端中所存储的文件进行分析之前,使用训练数据对第一清理模型进行训练,得到该第二清理模型,其中,该使用训练数据对第一清理模型进行训练包括:在该第二操作特征指示了第五类型文件被访问的频率高于第六类型文件被访问的频率的情况下,训练该第一清理模型将该第六类型文件优先确定为待清理文件。In an exemplary embodiment, the training module is further configured to use training data to train the first cleaning model to obtain the second cleaning model before analyzing the files stored in the terminal using the second cleaning model. Model, wherein the training the first cleaning model using the training data includes: training the first cleaning model when the second operating feature indicates that the fifth type of file is accessed more frequently than the sixth type of file is accessed. A cleaning model prioritizes the sixth type of file as a file to be cleaned.
在一个示例性的实施方式中,该第二操作特征是根据该目标对象对该终端中所存储的文件的操作行为以及该操作行为所指向的文件所得到的。In an exemplary embodiment, the second operating feature is obtained according to the target object's operating behavior on the file stored in the terminal and the file pointed to by the operating behavior.
在一个示例性的实施方式中,在该展示第二清理结果之后,该 装置还包括:第二分析模块,配置为使用第三清理模型对终端中所存储的文件进行分析,确定该文件是否需要被清理,其中,该第三清理模型为使用训练数据对该第二清理模型进行训练所得到的,该训练数据包括:该目标对象对该第二清理结果的第三操作特征,其中,该第三操作特征用于指示该目标对象对该第二清理结果中指定类型的文件的操作特征;第二展示模块,配置为在确定该文件需要被清理的情况下,展示第三清理结果,其中,该第三清理结果指示了待清理的该文件。In an exemplary embodiment, after the display of the second cleaning result, the device further includes: a second analysis module configured to use the third cleaning model to analyze the file stored in the terminal to determine whether the file is required Is cleaned up, where the third clean-up model is obtained by training the second clean-up model using training data, and the training data includes: the third operation feature of the target object on the second clean-up result, wherein the first The third operating feature is used to indicate the operating feature of the target object on the file of the specified type in the second cleaning result; the second display module is configured to display the third cleaning result when it is determined that the file needs to be cleaned, where: The third cleaning result indicates the file to be cleaned.
需要说明的是,在本公开中,清理模型的训练可以是迭代的,例如,每次清理模型输出清理结果,并接收目标对象对清理结果的操作之后,都可以基于本次(或者截止当前)目标对象的操作特征对清理模型进行再次训练,其中,操作特征可以是对清理结果的操作特征和/或对终端中所存储的文件的操作特征。It should be noted that in the present disclosure, the training of the cleaning model can be iterative. For example, each time the cleaning model outputs the cleaning result and receives the operation of the target object on the cleaning result, it can be based on this time (or as of the current time). The operation feature of the target object retrains the cleaning model, where the operation feature may be the operation feature of the cleaning result and/or the operation feature of the file stored in the terminal.
在一个示例性的实施方式中,该装置还包括:第三分析模块,配置为使用该第二清理模型对该终端中所存储的文件进行分析,确定该第二清理结果中的该文件是否需要被发送至除该终端之外的存储设备;第三展示模块,配置为在确定该文件需要被发送至除该终端之外的存储设备的情况下,展示分析结果,其中,该分析结果指示了待发送的该文件。In an exemplary embodiment, the device further includes: a third analysis module configured to use the second cleaning model to analyze the file stored in the terminal to determine whether the file in the second cleaning result is required Is sent to a storage device other than the terminal; the third display module is configured to display the analysis result when it is determined that the file needs to be sent to a storage device other than the terminal, where the analysis result indicates The file to be sent.
需要说明的是,某些情况下,一些文件可能需要被存储至终端之外的其他存储设备中,在将这种文件发送给其他存储设备之后,才会在本终端上删除这种文件。所以,在本公开中,可以在确定某文件为待删除文件时,也区分该文件是否需要被发送至其他存储设备。It should be noted that, in some cases, some files may need to be stored in other storage devices other than the terminal. After sending such files to other storage devices, such files will be deleted on this terminal. Therefore, in the present disclosure, when it is determined that a certain file is a file to be deleted, it can also be distinguished whether the file needs to be sent to other storage devices.
在一个示例性的实施方式中,该训练模块还配置为,在该使用第二清理模型对终端中所存储的文件进行分析之前,使用训练数据对第一清理模型进行训练,得到该第二清理模型,其中,该使用训练数据对第一清理模型进行训练包括:在该第一操作特征与该第二操作特征指示了第七类型文件被清理之前,被发送至除该终端之外的存储设备的情况下,训练该第一清理模型将该第七类型文件优先确定为待发送的文件。In an exemplary embodiment, the training module is further configured to use training data to train the first cleaning model to obtain the second cleaning model before analyzing the files stored in the terminal using the second cleaning model. A model, wherein the training of the first cleaning model using the training data includes: sending to a storage device other than the terminal before the first operating feature and the second operating feature indicate that the seventh type of file is to be cleaned In the case of training, the first cleaning model is trained to preferentially determine the seventh type of file as the file to be sent.
在一个示例性的实施方式中,该装置还包括:接收模块,配置为在使用第二清理模型对该终端中所存储的文件进行分析之前,接收启动信号,其中,该启动信号用于指示启动该第二清理模型。In an exemplary embodiment, the device further includes: a receiving module configured to receive a start signal before analyzing the file stored in the terminal using the second cleaning model, wherein the start signal is used to indicate the start The second cleaning model.
在一个示例性的实施方式中,该第二清理结果中,按照该待清理的该文件被清理的几率的高低确定该待清理的该文件的排序。In an exemplary embodiment, in the second cleaning result, the order of the file to be cleaned is determined according to the probability of the file to be cleaned being cleaned.
在一个示例性的实施方式中,在该展示第二清理结果之后,该装置还包括:操作模块,配置为接收该目标对象对该第二清理结果的目标操作;根据该目标操作对该第二清理结果执行对应的操作。In an exemplary embodiment, after the second cleaning result is displayed, the device further includes: an operation module configured to receive a target operation of the target object on the second cleaning result; and the second cleaning result according to the target operation Perform the corresponding operation on the cleanup result.
需要说明的是,上述各个模块是可以通过软件或硬件来实现的,对于后者,可以通过以下方式实现,但不限于此:上述模块均位于同一处理器中;或者,上述各个模块以任意组合的形式分别位于不同的处理器中。It should be noted that each of the above-mentioned modules can be implemented by software or hardware. For the latter, it can be implemented in the following way, but not limited to this: the above-mentioned modules are all located in the same processor; or, the above-mentioned modules are in any combination The forms are located in different processors.
示例实施方式Example implementation
以下结合具体场景对本公开实施方式作进一步的解释说明。The following further explains the embodiments of the present disclosure in combination with specific scenarios.
本实施方式所提供的方法可以在智能移动终端,个人电脑中执行。需要指出的是,本实施方式的方法的执行需要具有终端的读写权限,也就是拥有安全范围内的系统权限,这样才能获取到用户的操作信息,类似手机管家之类的权限。The method provided in this embodiment can be executed in a smart mobile terminal or a personal computer. It should be pointed out that the execution of the method in this embodiment requires the read and write permissions of the terminal, that is, the system permissions within the security range, so that the user's operation information can be obtained, similar to the permissions of a mobile phone housekeeper.
本实施方式先在终端置入一个原始的文件清理模型,这个模型可以是一种清理机制模型,该模型的默认判断参数,包括但不限于:识别应用缓存的图片、用户编辑修改过的图片,浏览频率高的图片,聊天群中的图片,用户下载保存的图片等,还可以识别其他各种文件类型,例如视频、音频、文件等。In this embodiment, an original file cleaning model is first placed in the terminal. This model can be a cleaning mechanism model. The default judgment parameters of the model include, but are not limited to: identifying images cached by the application, and images edited and modified by the user. Pictures with high browsing frequency, pictures in chat groups, pictures downloaded and saved by users, etc., can also identify various other file types, such as video, audio, and files.
这个模型会对用户的使用习惯进行反复的采集,处理,从而实现长期训练,提取用户使用习惯的特征点,并生成一个阶段后的新清理模型。当触发其清理时,终端根据新的清理模型,输出清理列表供用户选择,其中,清理列表中包括清理模型所确定的待清理内容的清单。用户在使用该清理模型的过程中产生的数据,可以做为本公开实施方式中的输入信息。This model will repeatedly collect and process the user's usage habits, so as to achieve long-term training, extract the characteristic points of the user's usage habits, and generate a new cleaning model after a stage. When the cleanup is triggered, the terminal outputs a cleanup list for the user to choose according to the new cleanup model, where the cleanup list includes a list of content to be cleaned determined by the cleanup model. The data generated by the user in the process of using the cleaning model can be used as input information in the embodiment of the present disclosure.
需要说明的是,本公开实施方式并不能实现一次就能准确的判断用户要对哪些数据进行删除,哪些数据进行保留,而是通过多次反复的训练,来达到越来越趋近于用户的真实意图为目的,从而达到自学习和智能的目的。It should be noted that the embodiments of the present disclosure cannot accurately determine which data the user wants to delete and which data to retain at one time. Instead, it is more and more close to the user through repeated training. The real intention is the purpose, so as to achieve the purpose of self-learning and intelligence.
用户可以对输出结果(即终端输出的清理列表)进行直接操作(例如勾选,去勾选,保留,一键清理,删除一条或多条数据等),终端记录用户对输出结果进行的操作以及操作所指向的内容,例如,记录直接清理的内容信息(即终端中所存储的数据的信息),也记录用户去勾选,或者保留的内容信息,以此作为下一次再训练的一部分输入数据,和再训练的方法依据。示例性的,终端可以记录用户对输出结果的全部操作以及全部操作所指向的内容;也可以记录用户对输出结果的部分操作以及该部分操作所指向的内容。需要终端所记录的部分操作可以是预先设定的,例如可以是与内容信息的保留或删除相关的操作,例如删除操作、保留操作。The user can perform direct operations on the output results (that is, the cleanup list output by the terminal) (for example, check, uncheck, retain, one-click cleanup, delete one or more pieces of data, etc.), and the terminal records the user's operations on the output results and The content pointed to by the operation, for example, record the content information that is directly cleaned up (that is, the information of the data stored in the terminal), and also record the content information that the user checks or keeps, as part of the input data for the next retraining , And the method basis of retraining. Exemplarily, the terminal may record all the user's operations on the output result and the content pointed to by all operations; it may also record part of the user's operation on the output result and the content pointed to by the part of the operation. Part of the operations that need to be recorded by the terminal may be preset, for example, may be operations related to the retention or deletion of content information, such as deletion operations and retention operations.
在一个示例性的实施方式中,下次再训练的输入数据,由两部分组成:一部分是上次和下次产生输出结果之间的新的用户数据生成,例如上次产生输出结果之后直至本次产生输出结果之前的时间段中产生的新的用户数据;另一部分是,上次输出结果中,未被用户删除的数据。需要说明的是,可以将这部分内容(即前述的两种数据)确定为是用户不满意训练结果,或者训练结果并不是用户想要的结果,所以需要再进行下次训练,以此来让训练模型更成熟,以达到下次趋近用户期望结果的目的。In an exemplary embodiment, the input data for the next retraining consists of two parts: one part is the generation of new user data between the last and the next output, for example, after the last output is generated until the current The new user data generated in the time period before the output result is generated; the other part is the data that has not been deleted by the user in the last output result. It should be noted that this part of the content (that is, the aforementioned two kinds of data) can be determined to be that the user is not satisfied with the training result, or the training result is not the result that the user wants, so the next training is required to make The training model is more mature to achieve the goal of approaching the user's expected result next time.
在一个示例性的实施方式中,本公开中所提供的清理模型可以在输出结果后提供用户打包需要保留的内容,以便让用户将需要保留到其他存储设备但在本地需要删除的内容拷贝到其他地方后,可以放心在本地删除这些内容。In an exemplary embodiment, the cleanup model provided in the present disclosure can provide the user to package the content that needs to be retained after outputting the result, so that the user can copy the content that needs to be retained in other storage devices but needs to be deleted locally to other storage devices. After the locale, you can safely delete these contents locally.
图4是根据本公开的终端清理系统的结构示意图,如图4所示,实现本公开方法的框架可以包括:输入模块、处理模块、输出模块和用户操作记录模块。Fig. 4 is a schematic structural diagram of the terminal cleaning system according to the present disclosure. As shown in Fig. 4, the framework for implementing the method of the present disclosure may include: an input module, a processing module, an output module, and a user operation recording module.
其中,在一个示例性的实施方式中,输入模块可以记录用户行 为,把用户对终端的操作和使用行为的记录作为清理模型的初始输入数据。输入模块有专门的大数据存储空间,配置为保存或者记录用户在一段时间内对终端各个应用的使用情况,以及对各联系人的使用情况。示例性的,包括但不限于这些内容:使用某应用的时间段,使用方式,产生的缓存文件,文件的保存路径,文件的删除情况等等。Among them, in an exemplary embodiment, the input module may record user behavior, and the user's operation and use behavior of the terminal is used as the initial input data of the cleaning model. The input module has a dedicated big data storage space, which is configured to save or record the user's usage of each terminal application and the usage of each contact within a period of time. Exemplary, including but not limited to these content: the time period of using a certain application, the way of use, the generated cache file, the storage path of the file, the deletion of the file, and so on.
在一个示例性的实施方式中,处理模块,配置为在对清理模型进行一段时间的训练和学习之后,提取出用户访问频率较高的内容,例如经常聊天的联系人,经常浏览的网页,经常看的视频,经常听的音乐类应用,经常用的拍照应用等等,以及这些对应应用、内容被使用后所产生的不同类型的文件,文件存在的路径,文件的大小等等。在用户使用一个月两个月三个月或更久以后,根据用户的使用习惯,提炼出对这些文件操作的共性,例如:下载的视频看完以后可以删除,经常听的音乐文件不删除,经常浏览的网页缓存可以删除,朋友圈的缓存视频可以删除,和谁的聊天视频或者图片不删除,孩子和家人,朋友的照片和视频不删除,娱乐类应用缓存结果可删除等等。In an exemplary embodiment, the processing module is configured to extract content that is frequently accessed by the user after training and learning the cleaning model for a period of time, such as contacts that frequently chat, web pages that are frequently browsed, and Watch videos, frequently listened to music apps, frequently used camera apps, etc., as well as different types of files generated after these corresponding apps and content are used, the path where the files exist, the size of the files, and so on. After the user uses it for one month, two months, three months or more, according to the user’s usage habits, the commonality of these file operations is extracted, for example: the downloaded video can be deleted after watching it, and the music file that is often listened to is not deleted. The cache of frequently browsed web pages can be deleted, the cached videos of Moments can be deleted, the chat videos or pictures with whom are not deleted, the photos and videos of children, family, and friends can not be deleted, and the cached results of entertainment applications can be deleted, etc.
在一个示例性的实施方式中,需要特别说明的是,如果算法学习到用户对特定的文件有很大概率会对其进行保存并拷贝或者发送到其他地方的内容,处理模块需要对这些文件进行提炼,并最终展示给用户。比如,用户定期会对孩子的照片,视频,或者旅游的照片,自拍等打包保存起来后才会清理终端上的该内容,那么,在触发清理该类型的内容时,需要将该内容或者该内容的清单展示给用户,并让用户有可操作方案。In an exemplary implementation, it needs to be specifically noted that if the algorithm learns that the user has a high probability of saving and copying a specific file or sending it to other places, the processing module needs to perform processing on these files. Refine it and finally show it to users. For example, the user regularly packs and saves the children’s photos, videos, or travel photos, selfies, etc., before cleaning up the content on the terminal. Then, when the cleaning of this type of content is triggered, the content or the content needs to be cleaned up. The list of is displayed to the user, and allows the user to have an actionable plan.
根据以上这些结果,训练后生成新的清理模型,清理模型通过一定的清理算法对终端用户的操作数据进行计算和呈现。Based on the above results, a new cleaning model is generated after training, and the cleaning model calculates and presents the terminal user's operating data through a certain cleaning algorithm.
在一个示例性的实施方式中,输出模块,配置为在输出端提供界面,展示训练算法的输出结果,即给出用户清理列表,清理列表可按某种规则进行显示。如:可以按照用户可能进行的清理优先级进行展示:用户很可能会删除的文件作为第一优先级,可以排序较靠前;有可能会删除的文件作为第二优先级,排序稍靠后,以此类推。也可以按照算法的收敛准确等级进行展示:算法预估最准确的用户可以放 心删除的文件作为第一优先级,可以排序较靠前;其次准确的,可能需要用户稍加判断的文件作为第二优先级,排序稍靠后,以此类推。这里不对显示方式进行限制,凡是类似的方案,都属于本公开的保护范围。In an exemplary embodiment, the output module is configured to provide an interface on the output terminal to display the output result of the training algorithm, that is, to provide a user cleanup list, and the cleanup list can be displayed according to a certain rule. For example, it can be displayed according to the user's possible cleaning priority: files that the user is likely to delete are the first priority, which can be ranked higher; the files that may be deleted are the second priority, and the ranking is slightly lower. And so on. It can also be displayed in accordance with the convergence accuracy level of the algorithm: the file that the algorithm estimates the most accurately can be safely deleted as the first priority, which can be ranked higher; the second is accurate, and the file that may require a little judgment from the user is the second Priority, sorted slightly later, and so on. The display mode is not limited here, and all similar solutions fall within the protection scope of the present disclosure.
在一个示例性的实施方式中,输出模块除了显示可清除的内容外,还可以展示给用户算法提取出来的,可供用户保存或者打包的文件,并供用户操作。In an exemplary embodiment, in addition to displaying the clearable content, the output module may also display to the user the files extracted by the algorithm, which can be saved or packaged by the user, and provided for the user to operate.
在一个示例性的实施方式中,用户可对输出端显示的清理列表进行各种操作,用户操作记录模块配置为记录用户操作。如果用户对显示结果可以放心清理,可以进行一键清理;如果对输出端显示的清理列表可能需要再判断的输出结果,进行确认后,可以对一条或多条内容进行清理或者取消清理。In an exemplary embodiment, the user can perform various operations on the cleanup list displayed on the output terminal, and the user operation recording module is configured to record user operations. If the user can clean up the displayed results with confidence, one-click cleaning can be performed; if the output result of the cleaning list displayed on the output terminal may need to be judged, after confirming, one or more items of content can be cleaned up or canceled.
用户取消清理的内容又循环到输入模块,作为一部分输入数据,进入下一轮的训练学习中。同时,这部分内容的标记特别重要,因为它可以作为下次修正算法的因子的参数,让算法越来越接近用户意图。比如,算法得到的和谁的聊天图片删除的结果其实并不是用户的意图,所以在下一次的算法中,这部分内容就可能不会出现在用户删除的内容中。清理模型提供的清单是清理模型认为可以被清除的内容,如果用户取消清理,说明算法确定这部分内容应该被删除,但是用户认为应该保留,所以这部分内容之后不会出现在用户删除的内容中。The content that the user cancels to clean up loops to the input module, and enters the next round of training and learning as part of the input data. At the same time, the marking of this part of the content is particularly important, because it can be used as a parameter of the factor of the next revision of the algorithm, making the algorithm closer and closer to the user's intention. For example, the result of deleting the chat picture obtained by the algorithm is not the user's intention, so in the next algorithm, this part of the content may not appear in the content deleted by the user. The list provided by the cleanup model is the content that the cleanup model thinks can be cleaned up. If the user cancels the cleanup, the algorithm determines that this part of the content should be deleted, but the user thinks it should be kept, so this part of the content will not appear in the user deleted content afterwards .
在一个示例性的实施方式中,用户直接删除的内容可以在此一次删除掉,或者保留一个时间段,以防止用户误删后想去找回。In an exemplary implementation, the content directly deleted by the user may be deleted once or kept for a period of time, so as to prevent the user from deleting it by mistake and wanting to retrieve it.
同时,为了方便用户放心清理,对算法显示出来很可能是用户想要保存的文件的情况下,需要提供通道供用户将这些文件打包保存到其他可存储的地方,具体实现方式这里不做具体限制,例如可以是勾选,去勾选,合并打包,一键打包,保存到,发送到等。At the same time, in order to facilitate users to clean up at ease, when the algorithm shows that it is likely to be the file that the user wants to save, it is necessary to provide a channel for the user to package and save these files to other storage places. The specific implementation method is not specifically limited here. For example, it can be checked, unchecked, combined and packaged, one-click packaged, saved to, sent to, etc.
在一个示例性的实施方式中,本公开提供了一种基于训练的终端空间清理系统,包括输入模块、处理模块和输出模块。In an exemplary embodiment, the present disclosure provides a training-based terminal space cleaning system, including an input module, a processing module, and an output module.
图5是根据本公开的输入模块记录用户行为的示意图,如图5所示,输入模块,配置为记录用户行为。用户在前台操作,后台分别 会有不同的数据库存储使用过程中产生的各种类型的文件,需要使用的字段包括但不限于图5中的字段,可以根据实际需要进行规定。同时在输入阶段,数据库中的表作为初始值,伴随着用户的使用,每个表中的内容都在不断的更新,并有用户不同操作的记录,供后续训练算法用。Fig. 5 is a schematic diagram of the input module recording user behavior according to the present disclosure. As shown in Fig. 5, the input module is configured to record user behavior. The user operates in the foreground and different databases in the background store various types of files generated during use. The fields that need to be used include but are not limited to the fields in Figure 5, which can be specified according to actual needs. At the same time, in the input stage, the tables in the database are used as initial values. With the user's use, the content in each table is constantly updated, and there are records of different users' operations for subsequent training algorithms.
图6是根据本公开的处理模块对用户行为进行分析处理的示意图,如图6所示,处理模块,配置为对用户行为进行分析处理。处理模块利用数据库存储了用户对后台文件的所有处理行为,包括曾经被删除,被保存,被转发以及操作的时间等等,通过这些操作,根据内置的清理算法,提炼出用户的使用习惯以及一些倾向性的行为,即生成为新的数据库,新的数据库中,对旧数据库中的文件进行标记,分类,最后形成一个基本可用于后续输出的数据库文件。Fig. 6 is a schematic diagram of analyzing and processing user behaviors by a processing module according to the present disclosure. As shown in Fig. 6, the processing module is configured to analyze and process user behaviors. The processing module uses the database to store all the user's processing behaviors of the background files, including the time of being deleted, saved, forwarded, and operation, etc., through these operations, according to the built-in cleaning algorithm, the user's usage habits and some The tendentious behavior is to generate a new database. In the new database, the files in the old database are marked and classified, and finally a database file that can be basically used for subsequent output is formed.
图7是根据本公开的输出模块输出清理结果的示意图,如图7所示,输出模块,配置为输出清理结果。当触发清理动作以后,根据训练算法得出的清理结果,在终端用户界面展现,根据之前阐述的,清理列表可按某种规则进行显示。如:可以按照用户可能进行的清理优先级进行展示;也可以按照算法的收敛准确等级进行展示等,都不做具体限制。用户在展示的界面进行操作,可以删除一部分内容,可以拷贝一部分内容,可以去勾选一部分内容,保留的内容后台数据库进行特殊标记,以作为下一次训练的一部分输入数据。FIG. 7 is a schematic diagram of the output module outputting the cleaning result according to the present disclosure. As shown in FIG. 7, the output module is configured to output the cleaning result. When the cleanup action is triggered, the cleanup result obtained by the training algorithm is displayed on the terminal user interface. According to the previous explanation, the cleanup list can be displayed according to certain rules. For example: it can be displayed according to the cleanup priority that the user may perform; it can also be displayed according to the convergence accuracy level of the algorithm, etc., without specific restrictions. The user can delete part of the content, copy part of the content, and deselect part of the content when operating on the displayed interface. The reserved content back-end database is specially marked as part of the input data for the next training.
在一个示例性的实施方式中,本公开提供了一种后台自动触发扫描的基于训练的移动终端空间清理功能的方法。图8是根据本公开的文件清理方法的流程示意图,如图8所示,所述方法可以包括步骤S1至步骤S9。In an exemplary embodiment, the present disclosure provides a method for a training-based space clearing function of a mobile terminal that automatically triggers a scan in the background. FIG. 8 is a schematic flowchart of a file cleaning method according to the present disclosure. As shown in FIG. 8, the method may include steps S1 to S9.
在步骤S1,预制原始清理算法。In step S1, the original cleaning algorithm is prefabricated.
在步骤S2,用户对终端进行操作。In step S2, the user operates the terminal.
在步骤S3,获取并存储用户对文件的操作信息。In step S3, the user's operation information on the file is obtained and stored.
在步骤S4,根据用户一段时间的操作进行共性提取。In step S4, commonality is extracted according to the user's operation over a period of time.
在步骤S5,用预制的原始算法按照提取得共性进行算法训练,形成新的算法,并且后续会根据用户不断的操作,而不断的修正算法, 以达到更趋于用户真实目的;In step S5, the pre-made original algorithm is used for algorithm training according to the extracted commonality to form a new algorithm, and the algorithm will be continuously revised according to the user's continuous operation in order to achieve the real goal of the user;
在步骤S6,训练的算法随时都有可以输出的结果,当用户调起清理或者后台触发清理时,算法的结果展现给用户。只是这个结果可能会随着时间的变化而不断变化,但是都是为了更好的满足用户需求。In step S6, the trained algorithm has a result that can be output at any time. When the user initiates a cleanup or triggers the cleanup in the background, the result of the algorithm is displayed to the user. It's just that this result may change over time, but it's all for better satisfying user needs.
在步骤S7,用户对给出的清理结果进行删除,意味着这部分内容,用户认可删除,也就是说对着部分的数据算法收敛较好。In step S7, the user deletes the given cleaning result, which means that the user approves the deletion of this part of the content, that is, the algorithm converges better for the part of the data.
在步骤S8,用户对给出的清理结果进行取消删除,意味着这部分内容,用户不认可删除,也就是说对着部分的数据算法需要继续修正。因此,它又进入到下一个循环,继续进行算法训练。In step S8, the user cancels the deletion of the given cleaning result, which means that the user does not approve the deletion of this part of the content, that is to say, the part of the data algorithm needs to be continuously revised. Therefore, it enters the next cycle and continues to perform algorithm training.
在步骤S9,用户对给出的拷贝数据比较认可,用户直接进行拷贝,后台进行打包操作,然后用户再进行删除。In step S9, the user agrees to the copy data provided, the user directly copies, and the packaging operation is performed in the background, and then the user deletes it.
在一个示例性的实施方式中,前置条件就是需要终端具有系统权限,并且内置一种适合的清理算法,可以是目前主流的算法,本公开是基于原始算法的基础上,按照用户习惯,对算法进行不断的训练,达到无限接近用户的真正意图的智能算法。In an exemplary embodiment, the precondition is that the terminal has system permissions and a suitable cleaning algorithm is built-in, which can be the current mainstream algorithm. The present disclosure is based on the original algorithm and is based on user habits. The algorithm is continuously trained to reach an intelligent algorithm that is infinitely close to the real intention of the user.
在一个示例性的实施方式中,在用户可操作界面,根据展示的清理列表,进行勾选,去勾选,删除,拷贝等操作,保留的文件又继续进行下一次训练,并作为算法的修正因子。算法的内容用户界面不可见。In an exemplary implementation, in the user-operable interface, check, uncheck, delete, copy, etc. operations are performed according to the displayed cleanup list, and the retained files continue to be trained for the next time and are used as algorithm corrections. factor. The content user interface of the algorithm is not visible.
在本实施方式中,通过算法的不断训练,学习,一定程度上逐步解决了用户一直以来不敢清理许多图片,视频,文件的担忧;同时又方便了用户,为用户自动整合需要的文件,并拷贝或者分享到其他地方后进行清理,用户就不会担心东西丢失,也可以释放大量空间。In this embodiment, through the continuous training and learning of the algorithm, the user’s worries that users have not dared to clean up many pictures, videos, and files have been gradually solved to a certain extent; at the same time, it is convenient for the user to automatically integrate the required files for the user, and Clean up after copying or sharing to other places, users will not worry about things being lost, and a lot of space can be freed up.
本公开还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行本文描述的任一方法。The present disclosure also provides a computer-readable storage medium in which a computer program is stored, wherein the computer program is configured to execute any method described herein when running.
在一个示例性实施方式中,上述计算机可读存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储计算机程序的介质。In an exemplary embodiment, the foregoing computer-readable storage medium may include, but is not limited to: U disk, Read-Only Memory (Read-Only Memory, ROM for short), Random Access Memory (Random Access Memory, RAM for short) , Mobile hard drives, magnetic disks or optical discs and other media that can store computer programs.
本公开还提供了一种终端,包括存储器和处理器,该存储器中存储有计算机程序,该处理器被设置为运行计算机程序以执行本文所述任一方法。The present disclosure also provides a terminal, including a memory and a processor, and a computer program is stored in the memory, and the processor is configured to run the computer program to execute any method described herein.
在一个示例性实施方式中,上述电子装置还可以包括传输设备以及输入输出设备,其中,该传输设备和上述处理器连接,该输入输出设备和上述处理器连接。In an exemplary embodiment, the aforementioned electronic device may further include a transmission device and an input-output device, wherein the transmission device is connected to the aforementioned processor, and the input-output device is connected to the aforementioned processor.
本实施方式中的具体示例可以参考上述实施方式及示例性实施方式中所描述的示例,本实施方式在此不再赘述。For specific examples in this embodiment, reference may be made to the examples described in the foregoing embodiment and exemplary embodiments, and this embodiment will not be repeated here.
显然,本领域的技术人员应该明白,上述的本公开的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本公开不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that the above-mentioned modules or steps of the present disclosure can be implemented by a general computing device, and they can be concentrated on a single computing device or distributed in a network composed of multiple computing devices. Above, they can be implemented with program codes executable by a computing device, so that they can be stored in a storage device for execution by the computing device, and in some cases, they can be executed in a different order than shown here. Or the described steps, or fabricate them into individual integrated circuit modules respectively, or fabricate multiple modules or steps of them into a single integrated circuit module to achieve. In this way, the present disclosure is not limited to any specific combination of hardware and software.
以上所述仅为本公开的优选实施方式而已,并不用于限制本公开,对于本领域的技术人员来说,本公开可以有各种更改和变化。凡在本公开的原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The foregoing descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the present disclosure. For those skilled in the art, the present disclosure may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the principles of the present disclosure shall be included in the protection scope of the present disclosure.

Claims (15)

  1. 一种文件的处理方法,包括:A file processing method, including:
    使用第二清理模型对终端中所存储的文件进行分析,确定所述文件是否需要被清理,其中,所述第二清理模型为使用训练数据对第一清理模型进行训练所得到的,所述训练数据包括:目标对象对第一清理结果的第一操作特征,其中,所述第一清理结果是通过所述第一清理模型对所述终端中所存储的文件进行分析所得到的,所述第一清理结果指示了待清理的文件,所述第一操作特征用于指示所述目标对象对所述第一清理结果中指定类型的文件的操作特征,同一类型的文件具有相同的文件特征;Use the second cleaning model to analyze the files stored in the terminal to determine whether the files need to be cleaned, where the second cleaning model is obtained by training the first cleaning model using training data, and the training The data includes: the first operation characteristic of the target object on the first cleaning result, wherein the first cleaning result is obtained by analyzing the files stored in the terminal through the first cleaning model, and the first cleaning result A cleaning result indicates a file to be cleaned, and the first operating characteristic is used to indicate the operating characteristic of the target object on a file of a specified type in the first cleaning result, and files of the same type have the same file characteristics;
    在确定所述文件需要被清理的情况下,展示第二清理结果,其中,所述第二清理结果指示了待清理的所述文件。In the case where it is determined that the file needs to be cleaned up, a second cleanup result is displayed, where the second cleanup result indicates the file to be cleaned up.
  2. 根据权利要求1所述的方法,其中,在所述使用第二清理模型对终端中所存储的文件进行分析之前,所述方法还包括:The method according to claim 1, wherein, before said using the second cleaning model to analyze the files stored in the terminal, the method further comprises:
    使用训练数据对第一清理模型进行训练,得到所述第二清理模型,其中,所述使用训练数据对第一清理模型进行训练包括:Training the first cleaning model using training data to obtain the second cleaning model, wherein the training the first cleaning model using the training data includes:
    在所述第一操作特征指示了第一类型文件被保留的几率高于第二类型文件被保留的几率的情况下,训练所述第一清理模型将所述第二类型文件优先确定为待清理文件;和/或,In the case where the first operating characteristic indicates that the probability of the first type of file being retained is higher than the probability of the second type of file being retained, the first cleaning model is trained to prioritize the second type of file to be cleaned Documents; and/or,
    在所述第一操作特征指示了第三类型文件被清理的几率高于第四类型文件被清理的几率的情况下,训练所述第一清理模型将所述第三类型文件优先确定为待清理文件。In the case where the first operating feature indicates that the probability of the third type of file being cleaned is higher than the probability of the fourth type of file being cleaned, the first cleaning model is trained to prioritize the third type of file to be cleaned document.
  3. 根据权利要求1或2所述的方法,其中,所述第一操作特征是根据所述目标对象对所述第一清理结果的操作行为以及所述操作行为所指向的文件所得到的。The method according to claim 1 or 2, wherein the first operation characteristic is obtained according to the operation behavior of the target object on the first cleaning result and the file pointed to by the operation behavior.
  4. 根据权利要求1所述的方法,其中,所述训练数据还包括:所述目标对象对所述终端中所存储的文件的第二操作特征,其中,所述第二操作特征用于指示所述目标对象对所述终端中所存储的指定类型的文件的操作特征。The method according to claim 1, wherein the training data further comprises: a second operation characteristic of the target object on the file stored in the terminal, wherein the second operation characteristic is used to indicate the The operating characteristics of the target object on the specified type of file stored in the terminal.
  5. 根据权利要求4所述的方法,其中,在所述使用第二清理模型对终端中所存储的文件进行分析之前,所述方法还包括:The method according to claim 4, wherein, before said using the second cleaning model to analyze the files stored in the terminal, the method further comprises:
    使用训练数据对第一清理模型进行训练,得到所述第二清理模型,其中,所述使用训练数据对第一清理模型进行训练包括:Training the first cleaning model using training data to obtain the second cleaning model, wherein the training the first cleaning model using the training data includes:
    在所述第二操作特征指示了第五类型文件被访问的频率高于第六类型文件被访问的频率的情况下,训练所述第一清理模型将所述第六类型文件优先确定为待清理文件。In the case where the second operating feature indicates that the frequency of the fifth type of file being accessed is higher than the frequency of accessing the sixth type of file, the first cleaning model is trained to prioritize the sixth type of file to be cleaned document.
  6. 根据权利要求4或5所述的方法,其中,所述第二操作特征是根据所述目标对象对所述终端中所存储的文件的操作行为以及所述操作行为所指向的文件所得到的。The method according to claim 4 or 5, wherein the second operation characteristic is obtained according to the operation behavior of the target object on the file stored in the terminal and the file pointed to by the operation behavior.
  7. 根据权利要求1所述的方法,其中,在所述展示第二清理结果之后,所述方法还包括:The method according to claim 1, wherein, after the displaying the second cleaning result, the method further comprises:
    使用第三清理模型对终端中所存储的文件进行分析,确定所述文件是否需要被清理,其中,所述第三清理模型为使用训练数据对所述第二清理模型进行训练所得到的,所述训练数据包括:所述目标对象对所述第二清理结果的第三操作特征,其中,所述第三操作特征用于指示所述目标对象对所述第二清理结果中指定类型的文件的操作特征;Use the third cleaning model to analyze the files stored in the terminal to determine whether the files need to be cleaned, where the third cleaning model is obtained by training the second cleaning model using training data, so The training data includes: a third operating characteristic of the target object on the second cleaning result, wherein the third operating characteristic is used to indicate that the target object has a specific type of file in the second cleaning result. Operating characteristics;
    在确定所述文件需要被清理的情况下,展示第三清理结果,其中,所述第三清理结果指示了待清理的所述文件。In the case where it is determined that the file needs to be cleaned up, a third cleanup result is displayed, where the third cleanup result indicates the file to be cleaned up.
  8. 根据权利要求4所述的方法,还包括:The method according to claim 4, further comprising:
    使用所述第二清理模型对所述终端中所存储的文件进行分析,确定所述第二清理结果中的所述文件是否需要被发送至除所述终端之外的存储设备;Use the second cleaning model to analyze the files stored in the terminal to determine whether the files in the second cleaning result need to be sent to a storage device other than the terminal;
    在确定所述文件需要被发送至除所述终端之外的存储设备的情况下,展示分析结果,其中,所述分析结果指示了待发送的所述文件。In the case where it is determined that the file needs to be sent to a storage device other than the terminal, the analysis result is displayed, where the analysis result indicates the file to be sent.
  9. 根据权利要求8所述的方法,其中,在所述使用第二清理模型对终端中所存储的文件进行分析之前,所述方法还包括:8. The method according to claim 8, wherein, before said using the second cleaning model to analyze the files stored in the terminal, the method further comprises:
    使用训练数据对第一清理模型进行训练,得到所述第二清理模型,其中,所述使用训练数据对第一清理模型进行训练包括:Training the first cleaning model using training data to obtain the second cleaning model, wherein the training the first cleaning model using the training data includes:
    在所述第一操作特征与所述第二操作特征指示了第七类型文件被清理之前,被发送至除所述终端之外的存储设备的情况下,训练所述第一清理模型将所述第七类型文件优先确定为待发送的文件。In the case where the first operating feature and the second operating feature indicate that the seventh type of file is sent to a storage device other than the terminal before being cleared, the first cleaning model is trained to transfer the The seventh type of file is prioritized as the file to be sent.
  10. 根据权利要求1所述的方法,其中,在使用第二清理模型对所述终端中所存储的文件进行分析之前,所述方法还包括:The method according to claim 1, wherein, before using the second cleaning model to analyze the files stored in the terminal, the method further comprises:
    通过所述终端接收启动信号,其中,所述启动信号用于指示启动所述第二清理模型。A start signal is received through the terminal, where the start signal is used to instruct to start the second cleaning model.
  11. 根据权利要求1所述的方法,其中,所述第二清理结果中,按照所述待清理的所述文件被清理的几率的高低确定所述待清理的所述文件的排序。The method according to claim 1, wherein in the second cleaning result, the order of the files to be cleaned is determined according to the probability of the files to be cleaned being cleaned.
  12. 根据权利要求1所述的方法,其中,在所述展示第二清理结果之后,所述方法还包括:The method according to claim 1, wherein after the displaying the second cleaning result, the method further comprises:
    接收所述目标对象对所述第二清理结果的目标操作;Receiving the target operation of the target object on the second cleaning result;
    根据所述目标操作对所述第二清理结果执行对应的操作。Perform a corresponding operation on the second cleaning result according to the target operation.
  13. 一种文件的处理装置,包括:A file processing device, including:
    分析模块,配置为使用第二清理模型对终端中所存储的文件进行分析,确定所述文件是否需要被清理,其中,所述第二清理模型为使用训练数据对第一清理模型进行训练所得到的,所述训练数据包括:目标对象对第一清理结果的第一操作特征,其中,所述第一清理结果是通过所述第一清理模型对所述终端中所存储的文件进行分析所得到的,所述第一清理结果指示了待清理的文件,所述第一操作特征用于指示所述目标对象对所述第一清理结果中指定类型的文件的操作特征,同一类型的文件具有相同的文件特征;The analysis module is configured to use a second cleaning model to analyze files stored in the terminal to determine whether the files need to be cleaned, wherein the second cleaning model is obtained by training the first cleaning model using training data Yes, the training data includes: a first operation feature of the target object on the first cleaning result, wherein the first cleaning result is obtained by analyzing the files stored in the terminal through the first cleaning model , The first cleaning result indicates the file to be cleaned, and the first operating feature is used to indicate the operating feature of the target object on the file of the specified type in the first cleaning result, and files of the same type have the same File characteristics;
    展示模块,配置为在确定所述文件需要被清理的情况下,展示第二清理结果,其中,所述第二清理结果指示了待清理的所述文件。The display module is configured to display a second cleaning result when it is determined that the file needs to be cleaned, where the second cleaning result indicates the file to be cleaned.
  14. 一种计算机可读存储介质,其中,所述计算机可读存储介质中存储有计算机程序,其中,所述计算机程序在被处理器执行时实现权利要求1至12任一项中所述的方法。A computer-readable storage medium, wherein a computer program is stored in the computer-readable storage medium, wherein the computer program implements the method described in any one of claims 1 to 12 when executed by a processor.
  15. 一种终端,包括存储器和处理器,其中,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行权 利要求1至12任一项中所述的方法。A terminal includes a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to execute the method described in any one of claims 1 to 12.
PCT/CN2021/101220 2020-06-22 2021-06-21 File processing method and apparatus, storage medium, and terminal WO2021259197A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010575231.0A CN113900996A (en) 2020-06-22 2020-06-22 File processing method and device, storage medium and terminal
CN202010575231.0 2020-06-22

Publications (1)

Publication Number Publication Date
WO2021259197A1 true WO2021259197A1 (en) 2021-12-30

Family

ID=79186419

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/101220 WO2021259197A1 (en) 2020-06-22 2021-06-21 File processing method and apparatus, storage medium, and terminal

Country Status (2)

Country Link
CN (1) CN113900996A (en)
WO (1) WO2021259197A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117648298B (en) * 2024-01-30 2024-04-16 深圳市华瑞安科技有限公司 Intelligent management system, method and device for tablet personal computer

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108153862A (en) * 2017-12-22 2018-06-12 联想(北京)有限公司 File clean-up methods, devices and systems
CN108959488A (en) * 2018-06-22 2018-12-07 阿里巴巴集团控股有限公司 Safeguard the method and device of Question-Answering Model
CN110232000A (en) * 2018-03-05 2019-09-13 腾讯科技(深圳)有限公司 Data storage and management method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108153862A (en) * 2017-12-22 2018-06-12 联想(北京)有限公司 File clean-up methods, devices and systems
CN110232000A (en) * 2018-03-05 2019-09-13 腾讯科技(深圳)有限公司 Data storage and management method and device
CN108959488A (en) * 2018-06-22 2018-12-07 阿里巴巴集团控股有限公司 Safeguard the method and device of Question-Answering Model

Also Published As

Publication number Publication date
CN113900996A (en) 2022-01-07

Similar Documents

Publication Publication Date Title
US20150248465A1 (en) Method and apparatus for processing history operation records of electronic terminal, and storage medium
CN109976668A (en) Data-erasure method, data deletion apparatus and computer readable storage medium
CN103888837A (en) Video information pushing method and device
US11531658B2 (en) Criterion-based retention of data object versions
KR20160003682A (en) Hydration and dehydration with placeholders
CN103150079A (en) Application icon management method and device based on Firefox OS (Operation System) platform
CN107729483B (en) Classification display method, classification display device, terminal and computer-readable storage medium
CN107016123B (en) File management method and terminal device
CN104965630A (en) Method and system for layout of desktop application icons
CN108416053A (en) Display processing method, TV and the readable storage medium storing program for executing of historical viewings record
CN107729946A (en) Picture classification method, device, terminal and storage medium
CN106354570A (en) Account information copying and pasting method and device
CN107729485B (en) Data management method, device, terminal and computer readable storage medium
US11514949B2 (en) Method and system for long term stitching of video data using a data processing unit
CN105824827A (en) File path storage and local file visiting method and apparatus
WO2021259197A1 (en) File processing method and apparatus, storage medium, and terminal
US20140289626A1 (en) Cloud based audio recording system
CN103123651A (en) Method of rapidly searching multiple same-kind paper, device and mobile equipment
US11200205B2 (en) Displaying an alert and options when deleting a file that is associated with a sequence of files
CN103995640B (en) Information transmitting apparatus and method for sending information
CN110008398B (en) Data classification management recommendation method and device
US10528536B1 (en) Managing data object versions in a storage service
CN103488382A (en) Picture playing method and terminal
CN108021464B (en) Bottom-pocketing processing method and device for application response data
CN110399451B (en) Full-text search engine caching method, system and device based on nonvolatile memory and readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21829462

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 12/05/2023)

122 Ep: pct application non-entry in european phase

Ref document number: 21829462

Country of ref document: EP

Kind code of ref document: A1