CN116955291A - Intelligent file management method and system - Google Patents

Intelligent file management method and system Download PDF

Info

Publication number
CN116955291A
CN116955291A CN202310705793.6A CN202310705793A CN116955291A CN 116955291 A CN116955291 A CN 116955291A CN 202310705793 A CN202310705793 A CN 202310705793A CN 116955291 A CN116955291 A CN 116955291A
Authority
CN
China
Prior art keywords
file
digital
intelligent
management method
files
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310705793.6A
Other languages
Chinese (zh)
Inventor
刘文杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong 115 Technology Co ltd
Original Assignee
Guangdong 115 Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong 115 Technology Co ltd filed Critical Guangdong 115 Technology Co ltd
Priority to CN202310705793.6A priority Critical patent/CN116955291A/en
Publication of CN116955291A publication Critical patent/CN116955291A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/172Caching, prefetching or hoarding of files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/906Clustering; Classification

Abstract

The application discloses an intelligent file management method and system, comprising the following steps: acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file; extracting a plurality of key features in the service data information; generating a master classification tag associated with the digital file based on the metadata information; generating a number of expanded classification tags associated with the digital file based on the number of key features; classifying and sorting the digital files according to the main classification labels and the plurality of expansion classification labels; based on the method, the digital files uploaded by the user can be automatically classified and arranged, so that the user does not need to spend time arranging the files, the file arranging efficiency is improved, file items related to file contents can be directly searched according to the expanded classification labels, the user can more comprehensively and directly know the files stored by the user, and error classification caused by manual classification can be avoided.

Description

Intelligent file management method and system
Technical Field
The application relates to the technical field of automatic data processing and file arrangement, in particular to an intelligent file management method and system.
Background
With the widespread use of cloud storage services, more and more users store various data files (including video, audio, documents, pictures, installation packages, compression packages, etc.) in the cloud. With the increasing amount of data, the problems of sorting, sorting and archiving personal file data, and quickly searching files are becoming more obvious. The current cloud storage system sorts the digital files in a tree-shaped directory structure, hierarchical division, custom labels, attribute classification and other modes so as to be used for users to review. In this regard, it takes a lot of time and effort for the user to perform basic definition, and particularly when the number of files is huge or the variety is numerous, sorting becomes more cumbersome. It is difficult for a user to take a lot of time to manually sort through the data. In addition, for the traditional file arrangement mode, only basic information such as file (folder) names, categories, sizes and the like is provided, and further expansion analysis is not performed on file contents, so that the contents are displayed too singly and are not fully displayed. The user can search the file only by the keyword, so that the searching is inconvenient, and particularly when the relevance between the file name and the content recorded in the file is not high, the efficiency of searching the target file by the user is extremely low.
Disclosure of Invention
The application aims to provide an intelligent file management method and system which can replace manual and automatic classification and arrangement of stored files and can be used for purposefully classifying the files in detail based on the content of file records for users to review.
In order to achieve the above object, the present application discloses an intelligent file management method, comprising:
acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file;
extracting a plurality of key features in the service data information;
generating a master classification tag associated with the digital file based on the metadata information;
generating a number of expanded classification tags associated with the digital file based on the number of key features;
and classifying and sorting the digital files according to the main classification labels and the extended classification labels.
Preferably, the metadata information is further supplemented based on a big data analysis mode and the service data information so as to perfect the metadata information of the digital file.
Preferably, the metadata information and the business data information of any two digital files are subjected to association analysis so as to obtain an association relation of the two digital files based on a certain characteristic; and when the digital file is displayed, synchronously displaying other digital files which have association relation with the digital file.
Preferably, an index is established based on metadata information and service data information of the digital file to obtain a file index library.
Preferably, key information in the service data information is extracted to generate summary information, and when the digital file is displayed, the summary information corresponding to the digital file is displayed at the same time.
Preferably, the digital file is also encrypted by an asymmetric encryption method.
Preferably, the method for parsing the digital file includes:
when the digital file is a video file, converting the video file into a plurality of frames of image files, and identifying the content in each frame of the image file based on an image processing technology so as to generate a text file for recording the content of the video file;
when the digital file is an audio file, converting the audio file into a text file for recording the audio content of the audio file;
and reading the business data information of the digital file from the text file.
The application also discloses an intelligent file management system comprising a system processor, wherein the system processor works based on the intelligent file management method according to any one of claims 1 to 7.
The application also discloses another intelligent file management system, which comprises:
one or more processors;
a memory;
and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the intelligent file management method as described above.
The application also discloses a computer readable storage medium comprising a computer program executable by a processor to perform the intelligent file management method as described above.
Compared with the prior art, the intelligent file management method disclosed by the technical scheme of the application automatically analyzes the service data information of the digital file after receiving the digital file, and further generates a main classification label and an extended classification label associated with the digital file through the metadata information and the service data information of the digital file, wherein the extended classification label is generated based on the service data information recorded by the digital file. Therefore, through the management method, the digital files uploaded by the user can be automatically classified and arranged, so that the user does not need to spend time arranging the files, the file arranging efficiency is improved, file items related to file contents can be directly searched according to the expanded classification labels, the user can more comprehensively and directly know the files stored by the user, and error classification caused by manual classification can be avoided.
Drawings
FIG. 1 is a flowchart of an intelligent file management method according to an embodiment of the present application.
Detailed Description
In order to describe the technical content, the constructional features, the achieved objects and effects of the present application in detail, the following description is made in connection with the embodiments and the accompanying drawings.
The embodiment discloses an intelligent file management method for classifying and sorting digital files in a file storage system, wherein the file storage system in the embodiment is a cloud storage, but not limited to the cloud storage.
As shown in fig. 1, the intelligent file management method in this embodiment includes the following steps:
s1: and acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file. Metadata, which is data describing data, is descriptive information on data and information resources, is data existing for describing related information of data, and includes file name, file type, creation time, modification time, etc. of a digital file. The service data information is file content recorded in a digital file, for example, for a certain text file, the service data information is text content recorded in the text file, and for each video file, the service data information is a video stream recorded in the video file and comprising a plurality of frames of images.
S2: extracting a plurality of key features in the service data information. In this step, the service data information is in a plain text format, and the key features include keywords or key sentences for describing related information such as topics, fields, industries, etc. related to the content recorded in digital files such as document files, video files, image files, audio files, etc.
S3: a primary classification tag associated with the digital file is generated based on the metadata information, and a number of extended classification tags associated with the digital file are generated based on the number of key features.
S4: and classifying and sorting the digital files according to the main classification labels and the plurality of expansion classification labels.
In this embodiment, the main category labels are a type of labels, such as including "me watch", "me listen", "album", "document", "software", "others", and so on.
And (5) expanding the classification labels into a plurality of class II labels subordinate to each main classification label. For example, the extended category label under the "me watch" primary category label includes: "scenario", "science fiction", "action", "comedy", "love", "adventure", "child", "dance", "animation", etc. The extended category label under the main category label "i listen" includes: "pop", "rock", "ballad", "electronic", "dance", "talk", "light music", "jazz", "country", etc. The extended category label under the main category label "document" includes: "A philosophy, religion"; "B social science general theory"; "C politics, law"; "D military"; "E economy"; "F culture, science, education, sports"; "G language, text"; "H literature"; "I art". The extended class label under the main class label "software" includes: "game", "video", "browser", "chat", "input method", "download", etc.
In addition, the extended category labels for images under "album" include "time", "place", "person", "subject", and the like. For photographs under the extended category label "time": the photos are classified according to the time sequence of the photo shooting, so that a user can easily find the photos in a certain time period. For photographs under the extended category label located at "place": the photos are classified according to the geographical position of the photos, so that a user can view all the photos taken in a certain place. For photographs under the extended category label of "people": by means of face recognition technology, people appearing in the photos are automatically classified, and a user can view all photos of a person in the photo album. For photographs under the extended category label of "subject": automatically identifying a subject in a photograph, such as a food, animal, building, etc., may allow a user to find all of the photographs that are related to a particular subject. For photographs under the extended category label "active": the photos are classified according to the occasions of shooting photos, such as weddings, birthday parties and the like, so that a user can view all photos of a certain activity.
Further, the method for analyzing the digital file comprises the following steps:
when the digital file is a video file, converting the video file into a plurality of frame image files, and identifying the content in each frame image file based on an image processing technology so as to generate a text file for recording the content of the video file;
when the digital file is an audio file, converting the audio file into a text file for recording the audio content of the audio file;
and reading the business data information of the digital file from the text file.
In this embodiment, other non-text files are converted into text files, and service data information is generated through the text files, so that the service data information is also in a plain text format, and subsequent processing of the service data information is facilitated.
Further, in order to avoid that the metadata information of some digital files is not full and affects the user's review, in this embodiment, the metadata information is further supplemented based on the big data analysis mode and the service data information, so as to perfect the metadata information of the digital files. In this embodiment, for video files, the metadata information that may be supplemented includes covers, profiles, types, time-related information, languages, related personnel information, regional or geographic locations, time durations, etc. For audio files, the metadata information that may be supplemented includes cover, style, scene, emotion, theme, language, lyrics, singer, author, duration, etc. For software files, the metadata information that may be supplemented includes cover, category, size, profile, version number, etc.
Of course, for a digital file that is not in large data, the metadata information of the digital file is discarded from being supplemented.
On the other hand, the association analysis is also carried out on the metadata information and the business data information of any two digital files so as to obtain the association relation of the two digital files based on a certain characteristic. When a digital file is displayed, other digital files having an association relationship with the digital file are synchronously displayed. Specifically, in this embodiment, the association analysis may be performed on the metadata information and the service data information of any two digital files through a natural language processing method, a graph theory method, an intelligent processing method based on machine learning, and a data mining method, so as to find the association relationship between two digital files, where the association relationship includes multiple aspects, such as author related, theme related, and the like of the files.
In still another aspect, for facilitating the user to quickly search for the target file, an index is established based on metadata information and service data information of the digital file to obtain a file index library.
In still another aspect, in order to enable a user to quickly browse basic information of a digital file to increase the speed of review and quickly find a desired file, the intelligent file management method in this embodiment further extracts key information in service data information to generate summary information, and when the digital file is displayed, the summary information corresponding to the digital file is displayed at the same time. Specifically, keywords, key sentences or paragraphs in the business data information are extracted through word frequency statistics and other methods, and the abstract information of the digital file is generated through splicing the keywords, the key sentences or the paragraphs.
In yet another aspect, to ensure the security of the stored digital file, the digital file is further encrypted by an asymmetric encryption method.
In summary, by the intelligent file management method disclosed by the application, the service data information of the digital file uploaded by the user is automatically analyzed, and then the main classification label and the extension classification label associated with the digital file are generated by the metadata information and the service data information of the digital file. And generating the association relation of the two digital files with the association based on the metadata information and the service data information. Therefore, through the management method, the digital files uploaded by the user can be automatically classified and arranged, so that the user does not need to spend time arranging the files, the file arranging efficiency is improved, file items related to file contents can be directly searched according to the expanded classification labels, the user can more comprehensively and directly know the files stored by the user, and error classification caused by manual classification can be avoided. In addition, through perfecting the metadata information of the digital file, the information displayed by the digital file is more complete and orderly, so that the user can search and review conveniently.
In another preferred embodiment of the present application, an intelligent file management system is also disclosed, the management system includes a system processor, and the system processor works based on the intelligent file management method in the above embodiment.
The present application also discloses another intelligent file management system comprising one or more processors, memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the intelligent file management method as described above. The processor may be a general-purpose central processing unit (Central Processing Unit, CPU), microprocessor, application specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits for executing related programs to perform the functions required by the modules in the intelligent file management system of the present application or to perform the intelligent file management method of the method embodiment of the present application.
The application also discloses a computer readable storage medium comprising a computer program executable by a processor to perform the intelligent file management method as described above. The computer readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains an integration of one or more available media. The usable medium may be a read-only memory (ROM), or a random-access memory (random access memory, RAM), or a magnetic medium, for example, a floppy disk, a hard disk, a magnetic tape, a magnetic disk, or an optical medium, for example, a digital versatile disk (digital versatiledisc, DVD), or a semiconductor medium, for example, a Solid State Disk (SSD), or the like.
Embodiments of the present application also disclose a computer program product or computer program comprising computer instructions stored in a computer readable storage medium. The processor of the electronic device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the electronic device performs the above-described intelligent file management method.
The foregoing description of the preferred embodiments of the present application is not intended to limit the scope of the claims, which follow, as defined in the claims.

Claims (10)

1. An intelligent file management method, comprising:
acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file;
extracting a plurality of key features in the service data information;
generating a master classification tag associated with the digital file based on the metadata information;
generating a number of expanded classification tags associated with the digital file based on the number of key features;
and classifying and sorting the digital files according to the main classification labels and the extended classification labels.
2. The intelligent file management method according to claim 1, wherein said metadata information is supplemented based on a big data analysis mode and said business data information to perfect said metadata information of said digital file.
3. The intelligent file management method according to claim 1, wherein association analysis is performed on the metadata information and the service data information of any two digital files to obtain an association relationship of the two digital files based on a certain feature; and when the digital file is displayed, synchronously displaying other digital files which have association relation with the digital file.
4. The intelligent file management method according to claim 1, wherein an index is established based on metadata information and service data information of the digital file to obtain a file index library.
5. The intelligent file management method according to claim 1, wherein key information in said service data information is extracted to generate summary information, and when said digital file is displayed, said summary information corresponding thereto is displayed at the same time.
6. The intelligent file management method according to claim 1, wherein said digital file is further encrypted by an asymmetric encryption method.
7. The intelligent file management method according to claim 1, wherein the method of parsing the digital file comprises:
when the digital file is a video file, converting the video file into a plurality of frames of image files, and identifying the content in each frame of the image file based on an image processing technology so as to generate a text file for recording the content of the video file;
when the digital file is an audio file, converting the audio file into a text file for recording the audio content of the audio file;
and reading the business data information of the digital file from the text file.
8. An intelligent file management system comprising a system processor that operates based on the intelligent file management method of any one of claims 1 to 7.
9. An intelligent file management system, comprising:
one or more processors;
a memory;
and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the intelligent file management method of any of claims 1 to 7.
10. A computer readable storage medium comprising a computer program executable by a processor to perform the intelligent file management method of any of claims 1 to 7.
CN202310705793.6A 2023-06-14 2023-06-14 Intelligent file management method and system Pending CN116955291A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310705793.6A CN116955291A (en) 2023-06-14 2023-06-14 Intelligent file management method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310705793.6A CN116955291A (en) 2023-06-14 2023-06-14 Intelligent file management method and system

Publications (1)

Publication Number Publication Date
CN116955291A true CN116955291A (en) 2023-10-27

Family

ID=88453874

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310705793.6A Pending CN116955291A (en) 2023-06-14 2023-06-14 Intelligent file management method and system

Country Status (1)

Country Link
CN (1) CN116955291A (en)

Similar Documents

Publication Publication Date Title
CN106383887B (en) Method and system for collecting, recommending and displaying environment-friendly news data
US9489577B2 (en) Visual similarity for video content
US8788529B2 (en) Information sharing between images
US20080162561A1 (en) Method and apparatus for semantic super-resolution of audio-visual data
JP2013541793A (en) Multi-mode search query input method
KR20110007179A (en) Method and apparatus for searching a plurality of stored digital images
US10572528B2 (en) System and method for automatic detection and clustering of articles using multimedia information
TWI387890B (en) A method of converting a hypertext label language file into a plain text file
Sandhaus et al. Semantic analysis and retrieval in personal and social photo collections
Zaharieva et al. Automated social event detection in large photo collections
WO2015188719A1 (en) Association method and association device for structural data and picture
Liu et al. Event analysis in social multimedia: a survey
US20090125381A1 (en) Methods for identifying documents relating to a market
KR100876214B1 (en) Apparatus and method for context aware advertising and computer readable medium processing the method
KR101651963B1 (en) Method of generating time and space associated data, time and space associated data generation server performing the same and storage medium storing the same
Truong et al. Video search based on semantic extraction and locally regional object proposal
KR101934108B1 (en) Method for clustering and sharing images, and system and application implementing the same method
JP7395377B2 (en) Content search methods, devices, equipment, and storage media
Nixon et al. Multimodal video annotation for retrieval and discovery of newsworthy video in a news verification scenario
CN116955291A (en) Intelligent file management method and system
Burtner et al. Interactive visual comparison of multimedia data through type-specific views
Ruocco et al. Event clusters detection on flickr images using a suffix-tree structure
Choudhury et al. Detecting presence of personal events in twitter streams
Sheba et al. Event detection refinement using external tags for flickr collections
KR20080091738A (en) Apparatus and method for context aware advertising and computer readable medium processing the method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination