CN116955291A - Intelligent file management method and system - Google Patents
Intelligent file management method and system Download PDFInfo
- Publication number
- CN116955291A CN116955291A CN202310705793.6A CN202310705793A CN116955291A CN 116955291 A CN116955291 A CN 116955291A CN 202310705793 A CN202310705793 A CN 202310705793A CN 116955291 A CN116955291 A CN 116955291A
- Authority
- CN
- China
- Prior art keywords
- file
- digital
- intelligent
- management method
- files
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000007726 management method Methods 0.000 title claims abstract description 40
- 238000000034 method Methods 0.000 claims abstract description 13
- 238000004590 computer program Methods 0.000 claims description 5
- 238000012098 association analyses Methods 0.000 claims description 4
- 238000005516 engineering process Methods 0.000 claims description 4
- 238000007405 data analysis Methods 0.000 claims description 3
- 230000000694 effects Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 238000010801 machine learning Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 239000011435 rock Substances 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/16—File or folder operations, e.g. details of user interfaces specifically adapted to file systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
- G06F16/17—Details of further file system functions
- G06F16/172—Caching, prefetching or hoarding of files
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/906—Clustering; Classification
Abstract
The application discloses an intelligent file management method and system, comprising the following steps: acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file; extracting a plurality of key features in the service data information; generating a master classification tag associated with the digital file based on the metadata information; generating a number of expanded classification tags associated with the digital file based on the number of key features; classifying and sorting the digital files according to the main classification labels and the plurality of expansion classification labels; based on the method, the digital files uploaded by the user can be automatically classified and arranged, so that the user does not need to spend time arranging the files, the file arranging efficiency is improved, file items related to file contents can be directly searched according to the expanded classification labels, the user can more comprehensively and directly know the files stored by the user, and error classification caused by manual classification can be avoided.
Description
Technical Field
The application relates to the technical field of automatic data processing and file arrangement, in particular to an intelligent file management method and system.
Background
With the widespread use of cloud storage services, more and more users store various data files (including video, audio, documents, pictures, installation packages, compression packages, etc.) in the cloud. With the increasing amount of data, the problems of sorting, sorting and archiving personal file data, and quickly searching files are becoming more obvious. The current cloud storage system sorts the digital files in a tree-shaped directory structure, hierarchical division, custom labels, attribute classification and other modes so as to be used for users to review. In this regard, it takes a lot of time and effort for the user to perform basic definition, and particularly when the number of files is huge or the variety is numerous, sorting becomes more cumbersome. It is difficult for a user to take a lot of time to manually sort through the data. In addition, for the traditional file arrangement mode, only basic information such as file (folder) names, categories, sizes and the like is provided, and further expansion analysis is not performed on file contents, so that the contents are displayed too singly and are not fully displayed. The user can search the file only by the keyword, so that the searching is inconvenient, and particularly when the relevance between the file name and the content recorded in the file is not high, the efficiency of searching the target file by the user is extremely low.
Disclosure of Invention
The application aims to provide an intelligent file management method and system which can replace manual and automatic classification and arrangement of stored files and can be used for purposefully classifying the files in detail based on the content of file records for users to review.
In order to achieve the above object, the present application discloses an intelligent file management method, comprising:
acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file;
extracting a plurality of key features in the service data information;
generating a master classification tag associated with the digital file based on the metadata information;
generating a number of expanded classification tags associated with the digital file based on the number of key features;
and classifying and sorting the digital files according to the main classification labels and the extended classification labels.
Preferably, the metadata information is further supplemented based on a big data analysis mode and the service data information so as to perfect the metadata information of the digital file.
Preferably, the metadata information and the business data information of any two digital files are subjected to association analysis so as to obtain an association relation of the two digital files based on a certain characteristic; and when the digital file is displayed, synchronously displaying other digital files which have association relation with the digital file.
Preferably, an index is established based on metadata information and service data information of the digital file to obtain a file index library.
Preferably, key information in the service data information is extracted to generate summary information, and when the digital file is displayed, the summary information corresponding to the digital file is displayed at the same time.
Preferably, the digital file is also encrypted by an asymmetric encryption method.
Preferably, the method for parsing the digital file includes:
when the digital file is a video file, converting the video file into a plurality of frames of image files, and identifying the content in each frame of the image file based on an image processing technology so as to generate a text file for recording the content of the video file;
when the digital file is an audio file, converting the audio file into a text file for recording the audio content of the audio file;
and reading the business data information of the digital file from the text file.
The application also discloses an intelligent file management system comprising a system processor, wherein the system processor works based on the intelligent file management method according to any one of claims 1 to 7.
The application also discloses another intelligent file management system, which comprises:
one or more processors;
a memory;
and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the intelligent file management method as described above.
The application also discloses a computer readable storage medium comprising a computer program executable by a processor to perform the intelligent file management method as described above.
Compared with the prior art, the intelligent file management method disclosed by the technical scheme of the application automatically analyzes the service data information of the digital file after receiving the digital file, and further generates a main classification label and an extended classification label associated with the digital file through the metadata information and the service data information of the digital file, wherein the extended classification label is generated based on the service data information recorded by the digital file. Therefore, through the management method, the digital files uploaded by the user can be automatically classified and arranged, so that the user does not need to spend time arranging the files, the file arranging efficiency is improved, file items related to file contents can be directly searched according to the expanded classification labels, the user can more comprehensively and directly know the files stored by the user, and error classification caused by manual classification can be avoided.
Drawings
FIG. 1 is a flowchart of an intelligent file management method according to an embodiment of the present application.
Detailed Description
In order to describe the technical content, the constructional features, the achieved objects and effects of the present application in detail, the following description is made in connection with the embodiments and the accompanying drawings.
The embodiment discloses an intelligent file management method for classifying and sorting digital files in a file storage system, wherein the file storage system in the embodiment is a cloud storage, but not limited to the cloud storage.
As shown in fig. 1, the intelligent file management method in this embodiment includes the following steps:
s1: and acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file. Metadata, which is data describing data, is descriptive information on data and information resources, is data existing for describing related information of data, and includes file name, file type, creation time, modification time, etc. of a digital file. The service data information is file content recorded in a digital file, for example, for a certain text file, the service data information is text content recorded in the text file, and for each video file, the service data information is a video stream recorded in the video file and comprising a plurality of frames of images.
S2: extracting a plurality of key features in the service data information. In this step, the service data information is in a plain text format, and the key features include keywords or key sentences for describing related information such as topics, fields, industries, etc. related to the content recorded in digital files such as document files, video files, image files, audio files, etc.
S3: a primary classification tag associated with the digital file is generated based on the metadata information, and a number of extended classification tags associated with the digital file are generated based on the number of key features.
S4: and classifying and sorting the digital files according to the main classification labels and the plurality of expansion classification labels.
In this embodiment, the main category labels are a type of labels, such as including "me watch", "me listen", "album", "document", "software", "others", and so on.
And (5) expanding the classification labels into a plurality of class II labels subordinate to each main classification label. For example, the extended category label under the "me watch" primary category label includes: "scenario", "science fiction", "action", "comedy", "love", "adventure", "child", "dance", "animation", etc. The extended category label under the main category label "i listen" includes: "pop", "rock", "ballad", "electronic", "dance", "talk", "light music", "jazz", "country", etc. The extended category label under the main category label "document" includes: "A philosophy, religion"; "B social science general theory"; "C politics, law"; "D military"; "E economy"; "F culture, science, education, sports"; "G language, text"; "H literature"; "I art". The extended class label under the main class label "software" includes: "game", "video", "browser", "chat", "input method", "download", etc.
In addition, the extended category labels for images under "album" include "time", "place", "person", "subject", and the like. For photographs under the extended category label "time": the photos are classified according to the time sequence of the photo shooting, so that a user can easily find the photos in a certain time period. For photographs under the extended category label located at "place": the photos are classified according to the geographical position of the photos, so that a user can view all the photos taken in a certain place. For photographs under the extended category label of "people": by means of face recognition technology, people appearing in the photos are automatically classified, and a user can view all photos of a person in the photo album. For photographs under the extended category label of "subject": automatically identifying a subject in a photograph, such as a food, animal, building, etc., may allow a user to find all of the photographs that are related to a particular subject. For photographs under the extended category label "active": the photos are classified according to the occasions of shooting photos, such as weddings, birthday parties and the like, so that a user can view all photos of a certain activity.
Further, the method for analyzing the digital file comprises the following steps:
when the digital file is a video file, converting the video file into a plurality of frame image files, and identifying the content in each frame image file based on an image processing technology so as to generate a text file for recording the content of the video file;
when the digital file is an audio file, converting the audio file into a text file for recording the audio content of the audio file;
and reading the business data information of the digital file from the text file.
In this embodiment, other non-text files are converted into text files, and service data information is generated through the text files, so that the service data information is also in a plain text format, and subsequent processing of the service data information is facilitated.
Further, in order to avoid that the metadata information of some digital files is not full and affects the user's review, in this embodiment, the metadata information is further supplemented based on the big data analysis mode and the service data information, so as to perfect the metadata information of the digital files. In this embodiment, for video files, the metadata information that may be supplemented includes covers, profiles, types, time-related information, languages, related personnel information, regional or geographic locations, time durations, etc. For audio files, the metadata information that may be supplemented includes cover, style, scene, emotion, theme, language, lyrics, singer, author, duration, etc. For software files, the metadata information that may be supplemented includes cover, category, size, profile, version number, etc.
Of course, for a digital file that is not in large data, the metadata information of the digital file is discarded from being supplemented.
On the other hand, the association analysis is also carried out on the metadata information and the business data information of any two digital files so as to obtain the association relation of the two digital files based on a certain characteristic. When a digital file is displayed, other digital files having an association relationship with the digital file are synchronously displayed. Specifically, in this embodiment, the association analysis may be performed on the metadata information and the service data information of any two digital files through a natural language processing method, a graph theory method, an intelligent processing method based on machine learning, and a data mining method, so as to find the association relationship between two digital files, where the association relationship includes multiple aspects, such as author related, theme related, and the like of the files.
In still another aspect, for facilitating the user to quickly search for the target file, an index is established based on metadata information and service data information of the digital file to obtain a file index library.
In still another aspect, in order to enable a user to quickly browse basic information of a digital file to increase the speed of review and quickly find a desired file, the intelligent file management method in this embodiment further extracts key information in service data information to generate summary information, and when the digital file is displayed, the summary information corresponding to the digital file is displayed at the same time. Specifically, keywords, key sentences or paragraphs in the business data information are extracted through word frequency statistics and other methods, and the abstract information of the digital file is generated through splicing the keywords, the key sentences or the paragraphs.
In yet another aspect, to ensure the security of the stored digital file, the digital file is further encrypted by an asymmetric encryption method.
In summary, by the intelligent file management method disclosed by the application, the service data information of the digital file uploaded by the user is automatically analyzed, and then the main classification label and the extension classification label associated with the digital file are generated by the metadata information and the service data information of the digital file. And generating the association relation of the two digital files with the association based on the metadata information and the service data information. Therefore, through the management method, the digital files uploaded by the user can be automatically classified and arranged, so that the user does not need to spend time arranging the files, the file arranging efficiency is improved, file items related to file contents can be directly searched according to the expanded classification labels, the user can more comprehensively and directly know the files stored by the user, and error classification caused by manual classification can be avoided. In addition, through perfecting the metadata information of the digital file, the information displayed by the digital file is more complete and orderly, so that the user can search and review conveniently.
In another preferred embodiment of the present application, an intelligent file management system is also disclosed, the management system includes a system processor, and the system processor works based on the intelligent file management method in the above embodiment.
The present application also discloses another intelligent file management system comprising one or more processors, memory, and one or more programs, wherein one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the intelligent file management method as described above. The processor may be a general-purpose central processing unit (Central Processing Unit, CPU), microprocessor, application specific integrated circuit (Application Specific Integrated Circuit, ASIC), or one or more integrated circuits for executing related programs to perform the functions required by the modules in the intelligent file management system of the present application or to perform the intelligent file management method of the method embodiment of the present application.
The application also discloses a computer readable storage medium comprising a computer program executable by a processor to perform the intelligent file management method as described above. The computer readable storage medium may be any available medium that can be accessed by a computer or a data storage device such as a server, data center, etc. that contains an integration of one or more available media. The usable medium may be a read-only memory (ROM), or a random-access memory (random access memory, RAM), or a magnetic medium, for example, a floppy disk, a hard disk, a magnetic tape, a magnetic disk, or an optical medium, for example, a digital versatile disk (digital versatiledisc, DVD), or a semiconductor medium, for example, a Solid State Disk (SSD), or the like.
Embodiments of the present application also disclose a computer program product or computer program comprising computer instructions stored in a computer readable storage medium. The processor of the electronic device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, so that the electronic device performs the above-described intelligent file management method.
The foregoing description of the preferred embodiments of the present application is not intended to limit the scope of the claims, which follow, as defined in the claims.
Claims (10)
1. An intelligent file management method, comprising:
acquiring metadata information of the received digital file, and analyzing the digital file to acquire service data information recorded in the digital file;
extracting a plurality of key features in the service data information;
generating a master classification tag associated with the digital file based on the metadata information;
generating a number of expanded classification tags associated with the digital file based on the number of key features;
and classifying and sorting the digital files according to the main classification labels and the extended classification labels.
2. The intelligent file management method according to claim 1, wherein said metadata information is supplemented based on a big data analysis mode and said business data information to perfect said metadata information of said digital file.
3. The intelligent file management method according to claim 1, wherein association analysis is performed on the metadata information and the service data information of any two digital files to obtain an association relationship of the two digital files based on a certain feature; and when the digital file is displayed, synchronously displaying other digital files which have association relation with the digital file.
4. The intelligent file management method according to claim 1, wherein an index is established based on metadata information and service data information of the digital file to obtain a file index library.
5. The intelligent file management method according to claim 1, wherein key information in said service data information is extracted to generate summary information, and when said digital file is displayed, said summary information corresponding thereto is displayed at the same time.
6. The intelligent file management method according to claim 1, wherein said digital file is further encrypted by an asymmetric encryption method.
7. The intelligent file management method according to claim 1, wherein the method of parsing the digital file comprises:
when the digital file is a video file, converting the video file into a plurality of frames of image files, and identifying the content in each frame of the image file based on an image processing technology so as to generate a text file for recording the content of the video file;
when the digital file is an audio file, converting the audio file into a text file for recording the audio content of the audio file;
and reading the business data information of the digital file from the text file.
8. An intelligent file management system comprising a system processor that operates based on the intelligent file management method of any one of claims 1 to 7.
9. An intelligent file management system, comprising:
one or more processors;
a memory;
and one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the intelligent file management method of any of claims 1 to 7.
10. A computer readable storage medium comprising a computer program executable by a processor to perform the intelligent file management method of any of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310705793.6A CN116955291A (en) | 2023-06-14 | 2023-06-14 | Intelligent file management method and system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310705793.6A CN116955291A (en) | 2023-06-14 | 2023-06-14 | Intelligent file management method and system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN116955291A true CN116955291A (en) | 2023-10-27 |
Family
ID=88453874
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310705793.6A Pending CN116955291A (en) | 2023-06-14 | 2023-06-14 | Intelligent file management method and system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN116955291A (en) |
-
2023
- 2023-06-14 CN CN202310705793.6A patent/CN116955291A/en active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106383887B (en) | Method and system for collecting, recommending and displaying environment-friendly news data | |
US9489577B2 (en) | Visual similarity for video content | |
US8788529B2 (en) | Information sharing between images | |
US20080162561A1 (en) | Method and apparatus for semantic super-resolution of audio-visual data | |
JP2013541793A (en) | Multi-mode search query input method | |
KR20110007179A (en) | Method and apparatus for searching a plurality of stored digital images | |
US10572528B2 (en) | System and method for automatic detection and clustering of articles using multimedia information | |
TWI387890B (en) | A method of converting a hypertext label language file into a plain text file | |
Sandhaus et al. | Semantic analysis and retrieval in personal and social photo collections | |
Zaharieva et al. | Automated social event detection in large photo collections | |
WO2015188719A1 (en) | Association method and association device for structural data and picture | |
Liu et al. | Event analysis in social multimedia: a survey | |
US20090125381A1 (en) | Methods for identifying documents relating to a market | |
KR100876214B1 (en) | Apparatus and method for context aware advertising and computer readable medium processing the method | |
KR101651963B1 (en) | Method of generating time and space associated data, time and space associated data generation server performing the same and storage medium storing the same | |
Truong et al. | Video search based on semantic extraction and locally regional object proposal | |
KR101934108B1 (en) | Method for clustering and sharing images, and system and application implementing the same method | |
JP7395377B2 (en) | Content search methods, devices, equipment, and storage media | |
Nixon et al. | Multimodal video annotation for retrieval and discovery of newsworthy video in a news verification scenario | |
CN116955291A (en) | Intelligent file management method and system | |
Burtner et al. | Interactive visual comparison of multimedia data through type-specific views | |
Ruocco et al. | Event clusters detection on flickr images using a suffix-tree structure | |
Choudhury et al. | Detecting presence of personal events in twitter streams | |
Sheba et al. | Event detection refinement using external tags for flickr collections | |
KR20080091738A (en) | Apparatus and method for context aware advertising and computer readable medium processing the method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |