CN116955286B - File searching and classifying management method, system and device - Google Patents

File searching and classifying management method, system and device Download PDF

Info

Publication number
CN116955286B
CN116955286B CN202311205644.XA CN202311205644A CN116955286B CN 116955286 B CN116955286 B CN 116955286B CN 202311205644 A CN202311205644 A CN 202311205644A CN 116955286 B CN116955286 B CN 116955286B
Authority
CN
China
Prior art keywords
file
search
searching
index
files
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202311205644.XA
Other languages
Chinese (zh)
Other versions
CN116955286A (en
Inventor
王延国
刘洋洋
杨尚伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongfu Safety Technology Co Ltd
Original Assignee
Zhongfu Safety Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongfu Safety Technology Co Ltd filed Critical Zhongfu Safety Technology Co Ltd
Priority to CN202311205644.XA priority Critical patent/CN116955286B/en
Publication of CN116955286A publication Critical patent/CN116955286A/en
Application granted granted Critical
Publication of CN116955286B publication Critical patent/CN116955286B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • G06F16/148File search processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/17Details of further file system functions
    • G06F16/172Caching, prefetching or hoarding of files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/237Lexical tools
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The application provides a method, a system and a device for file searching and classification management, which belong to the technical field of file searching, and the method comprises the following steps: starting file traversal operation and reading search parameters; analyzing the search parameters, and updating a search word bank according to the analysis result; the search word segmentation library comprises a full-quantity word segmentation library and an increment word segmentation library; obtaining a traversing result, and determining the change of a file system by comparing an index library; updating the index library according to the change conditions of the file system and the search parameters in an increment mode, and searching through the index; classifying the files according to the file characteristics, and storing the files in a cache library. The application can realize the distributed storage, centralized management and quick search of the files.

Description

File searching and classifying management method, system and device
Technical Field
The application relates to the technical field of file searching, in particular to a method, a system and a device for file searching and classification management.
Background
With the improvement of informatization level, networking and paperless office work has been very popular, so that a large number of electronic documents are randomly distributed in office computers in a network, and because of the characteristic that the electronic documents are easy to transmit, some documents even have a plurality of copies and versions, and workers have difficulty in searching a certain document, a method capable of quickly searching a required document is needed, so that the quick searching and management of a local document can be realized, and the quick searching and classified management of the distributed files in the whole network can be realized through networking, wherein the searching and classification of the documents are important modules.
In the prior art, a search tool or a third party search tool (such as an evaluation tool) carried by a system is generally adopted for quick search and classification management of files. Although various search tools can achieve the purpose of searching files, a number of disadvantages still exist. The method comprises the following steps:
1. when the existing searching tool searches files, only the file names can be searched, the file contents cannot be searched, the full-disk searching is required to be carried out again in each search, the disk IO is continuously occupied, the performance of a host is affected, the time consumption is relatively long, and the results cannot be provided for users in time. And only the local files can be searched, and the unified management of the whole network files cannot be performed.
2. When a third party search tool is used, although an index can be provided, the space consumption is relatively large by using a general word segmentation library. Although smart listing presents search results, it is inconvenient to manage.
Disclosure of Invention
Aiming at the problems existing in the prior art, the application aims to provide a method, a system and a device for searching and classifying files, which can realize the distributed storage, the centralized management and the quick searching of the files.
The application aims to achieve the aim, and the aim is achieved by the following technical scheme:
a file searching and classifying management method comprises the following steps:
starting file traversal operation and reading search parameters;
analyzing the search parameters, and updating a search word bank according to the analysis result; the search word stock comprises a full-quantity word stock and an increment word stock;
obtaining a traversing result, and determining the change of a file system by comparing an index library;
and determining a file searching mode according to the change of the file system and the analysis result of the searching parameters, and searching.
Further, the analyzing the search parameter, updating the search word bank according to the analysis result, includes:
comparing the search parameter with the executed search parameter;
if the search parameters are changed, extracting keywords of the search parameters;
generating an increment word segmentation library according to the key;
and screening out newly added keywords in the keywords, and adding the newly added keywords into the full-quantity word stock.
Further, the obtaining the traversal result and determining the change of the file system by comparing the index library includes:
and (3) finding newly added files, files with modification operation after last search and files without change in the file system by comparing the index library with the traversing result.
Further, the determining a file searching mode according to the change of the file system and the analysis result of the searching parameter, and searching, includes:
searching the newly added files and the files with modification operation after the last searching by adopting a full-quantity word stock again, and generating indexes;
reading analysis results of the search parameters for files without change;
if the search parameters are changed, searching the file by using an increment word stock, and incrementally updating an index stock;
if the search parameters are not changed, the original file is not read any more, and the search result is directly obtained from the index library.
Further, the finding a newly added file, a file having a modification operation after a last search, and a file having no change in the file system by comparing the index library and the traversal result includes:
acquiring a traversing result, and finding a newly added file in a file system through the traversing searching position, the file size and the file type; and finding the file with the modification operation after the last search through the file version check code generated by the file size and the modification time.
Further, the directly obtaining the search result from the index base includes:
and caching search results through an index library, generating an identification code to identify the file by using a full path of the file, and caching a file version check code, a keyword hit condition and a classification condition.
Further, the file system performs data reading, analyzing and processing operations in units of storage blocks.
Further, the determining a change to the file system includes:
whether the storage block data changes or not is judged through a CRC32 check algorithm so as to determine the change of the file system.
Further, the search parameters include: name keywords and content keywords of the file.
Further, the index library adopts a secondary index to cache the content index of the disk data; the first-level index is used for storing check codes and pointers pointing to the index blocks, and the second-level index is used for storing index contents of the corresponding storage blocks.
Correspondingly, the application also discloses a file searching and classifying management system, which comprises:
the starting module is configured to start traversing operation of the file and read search parameters;
the parameter analysis module is configured to analyze the search parameters and update the search word bank according to the analysis result; the search word stock comprises a full-quantity word stock and an increment word stock;
the file analysis module is configured to acquire a traversing result and determine the change of a file system by comparing an index library;
and the searching module is configured to determine a file searching mode according to the change of the file system and the analysis result of the searching parameters and search.
Correspondingly, the application also discloses a file searching and classifying management device, which comprises:
a memory for storing a computer program;
a processor for implementing the steps of the file search and sort management method as described in any one of the preceding claims when executing the computer program.
Compared with the prior art, the application has the beneficial effects that:
1. the application sets the search word bank aiming at the user system and searches by utilizing the search word bank, thereby greatly improving the searching speed.
2. The application indexes and divides words aiming at the key words or characteristic word groups concerned by the user, can support the searching of file contents and saves the disk space occupied by an index library.
3. The application adopts an index mode to search, so as to avoid reading the original file; when data is read, sector data is read in blocks, IO performance is improved, disk reading and writing are reduced, and resources are further saved.
4. The method has high searching speed and small influence on the IO of the disk, reduces the influence on the performance of the host, and further reduces the influence on the work.
5. When the method is applied to the full-network file system, the files can be classified through the feature phrase during searching, and index libraries on all hosts are collected through a network, so that the distributed storage, centralized management and quick searching of the full-network files are realized.
It can be seen that the present application has outstanding substantial features and significant advances over the prior art, as well as the benefits of its implementation.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings that are required to be used in the embodiments or the description of the prior art will be briefly described below, and it is obvious that the drawings in the following description are only embodiments of the present application, and that other drawings can be obtained according to the provided drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flow chart of a method of an embodiment of the present application.
Fig. 2 is a schematic diagram of a memory block check code according to an embodiment of the present application.
Fig. 3 is a schematic diagram of a memory block according to an embodiment of the present application.
Fig. 4 is a system configuration diagram of an embodiment of the present application.
In the figure, 101, a starting module; 102. a parameter analysis module; 103. a file analysis module; 104. and a search module.
Detailed Description
The following describes specific embodiments of the present application with reference to the drawings.
The file searching and classifying management method shown in fig. 1 comprises the following steps:
s1: and starting file traversing operation and reading search parameters.
Wherein the search parameters include: name keywords and content keywords of the file. The method can realize the searching of file names and file contents.
It should be specifically noted that the files searched and managed by the method include existing files in the host and deleted problems.
S2: analyzing the search parameters, and updating a search word bank according to the analysis result; the search word stock comprises a full-quantity word stock and an increment word stock.
Specifically, the search parameters are compared with the executed search parameters. If the search parameters are changed, firstly extracting keywords of the search parameters, and then generating an incremental word stock according to the keywords; and finally, screening out newly added keywords in the keywords, and adding the newly added keywords into the full-quantity word stock.
It should be noted that if the file to be searched is a deleted file, the actual data still exists in the disk, and the search can be performed by directly reading the sector data. However, because of the large number of system files, archive files, installation packages and other external files existing in the system, the files have the characteristics of large number, large volume, fixed storage positions, no modification after creation and the like, and are embodied on a disk, namely, a large number of continuous sectors which are written into and no modification any more. Based on the characteristics, repeated searching of the positions is avoided through the index library in the method, and the searching speed is improved. Therefore, the same method is adopted for maintaining the full-quantity word stock and the increment word stock searched at the time aiming at the deleted files.
S3: and obtaining a traversing result, and determining the change of the file system by comparing the index library.
Specifically, by comparing the index library with the traversal result, newly added files in the file system, files with modification operation after last search, and files without change are found.
As an example, for existing files in the host, file traversal discovers newly added files or files that have not been previously searched by searching for location, file size, file type, and discovers recently modified files by file version check codes generated by file size, modification time.
As an example, for deleted files, changes to the file may be determined by determining the change in the data of the data block. Specifically, whether the memory block data is changed or not is judged by a CRC32 check algorithm.
It should be noted that each storage block is a continuous sector, where the NTFS file system is an integer multiple of a cluster, and when a sector starts to be deleted, a cluster starts to be deleted, and the storage block under the EXT file system corresponds to a fast group by default. The file system reads, analyzes and processes the data by taking the storage block as a unit, and obviously prompts the performance of the system compared with the existing sector-by-sector processing mode.
S4: and determining a file searching mode according to the change of the file system and the analysis result of the searching parameters, and searching.
Specifically, for the newly added files and the files with modification operation after the last search, the files are searched again by adopting a full-quantity word stock, and indexes are generated. For the file without change, firstly, reading the analysis result of the search parameter obtained in the step S2. If the search parameters are changed, searching the file by using an increment word stock, and incrementally updating an index stock; if the search parameters are not changed, the original file is not read any more, and the search result is directly obtained from the index library.
It should be noted that, in this step, the search is completed, and the word segmentation for the user's attention is also realized, and the index library is maintained through each incremental search. Compared with a common index library, the index library only contains the word segmentation focused by the user, so that the space consumption is reduced, and the retrieval speed is improved. The index library supports two search strategies, one is to set a group of keywords, and whether keywords exist in a search file or not is judged; the other is to set keywords in groups, each group of keywords corresponds to one file classification to form a characteristic phrase, and a plurality of groups can be set to classify the files while searching. In addition, in the method, the search parameters can be set in groups to form characteristic word groups, and the files are classified while searching.
As an example, for existing files in the host, search results are cached through an index library, the files are identified using a file full path generation identification code, a file version check code, a key hit condition, and a classification condition.
As an example, for deleted files, when there is a change in the stored block data, searching is performed again and the index is updated, and when there is no change in the stored block data, if there is no new word segmentation, the search result is returned directly through the index library. The index library adopts a secondary index to cache the content index of the disk data; the first-level index is used for storing check codes and pointers pointing to the index blocks, and the second-level index is used for storing index contents of the corresponding storage blocks.
It should be noted that, referring to fig. 2-3, the storage blocks are divided in two modes, namely, dynamic division, and fixed division, wherein the fixed division mode is suitable for the disk of the personal office computer, and the dynamic division mode is suitable for the server disk of the special user for storing the documents. The size of each storage block in a fixed division mode is fixed, and the start and stop sector numbers of each storage block can be obtained through calculation; and each storage block in the dynamic dividing mode uses a starting position identifier, indexes are sorted after each search is completed, and continuous index blocks which are unchanged for a long time are combined into one block. Taking a 1T disk as an example, a fixed partition mode, each storage block 128M requires 8192 index entries to store a check code, the check code occupies 4 bytes, and the hit pointer occupies 4 bytes, requiring 64K space.
In addition, the method can be applied to a single host machine and a full-network file system. With reference to the method, when in application, the file content index of each host is collected through a network and stored in an index server, so that the quick searching and classified management of the local files are supported, and the quick searching and classified management of the whole network files can be performed through accessing the index server.
Therefore, the application discloses a file searching and classifying management method which can realize the distributed storage, centralized management and quick searching of files.
Correspondingly, as shown in fig. 4, the application also discloses a file searching and classifying management system, which comprises: a startup module 101, a parameter analysis module 102, a file analysis module 103, and a search module 104.
The starting module 101 is configured to start the traversing operation of the file and read the search parameters.
The parameter analysis module 102 is configured to analyze the search parameters and update the search word bank according to the analysis result; the search word stock comprises a full-quantity word stock and an increment word stock.
The file analysis module 103 is configured to obtain the traversal result, and determine the change of the file system by comparing the index library.
And the searching module 104 is configured to determine a file searching mode according to the change of the file system and the analysis result of the searching parameters and perform searching.
Correspondingly, the application also discloses a file searching and classifying management device, which comprises:
a memory for storing a computer program;
a processor for implementing the steps of the file search and sort management method as described in any one of the preceding claims when executing the computer program.
It will be apparent to those skilled in the art that the techniques of embodiments of the present application may be implemented in software plus a necessary general purpose hardware platform. Based on such understanding, the technical solution in the embodiments of the present application may be embodied essentially or in a part contributing to the prior art in the form of a software product stored in a storage medium such as a U-disc, a mobile hard disc, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk or an optical disk, etc. various media capable of storing program codes, including several instructions for causing a computer terminal (which may be a personal computer, a server, or a second terminal, a network terminal, etc.) to execute all or part of the steps of the method described in the embodiments of the present application. The same or similar parts between the various embodiments in this specification are referred to each other. In particular, for the terminal embodiment, since it is substantially similar to the method embodiment, the description is relatively simple, and reference should be made to the description in the method embodiment for relevant points.
In the several embodiments provided by the present application, it should be understood that the disclosed systems, and methods may be implemented in other ways. For example, the system embodiments described above are merely illustrative, e.g., the division of the elements is merely a logical functional division, and there may be additional divisions when actually implemented, e.g., multiple elements or components may be combined or integrated into another system, or some features may be omitted or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some interface, system or unit indirect coupling or communication connection, which may be in electrical, mechanical or other form.
The units described as separate units may or may not be physically separate, and units shown as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional module in the embodiments of the present application may be integrated in one processing unit, or each module may exist alone physically, or two or more modules may be integrated in one unit.
Similarly, each processing unit in the embodiments of the present application may be integrated in one functional module, or each processing unit may exist physically, or two or more processing units may be integrated in one functional module.
The application will be further described with reference to the accompanying drawings and specific embodiments. It is to be understood that these examples are illustrative of the present application and are not intended to limit the scope of the present application. Further, it will be understood that various changes and modifications may be made by those skilled in the art after reading the teachings of the application, and equivalents thereof fall within the scope of the application as defined by the claims.

Claims (7)

1. A method for searching and sorting files, comprising:
starting file traversal operation and reading search parameters;
analyzing the search parameters, and updating a search word bank according to the analysis result; the search word stock comprises a full-quantity word stock and an increment word stock;
obtaining a traversing result, and determining the change of a file system by comparing an index library;
determining a file searching mode according to the change of a file system and the analysis result of the searching parameters, and searching;
the step of obtaining the traversing result and determining the change of the file system by comparing the index library comprises the following steps:
by comparing the index library with the traversing result, finding a newly added file, a file with modification operation after last search and a file without change in the file system;
the method for determining the file searching mode according to the change of the file system and the analysis result of the searching parameters and searching comprises the following steps:
searching the newly added files and the files with modification operation after the last searching by adopting a full-quantity word stock again, and generating indexes;
reading analysis results of the search parameters for files without change;
if the search parameters are changed, searching the file by using an increment word stock, and incrementally updating an index stock;
if the search parameters are not changed, the original file is not read any more, and the search result is directly obtained from the index library;
the analysis search parameter updates a search word bank according to the analysis result, and comprises the following steps:
comparing the search parameter with the executed search parameter;
if the search parameters are changed, extracting keywords of the search parameters;
generating an increment word segmentation library according to the key;
and screening out newly added keywords in the keywords, and adding the newly added keywords into the full-quantity word stock.
2. The method for searching and sorting files according to claim 1, wherein the finding of the newly added files, the files having a modification operation after the last search and the files having no change in the file system by comparing the index library and the traversal result includes:
acquiring a traversing result, and finding a newly added file in a file system through the traversing searching position, the file size and the file type; and finding the file with the modification operation after the last search through the file version check code generated by the file size and the modification time.
3. The method for searching and categorizing files according to claim 1, wherein said directly obtaining search results from an index base comprises:
and caching search results through an index library, generating an identification code to identify the file by using a full path of the file, and caching a file version check code, a keyword hit condition and a classification condition.
4. The file searching and sorting management method according to claim 1, wherein the file system performs reading, analyzing and processing operations of data in units of memory blocks;
the determining a change to the file system includes:
whether the storage block data changes or not is judged through a CRC32 check algorithm so as to determine the change of the file system.
5. The method for searching and sorting documents according to claim 1, wherein the searching parameters are
The number includes: name keywords and content keywords of the file;
the index library adopts a secondary index to cache the content index of the disk data; the first-level index is used for storing check codes and pointers pointing to the index blocks, and the second-level index is used for storing index contents of the corresponding storage blocks.
6. A document searching and sorting management system, comprising:
the starting module is configured to start traversing operation of the file and read search parameters;
the parameter analysis module is configured to analyze the search parameters and update the search word bank according to the analysis result; the search word stock comprises a full-quantity word stock and an increment word stock;
the file analysis module is configured to acquire a traversing result and determine the change of a file system by comparing an index library;
the searching module is configured to determine a file searching mode according to the change of the file system and the analysis result of the searching parameters and search the file;
the file analysis module is specifically configured to:
by comparing the index library with the traversing result, finding a newly added file, a file with modification operation after last search and a file without change in the file system;
the searching module is specifically configured to:
searching the newly added files and the files with modification operation after the last searching by adopting a full-quantity word stock again, and generating indexes;
reading analysis results of the search parameters for files without change;
if the search parameters are changed, searching the file by using an increment word stock, and incrementally updating an index stock;
if the search parameters are not changed, the original file is not read any more, and the search result is directly obtained from the index library;
the parameter analysis module is specifically configured to:
comparing the search parameter with the executed search parameter;
if the search parameters are changed, extracting keywords of the search parameters;
generating an increment word segmentation library according to the key;
and screening out newly added keywords in the keywords, and adding the newly added keywords into the full-quantity word stock.
7. A document searching and sorting management apparatus, comprising:
a memory for storing a computer program;
a processor for implementing the steps of the file search and sort management method according to any one of claims 1 to 5 when executing the computer program.
CN202311205644.XA 2023-09-19 2023-09-19 File searching and classifying management method, system and device Active CN116955286B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202311205644.XA CN116955286B (en) 2023-09-19 2023-09-19 File searching and classifying management method, system and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202311205644.XA CN116955286B (en) 2023-09-19 2023-09-19 File searching and classifying management method, system and device

Publications (2)

Publication Number Publication Date
CN116955286A CN116955286A (en) 2023-10-27
CN116955286B true CN116955286B (en) 2023-12-15

Family

ID=88454890

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202311205644.XA Active CN116955286B (en) 2023-09-19 2023-09-19 File searching and classifying management method, system and device

Country Status (1)

Country Link
CN (1) CN116955286B (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107229649A (en) * 2016-03-25 2017-10-03 腾讯科技(深圳)有限公司 Data update system and method
CN108846121A (en) * 2018-06-27 2018-11-20 中国建设银行股份有限公司 A kind of data search method and device
CN110162522A (en) * 2019-05-22 2019-08-23 武汉市公安局 A kind of distributed data search system and method
CN113535642A (en) * 2021-08-05 2021-10-22 统信软件技术有限公司 File searching method and computing device

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NO327653B1 (en) * 2007-12-20 2009-09-07 Fast Search & Transfer As Procedure for dynamically updating an index and a search engine that implements the same
CN102023991A (en) * 2009-09-21 2011-04-20 中兴通讯股份有限公司 Method and device for updating indexes on terminal and sorting search results on the basis of updated indexes

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107229649A (en) * 2016-03-25 2017-10-03 腾讯科技(深圳)有限公司 Data update system and method
CN108846121A (en) * 2018-06-27 2018-11-20 中国建设银行股份有限公司 A kind of data search method and device
CN110162522A (en) * 2019-05-22 2019-08-23 武汉市公安局 A kind of distributed data search system and method
CN113535642A (en) * 2021-08-05 2021-10-22 统信软件技术有限公司 File searching method and computing device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
P级文件系统搜索概述;张妤芝;刘海涛;;计算机应用(03);全文 *
基于Solr技术的标准全文检索系统研究;于阳;;电脑编程技巧与维护(05);全文 *

Also Published As

Publication number Publication date
CN116955286A (en) 2023-10-27

Similar Documents

Publication Publication Date Title
US20040205044A1 (en) Method for storing inverted index, method for on-line updating the same and inverted index mechanism
KR101972645B1 (en) Clustering storage method and device
US10678654B2 (en) Systems and methods for data backup using data binning and deduplication
CN107577436B (en) Data storage method and device
US9575972B2 (en) Contiguous file allocation in an extensible file system
US7853598B2 (en) Compressed storage of documents using inverted indexes
US20070124277A1 (en) Index and Method for Extending and Querying Index
US20150363447A1 (en) Minimizing index maintenance costs for database storage regions using hybrid zone maps and indices
US20030120647A1 (en) Method and apparatus for indexing document content and content comparison with World Wide Web search service
US8472289B2 (en) Static TOC indexing system and method
US8225060B2 (en) Data de-duplication by predicting the locations of sub-blocks within the repository
CN113535670B (en) Virtual resource mirror image storage system and implementation method thereof
CN110569245A (en) Fingerprint index prefetching method based on reinforcement learning in data de-duplication system
Amur et al. Design of a write-optimized data store
Čech et al. Pivot-based approximate k-NN similarity joins for big high-dimensional data
Tulkinbekov et al. CaseDB: Lightweight key-value store for edge computing environment
US8156126B2 (en) Method for the allocation of data on physical media by a file system that eliminates duplicate data
EP3343395B1 (en) Data storage method and apparatus for mobile terminal
CN116955286B (en) File searching and classifying management method, system and device
CN111831691A (en) Data reading and writing method and device, electronic equipment and storage medium
US8775746B2 (en) Information processing system and method
US7822736B2 (en) Method and system for managing an index arrangement for a directory
Chawathe Effective whitelisting for filesystem forensics
CN116048396B (en) Data storage device and storage control method based on log structured merging tree
CN113641681B (en) Space self-adaptive mass data query method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant