WO2016169152A1 - 文件加速方法和装置 - Google Patents

文件加速方法和装置 Download PDF

Info

Publication number
WO2016169152A1
WO2016169152A1 PCT/CN2015/086404 CN2015086404W WO2016169152A1 WO 2016169152 A1 WO2016169152 A1 WO 2016169152A1 CN 2015086404 W CN2015086404 W CN 2015086404W WO 2016169152 A1 WO2016169152 A1 WO 2016169152A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
file
index
data file
data
Prior art date
Application number
PCT/CN2015/086404
Other languages
English (en)
French (fr)
Inventor
谢东
刘君
喻红宇
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2016169152A1 publication Critical patent/WO2016169152A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/14Details of searching files based on file metadata
    • G06F16/148File search processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/13File access structures, e.g. distributed indices
    • G06F16/137Hash-based

Definitions

  • This application relates to, but is not limited to, the field of communications.
  • the file system is responsible for organizing and allocating the space of the file storage device, managing the file storage management, and protecting and retrieving the stored files.
  • the file system when storing a data file, stores the data file in the directory only by the name or node of the data file.
  • the query can only be performed by name or node, and the data file is inefficiently queried. .
  • This paper proposes a file acceleration method and device to solve the technical problem of low efficiency of data file query and achieve efficient and orderly management.
  • a file acceleration method comprising:
  • association information of each data file of the index to be created in the file system where the association information includes attribute information and/or content information of the data file
  • Index information of each of the data files is generated based on the extracted association information.
  • the step of generating the index information of each of the data files based on the extracted association information includes:
  • the preset algorithm is a hash algorithm
  • the location information is a pointer location
  • the step of generating the index information of each of the data files based on the extracted association information includes:
  • Each row of each data file is separately segmented by a column separator of the data file to generate each data segment of each data file;
  • Index information of each of the data files is generated based on content information corresponding to each of the data segments and address information thereof.
  • the step of generating the index information of each of the data files based on the extracted association information includes:
  • each row of each data file is separately segmented by a column separator of the data file to generate each data segment of each data file; and content information corresponding to each data segment is obtained. And address information thereof, and determining location information of each content information on the corresponding data file, and generating index information of each of the data files based on the location information and the content information.
  • the file acceleration method further includes:
  • the new data file When a new data file is detected in the file system, the new data file is used as a data file to be indexed to trigger an index creation instruction.
  • the file acceleration method further includes:
  • a file acceleration device comprising:
  • the extraction module is configured to: when the creation instruction of the index is received, extract association information of each data file of the index to be created in the file system, where the association information includes attribute information and/or content information of the data file;
  • And generating a module configured to: generate index information of each of the data files based on the extracted association information.
  • the generating module includes:
  • a processing unit configured to: segment each row of each data file separately by a column separator of the data file to generate each data segment of each data file;
  • the obtaining unit is configured to: obtain content information corresponding to each data segment and address information thereof;
  • the generating unit is configured to: generate index information of each of the data files based on content information corresponding to each of the data segments and address information thereof.
  • the file acceleration device further includes:
  • the triggering module is configured to: when a new data file is detected in the file system, use the new data file as a data file to be indexed to trigger an index creation instruction.
  • the file acceleration device further includes:
  • the processing module is configured to: when receiving the data file query instruction, analyze the received query instruction, and extract a preset index item based on the query instruction;
  • the extracting module is further configured to: extract the keyword corresponding to the index item based on the extracted index item;
  • Obtaining a module configured to: obtain corresponding index information based on the extracted keywords; And acquiring the data file corresponding to the index information according to the index information.
  • a computer readable storage medium storing computer executable instructions for performing the method of any of the above.
  • the file acceleration method and apparatus when receiving an index creation instruction, extract association information of each data file of an index to be created in the file system, and generate each of the extracted information based on the extracted association information.
  • the index information of the data file realizes that when the user queries the data file and its content, the index information directly locates the corresponding data file and its content location, so as to quickly query the data file and its content, and improve the query efficiency of the data file. , to achieve efficient and orderly management.
  • FIG. 1 is a schematic flow chart of a first embodiment of a file acceleration method according to the present invention
  • FIG. 2 is a schematic diagram of a refinement process of step S20 in FIG. 1;
  • FIG. 3 is a schematic flowchart of a second embodiment of a file acceleration method according to the present invention.
  • FIG. 4 is a schematic diagram of functional modules of a first embodiment of a file acceleration device according to the present invention.
  • FIG. 5 is a schematic diagram of a refinement function module of the generation module 20 in FIG. 4;
  • FIG. 6 is a schematic diagram of functional modules of a second embodiment of a file acceleration device according to the present invention.
  • Embodiments of the present invention provide a file acceleration method.
  • FIG. 1 is a schematic flowchart diagram of a first embodiment of a file acceleration method according to the present invention.
  • step S10 when the creation instruction of the index is received, the association information of each data file of the index to be created in the file system is extracted, and the association information includes attribute information and/or content information of the data file;
  • the new number appears in the detected file system.
  • the new data file is used as a data file to be created to trigger an index creation instruction, and the triggering manner of the index creation instruction includes:
  • the preset terminal can perform a search on the file system within a preset time period, such as searching every 12 hours, and if a new data file is detected within a preset time period, the search is performed.
  • the new data file is used as the data file of the index to be created to trigger the index creation instruction.
  • the terminal In the manual trigger mode, when a new data file appears in the file system, the terminal outputs a prompt message on the preset display interface, and the output prompt information may display a prompt window on the preset interface (eg, display: a new data file appears in the system) !), or display a dithering window in the preset area of the terminal interface to prompt the user to have a new data file in the file system, and when the index creation icon of the user touch preset is detected, the new data file is treated as Create an indexed data file and trigger the index creation instruction.
  • a prompt window on the preset interface eg, display: a new data file appears in the system
  • the new data file is treated as Create an indexed data file and trigger the index creation instruction.
  • the method further includes: a preset file system management mode and a configuration file system operation parameter, where the preset file system management mode and the configuration file system operation parameter include: a, configuration The basic parameters of the file system, such as the update mode of the index in the configuration file system, the way the system memory is allocated, the number of system threads, and whether the index runs in the background with the startup of the system; b. Related parameters of the configuration index, such as configuration index storage.
  • the data file type corresponding to the location and index (such as which type of folder the index includes/excludes).
  • the terminal can extract data statistics generated during the running of the system, and present the generated data statistics to the user. By analyzing the statistics, the user can optimize the management mode of the file system and configure the file system running parameters to make the system The operating state is optimal.
  • the data file includes a voicemail data file, a system log file, a text file, or a binary file.
  • the attribute information includes a name, a size, a creation time, or a type of a data file of the data file, and the content information includes a key field of the content of the data file.
  • Step S20 Generate index information of each of the data files based on the extracted association information.
  • the embodiment of generating the index information of each of the data files based on the extracted association information includes:
  • each of the foregoing may be calculated by a preset algorithm, for example, according to a hash (Hash algorithm) method.
  • the attribute information determines location information of each data file in the file system, and generates index information of each of the data files with the location information. For example, calculating the attribute information according to a preset hash algorithm, and obtaining a position pointer result of the corresponding index data on the index tree according to the result of the attribute information calculation, and adding the index data to the acquiring
  • the location pointer is saved and saved to generate index information for each of the data files.
  • the positioning information may be performed through the index information associated with the attribute information, such as the user inputting the preset index information, and triggering the query instruction. According to the association between the index information and the data file, the data file corresponding to the index information can be quickly located.
  • the step S20 includes:
  • Step S21 segmenting each line content of each data file by a column spacer of the data file to generate each data segment of each data file;
  • the extracted related information is the content information of the data file
  • the content information of the data file is the content information of the log file
  • the row of the log file is fixed
  • Each of the line contents can be separately divided into a preset number of data segments according to a preset column spacing to generate each data segment of each data file.
  • Typical log content includes the following fields: time, log level, calling number, message number, operation type, reason for failure, etc., where the content information includes the following fields: time, calling number, message number, and the like.
  • Step S22 acquiring content information corresponding to each data segment and address information thereof;
  • Step S23 Generate index information of each of the data files based on content information corresponding to each of the data segments and address information thereof.
  • each data file is first acquired, and each of the obtained data files is segmented separately to obtain a plurality of data segments of each data file, and key content in each data segment is extracted.
  • the address information corresponding to each key content is extracted, and the correspondence between each content information and the address information is determined. That is to say, the pointer position of each content information on the corresponding data file is obtained, and each corresponding content information is respectively written on the acquired pointer position to generate index information of each data file.
  • each data file may correspond to one index, or may correspond to multiple indexes. When the index corresponding to the data file is multiple indexes, index information of each data file is generated according to multiple indexes.
  • the attribute information of each of the data files may be extracted first, and according to the extracted Each attribute information determines position information of each corresponding data file in the file system to generate index information of each of the data files, that is, obtain a pointer position of each data file in the file system, according to the obtained pointer position Generating index information of each of the data files; in each data file, each line content in each data file may be separately divided into a preset number of data segments according to a preset column spacing character.
  • the location information may be a pointer location, and each corresponding content information is respectively written on the obtained pointer position to generate index information of each data file.
  • the data file of the index to be created is sequentially opened, and the attribute information or the content information is extracted from the file to be created, which is sequentially opened, and is created according to the attribute information or the content information.
  • the index information of the corresponding data file may have more than one attribute information, and some index information may also be a combination of multiple attributes; similarly, the content information may be more than one, and an index information may also be multiple contents. Union.
  • the file acceleration method of the embodiment when receiving the index creation instruction, extracts association information of each data file of the index to be created in the file system, and generates each of the data files based on the extracted association information.
  • the index information realizes that when the user queries the data file and its content, the index information directly locates the corresponding data file and its content location, so as to quickly query the data file and its content, and improve the query efficiency of the data file, and realize Efficient and orderly management.
  • the file acceleration method further includes:
  • Step S30 when receiving the data file query instruction, analyzing the received query instruction, and extracting a preset index item based on the query instruction;
  • Step S40 extracting keywords corresponding to the index items based on the extracted index items
  • Step S50 Acquire corresponding index information based on the extracted keywords.
  • Step S60 Acquire a data file corresponding to the index information according to the index information.
  • an optional method is: when receiving the data query instruction, extracting an index item corresponding to the query instruction, and if the query instruction is a name query, the index item is a file name of the data file;
  • the index item is extracted from the keyword corresponding to the index item (for example, the keyword is a system), and the corresponding index information is obtained according to the keyword, and the data file corresponding to the index information is obtained according to the index information.
  • the indexes can be quickly queried, sorted, and presented through pre-created indexes. Solve the problem that the query and sort operations take a long time in the process of using the file, and improve the query efficiency of the log data file. For example, when accessing the data file, first use the index information, locate the corresponding data file through the index information, and quickly jump to the index information.
  • the data file is convenient for users to query. For example, when searching for a file by using attribute information such as file name, it can directly locate the data file in the file system through index positioning; or in the process of querying the log data content, find relevant index information according to the query condition, and then from the index information. Find the location index and finally access the data content directly through the location index. Since the index structure has efficient access efficiency, the process of accessing the data content has higher efficiency than the current technology.
  • the received query instruction is analyzed, and keywords are extracted based on the query instruction, and the keyword may be one or more.
  • the extracted keywords are multiple, the corresponding index information is obtained based on the extracted multiple keywords, and the data is searched by the index.
  • the result may be obtained at one time. , making the data file query more efficient.
  • Embodiments of the present invention provide a file acceleration apparatus.
  • FIG. 4 is a schematic diagram of functional modules of a first embodiment of a file acceleration device according to the present invention.
  • the embodiment provides a file acceleration device, and the file acceleration device includes:
  • the extraction module 10 is configured to: when receiving the creation instruction of the index, extract association information of each data file of the index to be created in the file system, where the association information includes attribute information and/or content information of the data file;
  • the file acceleration device further includes a triggering module, and the triggering module is configured to: when detecting a new data file in the file system, using the new data file as a data file to be created,
  • the triggering creation instruction of the index may be triggered by:
  • Timing trigger mode it is preset to perform a search on the file system within a preset time period, such as searching every 12 hours, and if a new data file is detected within a preset time period, the searched The new data file is used as the data file of the index to be created to trigger the index creation instruction.
  • the prompt information is output on the preset display interface, and the output prompt information can display a prompt window on the preset interface (for example, the system displays a new data file! ), or display a jitter window in a preset area of the terminal interface to prompt the user to have a new data file in the file system, and when the index creation icon of the user touch preset is detected, the new data file is to be created. Index the data file and trigger the index creation instruction.
  • the step of managing the preset file system and the step of configuring the file system running parameters, the management mode of the preset file system and the running parameters of the configuration file system include: a, basic parameters of the configuration file system, such as Configure the update mode of the index in the file system, the system memory allocation mode, the number of system threads, and whether the index runs in the background with the system startup; b. Configure the parameters of the index, such as the configuration index storage location, the data file corresponding to the index. Type (such as which type of folder the index includes/excludes) and so on.
  • the extraction module 10 may extract data statistics generated during system operation, and display the generated data statistics information to the user. By analyzing these statistics, the user can optimize the management of the file system and configure the file system operating parameters to optimize the system running status.
  • the data file includes a voicemail data file, a system log file, a text file, or a binary file.
  • the attribute information includes a name, a size, a creation time, or a type of a data file of the data file, and the content information includes a key field of the content of the data file.
  • the generating module 20 is configured to: generate index information of each of the data files based on the extracted related information.
  • the embodiment that the generating module 20 generates the index information of each of the data files based on the extracted association information includes:
  • each of the foregoing may be calculated by a preset algorithm, for example, according to a hash (Hash algorithm) method.
  • the attribute information determines location information of each data file in the file system, and the generation module 20 generates index information of each of the data files with the location information. For example, calculating the attribute information according to a preset hash algorithm, and obtaining a position pointer result of the corresponding index data on the index tree according to the result of the attribute information calculation, and adding the index data to the acquiring
  • the location pointer is saved and saved for the generation module 20 to generate index information for each of the data files.
  • the positioning information may be performed through the index information associated with the attribute information, such as the user inputting the preset index information, and triggering the query instruction. According to the association between the index information and the data file, the data file corresponding to the index information can be quickly located.
  • the generating module 20 includes:
  • the processing unit 21 is configured to separately segment each line content of each data file by a column spacer of the data file to generate each data segment of each data file;
  • the processing unit 21 may separately split each of the line contents into a preset number of data segments according to a preset column spacing to generate each data segment of each data file.
  • Skill It can be understood by the domain technician that in the voice mail system, the log file is structured, the content has a fixed table format, and there are clear line and column separators.
  • Typical log content includes the following fields: time, log level, calling number, message number, operation type, reason for failure, etc., where the content information includes the following fields: time, calling number, message number, and the like.
  • the obtaining unit 22 is configured to: obtain content information corresponding to each data segment and address information thereof;
  • the generating unit 23 is configured to generate index information of each of the data files based on the content information corresponding to each of the data segments and the address information thereof.
  • the obtaining unit 22 first acquires each data file, and separately performs segmentation processing on each of the obtained data files to obtain multiple data segments of each data file, and extracts each data segment.
  • the key information corresponding to each key content is extracted, and the correspondence between each content information and the address information is determined. That is, the obtaining unit 22 acquires the pointer position of each content information on the corresponding data file, and writes each corresponding content information on the acquired pointer position for the generating unit 23 to generate.
  • Index information for each data file It can be understood that each data file may correspond to one index, or may correspond to multiple indexes. When the index corresponding to the data file is multiple indexes, the generating unit 23 generates each data file according to multiple indexes. Index information.
  • the extraction module 10 when the association information extracted by the extraction module 10 includes attribute information of the data file and content information of the data file, the extraction module 10 first extracts each of the data Attribute information of the file, and determining location information of each corresponding data file in the file system according to each attribute information extracted, so that the generating unit 23 generates index information of each of the data files, that is, the extracting
  • the module 10 acquires a pointer position of each data file in the file system, and the generating unit 23 generates index information of each of the data files according to the obtained pointer position; in each data file, according to a preset column interval
  • the processing unit 21 splits each line content in each data file into a preset number of data segments, respectively, based on the split preset number of data segments, the obtaining unit 22 respectively Obtaining content information and address information of the preset number of data segments, and determining location information of each content information on the corresponding data file, and the generating unit 23 is based on the location information.
  • the data file of the index to be created is sequentially opened, and the attribute information or the content information is extracted from the file to be created, which is sequentially opened, and is created according to the attribute information or the content information.
  • the index information of the corresponding data file may have more than one attribute information, and some index information may also be a combination of multiple attributes; similarly, the content information may be more than one, and an index information may also be multiple contents. Union.
  • the file acceleration device of the embodiment when receiving the index creation instruction, extracts association information of each data file of the file system to be created in the file system, and generates each of the data files based on the extracted association information.
  • the index information realizes that when the user queries the data file and its content, the index information directly locates the corresponding data file and its content location, so as to quickly query the data file and its content, and improve the query efficiency of the data file, and realize Efficient and orderly management.
  • the file acceleration device further includes:
  • the processing module 30 is configured to: when receiving the data file query instruction, analyze the received query instruction, and extract a preset index item based on the query instruction;
  • the extracting module 10 is further configured to: extract the keyword corresponding to the index item based on the extracted index item;
  • the obtaining module 40 is configured to: acquire corresponding index information based on the extracted keywords; and acquire a data file corresponding to the index information according to the index information.
  • the processing module 30, when receiving the data query instruction, extracts an index item corresponding to the query instruction, and if the query instruction is a name query, the index item is a data file. a file name; the extracting module 10 further extracts a keyword corresponding to the index item according to the index item (such as a keyword as a system), and obtains corresponding index information according to the keyword, for the obtaining The module 40 acquires the data file corresponding to the index information according to the index information.
  • the indexes can be quickly queried, sorted, and presented through pre-created indexes. Solve the problem that the query and sort operations take a long time during file use, and improve the efficiency of querying log data files, for example, when accessing data files, first Using the index information, the index information is used to locate the corresponding data file, and the data file corresponding to the index information is quickly jumped, which facilitates the user to query.
  • searching for a file by using attribute information such as file name it can directly locate the data file in the file system through index positioning; or in the process of querying the log data content, find relevant index information according to the query condition, and then from the index information. Find the location index and finally access the data content directly through the location index. Since the index structure has efficient access efficiency, the process of accessing the data content has higher efficiency than the current technology.
  • the processing module 30, when receiving the data file query instruction, analyzes the received query instruction and extracts a keyword based on the query instruction, where the keyword may be One or more, when the extracted keywords are multiple, the obtaining module 40 acquires corresponding index information based on the extracted multiple keywords, and queries the data through the index.
  • the extracted Multiple keywords can be used to obtain results in one time, making the data file query more efficient.
  • all or part of the steps of the above embodiments may also be implemented by using an integrated circuit. These steps may be separately fabricated into individual integrated circuit modules, or multiple modules or steps may be fabricated into a single integrated circuit module. achieve.
  • the device/function module/functional unit in the above embodiment can be implemented by using a general-purpose computing device. Now, they can be concentrated on a single computing device or distributed over a network of multiple computing devices.
  • the device/function module/functional unit in the above embodiment When the device/function module/functional unit in the above embodiment is implemented in the form of a software function module and sold or used as a stand-alone product, it can be stored in a computer readable storage medium.
  • the above mentioned computer readable storage medium may be a read only memory, a magnetic disk or an optical disk or the like.
  • the file acceleration method and apparatus when receiving an index creation instruction, extract association information of each data file of an index to be created in the file system, and generate each of the extracted information based on the extracted association information.
  • the index information of the data file realizes that when the user queries the data file and its content, the index information directly locates the corresponding data file and its content location, so as to quickly query the data file and its content, and improve the query efficiency of the data file. , to achieve efficient and orderly management.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

一种文件加速方法和装置,所述方法包括:在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息(S10);基于提取到的所述关联信息生成每个所述数据文件的索引信息(S20)。

Description

文件加速方法和装置 技术领域
本申请涉及但不限于通讯领域。
背景技术
随着科学技术的发展,我们面对的数据量越来越大,工作生活中常用的电脑、摄像机、移动硬盘或U盘等,数据容量达几百G或几百T,里面存储了海量的数据文件。典型的情况下,语音信箱系统中每日新增数据文件几百G字节,分布在几万个数据文件里面,数据文件系统总体规模非常庞大,文件管理工作显得困难,常常会出现数据文件的排序或查询不便利的问题。
文件系统负责对文件存储设备的空间进行组织和分配,负责文件存储管理,并对存入的文件进行保护和检索。根据相关技术,文件系统在存储数据文件时,仅通过数据文件的名字或节点将数据文件存储在目录中,当用户查询数据文件时,只能通过名字或节点进行查询,数据文件的查询效率低。
发明内容
以下是对本文详细描述的主题的概述。本概述并非是为了限制权利要求的保护范围。
本文提出一种文件加速方法和装置,解决数据文件的查询效率低的技术问题,实现高效有序管理。
一种文件加速方法,所述文件加速方法包括:
在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息;
基于提取到的所述关联信息生成每个所述数据文件的索引信息。
可选地,当所述关联信息为所述数据文件的属性信息时,所述基于提取到的所述关联信息生成每个所述数据文件的索引信息的步骤包括:
根据预设的算法,计算每个所述数据文件的属性信息,确定每个数据文件在文件系统的位置信息,以所述位置信息生成每个所述数据文件的索引信息。
可选地,所述预设的算法为哈希算法,所述位置信息为指针位置。
可选地,当所述关联信息为所述数据文件的内容信息时,所述基于提取到的所述关联信息生成每个所述数据文件的索引信息的步骤包括:
通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;
获取每个数据段对应的内容信息及其地址信息;
基于每个所述数据段对应的内容信息及其地址信息,生成每个所述数据文件的索引信息。
可选地,当所述关联信息为所述数据文件的属性信息及内容信息时,所述基于提取到的所述关联信息生成每个所述数据文件的索引信息的步骤包括:
提取每个所述数据文件的属性信息,根据提取的每个属性信息,确定每个数据文件在文件系统的位置信息,以所述位置信息生成每个所述数据文件的索引信息;
在每个数据文件中,通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;获取每个数据段对应的内容信息及其地址信息,并确定每个内容信息在对应的数据文件上的位置信息,基于位置信息以及内容信息生成每个所述数据文件的索引信息。
可选地,所述在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息的步骤之前,所述文件加速方法还包括:
在检测到文件系统中出现新数据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
可选地,所述文件加速方法还包括:
在接收到数据文件查询指令时,分析所述接收到的查询指令,并基于所 述查询指令提取出预设的索引项;
基于提取的所述索引项,提取所述索引项对应的关键词;
基于提取的所述关键词获取相应的索引信息;
根据所述索引信息获取所述索引信息对应的数据文件。
一种文件加速装置,所述文件加速装置包括:
提取模块,设置为:在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息;
生成模块,设置为:基于提取到的所述关联信息生成每个所述数据文件的索引信息。
可选地,当所述关联信息为所述数据文件的内容信息时,所述生成模块包括:
处理单元,设置为:通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;
获取单元,设置为:获取每个数据段对应的内容信息及其地址信息;
生成单元,设置为:基于每个所述数据段对应的内容信息及其地址信息,生成每个所述数据文件的索引信息。
可选地,所述文件加速装置还包括:
触发模块,设置为:在检测到文件系统中出现新数据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
可选地,所述文件加速装置还包括:
处理模块,设置为:在接收到数据文件查询指令时,分析所述接收到的查询指令,并基于所述查询指令提取出预设的索引项;
所述提取模块,还设置为:基于提取的所述索引项,提取所述索引项对应的关键词;
获取模块,设置为:基于提取的所述关键词获取相应的索引信息;以 及,根据所述索引信息获取所述索引信息对应的数据文件。
一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行上述任一项的方法。
本发明实施例提出的文件加速方法和装置,在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,基于提取到的所述关联信息生成每个所述数据文件的索引信息,实现了用户在查询数据文件及其内容时,通过索引信息直接定位到对应的数据文件及其内容位置,以便快速查询到数据文件及其内容,提高了数据文件的查询效率,实现了高效有序管理。
在阅读并理解了附图和详细描述后,可以明白其他方面。
附图概述
图1为本发明文件加速方法第一实施例的流程示意图;
图2为图1中步骤S20的细化流程示意图;
图3为本发明文件加速方法第二实施例的流程示意图;
图4为本发明文件加速装置第一实施例的功能模块示意图;
图5为图4中生成模块20的细化功能模块示意图;
图6为本发明文件加速装置第二实施例的功能模块示意图。
本发明的实施方式
本发明实施例提供一种文件加速方法。
参照图1,图1为本发明文件加速方法第一实施例的流程示意图。
步骤S10,在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息;
在本实施例中,所述步骤S10之前,包括在检测到文件系统中出现新数 据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令的步骤,所述索引的创建指令的触发方式包括:
1)自动触发方式,在检测到文件系统中出现新数据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
2)定时触发方式,可预设终端在预设时间段内对文件系统进行一次搜索,如每12小时进行一次搜索,若在预设时间段内检测到新数据文件时,将所述搜索到的新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
3)人工触发方式,在检测到文件系统中出现新数据文件时,终端在预设显示界面输出提示信息,所述输出提示信息可以在预设界面显示提示窗口(如显示:系统出现新数据文件!),或者在终端界面的预设区域显示抖动窗口,以提示用户文件系统中出现了新数据文件,并在检测到用户触控预设的索引创建图标时,将所述新数据文件作为待创建索引的数据文件,并触发索引的创建指令。
在本实施例中,所述步骤S10之前,还包括预设文件系统的管理方式以及配置文件系统运行参数的步骤,所述预设文件系统的管理方式以及配置文件系统运行参数包括:a、配置文件系统的基本参数,如配置文件系统中索引的更新方式、系统内存分配方式、系统线程数以及索引是否随系统的启动在后台运行的方式等;b、配置索引的相关参数,如配置索引存储位置、索引对应的数据文件类型(如索引包括/排除哪些类型文件夹)等。终端可提取系统运行过程中产生的数据统计信息,并将所述产生的数据统计信息展现给用户,用户通过分析这些统计信息,可优化调整文件系统的管理方式以及配置文件系统运行参数,使系统运行状态达到最佳。
所述数据文件包括语音信箱数据文件、系统日志文件、文本文件或二进制文件。所述属性信息包括数据文件的名称、大小、创建时间或数据文件的类型,所述内容信息包括数据文件内容的关键字段。
步骤S20,基于提取到的所述关联信息生成每个所述数据文件的索引信息。
在本实施例中,所述基于提取到的所述关联信息生成每个所述数据文件的索引信息的实施例包括:
1)第一实施例,当提取到的所述关联信息为所述数据文件的属性信息时,可通过预设的算法,比如,根据hash(Hash algorithm哈希算法)方法,计算所述每个属性信息,确定每个数据文件在文件系统的位置信息,以所述位置信息生成每个所述数据文件的索引信息。例如,根据预设的哈希算法计算所述属性信息,根据所述属性信息计算的结果,得到所述对应的索引数据在索引树上的位置指针结果,并将所述索引数据加入所述获取的位置指针并保存,以生成每个所述数据文件的索引信息。可以理解的是,在查询数据文件过程中,通过文件名称等文件属性信息进行查询时,可通过所述属性信息关联的索引信息进行定位查询,如用户输入预设的索引信息,并触发查询指令,根据所述索引信息与数据文件的关联性,可快速定位到所述索引信息对应的数据文件。
2)第二实施例,参照图2,当所述提取到的所述关联信息为所述数据文件的内容信息时,所述步骤S20包括:
步骤S21,通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;
在本实施例中,当所述提取到的所述关联信息为所述数据文件的内容信息时,例如所述数据文件的内容信息为日志文件的内容信息,由于日志文件的行是固定的,可按照预设的列间隔符,将每个所述行内容分别拆分成预设个数的数据段,以生成每个数据文件的每个数据段。本领域技术人员可以理解的是,在语音信箱系统中,日志文件是结构化的,内容具有固定的表格格式,有明确的行、列间隔符。典型的日志内容包括以下字段:时间、日志级别、主叫号码、消息号、操作类型、失败原因等,其中内容信息包括以下字段:时间、主叫号码、消息号等。
步骤S22,获取每个数据段对应的内容信息及其地址信息;
步骤S23,基于每个所述数据段对应的内容信息及其地址信息,生成每个所述数据文件的索引信息。
在本实施例中,先获取每个数据文件,分别对获取的所述每个数据文件进行分段处理,以得到每个数据文件的多个数据段,提取每个数据段中的关键内容,同时,提取出每个关键内容对应的地址信息,确定每个内容信息与地址信息的对应关系。也就是说,获取每个内容信息在对应的数据文件上的指针位置,在获取的所述指针位置上分别写入对应的每个内容信息,以生成每个数据文件的索引信息。可以理解的是,每个数据文件可对应一个索引,也可对应多个索引,当所述数据文件对应的索引为多个索引时,根据多个索引生成每个数据文件的索引信息。
3)第三实施例,当提取到的所述关联信息包括所述数据文件的属性信息以及所述数据文件的内容信息时,可以先提取所述每个数据文件的属性信息,并根据提取的每个属性信息,确定对应的每个数据文件在文件系统的位置信息,以生成每个所述数据文件的索引信息,即获取每个数据文件在文件系统中的指针位置,根据获取的指针位置生成每个所述数据文件的索引信息;在每个数据文件中,可按照预设的列间隔符,将每个数据文件中的每个行内容分别拆分成预设个数的数据段,基于所述拆分的预设个数的数据段,分别获取所述预设个数数据段的内容信息以及地址信息,并确定每个内容信息在对应的数据文件上的位置信息,基于位置信息以及内容信息生成每个所述数据文件的索引信息。其中位置信息可以为指针位置,在获取的所述指针位置上分别写入对应的每个内容信息,以生成每个数据文件的索引信息。
在本实施例中,在接收到索引创建指令时,依次打开待创建索引的数据文件,从依次打开的待创建索引的文件中提取属性信息或内容信息,并根据所述属性信息或内容信息创建对应的数据文件的索引信息,所述属性信息可能不止一个,某个索引信息也可能是多个属性的联合;同理,所述内容信息可能不止一个,某个索引信息也可能是多个内容的联合。
本实施例提出的文件加速方法,在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,基于提取到的所述关联信息生成每个所述数据文件的索引信息,实现了用户在查询数据文件及其内容时,通过索引信息直接定位到对应的数据文件及其内容位置,以便快速查询到数据文件及其内容,提高了数据文件的查询效率,实现了高效有序管理。
参照图3,基于第一实施例提出本发明文件加速方法第二实施例,在本实施例中,所述文件加速方法还包括:
步骤S30,在接收到数据文件查询指令时,分析所述接收到的查询指令,并基于所述查询指令提取出预设的索引项;
步骤S40,基于提取的所述索引项,提取所述索引项对应的关键词;
步骤S50,基于提取的所述关键词获取相应的索引信息;
步骤S60,根据所述索引信息获取所述索引信息对应的数据文件。
在本实施例中,可选方案为,在接收到数据查询指令时,提取所述查询指令对应的索引项,如查询指令为名称查询,则所述索引项为数据文件的文件名;再根据所述索引项提取出所述索引项对应的关键词(如关键词为系统),则可根据所述关键词获取相应的索引信息,再根据所述索引信息获取所述索引信息对应的数据文件。
可以理解的是,在文件使用过程中,由于索引是有序排列的,可通过预先创建好的索引迅速查询、排序和展现。解决文件使用过程中查询和排序操作耗时长问题,提高日志数据文件查询效率,例如当访问数据文件时,首先使用索引信息,通过索引信息定位到对应的数据文件,快速跳到所述索引信息对应的数据文件,方便了用户查询。例如:通过文件名称等属性信息寻找文件的时候,可以通过索引定位,直接定位到文件系统中的数据文件;或者在查询日志数据内容过程中,根据查询条件找到相关索引信息,然后从索引信息中找到位置索引,最后通过位置索引直接访问数据内容。由于索引结构具有高效的访问效率,所以访问数据内容过程具有比当前技术更高的效率。
在本实施例中,可选方案为,在接收到数据文件查询指令时,分析所述接收到的查询指令并基于所述查询指令提取出关键词,所述关键词可为一个或多个,当提取出的所述关键词为多个时,基于提取的多个关键词获取对应的索引信息,并通过索引查询数据,本实施例中,根据提取的多个关键词,可一次性获得结果,使得数据文件的查询效率更高。
本发明实施例提供一种文件加速装置。
参照图4,图4为本发明文件加速装置第一实施例的功能模块示意图。
本实施例提出一种文件加速装置,所述文件加速装置包括:
提取模块10,设置为:在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息;
在本实施例中,所述文件加速装置还包括触发模块,所述触发模块设置为:在检测到文件系统中出现新数据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令,所述索引的创建指令的触发方式可包括:
1)自动触发方式,在检测到文件系统中出现新数据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
2)定时触发方式,可预设在预设时间段内对文件系统进行一次搜索,如每12小时进行一次搜索,若在预设时间段内检测到新数据文件时,将所述搜索到的新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
3)人工触发方式,在检测到文件系统中出现新数据文件时,在预设显示界面输出提示信息,所述输出提示信息可以在预设界面显示提示窗口(如显示:系统出现新数据文件!),或者在终端界面的预设区域显示抖动窗口,以提示用户文件系统中出现了新数据文件,并在检测到用户触控预设的索引创建图标时,将所述新数据文件作为待创建索引的数据文件,并触发索引的创建指令。
在本实施例中,包括预设文件系统的管理方式以及配置文件系统运行参数的步骤,所述预设文件系统的管理方式以及配置文件系统运行参数包括:a、配置文件系统的基本参数,如配置文件系统中索引的更新方式、系统内存分配方式、系统线程数以及索引是否随系统的启动在后台运行的方式等;b、配置索引的相关参数,如配置索引存储位置、索引对应的数据文件类型(如索引包括/排除哪些类型文件夹)等。所述提取模块10可提取系统运行过程中产生的数据统计信息,并将所述产生的数据统计信息展现给用户,用 户通过分析这些统计信息,可优化调整文件系统的管理方式以及配置文件系统运行参数,使系统运行状态达到最佳。
所述数据文件包括语音信箱数据文件、系统日志文件、文本文件或二进制文件。所述属性信息包括数据文件的名称、大小、创建时间或数据文件的类型,所述内容信息包括数据文件内容的关键字段。
生成模块20,设置为:基于提取到的所述关联信息生成每个所述数据文件的索引信息。
在本实施例中,所述生成模块20基于提取到的所述关联信息生成每个所述数据文件的索引信息的实施例包括:
1)第一实施例,当提取到的所述关联信息为所述数据文件的属性信息时,可通过预设的算法,比如,根据hash(Hash algorithm哈希算法)方法,计算所述每个属性信息,确定每个数据文件在文件系统的位置信息,所述生成模块20以所述位置信息生成每个所述数据文件的索引信息。例如,根据预设的哈希算法计算所述属性信息,根据所述属性信息计算的结果,得到所述对应的索引数据在索引树上的位置指针结果,并将所述索引数据加入所述获取的位置指针并保存,以供所述生成模块20生成每个所述数据文件的索引信息。可以理解的是,在查询数据文件过程中,通过文件名称等文件属性信息进行查询时,可通过所述属性信息关联的索引信息进行定位查询,如用户输入预设的索引信息,并触发查询指令,根据所述索引信息与数据文件的关联性,可快速定位到所述索引信息对应的数据文件。
2)第二实施例,参照图5,当所述提取到的所述关联信息为所述数据文件的内容信息时,所述生成模块20包括:
处理单元21,设置为:通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;
在本实施例中,当所述提取到的所述关联信息为所述数据文件的内容信息时,例如所述数据文件的内容信息为日志文件的内容信息,由于日志文件的行是固定的,所述处理单元21可按照预设的列间隔符,将每个所述行内容分别拆分成预设个数的数据段,以生成每个数据文件的每个数据段。本领 域技术人员可以理解的是,在语音信箱系统中,日志文件是结构化的,内容具有固定的表格格式,有明确的行、列间隔符。典型的日志内容包括以下字段:时间、日志级别、主叫号码、消息号、操作类型、失败原因等,其中内容信息包括以下字段:时间、主叫号码、消息号等。
获取单元22,设置为:获取每个数据段对应的内容信息及其地址信息;
生成单元23,设置为:基于每个所述数据段对应的内容信息及其地址信息,生成每个所述数据文件的索引信息。
在本实施例中,所述获取单元22先获取每个数据文件,分别对获取的所述每个数据文件进行分段处理,以得到每个数据文件的多个数据段,提取每个数据段中的关键内容,同时,提取出每个关键内容对应的地址信息,确定每个内容信息与地址信息的对应关系。也就是说,所述获取单元22获取每个内容信息在对应的数据文件上的指针位置,在获取的所述指针位置上分别写入对应的每个内容信息,以供所述生成单元23生成每个数据文件的索引信息。可以理解的是,每个数据文件可对应一个索引,也可对应多个索引,当所述数据文件对应的索引为多个索引时,所述生成单元23根据多个索引生成每个数据文件的索引信息。
3)第三实施例,当所述提取模块10提取到的所述关联信息包括所述数据文件的属性信息以及所述数据文件的内容信息时,所述提取模块10先提取所述每个数据文件的属性信息,并根据提取的每个属性信息,确定对应的每个数据文件在文件系统的位置信息,以供所述生成单元23生成每个所述数据文件的索引信息,即所述提取模块10获取每个数据文件在文件系统中的指针位置,所述生成单元23根据获取的指针位置生成每个所述数据文件的索引信息;在每个数据文件中,可按照预设的列间隔符,所述处理单元21将每个数据文件中的每个行内容分别拆分成预设个数的数据段,基于所述拆分的预设个数的数据段,所述获取单元22分别获取所述预设个数数据段的内容信息以及地址信息,并确定每个内容信息在对应的数据文件上的位置信息,生成单元23基于位置信息以及内容信息生成每个所述数据文件的索引信息。其中位置信息可以为指针位置,在获取的所述指针位置上分别写入对应的每个内容信息,以供所述生成单元23生成每个数据文件的索引信息。
在本实施例中,在接收到索引创建指令时,依次打开待创建索引的数据文件,从依次打开的待创建索引的文件中提取属性信息或内容信息,并根据所述属性信息或内容信息创建对应的数据文件的索引信息,所述属性信息可能不止一个,某个索引信息也可能是多个属性的联合;同理,所述内容信息可能不止一个,某个索引信息也可能是多个内容的联合。
本实施例提出的文件加速装置,在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,基于提取到的所述关联信息生成每个所述数据文件的索引信息,实现了用户在查询数据文件及其内容时,通过索引信息直接定位到对应的数据文件及其内容位置,以便快速查询到数据文件及其内容,提高了数据文件的查询效率,实现了高效有序管理。
参照图6,基于第一实施例提出本发明文件加速装置第二实施例,在本实施例中,所述文件加速装置还包括:
处理模块30,设置为:在接收到数据文件查询指令时,分析所述接收到的查询指令,并基于所述查询指令提取出预设的索引项;
所述提取模块10,还设置为:基于提取的所述索引项,提取所述索引项对应的关键词;
获取模块40,设置为:基于提取的所述关键词获取相应的索引信息;以及,根据所述索引信息获取所述索引信息对应的数据文件。
在本实施例中,可选方案为,所述处理模块30在接收到数据查询指令时,提取所述查询指令对应的索引项,如查询指令为名称查询,则所述索引项为数据文件的文件名;所述提取模块10再根据所述索引项提取出所述索引项对应的关键词(如关键词为系统),则可根据所述关键词获取相应的索引信息,以供所述获取模块40根据所述索引信息获取所述索引信息对应的数据文件。
可以理解的是,在文件使用过程中,由于索引是有序排列的,可通过预先创建好的索引迅速查询、排序和展现。解决文件使用过程中查询和排序操作耗时长问题,提高日志数据文件查询效率,例如当访问数据文件时,首先 使用索引信息,通过索引信息定位到对应的数据文件,快速跳到所述索引信息对应的数据文件,方便了用户查询。例如:通过文件名称等属性信息寻找文件的时候,可以通过索引定位,直接定位到文件系统中的数据文件;或者在查询日志数据内容过程中,根据查询条件找到相关索引信息,然后从索引信息中找到位置索引,最后通过位置索引直接访问数据内容。由于索引结构具有高效的访问效率,所以访问数据内容过程具有比当前技术更高的效率。
在本实施例中,可选方案为,所述处理模块30在接收到数据文件查询指令时,分析所述接收到的查询指令并基于所述查询指令提取出关键词,所述关键词可为一个或多个,当提取出的所述关键词为多个时,所述获取模块40基于提取的多个关键词获取对应的索引信息,并通过索引查询数据,本实施例中,根据提取的多个关键词,可一次性获得结果,使得数据文件的查询效率更高。
需要说明的是,在本文中,术语“包括”、“包含”或者其任何其它变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素,而且还包括没有明确列出的其它要素,或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。
上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。
本领域普通技术人员可以理解上述实施例的全部或部分步骤可以使用计算机程序流程来实现,所述计算机程序可以存储于一计算机可读存储介质中,所述计算机程序在相应的硬件平台上(如系统、设备、装置、器件等)执行,在执行时,包括方法实施例的步骤之一或其组合。
可选地,上述实施例的全部或部分步骤也可以使用集成电路来实现,这些步骤可以被分别制作成一个个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。
上述实施例中的装置/功能模块/功能单元可以采用通用的计算装置来实 现,它们可以集中在单个的计算装置上,也可以分布在多个计算装置所组成的网络上。
上述实施例中的装置/功能模块/功能单元以软件功能模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。上述提到的计算机可读取存储介质可以是只读存储器,磁盘或光盘等。
工业实用性
本发明实施例提出的文件加速方法和装置,在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,基于提取到的所述关联信息生成每个所述数据文件的索引信息,实现了用户在查询数据文件及其内容时,通过索引信息直接定位到对应的数据文件及其内容位置,以便快速查询到数据文件及其内容,提高了数据文件的查询效率,实现了高效有序管理。

Claims (12)

  1. 一种文件加速方法,包括:
    在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息;
    基于提取到的所述关联信息生成每个所述数据文件的索引信息。
  2. 如权利要求1所述的文件加速方法,其中,当所述关联信息为所述数据文件的属性信息时,所述基于提取到的所述关联信息生成每个所述数据文件的索引信息的步骤包括:
    根据预设的算法,计算每个所述数据文件的属性信息,确定每个数据文件在文件系统的位置信息,以所述位置信息生成每个所述数据文件的索引信息。
  3. 如权利要求2所述的文件加速方法,其中,所述预设的算法为哈希算法,所述位置信息为指针位置。
  4. 如权利要求1所述的文件加速方法,其中,当所述关联信息为所述数据文件的内容信息时,所述基于提取到的所述关联信息生成每个所述数据文件的索引信息的步骤包括:
    通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;
    获取每个数据段对应的内容信息及其地址信息;
    基于每个所述数据段对应的内容信息及其地址信息,生成每个所述数据文件的索引信息。
  5. 如权利要求1所述的文件加速方法,其中,当所述关联信息为所述数据文件的属性信息及内容信息时,所述基于提取到的所述关联信息生成每个所述数据文件的索引信息的步骤包括:
    提取每个所述数据文件的属性信息,根据提取的每个属性信息,确定每个数据文件在文件系统的位置信息,以所述位置信息生成每个所述数据文件 的索引信息;
    在每个数据文件中,通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;获取每个数据段对应的内容信息及其地址信息,并确定每个内容信息在对应的数据文件上的位置信息,基于位置信息以及内容信息生成每个所述数据文件的索引信息。
  6. 如权利要求1所述的文件加速方法,其中,所述在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息的步骤之前,所述文件加速方法还包括:
    在检测到文件系统中出现新数据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
  7. 如权利要求1所述的文件加速方法,所述文件加速方法还包括:
    在接收到数据文件查询指令时,分析所述接收到的查询指令,并基于所述查询指令提取出预设的索引项;
    基于提取的所述索引项,提取所述索引项对应的关键词;
    基于提取的所述关键词获取相应的索引信息;
    根据所述索引信息获取所述索引信息对应的数据文件。
  8. 一种文件加速装置,包括:
    提取模块,设置为:在接收到索引的创建指令时,提取文件系统中待创建索引的每个数据文件的关联信息,所述关联信息包括所述数据文件的属性信息及/或内容信息;
    生成模块,设置为:基于提取到的所述关联信息生成每个所述数据文件的索引信息。
  9. 如权利要求8所述的文件加速装置,其中,当所述关联信息为所述数据文件的内容信息时,所述生成模块包括:
    处理单元,设置为:通过数据文件的列间隔符对每个数据文件的每个行内容分别进行分段,以生成每个数据文件的每个数据段;
    获取单元,设置为:获取每个数据段对应的内容信息及其地址信息;
    生成单元,设置为:基于每个所述数据段对应的内容信息及其地址信息,生成每个所述数据文件的索引信息。
  10. 如权利要求8所述的文件加速装置,还包括:
    触发模块,设置为:在检测到文件系统中出现新数据文件时,将所述新数据文件作为待创建索引的数据文件,以触发索引的创建指令。
  11. 如权利要求8所述的文件加速装置,还包括:
    处理模块,设置为:在接收到数据文件查询指令时,分析所述接收到的查询指令,并基于所述查询指令提取出预设的索引项;
    所述提取模块,还设置为:基于提取的所述索引项,提取所述索引项对应的关键词;
    获取模块,设置为:基于提取的所述关键词获取相应的索引信息;以及,根据所述索引信息获取所述索引信息对应的数据文件。
  12. 一种计算机可读存储介质,存储有计算机可执行指令,所述计算机可执行指令用于执行权利要求1-7任一项的方法。
PCT/CN2015/086404 2015-04-20 2015-08-07 文件加速方法和装置 WO2016169152A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510188685.1A CN106156183A (zh) 2015-04-20 2015-04-20 文件加速方法和装置
CN201510188685.1 2015-04-20

Publications (1)

Publication Number Publication Date
WO2016169152A1 true WO2016169152A1 (zh) 2016-10-27

Family

ID=57142786

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/086404 WO2016169152A1 (zh) 2015-04-20 2015-08-07 文件加速方法和装置

Country Status (2)

Country Link
CN (1) CN106156183A (zh)
WO (1) WO2016169152A1 (zh)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101408876A (zh) * 2007-10-09 2009-04-15 中兴通讯股份有限公司 一种电子文档全文检索的方法及系统
CN101430685A (zh) * 2007-11-06 2009-05-13 施耐德电器工业公司 下载方法及其系统
CN101916164A (zh) * 2010-08-11 2010-12-15 中兴通讯股份有限公司 一种移动终端及其实现的文件浏览方法
US20140181119A1 (en) * 2012-12-26 2014-06-26 Industrial Technology Research Institute Method and system for accessing files on a storage system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101408876A (zh) * 2007-10-09 2009-04-15 中兴通讯股份有限公司 一种电子文档全文检索的方法及系统
CN101430685A (zh) * 2007-11-06 2009-05-13 施耐德电器工业公司 下载方法及其系统
CN101916164A (zh) * 2010-08-11 2010-12-15 中兴通讯股份有限公司 一种移动终端及其实现的文件浏览方法
US20140181119A1 (en) * 2012-12-26 2014-06-26 Industrial Technology Research Institute Method and system for accessing files on a storage system

Also Published As

Publication number Publication date
CN106156183A (zh) 2016-11-23

Similar Documents

Publication Publication Date Title
US11194779B2 (en) Generating an index for a table in a database background
US10169471B2 (en) Generating and executing query language statements from natural language
US8468146B2 (en) System and method for creating search index on cloud database
US9507762B1 (en) Converting portions of documents between structured and unstructured data formats to improve computing efficiency and schema flexibility
JP2019194882A (ja) ファーストクラスデータベース要素としての半構造データの実装
US9552161B2 (en) Repetitive data block deleting system and method
US20140310324A1 (en) Systems and methods for automatically sorting and indexing electronic files
US20140244606A1 (en) Method, apparatus and system for storing, reading the directory index
WO2018097846A1 (en) Edge store designs for graph databases
US20140052734A1 (en) Computing device and method for creating data indexes for big data
CN105760418B (zh) 用于对关系数据库表进行交叉列搜索的方法和系统
KR101234795B1 (ko) 컨텐츠 브라우징 장치 및 방법
US11645234B2 (en) Rule-based collections of subset(s) of metadata in response to a trigger event occurring
BR102013016924A2 (pt) Método e aparelho para priorizar metadados
US20090100039A1 (en) Extensible mechanism for grouping search results
CN111045994B (zh) 一种基于kv数据库的文件分类检索方法及系统
US11250002B2 (en) Result set output criteria
Atwal et al. Shining a light on Spotlight: Leveraging Apple's desktop search utility to recover deleted file metadata on macOS
US9286349B2 (en) Dynamic search system
WO2016169152A1 (zh) 文件加速方法和装置
US10318507B2 (en) Optimizing tables with too many columns in a database
US10997144B2 (en) Reducing write amplification in buffer trees
US9330130B1 (en) Get row index of record in dynamic data table
US20140081986A1 (en) Computing device and method for generating sequence indexes for data files
JPWO2019176011A1 (ja) 検索文活用装置および検索文活用方法

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15889640

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15889640

Country of ref document: EP

Kind code of ref document: A1