CN111767436B

CN111767436B - HASH index data storage and reading method and system

Info

Publication number: CN111767436B
Application number: CN202010581485.3A
Authority: CN
Inventors: 王金山
Original assignee: Beijing Si Tech Information Technology Co Ltd
Current assignee: Beijing Si Tech Information Technology Co Ltd
Priority date: 2020-06-23
Filing date: 2020-06-23
Publication date: 2023-11-10
Anticipated expiration: 2040-06-23
Also published as: CN111767436A

Abstract

The invention discloses a method and a system for storing and reading HASH index data, wherein index records are stored in memory values corresponding to the HASH index data; the number of index records is determined by the data of how many HASH values are repeated; the index data format is sequentially arranged with an index mark, an index value, a next pointer and a next deleted record pointer. The storage method of HASH index data comprises the following steps: storing the repeated data records of a plurality of HASH values in the same linked list; when the index record is inserted, when the index mark of the head index record of the linked list is in the use state, the head index record is directly multiplexed; when the index mark of the head index record of the linked list is in a deleted state, the record pointed by the next pointer of the head index record is fetched. The method solves the problem that the insertion and deletion cannot be fast when the repeatability of certain values of the HASH index is very high.

Description

HASH index data storage and reading method and system

Technical Field

The invention relates to the technical field of index data of a memory database, in particular to a method and a system for storing and reading HASH index data.

Background

In order to be able to quickly find a particular record from a huge number of memory records, an index needs to be created for frequently accessed fields. For equivalence lookup, a HASH index is typically used.

Conventional HASH indexes, when HASH values are repeated, are typically stored using a linked list, i.e., the repeated values are stored on the same linked list. If the repeatability of an index value is high, the corresponding linked list is particularly long. For the usage scenario of the memory database, frequent insert-delete operations are generally performed on the index record, and for high performance consideration, delete operations are not true delete, but delete marks are marked on corresponding records, so that the memory can be reused by subsequent insert operations. Therefore, when an index record is inserted, because of the need to multiplex previously deleted memory space, it is necessary to sequentially traverse the entire linked list to find a deleted location and then place a new record.

In the case where the amount of table data is large, such as when the repeated data exceeds 100 ten thousand records, the insertion becomes very slow.

Disclosure of Invention

Aiming at the problem that insertion and deletion cannot be quickly performed when the repeatability of certain values of the HASH index is very high at present, the invention provides a method and a system for storing and reading HASH index data.

The invention discloses a storage and reading method of HASH index data, wherein index records are stored in memory values corresponding to the HASH index data;

the number of index records is determined by the data of how many HASH values are repeated;

the index data format is sequentially arranged with an index mark, an index value, a next pointer and a next deleted record pointer.

Preferably, the storing method of HASH index data includes:

storing the repeated data records of a plurality of HASH values in the same linked list;

when the index record is inserted, when the index mark of the head index record of the linked list is 1, directly multiplexing the head index record; when the index mark of the head index record of the linked list is 0, the record pointed by the next pointer of the head index record is taken out, if the record pointed by the next pointer of the head index record is empty, no reusable record is indicated, and a new record is directly inserted into the head; if the record pointed to by the next pointer of the head index record is not null, multiplexing the record pointed to by the next pointer of the head index record.

Preferably, an "index flag" of 1 of the index record indicates that the current index record status is in use, and an "index flag" of 0 of the index record indicates that the current index record status is deleted.

Preferably, the method for reading HASH index data includes inquiring and reading data according to HASH values, index marks and index values; the specific process is as follows:

firstly, indexing according to the HASH value, then inquiring in the index record corresponding to the HASH value, wherein the inquiring process is to read the index mark firstly, read the index value in the index record with the index mark of 1, and compare with the inquired index value until the corresponding index record is found.

Preferably, the deletion method of HASH index data includes:

when the index record is deleted, the index space is not released, the index record is not deleted from the index linked list, and the index mark is modified to be 0;

when one index record is deleted, if the index record is not at the head of the linked list, the record pointed by the next pointer of the head record is pointed to the currently deleted index record, and the next deleted record pointer of the currently deleted index record is pointed to the record pointed by the next pointer of the original linked list head record.

A HASH index data storage and reading system at least comprises a processor and a memory, wherein the memory stores an executable program of the method; the processor runs the executable program of the method to index the memory data.

Compared with the prior art, the invention has the beneficial effects that:

after the HASH index data storage and reading method and system are adopted, the whole index chain table is prevented from being traversed when data are inserted, the corresponding index file is found through a direct memory location mode, and when the repeatability is more than 100 ten thousand, the execution efficiency can be improved by 100 times. The efficiency of the HASH index data is effectively improved, positive influence is generated on the application HASH index data, and the reliability and the practicability of the HASH index data are improved.

Drawings

Fig. 1 is a schematic diagram of a method for storing and reading HASH index data according to the present invention.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

The invention is described in further detail below with reference to the attached drawing figures:

referring to fig. 1, a method for storing and reading HASH index data, wherein an index record is stored in a memory value corresponding to the HASH index data;

the index data format is sequentially arranged index marks, index values, next pointers and next deleted record pointers next2.

In specific implementation, the method for storing HASH index data comprises the following steps:

when the index record is inserted, when the index mark of the head index record of the linked list is 1, directly multiplexing the head index record; when the index mark of the head index record of the linked list is 0, the record pointed by the next pointer next of the head index record is taken out, if the record pointed by the next pointer next of the head index record is empty, no reusable record is indicated, and a new record is directly inserted into the head; if the record pointed to by the next pointer next of the head index record is not null, multiplexing the record pointed to by the next pointer next of the head index record.

In specific implementation, an "index flag" of 1 in the above index record indicates that the current index record status is in use, and an "index flag" of 0 in the index record indicates that the current index record status is deleted.

In specific implementation, the method for reading the HASH index data is to query and read the data according to the HASH value, the index mark and the index value; the specific process is as follows:

In specific implementation, the deletion method of HASH index data comprises the following steps:

when one index record is deleted, if the index record is not at the head of the linked list, the record pointed to by the next pointer next of the head record is pointed to the currently deleted index record, and the next deleted record pointer next2 of the currently deleted index record is pointed to the record pointed to by the next pointer next of the original linked list head record.

The method and the system for storing and reading the HASH index data avoid traversing the whole index linked list when inserting data, find the corresponding index file by a direct memory location mode, and improve the execution efficiency by 100 times when the repeatability is more than 100 ten thousand. The efficiency of the HASH index data is effectively improved, positive influence is generated on the application HASH index data, and the reliability and the practicability of the HASH index data are improved.

The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A method for storing and reading HASH index data is characterized in that an index record is stored in a memory value corresponding to the HASH index data;

the index data format is an index mark, an index value, a next pointer and a next deleted record pointer which are sequentially arranged;

the storage method of the HASH index data comprises the following steps:

when the index record is inserted, when the index mark of the head index record of the linked list is 1, directly multiplexing the head index record; when the index mark of the head index record of the linked list is 0, the record pointed by the next pointer of the head index record is taken out, if the record pointed by the next pointer of the head index record is empty, no reusable record is indicated, and a new record is directly inserted into the head; multiplexing the record pointed by the next pointer of the head index record if the record pointed by the next pointer of the head index record is not empty; wherein, an index flag of 1 of the index record indicates that the current index record state is in use, and an index flag of 0 of the index record indicates that the current index record state is deleted;

the method for reading the HASH index data is to query and read the data according to the HASH value, the index mark and the index value; the specific process is as follows:

firstly, indexing according to the HASH value, then inquiring in the index record corresponding to the HASH value, wherein the inquiring process is to read an index mark firstly, read the index value in the index record with the index mark of 1, and compare with the inquired index value until the corresponding index record is found;

the deletion method of the HASH index data comprises the following steps:

2. A HASH index data storage and reading system, at least comprising a processor and a memory, characterized in that: the memory stores therein an executable program of the method of claim 1; the processor running the executable program of the method of claim 1 to index the memory data.