CN108108402B - Data item paraphrasing updating method and device - Google Patents

Data item paraphrasing updating method and device Download PDF

Info

Publication number
CN108108402B
CN108108402B CN201711287741.2A CN201711287741A CN108108402B CN 108108402 B CN108108402 B CN 108108402B CN 201711287741 A CN201711287741 A CN 201711287741A CN 108108402 B CN108108402 B CN 108108402B
Authority
CN
China
Prior art keywords
data item
paraphrase
time interval
data items
current time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711287741.2A
Other languages
Chinese (zh)
Other versions
CN108108402A (en
Inventor
刘尚超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Taikang Insurance Group Co Ltd
Original Assignee
Taikang Insurance Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Taikang Insurance Group Co Ltd filed Critical Taikang Insurance Group Co Ltd
Priority to CN201711287741.2A priority Critical patent/CN108108402B/en
Publication of CN108108402A publication Critical patent/CN108108402A/en
Application granted granted Critical
Publication of CN108108402B publication Critical patent/CN108108402B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/23Updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries

Abstract

The invention provides a data item paraphrasing updating method and a data item paraphrasing updating device, wherein the method comprises the following steps: identifying idle data items from all the data items, acquiring a first paraphrase for replacing an original paraphrase of the idle data items, and replacing the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data items. When a new data item needs to be added, the original paraphrase of the idle data item can be replaced by the first paraphrase, the aim of adding the data item is achieved by repeatedly using the idle data item, the data item does not need to be added by occupying new storage space, the data storage space is saved, the operation cost is reduced, and the problems that in the related technology, the number of data items is increased, a large amount of data storage space is occupied and the operation cost is increased due to the fact that the new data item is added by occupying the new storage space are solved.

Description

Data item paraphrasing updating method and device
Technical Field
The invention relates to the technical field of data processing, in particular to a method and a device for explaining and updating data items.
Background
The existing system follows the principle that one data item corresponds to one paraphrase when designing the data item architecture. With the development of the internet, the data volume and data types of various industries are more and more extensive, new data items are usually and continuously added in the related technology, and finally, the data items are more and more accumulated, so that a large amount of storage space is occupied, and the operation cost is increased.
Disclosure of Invention
The present invention is directed to solving, at least to some extent, one of the technical problems in the related art.
Therefore, a first objective of the present invention is to provide a method for updating definitions of data items, which replaces original definitions of idle data items with first definitions to reuse the idle data items, thereby saving data storage space and reducing operation cost.
The second objective of the present invention is to provide a data item paraphrasing updating device.
A third object of the invention is to propose a computer device.
A fourth object of the invention is to propose a computer program product.
A fifth object of the invention is to propose a non-transitory computer-readable storage medium.
To achieve the above object, an embodiment of a first aspect of the present invention provides a data item paraphrasing updating method, including:
identifying idle data items from all data items;
obtaining a first paraphrase that replaces an original paraphrase of the idle data item;
replacing the original paraphrase with the first paraphrase completes paraphrase updating of the idle data item.
As a possible implementation manner of the embodiment of the first aspect of the present invention, the identifying an idle data item from all data items includes:
acquiring the use information of each data item through a preset buried point probe of each data item; wherein the usage information includes at least a usage time per usage;
identifying idle data items from the data items based on the usage information for each data item.
As a possible implementation manner of the embodiment of the first aspect of the present invention, the identifying, according to the usage information of each data item, an idle data item from the data items includes:
acquiring a first data item in the current time period according to the use information of the candidate data item in the previous time period; the previous time interval is a time interval formed from a preset first moment to a current moment, and the current time interval is a time interval formed from a preset second moment to the current moment; the second moment is a moment after the first moment; the generation time of the candidate data item is in the previous period;
screening all first data items in the current time period according to the generation time of the first data items in the current time period to obtain candidate data items in the current time period;
if the current time interval is a target time interval, the candidate data item in the current time interval is taken as the idle data item.
As a possible implementation manner of the embodiment of the first aspect of the present invention, the method further includes:
if the current time interval is not the target time interval, the candidate data item of the current time interval is used as the candidate data item of the previous time interval, the use information of the candidate data item according to the previous time interval is returned to be executed again, and the first data item and the subsequent steps in the current time interval are obtained until the current time interval is the target time interval.
As a possible implementation manner of the embodiment of the first aspect of the present invention, the acquiring, according to the usage information of the candidate data item in the previous time interval, the first data item in the current time interval includes:
counting the use times of each candidate data item in the previous time period according to the use information of the candidate data item in the previous time period;
acquiring a first use frequency of each candidate data item according to the average use times of all candidate data items in the previous time period and the use times of each candidate data item;
and selecting the candidate data item of which the first use frequency does not exceed a preset threshold value as a first data item in the current time period.
As a possible implementation manner of the embodiment of the first aspect of the present invention, the screening all the first data items in the current time period according to the generation time of the first data item in the current time period to obtain the candidate data items in the current time period includes:
obtaining a second use frequency of each first data item according to the average use times of all the first data items in the current time period and the use times of each first data item;
selecting the first data item of which the second using frequency does not exceed the preset threshold value as a second data item in the current time period;
selecting the first data items in the second time period as third data items according to the generation time of each first data item; the second time interval is a time interval formed from a third moment to the current moment; the third moment is a moment after the second moment;
obtaining a third use frequency of the third data items according to the average use times of all the third data items and the use times of each third data item;
selecting the third data item of which the third using frequency does not exceed the preset threshold value as a fourth data item in the current time period;
and performing deduplication processing on the second data item and the fourth data item to obtain the candidate data item of the current time period.
As a possible implementation manner of the embodiment of the first aspect of the present invention, the identifying, according to the usage information of each data item, an idle data item from the data items includes:
counting the number of times of use of each data item in the current time period according to the use information of the data item; the current time interval is a time interval formed from a first moment to the current moment;
acquiring the use frequency of each data item according to the total use times of all the data items in the current time period and the use times of each data item;
taking the data items of which the use frequency does not exceed a preset threshold value as candidate data items;
acquiring the use time of the candidate data item when the candidate data item is used for the last time;
acquiring a time interval from the use time to the current time when the last use is carried out;
determining the candidate data item of which the time interval exceeds a preset time interval as the idle data item.
As a possible implementation manner of the embodiment of the first aspect of the present invention, before identifying an idle data item from all data items, the method further includes:
verifying the identity information of a user and determining that the user is a legal user;
before the obtaining a first paraphrase for replacing the original paraphrase of the idle data item, further comprising:
asking the user whether to paraphrase update the idle data item.
As a possible implementation manner of the embodiment of the first aspect of the present invention, after the replacing the original paraphrase with the first paraphrase and completing the paraphrase update of the idle data item, the method further includes:
presenting the first paraphrase of the idle data item to a user through a display interface.
The data item paraphrase updating method of the embodiment of the invention acquires the first paraphrase for replacing the original paraphrase of the idle data item by identifying the idle data item from all the data items, and completes paraphrase updating of the idle data item by replacing the original paraphrase with the first paraphrase. Therefore, when a new data item needs to be added, the original paraphrase of the idle data item can be replaced by the first paraphrase, the purpose of adding the data item is achieved by repeatedly utilizing the idle data item, the data item does not need to be added by occupying new storage space, the data storage space is saved, the operation cost is reduced, and the problems that in the related art, the number of data items is increased, a large amount of data storage space is occupied and the operation cost is increased due to the fact that the new data item is added by occupying the new storage space are solved.
To achieve the above object, a second aspect of the present invention provides a data item paraphrasing updating apparatus, including:
the identification module is used for identifying idle data items from all the data items;
an acquisition module for acquiring a first paraphrase for replacing an original paraphrase of the idle data item;
and the updating module is used for replacing the original paraphrase by the first paraphrase to complete paraphrase updating of the idle data item.
The data item paraphrase updating device of the embodiment of the invention acquires the first paraphrase for replacing the original paraphrase of the idle data item by identifying the idle data item from all the data items, and replaces the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data item. Therefore, when a new data item needs to be added, the original paraphrase of the idle data item can be replaced by the first paraphrase, the purpose of adding the data item is achieved by repeatedly utilizing the idle data item, the data item does not need to be added by occupying new storage space, the data storage space is saved, the operation cost is reduced, and the problems that in the related art, the number of data items is increased, a large amount of data storage space is occupied and the operation cost is increased due to the fact that the new data item is added by occupying the new storage space are solved.
To achieve the above object, a third embodiment of the present invention provides a computer device, including a processor and a memory;
wherein the processor executes a program corresponding to the executable program code by reading the executable program code stored in the memory, so as to implement the data item paraphrasing updating method according to the embodiment of the first aspect.
In order to achieve the above object, a fourth aspect of the present invention provides a computer program product, wherein instructions of the computer program product, when executed by a processor, implement the data item paraphrasing update method according to the first aspect.
In order to achieve the above object, a fifth embodiment of the present invention proposes a non-transitory computer-readable storage medium, on which a computer program is stored, which when executed by a processor implements the data item paraphrasing updating method as described in the first embodiment.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Drawings
The foregoing and/or additional aspects and advantages of the present invention will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
FIG. 1 is a flow chart illustrating a method for updating definitions of data items according to an embodiment of the present invention;
FIG. 2 is a flow chart illustrating another data item paraphrase updating method according to an embodiment of the present invention;
FIG. 3 is a flowchart illustrating a method for identifying idle data items from data items according to usage information of each data item according to an embodiment of the present invention;
FIG. 4 is a flowchart illustrating a further method for updating definitions of data items according to an embodiment of the present invention;
FIG. 5 is a schematic structural diagram of a data item paraphrase updating apparatus according to an embodiment of the present invention;
FIG. 6 is a block diagram of an exemplary computer device suitable for use in implementing embodiments of the present application.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.
A data item paraphrase updating method and apparatus of an embodiment of the present invention is described below with reference to the drawings.
Existing systems follow a data item to a paraphrase when designing the data item architecture. With the development of the internet, the data volume and data types of various industries are more and more extensive. For example, various products are continuously updated, new products appear, and old products are eliminated. In the related art, new data items are often added to meet the demand for new products. However, some data items occupied by the eliminated products cannot be used for new products, and the final data items are accumulated more and more, so that a large amount of storage space is occupied, and the operation cost is increased.
In view of the above problem, an embodiment of the present invention provides a method for updating definitions of data items, which replaces original definitions of idle data items with first definitions to reuse the idle data items, thereby saving data storage space, reducing operation cost, and solving the problems in the related art that the number of data items is increased due to continuous addition of new data items, a large amount of storage space is occupied, and operation cost is increased.
Fig. 1 is a flowchart illustrating a data item paraphrase updating method according to an embodiment of the present invention.
As shown in fig. 1, the data item paraphrase updating method includes:
in step 101, an idle data item is identified from all data items.
The idle data item refers to a data item that is not used at present, or a data item that has a last use time that exceeds a preset time length so far.
As a possible implementation form, whether the data item is an idle data item or not can be identified according to the use time of the data item. Specifically, for each data item, the time length of the last use time of each data item until now is calculated, and if the time length exceeds a preset time length, the data item can be identified as an idle data item.
At step 102, a first paraphrase is obtained that replaces an original paraphrase of an idle data item.
Where paraphrasing is to be understood as the interpretation of a data item. For example, the data item of certain product information includes a production place, a production date, a shelf life, and the like, and here, the name of the data item can be understood as a definition of the data item.
In this embodiment, the first paraphrase may be user-defined. When a new data item needs to be added, the user can enter the paraphrase of the data item that needs to be added, referred to herein as the first paraphrase, into the system that stores the data item, so that the first paraphrase can be obtained that replaces the original paraphrase of the idle data item.
Step 103, replacing the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data item.
After the first paraphrase is obtained, the first paraphrase is used for replacing the original paraphrase of the idle data item so as to realize paraphrase updating of the idle data item, and the idle data item is recycled without occupying new storage space to increase the data item, so that the data storage space is saved, and the operation cost is reduced.
In order to more clearly illustrate the above embodiments, the data item paraphrase updating method proposed by the present invention is explained below with reference to fig. 2.
As shown in fig. 2, the data item paraphrase updating method includes:
step 201, acquiring the use information of the data items through a buried point probe preset by each data item; wherein the usage information includes at least a usage time for each usage.
Wherein the buried point probe is used to record usage information of the data item, such as the time of each usage. That is, when a data item is used, the buried point probe records the time at which the data item was used. Thus, the use information of each data item can be acquired through the preset buried point probe of each data item.
In step 202, idle data items are identified from the data items according to the usage information of each data item.
In this embodiment, whether the data item is in the idle state may be determined according to the usage information of each data item, such as the usage time. The embodiment of the invention provides two methods for identifying idle data items, which will be described in detail later, and thus will not be described herein again.
In step 203, a first paraphrase is obtained that replaces the original paraphrase of the idle data item.
In this embodiment, the first paraphrase may be user-defined. When a new data item needs to be added, the user can enter the paraphrase of the data item that needs to be added, referred to herein as the first paraphrase, into the system that stores the data item, so that the first paraphrase can be obtained that replaces the original paraphrase of the idle data item.
Step 204, replacing the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data item.
After the first paraphrase is obtained, the first paraphrase is used for replacing the original paraphrase of the idle data item so as to realize paraphrase updating of the idle data item, and the idle data item is recycled without occupying new storage space to increase the data item, so that the data storage space is saved, and the operation cost is reduced.
Step 205, a first paraphrase of the idle data item is presented to the user through the display interface.
After the paraphrases of the idle data items are updated, in order to enable a user to confirm the updating state of the idle data items, the first paraphrases of the idle data items after being updated can be displayed on a display interface so as to visually display the updating condition of the idle data items.
To illustrate the above embodiments more clearly, with respect to step 202, the embodiment of the present invention provides two methods for identifying idle data items from the data items according to the usage information of each data item. Fig. 3 is a flowchart illustrating a method for identifying idle data items from data items according to usage information of each data item according to an embodiment of the present invention.
As shown in fig. 3, the method includes:
step 301, obtaining the first data item in the current time interval according to the use information of the candidate data item in the previous time interval.
The previous time interval is a time interval formed from a preset first moment to the current moment, the current time interval is a time interval formed from a preset second moment to the current moment, and the second moment is a moment after the first moment. It can be seen that the previous epoch contains the current epoch.
In the present embodiment, all data items on the line in the previous period are taken as candidate data items. It should be noted that if the previous time interval is the initial first time interval, all the data items in the previous time interval may be used as candidate data items.
As a possible implementation manner, the first data item in the current time interval is acquired according to the use frequency of each candidate data item in the previous time interval.
Specifically, the number of times of use of each candidate data item in the previous period is counted based on the use information, such as the use time, of each candidate data item in the previous period. Then, the number of times of use of each candidate data item is added, and the sum obtained after the addition is divided by the number of candidate data items, to obtain the average number of times of use of all candidate data items of the previous period. And then, the using times of each candidate data item is compared with the average using times of all the candidate data items to obtain a first using frequency of each data item.
And further comparing the first use frequency of each candidate data item with a preset threshold, wherein if the first use frequency does not exceed the preset threshold, the candidate data item in the previous time period is lower in use frequency and more prone to idle data items, and therefore the candidate data item of which the first use frequency does not exceed the preset threshold is the first data item in the current time period.
Step 302, according to the generation time of the first data items in the current time period, screening all the first data items in the current time period to obtain candidate data items in the current time period.
Since the closer the time of generation of the data item is to the current time, i.e. the later the data item is on line, the less frequently the data item is used in the previous period. Therefore, according to the frequency of use of the candidate data items in the previous time period, the screened-out data item in the current time period may have a data item which is not on line soon.
In order to improve the identification accuracy of the idle data items, in this embodiment, all the first data items in the current time period are screened according to the generation time of the first data items in the current time period, so as to obtain candidate data items in the current time period.
As a possible implementation manner, the first data item with high use frequency in the current period may be screened out according to the use frequency. Specifically, the number of times of use of each first data item in the current time period is counted according to the time of use of the first data item recorded by the buried point probe, and the average number of times of use of all the first data items is calculated, and further, the second frequency of use of the first data items in the current time period is calculated according to the number of times of use of the first data items and the average number of times of use of all the first data items.
And then, comparing the second use frequency of the first data item with a preset threshold, and if the second use frequency does not exceed the preset threshold, indicating that the second use frequency is low, taking the first data item as a candidate data item in the current time period. Therefore, all the first data items of which the second use frequency does not exceed the preset threshold value are selected as the second data items in the current time period.
In order to screen out the first data item with later line, a time is selected as a third time after the second time, and a time period formed from the third time value to the current time is used as a third time period, so as to screen out the data item with later generation time but higher use frequency.
Specifically, according to the generation time of the first data item, the first data item of which the generation time is in the second time period is selected as the third data item from the first data items in the current time period. Then, the number of times of use of each third data item in the second period is counted, and the average number of times of use of all the third data items is calculated. Then, the third frequency of use of the third data items is calculated by using the number of use of each third data item and the average number of use of all the third data items. And further comparing the third use frequency with a preset threshold value, and selecting a third data item of which the third use frequency does not exceed the preset threshold value as a fourth data item in the current time period.
And finally, performing deduplication processing on the second data item in the current time period and the fourth data item in the current time period, namely comparing the second data item in the current time period with the fourth data item in the current time period, deleting a data item which avoids repetition if the same second data item exists, and obtaining a candidate data item in the current time period after past reprocessing.
In the embodiment, the data items with later online are screened out according to the generation time of the first data item, so that the accuracy of identifying the idle data items is improved.
Step 303, if the current time interval is the target time interval, the candidate data items in the current time interval are taken as idle data items.
The target time period can be set as required, such as three months or one month or half, etc.
It can be understood that, if the current time interval is not the target time interval, the candidate data item of the current time interval is taken as the candidate data item of the previous time interval, and the use information of the candidate data item according to the previous time interval is returned to be executed again, and the first data item and the subsequent steps in the current time interval are acquired until the current time interval is the target time interval.
Taking the data item of a certain product as an example, all the data items in the last 3 years are selected, and the first data item in the last 2 years is selected according to the use frequency of each data item and a preset threshold value.
Then, all the first data items in the last 2 years are screened according to the generation time of the first data items, and specific steps can be referred to in the above embodiment, so as to obtain candidate data items in the last 2 years.
Further, the first data item of the last 1 year is acquired from the candidate data items of the last 2 years, and the candidate data items of the last 1 year are acquired according to the above method.
And then acquiring a first data item of the last half year according to the candidate data item of the last 1 year, and acquiring the candidate data item of the last half year according to the method.
And repeating the steps to obtain the candidate data item of the latest 1 month, and taking the candidate data item of the latest 1 month as an idle data item because the latest 1 month is the target time interval.
As an optional implementation manner, a method for acquiring idle data items according to the usage time interval of the data items is described below.
Specifically, the average number of times of use of all the data items is obtained by counting the number of times of use of each data item in the current period from the first time to the current time based on the use information, such as the use time, of the data items recorded by the buried point probe. Then, the frequency of use of each data item in the current period is calculated from the average number of use of all data items and the number of use of each data item.
And then, comparing the use frequency of each data item with the size of a preset threshold value, and taking the data item of which the use frequency does not exceed the preset threshold value as a candidate data item.
In order to improve the accuracy of identifying idle data items, candidate data items with longer use intervals are extracted from the candidate data items as idle data items. Specifically, the use time of the candidate data item in the last use in the current time period is acquired, the time interval from the use time in the last use to the current time is calculated, and the time interval is compared with the preset time interval. If the time interval exceeds the predetermined time interval, indicating that the candidate data item has not been used for a long time, the candidate data item may be regarded as an idle data item. Thus, all idle data items may be selected from the candidate data items.
The above two methods for identifying idle data items according to the usage information of each data item are provided in the embodiments of the present invention.
In order to more clearly illustrate the above embodiments, the data item definition updating method proposed by the embodiment of the present invention is further described below with reference to fig. 4.
As shown in fig. 4, the data item paraphrase updating method includes:
step 401, the identity information of the user is verified, and the user is determined to be a legal user.
In this embodiment, the data items are stored in a system, and the user can log in the system. The login can be performed in various ways, such as login through an account and a password obtained through registration, login through face recognition, or login through a mobile phone number, and the specific login way is not limited in the present invention.
After the user logs in, the system verifies the identity information of the user, and if the identity information input by the user during logging in is consistent with certain identity information prestored in the system, the user with the user rendezvous method can be determined. Otherwise, the user can be determined to be an illegal user, and the system displays prompt information of login failure.
It should be noted that the system supports all registered users to perform customization, and the customization can be performed between different organizations and different users.
Step 402, determine whether there is a user-defined paraphrase. If so, go to step 403; otherwise, step 404 is performed.
After the user successfully logs in the system, the system determines whether the paraphrase defined by the user exists, that is, whether a data item with updated paraphrase exists, wherein the data item may be an idle data item or an in-use data item.
Step 403, reading the custom paraphrases from the cache database and displaying the custom paraphrases to a user through a display interface.
And if the user-defined paraphrases exist, reading the user-defined paraphrases of the data items from the buffer database, and displaying the user-defined paraphrases to the user through a display interface.
In step 404, idle data items are identified from all data items.
If there is no definition defined by the user, the idle data item is identified from all the data items of the user, and the specific identification process can be referred to the above embodiment, which is not described herein again.
Step 405, ask the user whether to paraphrase update the idle data item.
The system may pop up a prompt on the display interface asking the user whether to paraphrase an idle data item for update.
Step 406, if the user does not update the idle data item, the original paraphrase is obtained and displayed through the display interface.
And when the user does not update the idle data item, acquiring the original paraphrase of the data item, and displaying the paraphrase on a display interface.
Step 407, if the user paraphrases and updates the idle data item, receiving a first paraphrase input by the user, and replacing the original paraphrase of the idle data item with the first paraphrase.
If the user paraphrases and updates the idle data item, the user can input a first paraphrase in the system, the system receives the first paraphrase input by the user, and the original paraphrase of the idle data item is replaced by the first paraphrase.
Step 408, a first paraphrase of the idle data item is presented to the user through the display interface.
After the paraphrases of the idle data items are updated, in order to enable a user to confirm the updating state of the idle data items, the first paraphrases of the idle data items after being updated can be displayed on a display interface so as to visually display the updating condition of the idle data items.
In the data item paraphrase updating method of the embodiment, by inquiring whether the user updates the paraphrases of the idle data items, when the user updates the paraphrases of the idle data items, the user can input the first paraphrases as required, obtain the first paraphrases input by the user and update the original paraphrases of the idle data items, so that the personalized requirements of the user are met.
The data item paraphrase updating method of the embodiment of the invention acquires the first paraphrase for replacing the original paraphrase of the idle data item by identifying the idle data item from all the data items, and completes paraphrase updating of the idle data item by replacing the original paraphrase with the first paraphrase. Therefore, when a new data item needs to be added, the original paraphrase of the idle data item can be replaced by the first paraphrase, the purpose of adding the data item is achieved by repeatedly utilizing the idle data item, the data item does not need to be added by occupying new storage space, the data storage space is saved, the operation cost is reduced, and the problems that in the related art, the number of data items is increased, a large amount of data storage space is occupied and the operation cost is increased due to the fact that the new data item is added by occupying the new storage space are solved.
In order to implement the above embodiments, the present invention further provides a data item updating apparatus. Fig. 5 is a schematic structural diagram of a data item updating apparatus according to an embodiment of the present invention.
As shown in fig. 5, the data item updating apparatus includes: an identification module 510, an acquisition module 520, and an update module 530.
The identifying module 510 is configured to identify an idle data item from all data items.
An obtaining module 520 is configured to obtain a first paraphrase that is used to replace an original paraphrase of an idle data item.
An update module 530 for replacing the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data item.
As a possible implementation manner of this embodiment, the identifying module 510 may include:
the acquisition unit is used for acquiring the use information of the data items through a buried point probe preset by each data item; wherein the usage information at least comprises a usage time of each usage;
and the identification unit is used for identifying idle data items from the data items according to the use information of each data item.
As a possible implementation manner of this embodiment, the identification unit may be further configured to:
acquiring a first data item in the current time period according to the use information of the candidate data item in the previous time period; the previous time interval is a time interval formed from a preset first moment to the current moment, and the current time interval is a time interval formed from a preset second moment to the current moment; the second moment is a moment after the first moment; the generation time of the candidate data item is within the previous time period;
screening all first data items in the current time period according to the generation time of the first data items in the current time period to obtain candidate data items in the current time period;
and if the current time interval is the target time interval, taking the candidate data item in the current time interval as an idle data item.
As a possible implementation manner of this embodiment, the identification unit may be further configured to:
and if the current time interval is not the target time interval, taking the candidate data item of the current time interval as the candidate data item of the previous time interval, returning to execute the use information of the candidate data item according to the previous time interval again, and acquiring the first data item and the subsequent steps in the current time interval until the current time interval is the target time interval.
As a possible implementation manner of this embodiment, the identification unit may be further configured to:
counting the use times of each candidate data item in the previous time period according to the use information of the candidate data item in the previous time period;
acquiring a first use frequency of each candidate data item according to the average use times of all the candidate data items in the previous period and the use times of each candidate data item;
and selecting the candidate data item of which the first use frequency does not exceed the preset threshold value as the first data item in the current time period.
As a possible implementation manner of this embodiment, the identification unit may be further configured to:
obtaining a second use frequency of each first data item according to the average use times of all the first data items in the current time period and the use times of each first data item;
selecting a first data item of which the second use frequency does not exceed a preset threshold value as a second data item in the current time period;
selecting the first data items in the second time period as third data items according to the generation time of each first data item; the second time interval is a time interval formed from the third moment to the current moment; the third moment is a moment after the second moment;
obtaining a third using frequency of the third data items according to the average using times of all the third data items and the using times of each third data item;
selecting a third data item of which the third using frequency does not exceed a preset threshold value as a fourth data item in the current time period;
carrying out duplicate removal processing on the second data item and the fourth data item to obtain a candidate data item of the current time period
As a possible implementation manner of this embodiment, the identification unit may be further configured to:
counting the use times of the data items in the current time period according to the use information of each data item; the current time interval is a time interval formed from the first moment to the current moment;
acquiring the use frequency of each data item according to the total use times of all the data items and the use times of each data item in the current time period;
taking the data items of which the use frequency does not exceed a preset threshold value as candidate data items;
acquiring the use time of the candidate data item in the last use;
acquiring a time interval from the use time of the last use to the current time;
and determining candidate data items with the time interval exceeding the preset time interval as idle data items.
As a possible implementation manner of this embodiment, the apparatus may further include:
the determining module is used for verifying the identity information of the user before identifying the idle data items from all the data items and determining that the user is a legal user;
the query module is used for querying whether the user carries out paraphrase updating on the idle data item before acquiring a first paraphrase for replacing the original paraphrase of the idle data item.
As a possible implementation manner of this embodiment, the apparatus may further include:
and the display module is used for displaying the first paraphrase of the idle data item to a user through a display interface after the paraphrase updating of the idle data item is completed by replacing the original paraphrase with the first paraphrase.
It should be noted that the foregoing explanation of the embodiment of the data item definition updating method is also applicable to the data item definition updating apparatus of the embodiment, and is not repeated herein.
The data item paraphrase updating device of the embodiment of the invention acquires the first paraphrase for replacing the original paraphrase of the idle data item by identifying the idle data item from all the data items, and replaces the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data item. Therefore, when a new data item needs to be added, the original paraphrase of the idle data item can be replaced by the first paraphrase, the purpose of adding the data item is achieved by repeatedly utilizing the idle data item, the data item does not need to be added by occupying new storage space, the data storage space is saved, the operation cost is reduced, and the problems that in the related art, the number of data items is increased, a large amount of data storage space is occupied and the operation cost is increased due to the fact that the new data item is added by occupying the new storage space are solved.
In order to implement the above embodiments, the present invention further provides a computer device, including a processor and a memory;
wherein, the processor executes the program corresponding to the executable program code by reading the executable program code stored in the memory, so as to realize the data item paraphrasing updating method according to any one of the foregoing embodiments.
FIG. 6 illustrates a block diagram of an exemplary computer device suitable for use to implement embodiments of the present application. The computer device 12 shown in fig. 6 is only an example and should not bring any limitation to the function and scope of use of the embodiments of the present application.
As shown in FIG. 6, computer device 12 is in the form of a general purpose computing device. The components of computer device 12 may include, but are not limited to: one or more processors or processing units 16, a system memory 28, and a bus 18 that couples various system components including the system memory 28 and the processing unit 16.
Bus 18 represents one or more of any of several types of bus structures, including a memory bus or memory controller, a peripheral bus, an accelerated graphics port, and a processor or local bus using any of a variety of bus architectures. These architectures include, but are not limited to, Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MAC) bus, enhanced ISA bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus, to name a few.
Computer device 12 typically includes a variety of computer system readable media. Such media may be any available media that is accessible by computer device 12 and includes both volatile and nonvolatile media, removable and non-removable media.
Memory 28 may include computer system readable media in the form of volatile Memory, such as Random Access Memory (RAM) 30 and/or cache Memory 32. Computer device 12 may further include other removable/non-removable, volatile/nonvolatile computer system storage media. By way of example only, storage system 34 may be used to read from and write to non-removable, nonvolatile magnetic media (not shown in FIG. 6, and commonly referred to as a "hard drive"). Although not shown in FIG. 6, a disk drive for reading from and writing to a removable, nonvolatile magnetic disk (e.g., a "floppy disk") and an optical disk drive for reading from or writing to a removable, nonvolatile optical disk (e.g., a Compact disk Read Only memory (CD-ROM), a Digital versatile disk Read Only memory (DVD-ROM), or other optical media) may be provided. In these cases, each drive may be connected to bus 18 by one or more data media interfaces. Memory 28 may include at least one program product having a set (e.g., at least one) of program modules that are configured to carry out the functions of embodiments of the application.
A program/utility 40 having a set (at least one) of program modules 42 may be stored, for example, in memory 28, such program modules 42 including, but not limited to, an operating system, one or more application programs, other program modules, and program data, each of which examples or some combination thereof may comprise an implementation of a network environment. Program modules 42 generally perform the functions and/or methodologies of the embodiments described herein.
Computer device 12 may also communicate with one or more external devices 14 (e.g., keyboard, pointing device, display 24, etc.), and may also communicate with one or more devices that enable a user to interact with the computer device 12, and/or with any devices (e.g., Network card, modem, etc.) that enable the computer device 12 to communicate with one or more other computing devices, such communication may occur via input/output (I/O) interfaces 22. moreover, computer device 12 may also communicate with one or more networks (e.g., local Area Network (L Area Network; L AN) Wide Area Network (WAN; WAN) and/or a public Network such as the WAN) via Network adapter 20. As shown, Network adapter 20 communicates with other modules of computer device 12 via bus 18. it should be understood that, although not shown, other hardware and/or software modules may be used in connection with computer device 12, including, but not limited to, microcode, a microcode processing device, a redundant array drive, a disk drive system, a RAID storage system, and the like.
The processing unit 16 executes various functional applications and data processing, for example, implementing the methods mentioned in the foregoing embodiments, by executing programs stored in the system memory 28.
In order to implement the above embodiments, the present invention further provides a computer program product, wherein when the instructions in the computer program product are executed by a processor, the data item paraphrasing updating method according to any one of the foregoing embodiments is implemented.
In order to implement the above embodiments, the present invention also proposes a non-transitory computer-readable storage medium, on which a computer program is stored, which when executed by a processor implements the data item paraphrasing update method as described in any of the preceding embodiments.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.
Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present invention, "a plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.
Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing steps of a custom logic function or process, and alternate implementations are included within the scope of the preferred embodiment of the present invention in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present invention.
The logic and/or steps represented in the flowcharts or otherwise described herein, e.g., an ordered listing of executable instructions that can be considered to implement logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). Additionally, the computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via for instance optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in a computer memory.
It should be understood that portions of the present invention may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. If implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.
It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.
In addition, functional units in the embodiments of the present invention may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.
The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc. Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims (10)

1. A method for updating paraphrase of a data item, comprising:
identifying idle data items from all data items;
obtaining a first paraphrase that replaces an original paraphrase of the idle data item;
replacing the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data item;
the identifying of the idle data item from all the data items comprises:
acquiring the use information of each data item through a preset buried point probe of each data item; wherein the usage information includes at least a usage time per usage;
identifying idle data items from the data items according to the usage information of each data item;
the identifying idle data items from the data items according to the usage information of each data item includes:
acquiring a first data item in the current time period according to the use information of the candidate data item in the previous time period; the previous time interval is a time interval formed from a preset first moment to a current moment, and the current time interval is a time interval formed from a preset second moment to the current moment; the second moment is a moment after the first moment; the generation time of the candidate data item is within the previous time period;
screening all first data items in the current time period according to the generation time of the first data items in the current time period to obtain candidate data items in the current time period;
if the current time interval is a target time interval, the candidate data item in the current time interval is taken as the idle data item.
2. The method of claim 1, further comprising:
if the current time interval is not the target time interval, the candidate data item of the current time interval is used as the candidate data item of the previous time interval, the use information of the candidate data item according to the previous time interval is returned to be executed again, and the first data item and the subsequent steps in the current time interval are obtained until the current time interval is the target time interval.
3. The method of claim 1, wherein obtaining the first data item in the current time interval according to the usage information of the candidate data items in the previous time interval comprises:
counting the use times of each candidate data item in the previous time period according to the use information of the candidate data item in the previous time period;
acquiring a first use frequency of each candidate data item according to the average use times of all candidate data items in the previous time period and the use times of each candidate data item;
and selecting the candidate data item of which the first use frequency does not exceed a preset threshold value as a first data item in the current time period.
4. The method of claim 3, wherein the screening all the first data items in the current time period according to the generation time of the first data items in the current time period to obtain candidate data items in the current time period comprises:
obtaining a second use frequency of each first data item according to the average use times of all the first data items in the current time period and the use times of each first data item;
selecting the first data item of which the second using frequency does not exceed the preset threshold value as a second data item in the current time period;
selecting the first data items in the second time period as third data items according to the generation time of each first data item; the second time interval is a time interval formed from a third moment to the current moment; the third moment is a moment after the second moment;
obtaining a third use frequency of the third data items according to the average use times of all the third data items and the use times of each third data item;
selecting the third data item of which the third using frequency does not exceed the preset threshold value as a fourth data item in the current time period;
and performing deduplication processing on the second data item and the fourth data item to obtain the candidate data item of the current time period.
5. The method of claim 1, wherein the identifying idle data items from the data items according to the usage information of each data item comprises:
counting the number of times of use of each data item in the current time period according to the use information of the data item; the current time interval is a time interval formed from a first moment to the current moment;
acquiring the use frequency of each data item according to the total use times of all the data items in the current time period and the use times of each data item;
taking the data items of which the use frequency does not exceed a preset threshold value as candidate data items;
acquiring the use time of the candidate data item when the candidate data item is used for the last time;
acquiring a time interval from the use time to the current time when the last use is carried out;
determining the candidate data item of which the time interval exceeds a preset time interval as the idle data item.
6. The method of any of claims 1-5, wherein before identifying idle data items from all data items, further comprising:
verifying the identity information of a user and determining that the user is a legal user;
before the obtaining a first paraphrase for replacing the original paraphrase of the idle data item, further comprising:
asking the user whether to paraphrase update the idle data item.
7. The method of any of claims 1-5, wherein said replacing said original paraphrase with said first paraphrase further comprises, after completion of a paraphrase update on said idle data item:
presenting the first paraphrase of the idle data item to a user through a display interface.
8. A data item paraphrasing update apparatus, comprising:
the identification module is used for identifying idle data items from all the data items;
an acquisition module for acquiring a first paraphrase for replacing an original paraphrase of the idle data item;
an update module for replacing the original paraphrase with the first paraphrase to complete paraphrase updating of the idle data item;
the identification module comprises:
the acquisition unit is used for acquiring the use information of each data item through a preset buried point probe of the data item; wherein the usage information includes at least a usage time per usage;
an identifying unit configured to identify an idle data item from the data items according to the usage information of each data item;
the identification unit is further configured to:
acquiring a first data item in the current time period according to the use information of the candidate data item in the previous time period; the previous time interval is a time interval formed from a preset first moment to a current moment, and the current time interval is a time interval formed from a preset second moment to the current moment; the second moment is a moment after the first moment; the generation time of the candidate data item is within the previous time period;
screening all first data items in the current time period according to the generation time of the first data items in the current time period to obtain candidate data items in the current time period;
if the current time interval is a target time interval, the candidate data item in the current time interval is taken as the idle data item.
9. A computer device comprising a processor and a memory;
wherein the processor executes a program corresponding to the executable program code by reading the executable program code stored in the memory for implementing the data item paraphrasing update method as recited in any one of claims 1 to 7.
10. A non-transitory computer-readable storage medium, on which a computer program is stored, which, when executed by a processor, implements a data item paraphrasing update method as claimed in any one of claims 1 to 7.
CN201711287741.2A 2017-12-07 2017-12-07 Data item paraphrasing updating method and device Active CN108108402B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711287741.2A CN108108402B (en) 2017-12-07 2017-12-07 Data item paraphrasing updating method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711287741.2A CN108108402B (en) 2017-12-07 2017-12-07 Data item paraphrasing updating method and device

Publications (2)

Publication Number Publication Date
CN108108402A CN108108402A (en) 2018-06-01
CN108108402B true CN108108402B (en) 2020-07-17

Family

ID=62209402

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711287741.2A Active CN108108402B (en) 2017-12-07 2017-12-07 Data item paraphrasing updating method and device

Country Status (1)

Country Link
CN (1) CN108108402B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049392A (en) * 2012-10-17 2013-04-17 华为技术有限公司 Method and device for achieving cache catalogue
CN103108047A (en) * 2013-02-06 2013-05-15 浪潮电子信息产业股份有限公司 Optimization method for object storage system metadata cache
CN105162622A (en) * 2015-08-03 2015-12-16 瑞斯康达科技发展股份有限公司 Storage method and system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6307866B1 (en) * 1997-01-15 2001-10-23 Roke Manor Research Limited Broadband telecommunications system
WO2013124896A1 (en) * 2012-02-23 2013-08-29 Hitachi, Ltd. Chunk- based deduplication system and method thereof
CN103313121A (en) * 2013-06-28 2013-09-18 深圳Tcl新技术有限公司 Method and device for recording by utilizing free memories of television set

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049392A (en) * 2012-10-17 2013-04-17 华为技术有限公司 Method and device for achieving cache catalogue
CN103108047A (en) * 2013-02-06 2013-05-15 浪潮电子信息产业股份有限公司 Optimization method for object storage system metadata cache
CN105162622A (en) * 2015-08-03 2015-12-16 瑞斯康达科技发展股份有限公司 Storage method and system

Also Published As

Publication number Publication date
CN108108402A (en) 2018-06-01

Similar Documents

Publication Publication Date Title
CN108073519B (en) Test case generation method and device
CN108038500B (en) Clustering method, apparatus, computer device, storage medium, and program product
CN109657213B (en) Text similarity detection method and device and electronic equipment
CN110377704B (en) Data consistency detection method and device and computer equipment
US20140317495A1 (en) Retroactive word correction
CN108415917B (en) Visual report generation method and visual report generation device
CN104850318A (en) Method and apparatus for transient message display control
CN109669798A (en) Crash analysis method, crash analysis device, electronic equipment and storage medium
CN110737680A (en) Cache data management method and device, storage medium and electronic equipment
CN104267872A (en) Application program APP information display method and device and mobile terminal
CN105117489A (en) Database management method and device and electronic equipment
CN112783749A (en) Static code scanning optimization method and device, electronic equipment and storage medium
CN114461691A (en) Control method and device of state machine, electronic equipment and storage medium
CN106897224B (en) Method and device for determining software testing range
CN108108402B (en) Data item paraphrasing updating method and device
CN109828902B (en) Interface parameter determining method and device, electronic equipment and storage medium
CN109710626B (en) Data warehousing management method and device, electronic equipment and storage medium
CN105868191B (en) Data processing method and device
CN114639056A (en) Live content identification method and device, computer equipment and storage medium
CN112783751A (en) Incremental code scanning method and device, electronic equipment and storage medium
CN109299294B (en) Resource searching method and device in application, computer equipment and storage medium
CN113157671A (en) Data monitoring method and device
CN105843715B (en) Information processing method and electronic equipment
CN109814897B (en) Application program file management method and system
CN110602737B (en) Regional removal identification method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant