CN116974468A - Equipment data storage management method and system based on big data - Google Patents

Equipment data storage management method and system based on big data Download PDF

Info

Publication number
CN116974468A
CN116974468A CN202310878824.8A CN202310878824A CN116974468A CN 116974468 A CN116974468 A CN 116974468A CN 202310878824 A CN202310878824 A CN 202310878824A CN 116974468 A CN116974468 A CN 116974468A
Authority
CN
China
Prior art keywords
data
storage
equipment
storage position
migration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310878824.8A
Other languages
Chinese (zh)
Other versions
CN116974468B (en
Inventor
林海棠
刘楠
唐燕芬
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Huayuan Network Engineering Co ltd
Original Assignee
Guangdong Huayuan Network Engineering Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Huayuan Network Engineering Co ltd filed Critical Guangdong Huayuan Network Engineering Co ltd
Priority to CN202310878824.8A priority Critical patent/CN116974468B/en
Publication of CN116974468A publication Critical patent/CN116974468A/en
Application granted granted Critical
Publication of CN116974468B publication Critical patent/CN116974468B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/0604Improving or facilitating administration, e.g. storage management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0602Interfaces specially adapted for storage systems specifically adapted to achieve a particular effect
    • G06F3/062Securing storage systems
    • G06F3/0623Securing storage systems in relation to content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0646Horizontal data movement in storage systems, i.e. moving data in between storage devices or systems
    • G06F3/0647Migration mechanisms
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0628Interfaces specially adapted for storage systems making use of a particular technique
    • G06F3/0653Monitoring storage devices or systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0683Plurality of storage devices

Abstract

The invention discloses a device data storage management method and system based on big data, which relate to the technical field of device data storage management and comprise the following steps: s1: for the equipment in the access equipment data storage management system, equipment numbers are allocated, and uploading data of each equipment is analyzed in a mode of reading configuration files; s2: monitoring storage management operation of equipment data, acquiring storage management operation of corresponding data of each equipment in an equipment data storage system, and integrating the acquired storage management operation; s3: updating the actual storage capacity of each storage position, acquiring the data access operation of each storage position, extracting the data storage time of the equipment data in each storage position, and judging whether the stored data in each storage position is migrated or not; s4: and managing the data transmission channels among the storage positions according to the judging result of whether the stored data in each storage position is migrated or not.

Description

Equipment data storage management method and system based on big data
Technical Field
The invention relates to the technical field of equipment data storage management, in particular to an equipment data storage management method and system based on big data.
Background
With the development of big data technology, the size and complexity of device data are also increasing. Because the equipment data has the characteristics of mass, continuity, diversity, isomerism and the like, the data needs to be collected and processed in real time, and the data flow is filtered and screened in real time so as to extract key information and provide efficient and reliable storage and management.
The efficiency of storage management systems is critical to data storage management, which currently mainly takes into account the performance of the storage system to cope with ever-increasing amounts of data and access requests. However, life cycle management on data use conditions in data storage management is not perfect, and currently, judgment on a device data storage position by selecting a data type or a history storage position of each device data in history storage data is mainly considered for automatic data storage, but the judgment mode of the storage position is too one-sided and cannot meet the actual requirement of a data storage management system. When the storage position of the data of the device is not reasonable enough, the data management difficulty or the system performance reduction may be caused, so that the device frequently reads and writes the data, the load and the loss of the device are increased, and even the data access efficiency of the system may be affected.
Therefore, in order to solve the above problems or part of the problems, the present invention provides a device data storage management method and system based on big data.
Disclosure of Invention
The invention aims to provide a device data storage management method and system based on big data, so as to solve the problems in the background technology.
In order to solve the technical problems, the invention provides the following technical scheme: a device data storage management method based on big data comprises the following steps:
s1: for the equipment in the access equipment data storage management system, equipment numbers are allocated, and uploading data of each equipment is analyzed in a mode of reading configuration files;
s2: monitoring storage management operation of equipment data, acquiring storage management operation of corresponding data of each equipment in an equipment data storage system, and integrating the acquired storage management operation;
s3: updating the actual storage capacity of each storage position, acquiring the data access operation of each storage position, extracting the data storage time of the equipment data in each storage position, and judging whether the stored data in each storage position is migrated or not;
s4: and managing the data transmission channels among the storage positions according to the judging result of whether the stored data in each storage position is migrated or not so as to ensure the smooth transmission and storage of the data.
Further, the step S1 includes:
step S1-1: respectively allocating different equipment numbers to each equipment of the connection system based on corresponding working contents;
step S1-2: and transmitting the equipment data to a processing platform, and analyzing the equipment data according to a predefined configuration file.
Further, the step S2 includes:
step S2-1: monitoring the data set uploaded by each device, and acquiring all storage management operations of the data uploaded by each numbering device in a device storage management system to respectively acquire a storage operation set of each data storage corresponding to the data uploaded by each numbering device;
step S2-2: integrating the storage operation sets of the devices in each device data set to obtain { C } 1 ,C 2 ,...,C n The storage operation set with the highest repeated occurrence number in each numbering device is regarded as a first storage operation set corresponding to the distribution of each device data set, and each storage operation in the first storage operation set is set as one storage operation node corresponding to each device data set;
step S2-3: integrating storage positions corresponding to the first storage operation set of each device data set to obtain a storage operation node C at any one i Storage information Z relating to its device data set Ci The method comprises the steps of carrying out a first treatment on the surface of the Wherein C is i Representing a store operation of an ith operation node, Z Ci Storage location information corresponding to a storage operation of the i-th operation node;
step S2-4: for each device data setAcquiring all operation nodes of the corresponding storage positions, and integrating the storage position data corresponding to each acquired device data set to obtain storage operation information of each device data set for each storage position in the device data storage management system; the storage operation information includes a storage operation path R of each device data at each storage location, r=z C1 →...→Z Cj →...→Z Cm Wherein Z is C1 、...、Z Cj 、...、Z Cm Storage location information corresponding to storage operations of the 1 st, i., j, m operation nodes are respectively indicated. The integrated storage location information may be used for subsequent data management operations.
Further, the step S3 includes:
step S3-1: when the data storage operation of the data of each device is finished, the storage capacity change in each storage position is updated in real time;
by analyzing the storage capacity of the storage locations, it is convenient to determine whether data cleaning, archiving, or migration is required. Therefore, the use of the storage position can be effectively managed, and data overflow or resource waste is avoided.
Step S3-2: setting the data storage time of any device data S corresponding to a data storage position P to be longer than D, collecting access operation generated in each data storage process of the device data S, and sequencing according to the storage operation paths of the device data in each storage position to obtain an accessed operation set of the device data S in a device data storage management system about each storage position;
step S3-3: setting a storage location in a device data storage management system to p= { P 1 ,P 2 ,...,P h Y which is less than a preset threshold value for the number of operations to be accessed 1 The device data storage locations of (1) are recorded as a first storage location set, and the number of accessed operations exceeds y of a preset threshold value 2 Is noted as a second set of storage locations;
step S3-4: judging the equipment data of which the accessed operation times do not belong to the preset threshold range, and analyzing whether the data migration is performed comprises the following steps:
step S3-4-1: analyzing the storage time of the equipment data in the current storage position, if the data storage time of the current equipment data exceeds a preset threshold T, executing the next analysis, otherwise, temporarily not performing data migration;
step S3-4-2: analyzing the current storage position of the equipment data, and judging whether the current data storage position can meet the access frequency of the current equipment data;
if the storage position of the current device data belongs to the first storage position set, executing the next analysis when the storage position of the device data is a high-speed storage medium; otherwise, not carrying out data migration;
if the storage position of the current device data belongs to the second storage position set, executing the next analysis when the storage position of the device data is a low-speed storage medium; otherwise, not carrying out data migration;
step S3-4-3: analyzing the change rule of the accessed times of the device data in any time period of the current storage position according to the following formula:
A(t)=∑[η(i)*A(t-i)]+∑[μ(j)*λ(t-v)]+c;
wherein A (t) represents the number of times the device data is accessed in the current storage position within time t, eta (i) represents the weight of the previous i time points, mu (j) represents the weight of the jth error term, lambda (t-v) is the jth error term, and c represents a constant which is preset by a related technician;
monitoring the predicted value of the accessed frequency of the equipment data in the current storage position, and according to the variation trend of the accessed frequency of the equipment data in the current storage position, when A (t) is in the completion time of completing data migration, reaching a preset threshold range y 1 Or y 2 If the data is not migrated, the data is not migrated; otherwise, data migration is carried out, and the next step is executed;
preferably, the amount of resources required for migration of device data may also be analyzed.
Further, the step S4 includes:
step S4-1: marking the front storage according to the storage operation path of the device data, if any storage position in the storage operation path of the device data, which is in front of the current storage position, meets the data migration condition of the device data, wherein the position has priority in the storage position selection of the data migration;
if a front storage position in a storage operation path of the device data meets the data migration requirement and the backup of the current device data exists in the position, the device data is not migrated, only the device data in the current storage position is deleted, and an access address of the device data is modified to the front storage position so as to reduce the workload of a system;
step S4-2: if the storage position which does not meet the data migration of the device data is in front of the current storage position in the storage operation path of the device data, screening the storage position types meeting the current accessed and storage time requirements of the device data in the device data storage management system;
step S4-3: analyzing the available storage capacity of the storage positions obtained by screening in the step S4-2, and judging whether the available storage capacity is larger than the storage occupation amount required by the equipment data or not so that the actual storage capacity of each storage position can meet the storage requirement; managing data transmission among the storage positions according to the judgment result; and managing the data transmission channels among the storage positions to ensure smooth transmission and storage of data. In addition, operations such as data backup, data migration, data replication and the like may be involved, and security and reliability of device data need to be ensured.
Step S4-4: and feeding back the data storage position selection of the automatic data storage according to the data migration result, and judging the influence of the addition of the historical data migration on the selection of the automatic data storage position of the equipment number corresponding to the equipment data in the equipment data storage management system.
By combining the influence of data migration in the history storage management data during the automated data storage, the reliability of the automated data storage management can be further improved.
A big data based device data storage management system, the system comprising: the device comprises a device account management module, a data storage operation acquisition module, a data migration discrimination module and a storage position management module;
the device account management module is used for recording the devices of each access system and distributing corresponding device numbers based on the device types or service contents corresponding to the devices;
the data storage operation acquisition module is used for monitoring the storage management operation of the uploading data of each device and acquiring the storage operation data in the storage process;
the data migration judging module is used for replacing storage related information of each device data and judging migration of stored device data in each storage position;
the storage position management module is used for monitoring the storage capacity in each storage position in real time and managing the storage and transmission of the equipment data according to the real-time storage capacity condition in each storage position and the migration judgment result of the stored data in each storage position.
Further, the data storage operation acquisition module comprises an equipment data identification unit, a storage operation monitoring unit and a storage operation integration unit;
the device data identification unit is used for identifying each device accessed to the device data storage management system, and receiving the data uploaded by the device with the system authority through identifying the device interface or the device number;
the storage operation monitoring unit is used for monitoring the storage operation of the data uploaded by each device in the device data storage management system and acquiring the storage operation corresponding to each device data;
the storage operation integration unit is used for integrating the storage operation of the acquired device data with the storage information corresponding to each operation.
Further, the data migration discriminating module comprises a stored information analyzing unit and a data migration discriminating unit;
the storage information analysis unit is used for acquiring the equipment data related storage information required by data migration discrimination and respectively analyzing the storage information affecting the data migration discrimination;
the data migration judging unit is used for judging whether the equipment data of the current storage position are migrated according to the storage information which influences the data migration judgment, and transmitting the judging result to the storage position management module.
Further, the storage position management module comprises a storage position monitoring unit, a data transmission management unit and a storage data feedback unit;
the storage position monitoring unit is used for monitoring each storage position in the equipment data storage management system and updating the real-time storage capacity of each storage position;
the data transmission management unit is used for managing data transmission among all storage positions according to the data migration discrimination result;
the data storage feedback unit is used for feeding back the data storage position selection of the automatic data storage according to the data migration judging result, and further, when the automatic data storage is carried out, the influence on the data migration in the historical storage management data is required to be combined.
Compared with the prior art, the invention has the following beneficial effects:
the invention records the equipment of each access system through the equipment account management module, and allocates corresponding equipment numbers based on the equipment types or service contents corresponding to the equipment; so as to facilitate the management and tracking of each device and determine the attribution of the device data;
monitoring the storage management operation of the uploading data of each device through a data storage operation acquisition module, and acquiring the storage operation data in the storage process; the method is convenient for knowing the data storage condition of each device in real time, thereby effectively monitoring the storage operation of the uploaded data and finding out the storage abnormality in time;
replacing storage related information of each device data through a data migration judging module, and judging migration of stored device data in each storage position; analyzing the accessed condition of the stored data in the system, analyzing the rationality of the device data in the current storage position by combining with the influence factors such as a storage operation path, adjusting the storage position of the device data in time, and migrating the device data can ensure the control of the data storage cost, reduce the resource waste of the storage space and improve the utilization rate of storage resources;
the storage capacity in each storage position is monitored in real time through the storage position management module, and the storage and transmission of the equipment data are managed according to the real-time storage capacity condition in each storage position and the migration judgment result of the stored data in each storage position. The influence of data migration on the data storage of the equipment is considered, the reliability of automatic data storage management is further improved, the actual requirement of a data storage management system is met, and the data access efficiency of the system is improved.
Drawings
The accompanying drawings are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate the invention and together with the embodiments of the invention, serve to explain the invention. In the drawings:
FIG. 1 is a schematic block diagram of a big data based device data storage management system according to the present invention;
fig. 2 is a flow chart of a device data storage management method based on big data according to the present invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
The invention is further described with reference to fig. 1, 2 and embodiments.
Example 1: as shown in fig. 1, the present embodiment provides a device data storage management system based on big data, the system including: the device comprises a device account management module, a data storage operation acquisition module, a data migration discrimination module and a storage position management module;
the device account management module is used for recording the devices of each access system and distributing corresponding device numbers based on the device types or service contents corresponding to the devices;
the data storage operation acquisition module is used for monitoring the storage management operation of the uploading data of each device and acquiring the storage operation data in the storage process; the data storage operation acquisition module comprises an equipment data identification unit, a storage operation monitoring unit and a storage operation integration unit;
the device data identification unit is used for identifying each device accessed into the device data storage management system, and receiving the data uploaded by the device with the system authority through identifying the device interface or the device number;
the storage operation monitoring unit is used for monitoring the storage operation of the data uploaded by each device in the device data storage management system and acquiring the storage operation corresponding to each device data;
the storage operation integration unit is used for integrating the storage operation of the acquired device data with the storage information corresponding to each operation.
The data migration judging module is used for replacing storage related information of each device data and judging migration of stored device data in each storage position; the data migration discriminating module comprises a stored information analyzing unit and a data migration discriminating unit;
the storage information analysis unit is used for acquiring the equipment data related storage information required by data migration discrimination and respectively analyzing the storage information affecting the data migration discrimination;
the data migration discriminating unit is used for discriminating whether the equipment data of the current storage position is migrated or not according to the storage information which influences the data migration discrimination, and transmitting the discrimination result to the storage position management module.
The storage position management module is used for monitoring the storage capacity in each storage position in real time and managing the storage and transmission of the equipment data according to the real-time storage capacity condition in each storage position and the migration judgment result of the stored data in each storage position; the storage position management module comprises a storage position monitoring unit, a data transmission management unit and a storage data feedback unit;
the storage position monitoring unit is used for monitoring each storage position in the equipment data storage management system and updating the real-time storage capacity of each storage position;
the data transmission management unit is used for managing data transmission among the storage positions according to the data migration discrimination result;
and the stored data feedback unit is used for feeding back the data storage position selection of the automatic data storage according to the data migration discrimination result.
Example 2: as shown in fig. 2, the present embodiment provides a device data storage management method based on big data, which is implemented based on a device data storage management system based on big data in the embodiment, and specifically includes the following steps:
s1: for the equipment in the access equipment data storage management system, equipment numbers are allocated, and uploading data of each equipment is analyzed in a mode of reading configuration files;
step S1-1: respectively allocating different equipment numbers to each equipment of the connection system based on corresponding working contents;
step S1-2: and transmitting the equipment data to a processing platform, and analyzing the equipment data according to a predefined configuration file.
S2: monitoring storage management operation of equipment data, acquiring storage management operation of corresponding data of each equipment in an equipment data storage system, and integrating the acquired storage management operation;
step S2-1: monitoring the data set uploaded by each device, and acquiring all storage management operations of the data uploaded by each numbering device in a device storage management system to respectively acquire a storage operation set of each data storage corresponding to the data uploaded by each numbering device;
the storage management operation of the equipment data can be obtained by monitoring the methods such as system logs, API calls, data auditing or custom setting event triggers and the like;
step S2-2: integrating the storage operation sets of the devices in each device data set to obtain { C } 1 ,C 2 ,...,C n The storage operation set with the highest repeated occurrence number in each numbering device is regarded as a first storage operation set corresponding to the distribution of each device data set, and each storage operation in the first storage operation set is set as one storage operation node corresponding to each device data set;
step S2-3: integrating storage positions corresponding to the first storage operation set of each device data set to obtain a storage operation node C at any one i Storage information Z relating to its device data set Ci The method comprises the steps of carrying out a first treatment on the surface of the Wherein C is i Representing a store operation of an ith operation node, Z Ci Storage location information corresponding to a storage operation of the i-th operation node;
step S2-4: acquiring all operation nodes of storage positions corresponding to the equipment data sets, and integrating the acquired storage position data corresponding to the equipment data sets to obtain storage operation information of the equipment data sets for the storage positions in the equipment data storage management system; the storage operation information includes a storage operation path R of each device data at each storage location, r=z C1 →...→Z Cj →...→Z Cm Wherein Z is C1 、...、Z Cj 、...、Z Cm Storage location information corresponding to storage operations of the 1 st, i., j, m operation nodes are respectively indicated.
The integrated storage location information may be used for subsequent data management operations such as backup and restore, data migration, or optimizing storage resource allocation.
S3: updating the actual storage capacity of each storage position, acquiring the data access operation of each storage position, extracting the data storage time of the equipment data in each storage position, and judging whether the stored data in each storage position is migrated or not;
step S3-1: when the data storage operation of the data of each device is finished, the storage capacity change in each storage position is updated in real time;
by analyzing the storage capacity of a storage location, such as a hard disk, database, etc., it is convenient to determine whether data cleaning, archiving, or migration is required. Therefore, the use of the storage position can be effectively managed, and data overflow or resource waste is avoided.
Step S3-2: setting the data storage time of any device data S corresponding to a data storage position P to be longer than D, collecting access operation generated in each data storage process of the device data S, and sequencing according to the storage operation paths of the device data in each storage position to obtain an accessed operation set of the device data S in a device data storage management system about each storage position;
step S3-3: setting a storage location in a device data storage management system to p= { P 1 ,P 2 ,...,P h Y which is less than a preset threshold value for the number of operations to be accessed 1 The device data storage locations of (1) are recorded as a first storage location set, and the number of accessed operations exceeds y of a preset threshold value 2 Is noted as a second set of storage locations; if there is device data S Pk Storage position P k The number of accesses exceeding a preset threshold y 2 Wherein k is [1, h ]]The method comprises the steps of carrying out a first treatment on the surface of the For the equipment data S Pk Marking a previous storage position of a storage operation path R of each storage position;
step S3-4: judging the equipment data of which the accessed operation times do not belong to the preset threshold range, and analyzing whether the data migration is performed comprises the following steps:
step S3-4-1: analyzing the storage time of the equipment data in the current storage position, if the data storage time of the current equipment data exceeds a preset threshold T, executing the next analysis, otherwise, temporarily not performing data migration;
step S3-4-2: analyzing the current storage position of the equipment data, and judging whether the current data storage position can meet the access frequency of the current equipment data;
if the storage position of the current device data belongs to the first storage position set, executing the next analysis when the storage position of the device data is a high-speed storage medium; otherwise, not carrying out data migration;
if the storage position of the current device data belongs to the second storage position set, executing the next analysis when the storage position of the device data is a low-speed storage medium; otherwise, not carrying out data migration;
for example, for y, the number of accessed operations exceeds a preset threshold 2 The device data storage location of (2) is recorded as a second storage location set, and if the current storage location is a high-speed storage medium, for example, a Solid State Disk (SSD) or a memory database which can provide lower access delay and faster data retrieval speed is not migrated; if the current storage location is a low-speed storage medium, such as a tape library or cold storage with lower storage cost, performing the next analysis;
step S3-4-3: analyzing the change rule of the accessed times of the device data in any time period of the current storage position according to the following formula:
A(t)=∑[η(i)*A(t-i)]+∑[μ(j)*λ(t-v)]+c;
wherein A (t) represents the number of times the device data is accessed in the current storage position within time t, eta (i) represents the weight of the previous i time points, mu (j) represents the weight of the jth error term, lambda (t-v) is the jth error term, and c represents a constant which is preset by a related technician;
monitoring the predicted value of the accessed frequency of the equipment data in the current storage position, and according to the variation trend of the accessed frequency of the equipment data in the current storage position, when A (t) is in the completion time of completing data migration, reaching a preset threshold range y 1 Or y 2 If the data is not migrated, the data is not migrated; whether or notThen, data migration is carried out, and the next step is executed;
s4: and managing the data transmission channels among the storage positions according to the judging result of whether the stored data in each storage position is migrated or not.
Step S4-1: marking the front storage according to the storage operation path of the device data, if any storage position in the storage operation path of the device data, which is in front of the current storage position, meets the data migration condition of the device data, wherein the position has priority in the storage position selection of the data migration;
if a front storage position in a storage operation path of the device data meets the data migration requirement and the backup of the current device data exists in the position, the device data is not migrated, only the device data in the current storage position is deleted, and an access address of the device data is modified to the front storage position so as to reduce the workload of a system;
step S4-2: if the storage position which does not meet the data migration of the device data is in front of the current storage position in the storage operation path of the device data, screening the storage position types meeting the current accessed and storage time requirements of the device data in the device data storage management system;
step S4-3: analyzing the available storage capacity of the storage positions obtained by screening in the step S4-2, and judging whether the available storage capacity is larger than the storage occupation amount required by the equipment data or not so that the actual storage capacity of each storage position can meet the storage requirement; managing data transmission among the storage positions according to the judgment result; and managing the data transmission channels among the storage positions to ensure smooth transmission and storage of data. In addition, operations such as data backup, data migration, data replication and the like may be involved, and security and reliability of device data need to be ensured.
Step S4-4: and feeding back the data storage position selection of the automatic data storage according to the data migration result, and judging the influence of the addition of the historical data migration on the selection of the automatic data storage position of the equipment number corresponding to the equipment data in the equipment data storage management system.
It is noted that relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Moreover, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus.
Finally, it should be noted that: the foregoing description is only a preferred embodiment of the present invention, and the present invention is not limited thereto, but it is to be understood that modifications and equivalents of some of the technical features described in the foregoing embodiments may be made by those skilled in the art, although the present invention has been described in detail with reference to the foregoing embodiments. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (9)

1. A device data storage management method based on big data is characterized in that: the method comprises the following steps:
s1: for the equipment in the access equipment data storage management system, equipment numbers are allocated, and uploading data of each equipment is analyzed in a mode of reading configuration files;
s2: monitoring storage management operation of equipment data, acquiring storage management operation of corresponding data of each equipment in an equipment data storage system, and integrating the acquired storage management operation;
s3: updating the actual storage capacity of each storage position, acquiring the data access operation of each storage position, extracting the data storage time of the equipment data in each storage position, and judging whether the stored data in each storage position is migrated or not;
s4: and managing the data transmission channels among the storage positions according to the judging result of whether the stored data in each storage position is migrated or not.
2. The big data based device data storage management method of claim 1, wherein: the S1 comprises the following steps:
step S1-1: respectively allocating different equipment numbers to each equipment of the connection system based on corresponding working contents;
step S1-2: and transmitting the equipment data to a processing platform, and analyzing the equipment data according to a predefined configuration file.
3. The big data based device data storage management method of claim 1, wherein: the step S2 comprises the following steps:
step S2-1: monitoring the data set uploaded by each device, and acquiring all storage management operations of the data uploaded by each numbering device in a device storage management system to respectively acquire a storage operation set of each data storage corresponding to the data uploaded by each numbering device;
step S2-2: integrating the storage operation sets of the devices in each device data set to obtain { C } 1 ,C 2 ,...,C n The storage operation set with the highest repeated occurrence number in each numbering device is regarded as a first storage operation set corresponding to the distribution of each device data set, and each storage operation in the first storage operation set is set as one storage operation node corresponding to each device data set;
step S2-3: integrating storage positions corresponding to the first storage operation set of each device data set to obtain a storage operation node C at any one i Storage information Z relating to its device data set Ci The method comprises the steps of carrying out a first treatment on the surface of the Wherein C is i Representing a store operation of an ith operation node, Z Ci Storage location information corresponding to a storage operation of the i-th operation node;
step S2-4: acquiring all operation nodes of storage positions corresponding to the equipment data sets, and integrating the acquired storage position data corresponding to the equipment data sets to obtain storage operation information of the equipment data sets for the storage positions in the equipment data storage management system; the storage operation information includes a storage operation path R of each device data at each storage location, r=z C1 →...→Z Cj →...→Z Cm Wherein Z is C1 、...、Z Cj 、...、Z Cm Storage location information corresponding to storage operations of the 1 st, i., j, m operation nodes are respectively indicated.
4. The big data based device data storage management method of claim 1, wherein: the step S3 comprises the following steps:
step S3-1: when the data storage operation of the data of each device is finished, the storage capacity change in each storage position is updated in real time;
step S3-2: setting the data storage time of any device data S corresponding to a data storage position P to be longer than D, collecting access operation generated in each data storage process of the device data S, and sequencing according to the storage operation paths of the device data in each storage position to obtain an accessed operation set of the device data S in a device data storage management system about each storage position;
step S3-3: setting a storage location in a device data storage management system to p= { P 1 ,P 2 ,...,P h Y which is less than a preset threshold value for the number of operations to be accessed 1 The device data storage locations of (1) are recorded as a first storage location set, and the number of accessed operations exceeds y of a preset threshold value 2 Is noted as a second set of storage locations;
step S3-4: judging the equipment data of which the accessed operation times do not belong to the preset threshold range, and analyzing whether the data migration is performed comprises the following steps:
step S3-4-1: analyzing the storage time of the equipment data in the current storage position, if the data storage time of the current equipment data exceeds a preset threshold T, executing the next analysis, otherwise, temporarily not performing data migration;
step S3-4-2: analyzing the current storage position of the equipment data, and judging whether the current data storage position can meet the access frequency of the current equipment data;
if the storage position of the current device data belongs to the first storage position set, executing the next analysis when the storage position of the device data is a high-speed storage medium; otherwise, not carrying out data migration;
if the storage position of the current device data belongs to the second storage position set, executing the next analysis when the storage position of the device data is a low-speed storage medium; otherwise, not carrying out data migration;
step S3-4-3: analyzing the change rule of the accessed times of the device data in any time period of the current storage position according to the following formula:
A(t)=∑[η(i)*A(t-i)]+∑[μ(j)*λ(t-v)]+c;
wherein A (t) represents the number of times the device data is accessed in the current storage position within time t, eta (i) represents the weight of the previous i time points, mu (j) represents the weight of the jth error term, lambda (t-v) is the jth error term, and c represents a constant which is preset by a related technician;
monitoring the predicted value of the accessed frequency of the equipment data in the current storage position, and according to the variation trend of the accessed frequency of the equipment data in the current storage position, when A (t) is in the completion time of completing data migration, reaching a preset threshold range y 1 Or y 2 If the data is not migrated, the data is not migrated; otherwise, data migration is carried out, and the next step is executed.
5. The big data based device data storage management method of claim 1, wherein: the step S4 comprises the following steps:
step S4-1: marking the front storage according to the storage operation path of the device data, if any storage position in the storage operation path of the device data, which is in front of the current storage position, meets the data migration condition of the device data, wherein the position has priority in the storage position selection of the data migration;
step S4-2: if the storage position which does not meet the data migration of the device data is in front of the current storage position in the storage operation path of the device data, screening the storage position types meeting the current accessed and storage time requirements of the device data in the device data storage management system;
step S4-3: analyzing the available storage capacity of the storage position obtained by screening in the step S4-2, and judging whether the available storage capacity is larger than the required storage occupation amount of the equipment data; managing data transmission among the storage positions according to the judgment result;
step S4-4: and feeding back the data storage position selection of the automatic data storage according to the data migration result, and judging the influence of the addition of the historical data migration on the selection of the automatic data storage position of the equipment number corresponding to the equipment data in the equipment data storage management system.
6. A big data based device data storage management system, characterized in that: the system comprises: the device comprises a device account management module, a data storage operation acquisition module, a data migration discrimination module and a storage position management module;
the device account management module is used for recording the devices of each access system and distributing corresponding device numbers based on the device types or service contents corresponding to the devices;
the data storage operation acquisition module is used for monitoring the storage management operation of the uploading data of each device and acquiring the storage operation data in the storage process;
the data migration judging module is used for replacing storage related information of each device data and judging migration of stored device data in each storage position;
the storage position management module is used for monitoring the storage capacity in each storage position in real time and managing the storage and transmission of the equipment data according to the real-time storage capacity condition in each storage position and the migration judgment result of the stored data in each storage position.
7. The big data based device data storage management system of claim 6, wherein: the data storage operation acquisition module comprises an equipment data identification unit, a storage operation monitoring unit and a storage operation integration unit;
the device data identification unit is used for identifying each device accessed to the device data storage management system, and receiving the data uploaded by the device with the system authority through identifying the device interface or the device number;
the storage operation monitoring unit is used for monitoring the storage operation of the data uploaded by each device in the device data storage management system and acquiring the storage operation corresponding to each device data;
the storage operation integration unit is used for integrating the storage operation of the acquired device data with the storage information corresponding to each operation.
8. The big data based device data storage management system of claim 6, wherein: the data migration discriminating module comprises a stored information analyzing unit and a data migration discriminating unit;
the storage information analysis unit is used for acquiring the equipment data related storage information required by data migration discrimination and respectively analyzing the storage information affecting the data migration discrimination;
the data migration judging unit is used for judging whether the equipment data of the current storage position are migrated according to the storage information which influences the data migration judgment, and transmitting the judging result to the storage position management module.
9. The big data based device data storage management system of claim 6, wherein: the storage position management module comprises a storage position monitoring unit, a data transmission management unit and a storage data feedback unit;
the storage position monitoring unit is used for monitoring each storage position in the equipment data storage management system and updating the real-time storage capacity of each storage position;
the data transmission management unit is used for managing data transmission among all storage positions according to the data migration discrimination result;
and the stored data feedback unit is used for feeding back the data storage position selection of the automatic data storage according to the data migration discrimination result.
CN202310878824.8A 2023-07-18 2023-07-18 Equipment data storage management method and system based on big data Active CN116974468B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310878824.8A CN116974468B (en) 2023-07-18 2023-07-18 Equipment data storage management method and system based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310878824.8A CN116974468B (en) 2023-07-18 2023-07-18 Equipment data storage management method and system based on big data

Publications (2)

Publication Number Publication Date
CN116974468A true CN116974468A (en) 2023-10-31
CN116974468B CN116974468B (en) 2024-02-20

Family

ID=88480800

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310878824.8A Active CN116974468B (en) 2023-07-18 2023-07-18 Equipment data storage management method and system based on big data

Country Status (1)

Country Link
CN (1) CN116974468B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140189196A1 (en) * 2013-01-02 2014-07-03 International Business Machines Corporation Determining weight values for storage devices in a storage tier to use to select one of the storage devices to use as a target storage to which data from a source storage is migrated
US20160034197A1 (en) * 2014-08-04 2016-02-04 Fujitsu Limited Data migration method and data migration device
WO2017092480A1 (en) * 2015-12-04 2017-06-08 华为技术有限公司 Data migration method and device
CN107340975A (en) * 2017-07-13 2017-11-10 郑州云海信息技术有限公司 A kind of method and device of file storage
US20180088870A1 (en) * 2016-09-23 2018-03-29 EMC IP Holding Company LLC Method and device for storage management
CN111367469A (en) * 2020-02-16 2020-07-03 苏州浪潮智能科技有限公司 Layered storage data migration method and system
CN115562870A (en) * 2022-10-25 2023-01-03 北京京航计算通讯研究所 Method for constructing task node resources of cluster
CN115826877A (en) * 2023-01-20 2023-03-21 中国华能集团清洁能源技术研究院有限公司 Data object migration method and device in big data environment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140189196A1 (en) * 2013-01-02 2014-07-03 International Business Machines Corporation Determining weight values for storage devices in a storage tier to use to select one of the storage devices to use as a target storage to which data from a source storage is migrated
US20160034197A1 (en) * 2014-08-04 2016-02-04 Fujitsu Limited Data migration method and data migration device
WO2017092480A1 (en) * 2015-12-04 2017-06-08 华为技术有限公司 Data migration method and device
US20180088870A1 (en) * 2016-09-23 2018-03-29 EMC IP Holding Company LLC Method and device for storage management
CN107340975A (en) * 2017-07-13 2017-11-10 郑州云海信息技术有限公司 A kind of method and device of file storage
CN111367469A (en) * 2020-02-16 2020-07-03 苏州浪潮智能科技有限公司 Layered storage data migration method and system
CN115562870A (en) * 2022-10-25 2023-01-03 北京京航计算通讯研究所 Method for constructing task node resources of cluster
CN115826877A (en) * 2023-01-20 2023-03-21 中国华能集团清洁能源技术研究院有限公司 Data object migration method and device in big data environment

Also Published As

Publication number Publication date
CN116974468B (en) 2024-02-20

Similar Documents

Publication Publication Date Title
US5325505A (en) Intelligent storage manager for data storage apparatus having simulation capability
CN101615143B (en) Method and device for diagnosing memory leak
EP1471441B1 (en) Performance management of a computer system with a database management system
US20090228669A1 (en) Storage Device Optimization Using File Characteristics
CN101673192B (en) Method for time-sequence data processing, device and system therefor
CN101443761A (en) QOS-enabled lifecycle management for file systems
CN109918448A (en) A kind of cloud storage data classification method based on user behavior
US20100199058A1 (en) Data Set Size Tracking and Management
CN110175100B (en) Storage disk fault prediction method and prediction system
CN106598501A (en) Data migration device and method for storage automatic hierarchy
CN110175070B (en) Distributed database management method, device, system, medium and electronic equipment
CN111125171A (en) Monitoring data access method, device, equipment and readable storage medium
CN111008107A (en) Big data cluster log storage method, device, equipment and storage medium
CN116974468B (en) Equipment data storage management method and system based on big data
CN106899436A (en) A kind of cloud platform failure predication diagnostic system
CN113778964B (en) Recording device for storing multiple temporary storage files and management method of temporary storage files
CN116091175B (en) Transaction information data management system and method based on big data
EP0352462B1 (en) Method and apparatus for calculating disk-access footprints
CN116661685A (en) Hierarchical storage method and system for object storage metadata of business behavior awareness
CN115374065B (en) File cleaning method and system based on cloud platform log record monitoring
CN116244085A (en) Kubernetes cluster container group scheduling method, device and medium
CN114675956B (en) Method for configuration and scheduling of Pod between clusters based on Kubernetes
CN110932935A (en) Resource control method, device, equipment and computer storage medium
CN115993932A (en) Data processing method, device, storage medium and electronic equipment
CN109828718B (en) Disk storage load balancing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant