CN116841830A - Data center operation and maintenance method, device, equipment and medium - Google Patents

Data center operation and maintenance method, device, equipment and medium Download PDF

Info

Publication number
CN116841830A
CN116841830A CN202310820391.0A CN202310820391A CN116841830A CN 116841830 A CN116841830 A CN 116841830A CN 202310820391 A CN202310820391 A CN 202310820391A CN 116841830 A CN116841830 A CN 116841830A
Authority
CN
China
Prior art keywords
data
data center
maintenance
monitoring
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310820391.0A
Other languages
Chinese (zh)
Inventor
张东文
单震
张帆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Chaozhou Zhuoshu Big Data Industry Development Co Ltd
Original Assignee
Chaozhou Zhuoshu Big Data Industry Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chaozhou Zhuoshu Big Data Industry Development Co Ltd filed Critical Chaozhou Zhuoshu Big Data Industry Development Co Ltd
Priority to CN202310820391.0A priority Critical patent/CN116841830A/en
Publication of CN116841830A publication Critical patent/CN116841830A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • G06F11/3072Monitoring arrangements determined by the means or processing involved in reporting the monitored data where the reporting involves data filtering, e.g. pattern matching, time or event triggered, adaptive or policy-based reporting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The application discloses a method, a device, equipment and a medium for operating and maintaining a data center, wherein the method comprises the following steps: collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data; acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait; based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle; and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.

Description

Data center operation and maintenance method, device, equipment and medium
Technical Field
The present application relates to the field of data center operation and maintenance technologies, and in particular, to a data center operation and maintenance method, apparatus, device, and medium.
Background
In the big data processing process, the operation and maintenance of the data center platform is an important link, and the management efficiency of data resources and the data safety can be ensured. However, because the data processed by different users are different and cannot be associated with the existing business, the common data center operation and maintenance method does not relate to the construction of the data life cycle in the digital construction process of enterprises, and the data operation and maintenance of the whole flow cannot be realized.
Disclosure of Invention
In order to solve the above problems, the present application provides a data center operation and maintenance method, including: collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
In one implementation of the present application, before collecting the operation data corresponding to the data center, the method further includes:
determining data flow logic corresponding to a data center station and a plurality of data flow nodes contained in the data flow logic, performing embedded point processing on the data flow nodes so as to respond to data operation corresponding to the data flow nodes at embedded points and collect corresponding log data;
and carrying out embedded point processing on the service system corresponding to the data center station so as to acquire corresponding monitoring data and safety equipment data at the embedded point.
In one implementation manner of the present application, for different operation dimensions, operation dimensions under each service operation node are analyzed to obtain a corresponding analysis result, which specifically includes:
based on log operation dimension, analyzing log data corresponding to each service operation node to obtain a corresponding log data analysis report;
analyzing the monitoring data corresponding to each service operation node based on the monitoring operation dimension to determine whether a security event exists in the data center, a server corresponding to the data center and an application process;
and analyzing the safety equipment data corresponding to each service operation node based on the equipment operation dimension so as to determine whether a safety event exists in the safety equipment.
In one implementation manner of the present application, according to the analysis result, performing operation maintenance on the data center station specifically includes:
positioning an abnormal unit with an abnormality in the data center station according to the log data analysis report, and alarming the abnormal unit;
tracing the security event to determine the appointed monitoring item of the security event; the specified monitoring items comprise a server, an application process and a safety device.
In one implementation of the present application, the method further includes:
and determining the operation and maintenance requirements required by the data center, generating a new added monitoring item matched with the operation and maintenance requirements through a preset script, and adding the new added monitoring item into the monitoring item corresponding to the data center.
In one implementation of the present application, before analyzing the operation data under each service operation node, the method further includes:
analyzing historical operation data of the service operation nodes aiming at different service operation nodes to determine a monitoring item threshold parameter range corresponding to the service operation nodes according to the historical operation data;
recommending a specified monitoring item threshold parameter for analyzing the operation and maintenance data according to the monitoring item threshold parameter range; wherein the specified monitoring item threshold parameter meets the monitoring item threshold parameter range.
In one implementation of the present application, after obtaining the corresponding analysis result, the method further includes:
and generating a monitoring interface corresponding to the data center table, and displaying the analysis result in the monitoring interface.
The embodiment of the application provides a data center station operation and maintenance device, which is characterized by comprising the following components:
the acquisition unit is used for acquiring operation and maintenance data corresponding to the data center station; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
the user operation portrait generation unit is used for acquiring user behavior data corresponding to the data center and carrying out cross verification on the user behavior data so as to generate a corresponding user operation portrait;
the life cycle determining unit is used for determining the life cycle corresponding to the data center station and each service operation node contained in the life cycle based on the user operation portrait;
the analysis unit is used for determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance dimensions under each service operation node aiming at different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
The embodiment of the application provides a data center station operation and maintenance device, which is characterized by comprising:
at least one processor; the method comprises the steps of,
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to:
collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
An embodiment of the present application provides a nonvolatile computer storage medium storing computer executable instructions, wherein the computer executable instructions are configured to:
collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
The data center operation and maintenance method provided by the application has the following beneficial effects:
based on the user operation portraits, the operation data under each service operation node included in the life cycle is analyzed, and then the operation and maintenance of the data center platform are realized based on the analysis result, the life cycle of the data circulated when different users execute the service operation can be reflected by the user portraits, the monitoring data of the whole flow of the data life cycle such as data generation, aggregation, processing, use, destruction and the like can be monitored, and the operation and maintenance, safety and service operation states of the data center platform are supported.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute a limitation on the application. In the drawings:
fig. 1 is a schematic flow chart of a method for operating and maintaining a data center station according to an embodiment of the present application;
fig. 2 is a schematic diagram of an architecture of a data center console operation and maintenance system according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of an operation and maintenance device for a data center station according to an embodiment of the present application;
fig. 4 is a schematic structural diagram of a data center station operation and maintenance device according to an embodiment of the present application.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the technical solutions of the present application will be clearly and completely described below with reference to specific embodiments of the present application and corresponding drawings. It will be apparent that the described embodiments are only some, but not all, embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
The following describes in detail the technical solutions provided by the embodiments of the present application with reference to the accompanying drawings.
As shown in fig. 1, a method for operating and maintaining a data center station according to an embodiment of the present application includes:
s101: collecting operation and maintenance data corresponding to a data center; the operation data comprise operation data corresponding to a plurality of monitoring items, and at least comprise monitoring data, log data and safety equipment data.
The data center platform is a platform which is connected with big data computing storage capacity by technology and connected with data application scene capacity by service, and can collect, calculate, store and process mass data by data technology, and unify standard and caliber.
In the embodiment of the application, the server can collect the operation and maintenance data corresponding to the data center platform in a page embedded point or interface embedded point mode, and after the operation and maintenance data are collected, the operation and maintenance data are uniformly stored in the data centralized storage platform to wait for subsequent call analysis. The operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data. It should be noted that, the embodiment of the present application supports customized operation and maintenance requirements, after determining the operation and maintenance requirements, the server may also generate a new added monitoring item matched with the operation and maintenance requirements through a preset script, and add the new added monitoring item to the monitoring item corresponding to the data center, so as to satisfy the actual application scenario and improve the application range.
Specifically, the data flow logic corresponding to the data center station and a plurality of data flow nodes contained in the data flow logic are determined, point burying processing is carried out on the data flow nodes, and therefore corresponding log data can be collected at the point burying position in response to data operation corresponding to the data flow nodes. In addition, the server can also perform embedded point processing on the service system corresponding to the data center station, so that corresponding monitoring data and safety equipment data are collected at the embedded point. It should be noted that, the service burial point log related to the security of the data center is collected through the burial point, and in addition, the service production log may be directly collected for subsequent analysis. The monitoring data comprises data center monitoring data, server monitoring data and application process data. The security device data includes security related information such as VPN, firewall, IPS, WAP, etc.
S102: and acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait.
When different users perform service operations, the data generated by the data center station are different, and the common data center station operation and maintenance mode can only perform operation and maintenance on the center station data under single service operation, but cannot be specific to the specific service operation performed by the different users, so that the whole flow of data circulation is considered. Therefore, the server needs to acquire the user behavior data of the data center, and generates corresponding user operation portraits by cross-verifying and deep mining the user behavior data. The user operation portraits can reflect the life cycle of data circulated when different users execute business operations, can monitor the monitoring data of the whole flow of the life cycle of the data, such as data generation, aggregation, processing, use, destruction and the like, and are used for supporting the operation and maintenance, safety and business operation state of the data center.
S103: and determining the life cycle corresponding to the station in the data and each service operation node contained in the life cycle based on the user operation portrait.
The user operation portrait generated by the process can determine the life cycle of the data in the data center station and each service operation node contained in the life cycle, so that the operation and maintenance of the data center station in the whole process can be realized based on different service operation nodes in the life cycle.
S104: and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
When the data center station is operated and maintained, the embodiment of the application is also provided with a plurality of operation and maintenance dimensions which are log operation and maintenance dimensions, monitoring operation and maintenance dimensions and security equipment operation and maintenance dimensions respectively, which are suitable for the dimension of operation and maintenance data. The operation and maintenance data under each service operation node are analyzed through a plurality of dimensions to obtain corresponding analysis results, so that the operation and maintenance requirements of various scenes of the data center can be met, the operation state of the data center can be comprehensively monitored, the accurate positioning alarm of abnormal problems can be realized, and the operation and maintenance efficiency is improved.
Specifically, based on the log operation dimension, the log data corresponding to each service operation node is analyzed, and a corresponding log data analysis report is obtained. Based on the monitoring operation dimension, the monitoring data corresponding to each service operation node is analyzed to determine whether a security event exists in the data center, the server corresponding to the data center and the application process. And analyzing the safety equipment data corresponding to each service operation node based on the equipment operation dimension to determine whether a safety event exists in the safety equipment. If a security event exists, corresponding alarm processing is needed to ensure the security operation of related equipment such as a data center station, a server and the like.
Wherein, when judging whether the part of the monitoring items have the security event, the method can determine by comparing the current monitoring item data with the threshold value parameter of the monitoring items. The embodiment of the application not only supports the self-defined monitoring item threshold parameter, but also can recommend the specific monitoring item threshold parameter so as to improve the operation and maintenance efficiency and accuracy. For different service operation nodes, the server analyzes the historical operation data of the service operation nodes to determine a monitoring item threshold parameter range corresponding to the service operation nodes according to the historical operation data, and then recommends a designated monitoring item threshold parameter for analyzing the operation data according to the monitoring item threshold parameter range. The specified monitoring item threshold parameter meets the monitoring item threshold parameter range. By recommending corresponding monitoring item threshold parameters or carrying out the self-definition of the monitoring item threshold parameters according to actual operation and maintenance requirements, user experience can be greatly improved, and operation and maintenance efficiency is improved.
After the analysis results corresponding to different operation and maintenance dimensions are obtained, the operation and maintenance of the data center station can be carried out according to the analysis results. The server needs to analyze and report according to the log data, locate the abnormal unit with the abnormality in the data center station, and alarm the abnormal unit. And the security event is traced to determine the designated monitoring item of the security event. The specified monitoring items comprise a server, an application process and a safety device.
The server can generate a monitoring interface corresponding to the data center platform, and can visualize analysis results through the monitoring interface, so that intuitiveness is improved. Besides the abnormal monitoring module, the monitoring interface can display information such as data assets, service calls, data resources, interface calls and the like, so that management and monitoring of operation and maintenance personnel are facilitated.
Fig. 2 is a hierarchical structure diagram of a data center console operation and maintenance system according to an embodiment of the present application. As shown in fig. 2, the embodiment of the present application may further provide a data center operation and maintenance system, where the system includes a data convergence layer, a data storage calculation layer, a secure data analysis layer, and a data service layer. The data convergence layer is mainly used for realizing multidimensional collection of operation and maintenance data, and collecting monitoring data, log data and safety equipment data of a data center station from multiple dimensions. The data centralized storage platform is used for storing the operation and maintenance data acquired by the data convergence layer. The safety data analysis layer mainly generates user operation portraits of users through cross verification and deep mining, and analyzes the operation data through general big data analysis methods such as analysis decision, knowledge reasoning, mode analysis, mathematical model, safety footprint and the like to obtain corresponding analysis results. The data service layer is mainly used for summarizing and sorting analysis results. And the analysis result, data such as service call, interface call and the like generated by the data center station, data asset and data resource can be subjected to data visualization through the monitoring interface, so that the data observability is improved. The application alarm platform is mainly used for carrying out abnormal alarm under the condition that the analysis result is abnormal.
The above is a method embodiment of the present application. Based on the same thought, some embodiments of the present application further provide an apparatus, a device, and a non-volatile computer storage medium corresponding to the above method.
Fig. 3 is a schematic structural diagram of an operation and maintenance device for a data center station according to an embodiment of the present application, where, as shown in fig. 3, the device includes:
the acquisition unit 301 is configured to acquire operation and maintenance data corresponding to the middle station of the data; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
a user operation portrait generation unit 302, configured to acquire user behavior data corresponding to the data center, and cross-verify the user behavior data to generate a corresponding user operation portrait;
a life cycle determining unit 303, configured to determine, based on the user operation representation, a life cycle corresponding to the station in the data and each service operation node included in the life cycle;
the analysis unit 304 is configured to determine operation dimensions corresponding to the data center, analyze the operation dimensions under each service operation node according to different operation dimensions, obtain a corresponding analysis result, and perform operation maintenance on the data center according to the analysis result.
Fig. 4 is a schematic structural diagram of a data center station operation and maintenance device according to an embodiment of the present application. As shown in fig. 4, includes:
at least one processor; the method comprises the steps of,
at least one processor in communication with the memory; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to:
collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
The embodiment of the application provides a nonvolatile computer storage medium, which stores computer executable instructions, wherein the computer executable instructions are configured to:
collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
The embodiments of the present application are described in a progressive manner, and the same and similar parts of the embodiments are all referred to each other, and each embodiment is mainly described in the differences from the other embodiments. In particular, for the apparatus and medium embodiments, the description is relatively simple, as it is substantially similar to the method embodiments, with reference to the section of the method embodiments being relevant.
The devices and media provided in the embodiments of the present application are in one-to-one correspondence with the methods, so that the devices and media also have similar beneficial technical effects as the corresponding methods, and since the beneficial technical effects of the methods have been described in detail above, the beneficial technical effects of the devices and media are not repeated here.
It will be appreciated by those skilled in the art that embodiments of the present application may be provided as a method, system, or computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the application. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
In one typical configuration, a computing device includes one or more processors (CPUs), input/output interfaces, network interfaces, and memory.
The memory may include volatile memory in a computer-readable medium, random Access Memory (RAM) and/or nonvolatile memory, such as Read Only Memory (ROM) or flash memory (flash RAM). Memory is an example of computer-readable media.
Computer readable media, including both non-transitory and non-transitory, removable and non-removable media, may implement information storage by any method or technology. The information may be computer readable instructions, data structures, modules of a program, or other data. Examples of storage media for a computer include, but are not limited to, phase change memory (PRAM), static Random Access Memory (SRAM), dynamic Random Access Memory (DRAM), other types of Random Access Memory (RAM), read Only Memory (ROM), electrically Erasable Programmable Read Only Memory (EEPROM), flash memory or other memory technology, compact disc read only memory (CD-ROM), digital Versatile Discs (DVD) or other optical storage, magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices, or any other non-transmission medium, which can be used to store information that can be accessed by a computing device. Computer-readable media, as defined herein, does not include transitory computer-readable media (transmission media), such as modulated data signals and carrier waves.
It should also be noted that the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising one … …" does not exclude the presence of other like elements in a process, method, article or apparatus that comprises the element.
The foregoing is merely exemplary of the present application and is not intended to limit the present application. Various modifications and variations of the present application will be apparent to those skilled in the art. Any modification, equivalent replacement, improvement, etc. which come within the spirit and principles of the application are to be included in the scope of the claims of the present application.

Claims (10)

1. A method of operation and maintenance of a data center, the method comprising:
collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
2. The method of claim 1, wherein prior to collecting the operation data corresponding to the data center, the method further comprises:
determining data flow logic corresponding to a data center station and a plurality of data flow nodes contained in the data flow logic, performing embedded point processing on the data flow nodes so as to respond to data operation corresponding to the data flow nodes at embedded points and collect corresponding log data;
and carrying out embedded point processing on the service system corresponding to the data center station so as to acquire corresponding monitoring data and safety equipment data at the embedded point.
3. The method for operation and maintenance of a data center station according to claim 1, wherein the analyzing operation and maintenance data under each service operation node for different operation and maintenance dimensions to obtain a corresponding analysis result specifically comprises:
based on log operation dimension, analyzing log data corresponding to each service operation node to obtain a corresponding log data analysis report;
analyzing the monitoring data corresponding to each service operation node based on the monitoring operation dimension to determine whether a security event exists in the data center, a server corresponding to the data center and an application process;
and analyzing the safety equipment data corresponding to each service operation node based on the equipment operation dimension so as to determine whether a safety event exists in the safety equipment.
4. A method for operating and maintaining a data center according to claim 3, wherein the operating and maintaining the data center according to the analysis result specifically includes:
positioning an abnormal unit with an abnormality in the data center station according to the log data analysis report, and alarming the abnormal unit;
tracing the security event to determine the appointed monitoring item of the security event; the specified monitoring items comprise a server, an application process and a safety device.
5. The method of claim 1, further comprising:
and determining the operation and maintenance requirements required by the data center, generating a new added monitoring item matched with the operation and maintenance requirements through a preset script, and adding the new added monitoring item into the monitoring item corresponding to the data center.
6. The method of claim 1, wherein prior to analyzing the operation data under each of the service operation nodes, the method further comprises:
analyzing historical operation data of the service operation nodes aiming at different service operation nodes to determine a monitoring item threshold parameter range corresponding to the service operation nodes according to the historical operation data;
recommending a specified monitoring item threshold parameter for analyzing the operation and maintenance data according to the monitoring item threshold parameter range; wherein the specified monitoring item threshold parameter meets the monitoring item threshold parameter range.
7. The method of claim 1, wherein after obtaining the corresponding analysis result, the method further comprises:
and generating a monitoring interface corresponding to the data center table, and displaying the analysis result in the monitoring interface.
8. A data center operation and maintenance device, the device comprising:
the acquisition unit is used for acquiring operation and maintenance data corresponding to the data center station; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
the user operation portrait generation unit is used for acquiring user behavior data corresponding to the data center and carrying out cross verification on the user behavior data so as to generate a corresponding user operation portrait;
the life cycle determining unit is used for determining the life cycle corresponding to the data center station and each service operation node contained in the life cycle based on the user operation portrait;
the analysis unit is used for determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance dimensions under each service operation node aiming at different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
9. A data center operation and maintenance device, the device comprising:
at least one processor; the method comprises the steps of,
a memory communicatively coupled to the at least one processor; wherein,,
the memory stores instructions executable by the at least one processor to enable the at least one processor to:
collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
10. A non-transitory computer storage medium storing computer-executable instructions, the computer-executable instructions configured to:
collecting operation and maintenance data corresponding to a data center; the operation data comprises operation data corresponding to a plurality of monitoring items, and at least comprises monitoring data, log data and safety equipment data;
acquiring user behavior data corresponding to the data center, and performing cross-validation on the user behavior data to generate a corresponding user operation portrait;
based on the user operation portrait, determining a life cycle corresponding to the data center station and each service operation node contained in the life cycle;
and determining the operation and maintenance dimensions corresponding to the data center, analyzing the operation and maintenance data under each service operation node according to different operation and maintenance dimensions to obtain a corresponding analysis result, and operating and maintaining the data center according to the analysis result.
CN202310820391.0A 2023-07-05 2023-07-05 Data center operation and maintenance method, device, equipment and medium Pending CN116841830A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310820391.0A CN116841830A (en) 2023-07-05 2023-07-05 Data center operation and maintenance method, device, equipment and medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310820391.0A CN116841830A (en) 2023-07-05 2023-07-05 Data center operation and maintenance method, device, equipment and medium

Publications (1)

Publication Number Publication Date
CN116841830A true CN116841830A (en) 2023-10-03

Family

ID=88170258

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310820391.0A Pending CN116841830A (en) 2023-07-05 2023-07-05 Data center operation and maintenance method, device, equipment and medium

Country Status (1)

Country Link
CN (1) CN116841830A (en)

Similar Documents

Publication Publication Date Title
CN112416645B (en) Fault root cause deducing and positioning method and device based on artificial intelligence
US20180365085A1 (en) Method and apparatus for monitoring client applications
US20190227860A1 (en) Root cause analysis for protection storage devices using causal graphs
CN111478889B (en) Alarm method and device
CN111897705B (en) Service state processing and model training method, device, equipment and storage medium
CN112529528B (en) Workflow monitoring and warning method, device and system based on big data flow calculation
CN113641526B (en) Alarm root cause positioning method and device, electronic equipment and computer storage medium
CN109062769B (en) Method, device and equipment for predicting IT system performance risk trend
CN110363381B (en) Information processing method and device
CN109951326B (en) Data processing method, device, storage medium and processor
CN116149926A (en) Abnormality monitoring method, device, equipment and storage medium for business index
CN112769615B (en) Anomaly analysis method and device
CN113411209A (en) Distributed password service full-link detection system and method
CN117234916A (en) Workflow application testing method and device, electronic equipment and storage medium
CN114625611B (en) Quantum computing environment monitoring method, device and storage medium
CN116841830A (en) Data center operation and maintenance method, device, equipment and medium
CN116527553A (en) Processing method, system and storage medium for automatic test report of switch
CN115225470A (en) Business abnormity monitoring method and device, electronic equipment and storage medium
CN114490413A (en) Test data preparation method and device, storage medium and electronic equipment
CN109426559B (en) Command issuing method and device, storage medium and processor
US20190238400A1 (en) Network element operational status ranking
CN113656245B (en) Data inspection method and device, storage medium and processor
CN112100661B (en) Data processing method and device
CN117194091A (en) Abnormal service detection method and device, storage medium and electronic equipment
CN107291613B (en) Functional node abnormity simulation method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination