WO2023103592A1 - Device risk prediction method, electronic device and computer-readable storage medium - Google Patents

Device risk prediction method, electronic device and computer-readable storage medium Download PDF

Info

Publication number
WO2023103592A1
WO2023103592A1 PCT/CN2022/125757 CN2022125757W WO2023103592A1 WO 2023103592 A1 WO2023103592 A1 WO 2023103592A1 CN 2022125757 W CN2022125757 W CN 2022125757W WO 2023103592 A1 WO2023103592 A1 WO 2023103592A1
Authority
WO
WIPO (PCT)
Prior art keywords
gradient
updated
risk prediction
statistical algorithm
algorithm model
Prior art date
Application number
PCT/CN2022/125757
Other languages
French (fr)
Chinese (zh)
Inventor
李君�
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2023103592A1 publication Critical patent/WO2023103592A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0635Risk analysis of enterprise or organisation activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management

Definitions

  • the embodiments of the present application relate to the field of information processing, and in particular to a device risk prediction method, an electronic device, and a computer-readable storage medium.
  • the device temperature can be detected for prevention.
  • the temperature data of a large number of devices is collected to detect the overheating of the device environment, but the algorithm used for detection requires a large amount of data.
  • the algorithm used for detection requires a large amount of data.
  • the amount of data at a single site is small and single, which cannot meet the operating conditions required by the algorithm. Therefore, there is a problem of insufficient accuracy in the process of use.
  • the main purpose of the embodiments of the present application is to provide a device risk prediction method, an electronic device, and a computer-readable storage medium, so as to improve the accuracy of device risk prediction.
  • an embodiment of the present application provides a device risk prediction method, which is applied to a sub-computing node, including: respectively acquiring characteristic data of multiple devices; obtaining a statistical algorithm for predicting device risk according to the characteristic data Gradient of the model; encrypt and upload the gradient of the statistical algorithm model to the public server; receive the updated gradient issued by the public server, and the updated gradient is uploaded by the public server according to N sub-computing nodes
  • the gradient is obtained by aggregation; the statistical algorithm model is updated according to the updated gradient; and the result of the equipment risk prediction is output when the updated statistical algorithm model converges.
  • the embodiment of the present application also provides a device risk prediction method, which is applied to the public server, including: receiving the gradient of the statistical algorithm model uploaded by the N sub-computing nodes; The attributes of the equipment all meet the preset public conditions, and the N is a positive integer greater than 1; the gradients of the statistical algorithm models encrypted and uploaded by the N sub-computing nodes are aggregated to obtain an updated gradient; The computing node sends the updated gradient.
  • an embodiment of the present application further provides an electronic device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the at least one processor Instructions executed by a processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the above-mentioned equipment risk prediction method applied to sub-computing nodes, or can execute the above-mentioned method applied to public A device risk prediction approach for managers.
  • the embodiment of the present application also provides a computer-readable storage medium, which stores a computer program, and when the computer program is executed by a processor, the above-mentioned device risk prediction method applied to a sub-computing node is implemented, or The above-mentioned equipment risk prediction method applied to the public manager.
  • the equipment it is predicted whether the equipment will be overheated, so as to prevent the equipment from being overheated due to the installation environment or other reasons, thereby affecting business execution and equipment life.
  • the statistical algorithm model is used to predict the overheating situation, and the encrypted data transmission is carried out with the public server, and the optimization data issued by the public server is updated and adjusted to increase participation.
  • the amount of calculated data makes the prediction result of the current sub-computing node more accurate and improves the prediction efficiency of the sub-computing node.
  • FIG. 1 is a flow chart of a method for predicting equipment risk according to an embodiment of the present application
  • Fig. 2 is a schematic diagram of an equipment risk prediction method provided according to an embodiment of the present application.
  • Fig. 3 is a flow chart of a method for predicting equipment risk according to another embodiment of the present application.
  • Fig. 4 is a schematic diagram of an equipment risk prediction method provided according to another embodiment of the present application.
  • Fig. 5 is a schematic structural diagram of an electronic device provided according to an embodiment of the present application.
  • the device If the device has reported an overtemperature alarm and then handles it, it will seriously affect the normal operation of the device, for example, causing service interruption. There may be many reasons why the equipment is prone to overheating, such as model problems, high power due to heavy traffic, and closed installation environment. If the installation environment of the equipment can be detected, it can effectively predict the overheating of the equipment caused by poor heat dissipation in the installation environment, and reduce the impact and interruption of the business by measures such as equipment damage and power amplifier derating after an alarm occurs.
  • the current detection of device temperature includes monitoring and estimating the current temperature of the device, and there are few methods for predicting the device temperature.
  • there is an algorithm for detecting the overheating of the equipment environment by collecting temperature data of a large number of equipment but the algorithm is based on the statistics of a large amount of data, and the data volume of a single site as the execution subject is small and single, which is accurate and applicable The problem of lack of sex and optimization efficiency.
  • An embodiment of the present application relates to a device risk prediction method, which is applied to sub-computing nodes.
  • the specific process of the equipment risk prediction method in this embodiment can be shown in Figure 1, including:
  • Step 101 acquiring feature data of multiple devices respectively
  • Step 102 obtaining the gradient of the statistical algorithm model for predicting equipment risk according to the feature data
  • Step 103 uploading the gradient encryption of the statistical algorithm model to the public server
  • Step 104 receiving the updated gradient issued by the public server, the updated gradient is obtained by the public server through aggregation according to the gradients uploaded by N sub-computing nodes;
  • Step 105 updating the statistical algorithm model according to the updated gradient
  • Step 106 when the updated statistical algorithm model converges, output the result of equipment risk prediction.
  • step 101 feature data of multiple devices are acquired respectively.
  • the sub-computing node may be an office point, and the office point obtains characteristic data of devices within the current office point collected by the base station.
  • the attributes of the multiple devices meet the preset conditions; the preset conditions include: the models of the multiple devices are the same, the difference between the latitude coordinates of the multiple devices is within the first preset range, and the respective The temperature difference in the region is within the second preset range. That is, in a single sub-computing node or site, the attributes of the devices are similar, so that the sub-computing node can perform risk prediction on the device with the current attributes according to the acquired feature data.
  • the device includes a remote radio unit (Remote Radio Unit, RRU); in the case of the device being an RRU, the feature data of multiple devices includes each RRU: power, optical module temperature, baseband temperature, intermediate frequency temperature, Power supply temperature, board temperature, transceiver temperature, location, air temperature.
  • RRU Remote Radio Unit
  • the power is such as the average power of the power supply
  • the temperature of the optical module is such as the average temperature of the optical/electrical module transceiver.
  • Board temperature such as the average temperature of the RRU board
  • transceiver temperature such as the average RRU transmit (transport, TX)/receive (receive, RX) temperature or power amplifier temperature
  • location such as latitude and longitude
  • air temperature such as the local air temperature where the device is located.
  • the device can be an electronic device whose position does not change in a large range, and is not limited to RRU, for example, it can also be an indoor baseband processing unit (Building Base band Unit, BBU). predict.
  • BBU Building Base band Unit
  • step 102 the gradient of the statistical algorithm model used to predict equipment risk is obtained according to the feature data; that is, the acquired feature data is input into the statistical algorithm model for calculation and prediction, and the gradient in the calculation process is obtained, the gradient
  • the mean value and deviation range in the set of characteristic data can be characterized, and the mean value and deviation range can be used to measure the operation results of the current statistical algorithm model. For example, if the ratio of the part of the current characteristic data within the deviation range of the mean value to the total number of the current characteristic data is less than the threshold value of the preset ratio, the calculation result of the current statistical algorithm model is not optimal and needs to be adjusted. That is, the gradient needs to be adjusted, that is, the model has not converged.
  • the gradient of the statistical algorithm model for predicting equipment risk according to the feature data includes: performing data cleaning on the feature data of multiple devices. That is, when collecting characteristic data, it can be obtained through active detection or reporting of receiving equipment, and the format of characteristic data is not screened during the collection process, which can ensure the number of samples obtained.
  • data cleaning includes one of the following or any combination thereof: unifying the data format of feature data; deleting abnormal data in feature data; integrating feature data according to time granularity.
  • the data format of the characteristic data is unified, that is, since there are no restrictions on the data during the collection process, and there is no restriction on the way to collect the characteristic data, the received characteristic data may have inconsistent data formats.
  • the format of the acquired feature data is unified. Delete the abnormal data in the characteristic data. For computing nodes, there may be cases where the data format that can be processed is limited. If the obtained characteristic data is in a format that cannot be processed by the current computing node, it will be judged as abnormal data.
  • the preset range interval corresponding to the range interval and the minimum value can also be defined as abnormal data, which is deleted before processing. Integrate feature data according to time granularity. Since the reporting time granularity of various types of data is different, it is necessary to integrate all acquired feature data into one data stream, for example: add location data to each piece of collected temperature data, that is, Supplement the location data with coarser granularity of collection time to the temperature data with finer granularity of collection time.
  • the granularity of feature data a is one piece of data every 15 minutes
  • the granularity of feature data b is one piece of data every hour.
  • one data stream every 15 minutes contains feature data a and feature data b. , containing a is collected this time, and b is collected last time within one hour.
  • step 103 the encrypted gradient of the statistical algorithm model is uploaded to the public server. That is to say, there is a preset encryption process in the sub-computing nodes. After the gradient obtained after the statistical algorithm model is executed, the obtained gradient is encrypted, and the encrypted gradient is uploaded to the public server.
  • the mean value and offset range are obtained according to the gradient, and the data of the characteristic parameters acquired by the current sub-computing node in the offset range of the mean value are judged to account for the characteristic parameters obtained by the current sub-computing node.
  • the ratio of the total number of parameters if the ratio is greater than the threshold of the preset ratio, it means that the current calculation result is optimal and meets the requirements, and the calculation result is output; if the ratio is not greater than the threshold of the preset ratio, it means the current calculation result It is not optimal and requires gradient adjustment, so perform the steps of uploading gradient encryption to the public server.
  • the process of uploading the gradient to the public server can refer to the federated learning algorithm, that is, refer to the federated learning algorithm to encrypt and transmit the gradient data between the sub-computing nodes and the public server to ensure the security of the gradient data. It can be understood that there is a decryption algorithm corresponding to the encryption algorithm in the current child node in the public server.
  • step 104 the updated gradient issued by the public server is received, and the updated gradient is obtained by the public server through aggregation according to the gradients uploaded by the N sub-computing nodes. That is, the received updated gradient issued by the public server is obtained by integrating the gradients uploaded by the sub-computing nodes within the public server's own scope. Since there are multiple sub-computing nodes within the scope of the public server, and each sub-computing node also corresponds to multiple devices, for a public server, the received gradient is a value obtained through a large amount of sample data. Due to the abundant sample size, the aggregated value of the received gradient is closer to the optimal choice of the gradient of the statistical algorithm model of the current child node, and the original gradient value is updated with the aggregated gradient.
  • the public server when the public server sends the updated gradient to the sub-computing node, it can also be encrypted, and the sub-computing node decrypts it after the sub-computing node receives it, further improving the security of data transmission.
  • step 105 the statistical algorithm model is updated according to the updated gradient. That is, according to the updated gradient sent by the public server, the statistical algorithm model of the current computing node is updated, so that the output of the statistical algorithm model of the current sub-computing node is closer to the actual situation, wherein the statistical algorithm model such as statistical clustering
  • the algorithm model constructs relatively close groups into a large group by continuously merging similar samples.
  • the gradient uploaded by each sub-computing node received in the public server is to obtain the mean value and offset range uploaded by each sub-computing node, aggregate each mean value and offset range, and obtain the updated mean value and offset range; update After the gradient is sent to the current sub-computing node, the current sub-computing node also gets the updated mean value and offset range from the public server; the statistical algorithm model of the current computing node is updated using the gradient delivered by the public server.
  • the gradient issued by the public server is used to update the statistical algorithm model of the current computing node, for example: according to the gradient issued by the public server, the parameters in the statistical algorithm model are updated, including updating various judgment criteria, such as the mean value and offset range , where the mean value and offset range include the relative mean value and offset range of temperature, latitude and longitude, power, etc. After the parameters are updated, it can be judged whether the current statistical algorithm model is converged. In addition, the criteria for judging abnormal data can also be updated.
  • step 106 when the updated statistical algorithm model converges, the result of equipment risk prediction is output.
  • the statistical algorithm model of the current sub-computing node is updated according to the gradient issued by the public server, the calculation is performed again according to the characteristic data, and the gradient during this calculation process is obtained, and the current calculation is judged according to the mean value and offset range corresponding to the gradient. Whether the statistical algorithm model converges, and if it converges, the calculation result of the statistical algorithm model is the final equipment risk prediction result.
  • after updating the statistical algorithm model according to the updated gradient it also includes: in the case that the updated statistical algorithm model does not converge, encrypting and uploading the updated statistical algorithm model according to the gradient obtained from the feature data To the public server; receive the updated gradient issued by the public server again, and update the statistical algorithm model again according to the updated gradient issued by the public server again until the updated statistical algorithm model converges, as shown in Figure 2.
  • the equipment risk prediction results of the statistical algorithm model include: normal, general overheating, high risk overheating, and severe overheating.
  • Normal means that the device installation environment corresponding to the characteristic data obtained by the current sub-computing node is normal; general overheating means that there is an overheating risk. It is recommended to check the heat dissipation of the installation environment to determine the cause of the adverse effects of heat dissipation, and Renovate the installation environment when necessary to eliminate the impact of poor heat dissipation on equipment performance and hardware life; high risk of overheating indicates a high risk of overheating in the installation environment.
  • the installation environment of the aforementioned equipment for example, the installation environment of the RRU.
  • a site is located in a high temperature zone, and RRU overheating alarms often occur, which is suspected to be caused by poor heat dissipation in the installation environment. It is necessary to check the installation environment of the RRUs, find out which RRUs are prone to overheating due to poor heat dissipation in the installation environment, and then rectify the installation environment.
  • RRUs there are many types of RRUs in this site and the number of each type of RRU is very small, and the result obtained for each type of RRU is not accurate enough. Therefore, the following process is used for detection:
  • the gradient calculated by the site according to the statistical algorithm model is encrypted and uploaded to the public server.
  • the public server aggregates the data of multiple sites to calculate a new gradient and distributes it to each site. Among them, multiple sites under the public server The devices corresponding to the points have the same model and similar properties. Update the model according to the new gradient issued by the public server, and repeat until the model converges, and the output result when the model converges is the final prediction result.
  • the business volume of a certain site is small, but the business needs to be expanded in summer.
  • the daily operating power of this site is small, and the amount of data used for prediction is insufficient. Therefore, we choose to use the data of other sites with similar attributes to this site but with high operating power to perform model training together through the public server. Predict the RRUs that are prone to overheating at this site and make rectifications.
  • the current service volume of RRU1 in the site is small.
  • the temperature of RRU1 at different operating powers can be predicted to determine whether it is prone to overheating when the power increases. This method needs to collect historical data for a long time.
  • the public server obtains a new gradient by integrating the data of each site. Send the new gradient to each site.
  • Each site continuously updates the statistical clustering algorithm model according to the new gradient and uploads the updated new gradient until the model converges. When the model converges, the predicted result of the model is the final predicted result.
  • the equipment it is predicted whether the equipment will be overheated, so as to prevent the equipment from being overheated due to the installation environment or other reasons, thereby affecting service execution and equipment life.
  • the statistical algorithm model is used to predict the overheating situation, and the encrypted data transmission is carried out with the public server, and the optimization data issued by the public server is updated and adjusted to increase participation.
  • the amount of calculated data makes the prediction result of the current sub-computing node more accurate and improves the prediction efficiency of the sub-computing node.
  • FIG. 3 Another embodiment of the present application relates to a device risk prediction method, as shown in Figure 3, which is applied to a public server.
  • the implementation details of the device risk prediction method in this embodiment will be described in detail below, and the following content is provided for convenience of understanding The implementation details are not necessary to implement this solution.
  • Step 201 receiving gradients of statistical algorithm models uploaded by N sub-computing nodes; multiple devices acquired by N sub-computing nodes all meet preset common conditions, and N is a positive integer greater than 1. That is, the public server corresponds to N sub-computing nodes, and the attributes of all devices acquired by the N sub-computing nodes all meet preset public conditions.
  • the preset common condition is that the models of the multiple devices are the same, the difference between the latitude coordinates of the multiple devices is within the third preset range, and the temperature difference between the regions where the multiple devices are located is within the fourth preset range. within the set range.
  • the third preset range is not smaller than the first preset range
  • the fourth preset range is not smaller than the second preset range. That is to say, the properties of multiple devices acquired in the same public server are similar, and the number of parameters in the sub-computing nodes can be expanded.
  • the attribute range of multiple devices obtained by the sub-computing node in the public server can be consistent with the attribute range of the corresponding device in a sub-computing node, or can be appropriately expanded to expand the amount of data used by the sub-computing node for calculation .
  • the number of devices required by the public server has a lower limit
  • the number of devices acquired by the sub-computing nodes under the public server needs to be greater than the lower limit.
  • M is the lower limit value
  • NA to NP respectively represent the number of devices whose attributes selected by each sub-computing node meet the preset conditions.
  • sub-computing nodes with similar attributes of corresponding devices are selected as sub-computing nodes of the same system.
  • the sub-computing node as the site and the equipment as the RRU as an example, that is, the attributes of the RRU in the site A are similar to the attributes of the RRU in the site B, and although the number of RRUs in the site C is large, the If the RRU attributes of RRUs are different, then A and B are more suitable as a computing system, corresponding to the same public server. It is necessary to find other sites that have similar RRU attributes to site C, and site C corresponds to the same public server.
  • the relationship between Site A and Site B and the public server is shown in Figure 4.
  • the sub-computing nodes corresponding to the public servers corresponding to Sites A and B are not limited to only Sites A and B.
  • the public server may be a northbound server.
  • step 202 the gradients of the statistical algorithm models encrypted and uploaded by the N sub-computing nodes are aggregated to obtain updated gradients. That is, the N sub-computing nodes respectively collect the characteristic parameters of their corresponding devices, perform data cleaning, calculation, etc., and upload the calculated gradients to the public server.
  • the public server aggregates the gradients uploaded by the sub-computing nodes obtained by the current public server, that is, performs gradient calculations based on all devices obtained by the sub-computing nodes under the current public server to increase accuracy.
  • Step 203 sending the updated gradients to the N sub-computing nodes, and sending the updated gradients to the sub-computing nodes, so that the sub-computing nodes can update the statistical algorithm model and obtain prediction results.
  • the prediction result generated when the statistical algorithm model converges is the final prediction result.
  • a public server is provided to participate in the prediction process of the sub-computing nodes, obtain the gradients of each sub-computing node, and distribute them after integration to update the statistical algorithm model of the sub-computing nodes and expand the participation of each sub-computing node in the calculation
  • the amount of data can improve the prediction accuracy of each sub-computing node.
  • FIG. 5 Another embodiment of the present application relates to an electronic device, as shown in FIG. 5 , including: at least one processor 301; and a memory 302 communicatively connected to the at least one processor 301; wherein, the memory 302 stores Instructions that can be executed by the at least one processor 301, the instructions are executed by the at least one processor 301, so that the at least one processor 301 can execute the devices applied to sub-computing nodes in the above embodiments A risk prediction method, or a device risk prediction method applied to a public server in the various examples above.
  • the memory and the processor are connected by a bus
  • the bus may include any number of interconnected buses and bridges, and the bus connects one or more processors and various circuits of the memory together.
  • the bus may also connect together various other circuits such as peripherals, voltage regulators, and power management circuits, all of which are well known in the art and therefore will not be further described herein.
  • the bus interface provides an interface between the bus and the transceivers.
  • a transceiver may be a single element or multiple elements, such as multiple receivers and transmitters, providing means for communicating with various other devices over a transmission medium.
  • the data processed by the processor is transmitted on the wireless medium through the antenna, further, the antenna also receives the data and transmits the data to the processor.
  • the processor is responsible for managing the bus and general processing, and can also provide various functions, including timing, peripheral interface, voltage regulation, power management, and other control functions. Instead, memory can be used to store data that the processor uses when performing operations.
  • Another embodiment of the present application relates to a computer-readable storage medium storing a computer program.
  • the above method embodiments are implemented when the computer program is executed by the processor.
  • a storage medium includes several instructions to make a device ( It may be a single-chip microcomputer, a chip, etc.) or a processor (processor) to execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disc, etc., which can store program codes. .

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Strategic Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • Development Economics (AREA)
  • Educational Administration (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The embodiments of the present application relate to the field of information processing, in particular to a device risk prediction method, an electronic device and a computer-readable storage medium. The device risk prediction method comprises: respectively acquiring feature data of a plurality of devices; acquiring, according to the feature data, a gradient of a statistical algorithm model for predicting a device risk; encrypting the gradient of the statistical algorithm model and uploading same to a public server; receiving an updated gradient, which is issued by the public server, wherein the updated gradient is obtained by means of the public server performing aggregation according to gradients which are uploaded by N computing sub-nodes; updating the statistical algorithm model according to the updated gradient; and when the updated statistical algorithm model converges, outputting a result of device risk prediction.

Description

设备风险预测方法、电子设备和计算机可读存储介质Device risk prediction method, electronic device and computer readable storage medium
相关申请related application
本申请要求于2021年12月6号申请的、申请号为202111479950.3的中国专利申请的优先权。This application claims the priority of the Chinese patent application with application number 202111479950.3 filed on December 6, 2021.
技术领域technical field
本申请实施例涉及信息处理领域,特别涉及一种设备风险预测方法、电子设备和计算机可读存储介质。The embodiments of the present application relate to the field of information processing, and in particular to a device risk prediction method, an electronic device, and a computer-readable storage medium.
背景技术Background technique
随着第五代移动通信技术(5th Generation Mobile Communication Technology,5G)商用逐步推进,通信设备出现过温的情况更频繁。如果设备已上报过温告警再做处理,会严重影响设备正常工作,例如导致业务中断。With the commercialization of the fifth generation mobile communication technology (5th Generation Mobile Communication Technology, 5G) gradually advancing, communication equipment overheating occurs more frequently. If the device has reported an overtemperature alarm and then handles it, it will seriously affect the normal operation of the device, for example, causing service interruption.
为了避免业务中断影响业务的执行进程,可以对设备温度进行检测,用于预防。目前通过收集大量设备的温度数据,来进行设备环境易过温检测的,但用于检测的算法所需要的数据量较大,对于作为执行主体的单个子计算节点(例如局点)来说,单个局点数据量少且单一,不能够满足算法所需的运算条件。所以在使用过程中存在准确性不足的问题。In order to prevent service interruption from affecting the service execution process, the device temperature can be detected for prevention. At present, the temperature data of a large number of devices is collected to detect the overheating of the device environment, but the algorithm used for detection requires a large amount of data. For a single sub-computing node (such as an office point) as the execution subject, The amount of data at a single site is small and single, which cannot meet the operating conditions required by the algorithm. Therefore, there is a problem of insufficient accuracy in the process of use.
发明内容Contents of the invention
本申请实施例的主要目的在于提出一种设备风险预测方法、电子设备和计算机可读存储介质,用于提高设备风险预测的准确性。The main purpose of the embodiments of the present application is to provide a device risk prediction method, an electronic device, and a computer-readable storage medium, so as to improve the accuracy of device risk prediction.
为实现上述目的,本申请实施例提供了一种设备风险预测方法,应用于子计算节点,包括:分别获取多个设备的特征数据;根据所述特征数据获取用于预测设备风险的统计学算法模型的梯度;将所述统计学算法模型的梯度加密上传至公共服务器;接收所述公共服务器下发的更新后的梯度,所述更新后的梯度为所述公共服务器根据N个子计算节点上传的梯度进行聚合得到;根据所述更新后的梯度更新所述统计学算法模型;在更新后的统计学算法模型收敛的情况下,输出所述设备风险预测的结果。In order to achieve the above purpose, an embodiment of the present application provides a device risk prediction method, which is applied to a sub-computing node, including: respectively acquiring characteristic data of multiple devices; obtaining a statistical algorithm for predicting device risk according to the characteristic data Gradient of the model; encrypt and upload the gradient of the statistical algorithm model to the public server; receive the updated gradient issued by the public server, and the updated gradient is uploaded by the public server according to N sub-computing nodes The gradient is obtained by aggregation; the statistical algorithm model is updated according to the updated gradient; and the result of the equipment risk prediction is output when the updated statistical algorithm model converges.
为实现上述目的,本申请实施例还提供一种设备风险预测方法,应用于公共服务器,包括:接收N个子计算节点上传的统计学算法模型的梯度;所述N个子计算节点所获取的多个设备的属性均满足预设公共条件,所述N为大于1的正整数;将所述N个子计算节点加密上传的统计学算法模型的梯度进行聚合,得到更新后的梯度;向所述N个子计算节点发送所述更新后的梯度。In order to achieve the above purpose, the embodiment of the present application also provides a device risk prediction method, which is applied to the public server, including: receiving the gradient of the statistical algorithm model uploaded by the N sub-computing nodes; The attributes of the equipment all meet the preset public conditions, and the N is a positive integer greater than 1; the gradients of the statistical algorithm models encrypted and uploaded by the N sub-computing nodes are aggregated to obtain an updated gradient; The computing node sends the updated gradient.
为实现上述目的,本申请实施例还提供了一种电子设备,包括:至少一个处理器;以及,与所述至少一个处理器通信连接的存储器;其中,所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行上述的应用于子计算节点的设备风险预测方法,或能够执行上述的应用于公共管理器的设备风险预测方法。To achieve the above purpose, an embodiment of the present application further provides an electronic device, including: at least one processor; and a memory connected to the at least one processor in communication; wherein, the memory stores information that can be used by the at least one processor Instructions executed by a processor, the instructions are executed by the at least one processor, so that the at least one processor can execute the above-mentioned equipment risk prediction method applied to sub-computing nodes, or can execute the above-mentioned method applied to public A device risk prediction approach for managers.
为实现上述目的,本申请实施例还提供了一种计算机可读存储介质,存储有计算机程序,所述计算机程序被处理器执行时实现上述的应用于子计算节点的设备风险预测方法,或实现上述的应用于公共管理器的设备风险预测方法。In order to achieve the above purpose, the embodiment of the present application also provides a computer-readable storage medium, which stores a computer program, and when the computer program is executed by a processor, the above-mentioned device risk prediction method applied to a sub-computing node is implemented, or The above-mentioned equipment risk prediction method applied to the public manager.
在本申请的实施方式中,对设备是否会产生过温情况进行预测,防止由于安装环境或其他原因导致的设备产生过温情况,从而影响业务执行和设备寿命。在子计算节点用于计算的参数少的情况下,通过统计学算法模型对易过温情况进行预测,并与公共服务器进行加密数据传输,结合公共服务器下发的优化数据进行更新调整,增加参与计算的数据量,使得当前子计算节点的预测结果更准确,提高子计算节点的预测效率。In the embodiment of the present application, it is predicted whether the equipment will be overheated, so as to prevent the equipment from being overheated due to the installation environment or other reasons, thereby affecting business execution and equipment life. In the case that the sub-computing nodes have few parameters for calculation, the statistical algorithm model is used to predict the overheating situation, and the encrypted data transmission is carried out with the public server, and the optimization data issued by the public server is updated and adjusted to increase participation. The amount of calculated data makes the prediction result of the current sub-computing node more accurate and improves the prediction efficiency of the sub-computing node.
附图说明Description of drawings
图1是根据本申请一个实施例所提供的设备风险预测方法的流程图;FIG. 1 is a flow chart of a method for predicting equipment risk according to an embodiment of the present application;
图2是根据本申请一个实施例所提供的设备风险预测方法的示意图;Fig. 2 is a schematic diagram of an equipment risk prediction method provided according to an embodiment of the present application;
图3是根据本申请另一个实施例所提供的设备风险预测方法的流程图;Fig. 3 is a flow chart of a method for predicting equipment risk according to another embodiment of the present application;
图4是根据本申请另一个实施例所提供的设备风险预测方法的示意图;Fig. 4 is a schematic diagram of an equipment risk prediction method provided according to another embodiment of the present application;
图5是根据本申请一个实施例所提供的的电子设备的结构示意图。Fig. 5 is a schematic structural diagram of an electronic device provided according to an embodiment of the present application.
具体实施方式Detailed ways
为使本申请实施例的目的、技术方案和优点更加清楚,下面将结合附图对本申请的各实施例进行详细的阐述。然而,本领域的普通技术人员可以理解,在本申请各实施例中,为了使读者更好地理解本申请而提出了许多技术细节。但是,即使没有这些技术细节和基于以下各实施例的种种变化和修改,也可以实现本申请所要求保护的技术方案。以下各个实施例的划分是为了描述方便,不应对本申请的具体实现方式构成任何限定,各个实施例在不矛盾的前提下可以相互结合相互引用。In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the embodiments of the present application will be described in detail below with reference to the accompanying drawings. However, those of ordinary skill in the art can understand that in each embodiment of the application, many technical details are provided for readers to better understand the application. However, even without these technical details and various changes and modifications based on the following embodiments, the technical solutions claimed in this application can also be realized. The division of the following embodiments is for the convenience of description, and should not constitute any limitation to the specific implementation of the present application, and the embodiments can be combined and referred to each other on the premise of no contradiction.
随着5G商用逐步推进,通信设备过温情况也逐步上升,而设备不同程度的过温的处理过程如下:轻微—做功率降额处理,降额后告警消失尝试功率回升,告警消失则功率恢复到原值,如果告警还在,则继续降额处理。如果降额到门限(一般3dB)还有告警,则如实将告警上报;严重—直接做关功率放大器处理,先尝试关闭部分功放。如果告警消失尝试打开关闭的功放,如果告警还在,继续关功放处理。如果全部通道的功放关完还有告警,则如实将告警上报。With the gradual advancement of 5G commercial use, the overheating of communication equipment is also gradually increasing, and the processing process for different degrees of overheating of equipment is as follows: Minor—do power derating processing, after derating, the alarm disappears and try to power up, and the power recovers when the alarm disappears To the original value, if the alarm is still there, continue to derate processing. If the derating reaches the threshold (generally 3dB) and there is an alarm, report the alarm truthfully; if it is serious, turn off the power amplifier directly, and try to turn off some power amplifiers first. If the alarm disappears, try to turn on the power amplifier that is turned off. If the alarm is still there, continue to turn off the power amplifier. If there is still an alarm after the power amplifiers of all channels are turned off, report the alarm truthfully.
如果设备已上报过温告警再做处理,会严重影响设备正常工作,例如导致业务中断。设备易过温的原因可能有很多,例如型号问题、业务量大导致功率过高、安装环境封闭等。如果可以检测设备安装环境,就可以有效预测由于安装环境散热差导致的设备过温,减少设备损害及告警发生后降额关功放等措施对业务的影响和中断。If the device has reported an overtemperature alarm and then handles it, it will seriously affect the normal operation of the device, for example, causing service interruption. There may be many reasons why the equipment is prone to overheating, such as model problems, high power due to heavy traffic, and closed installation environment. If the installation environment of the equipment can be detected, it can effectively predict the overheating of the equipment caused by poor heat dissipation in the installation environment, and reduce the impact and interruption of the business by measures such as equipment damage and power amplifier derating after an alarm occurs.
当前对设备温度的检测包括监控和估算设备当前温度,对设备温度的预测方法较少。目前具有通过收集大量设备的温度数据,来进行设备环境易过温检测的算法,但该算法基于对大量数据的统计,而作为执行主体的单个局点数据量少且单一,存在准确性、适用性和优化效率不足的问题。The current detection of device temperature includes monitoring and estimating the current temperature of the device, and there are few methods for predicting the device temperature. At present, there is an algorithm for detecting the overheating of the equipment environment by collecting temperature data of a large number of equipment, but the algorithm is based on the statistics of a large amount of data, and the data volume of a single site as the execution subject is small and single, which is accurate and applicable The problem of lack of sex and optimization efficiency.
本申请的一个实施例涉及一种设备风险预测方法,应用于子计算节点。本实施例的设备风险预测方法的具体流程可以如图1所示,包括:An embodiment of the present application relates to a device risk prediction method, which is applied to sub-computing nodes. The specific process of the equipment risk prediction method in this embodiment can be shown in Figure 1, including:
步骤101,分别获取多个设备的特征数据; Step 101, acquiring feature data of multiple devices respectively;
步骤102,根据特征数据获取用于预测设备风险的统计学算法模型的梯度; Step 102, obtaining the gradient of the statistical algorithm model for predicting equipment risk according to the feature data;
步骤103,将统计学算法模型的梯度加密上传至公共服务器; Step 103, uploading the gradient encryption of the statistical algorithm model to the public server;
步骤104,接收公共服务器下发的更新后的梯度,更新后的梯度为公共服务器根据N个子计算节点上传的梯度进行聚合得到; Step 104, receiving the updated gradient issued by the public server, the updated gradient is obtained by the public server through aggregation according to the gradients uploaded by N sub-computing nodes;
步骤105,根据更新后的梯度更新统计学算法模型; Step 105, updating the statistical algorithm model according to the updated gradient;
步骤106,在更新后的统计学算法模型收敛的情况下,输出设备风险预测的结果。 Step 106, when the updated statistical algorithm model converges, output the result of equipment risk prediction.
下面对本实施例的设备风险预测方法的实现细节进行具体的说明,以下内容仅为方便理解提供的实现细节,并非实施本方案的必须。The implementation details of the equipment risk prediction method of this embodiment are described in detail below, and the following content is only implementation details provided for easy understanding, and is not necessary for implementing this solution.
在步骤101中,分别获取多个设备的特征数据。其中,子计算节点可以是局点,该局点获取基站收集的当前局点范围内的设备的特征数据。In step 101, feature data of multiple devices are acquired respectively. Wherein, the sub-computing node may be an office point, and the office point obtains characteristic data of devices within the current office point collected by the base station.
在一个例子中,多个设备的属性满足预设条件;预设条件包括:多个设备的型号相同,多个设备的纬度坐标的差值在第一预设范围内,并且多个设备各自所处地区的温度差值在第二预设范围内。即,单个子计算节点或局点中,设备的属性相近,使得该子计算节点能够根据所获取的特征数据对当前属性的设备进行风险预测。In one example, the attributes of the multiple devices meet the preset conditions; the preset conditions include: the models of the multiple devices are the same, the difference between the latitude coordinates of the multiple devices is within the first preset range, and the respective The temperature difference in the region is within the second preset range. That is, in a single sub-computing node or site, the attributes of the devices are similar, so that the sub-computing node can perform risk prediction on the device with the current attributes according to the acquired feature data.
在一个例子中,设备包括射频拉远单元(Remote Radio Unit,RRU);在设备为RRU的情况下,多个设备的特征数据包括各RRU的:功率、光模块温度、基带温度、中频温度、电源温度、单板温度、收发信机温度、位置、气温。在一个具体实现中,功率例如电源平均功率,光模块温度例如光/电模块收发器平均温度,基带温度例如RRU基带平均温度,中频温度例如RRU中频平均温度,电源温度例如RRU电源平均温度,单板温度例如RRU单板平均温度,收发信机温度例如RRU的发送(transport,TX)/接收(receive,RX)平均温度或功放温度平均值,位置例如经纬度,气温例如设备所处的当地气温。In one example, the device includes a remote radio unit (Remote Radio Unit, RRU); in the case of the device being an RRU, the feature data of multiple devices includes each RRU: power, optical module temperature, baseband temperature, intermediate frequency temperature, Power supply temperature, board temperature, transceiver temperature, location, air temperature. In a specific implementation, the power is such as the average power of the power supply, and the temperature of the optical module is such as the average temperature of the optical/electrical module transceiver. Board temperature, such as the average temperature of the RRU board, transceiver temperature, such as the average RRU transmit (transport, TX)/receive (receive, RX) temperature or power amplifier temperature, location, such as latitude and longitude, and air temperature, such as the local air temperature where the device is located.
另外,设备为位置不会发生大范围变动的电子设备即可,并不仅限于RRU,例如还可以是室内基带处理单元(Building Base band Unit,BBU),即本实施方式还可以对于BBU的风险进行预测。In addition, the device can be an electronic device whose position does not change in a large range, and is not limited to RRU, for example, it can also be an indoor baseband processing unit (Building Base band Unit, BBU). predict.
在步骤102中,根据特征数据获取用于预测设备风险的统计学算法模型的梯度;即,将所获取的特征数据输入统计学算法模型进行计算预测,并得到计算过程中的梯度,所述梯度能够表征该组特征数据中的均值和偏移范围,所述均值和偏移范围可以用于对当前统计学算法模型的运算结果进行衡量。例如,当前特征数据在均值的偏移范围内的部分,占当前特征数据的总数的比值小于预设比值的阈值,则当前统计学算法模型的运算结果不是最优,需要进行调整。也就是梯度需要调整,即模型未收敛。In step 102, the gradient of the statistical algorithm model used to predict equipment risk is obtained according to the feature data; that is, the acquired feature data is input into the statistical algorithm model for calculation and prediction, and the gradient in the calculation process is obtained, the gradient The mean value and deviation range in the set of characteristic data can be characterized, and the mean value and deviation range can be used to measure the operation results of the current statistical algorithm model. For example, if the ratio of the part of the current characteristic data within the deviation range of the mean value to the total number of the current characteristic data is less than the threshold value of the preset ratio, the calculation result of the current statistical algorithm model is not optimal and needs to be adjusted. That is, the gradient needs to be adjusted, that is, the model has not converged.
在一个例子中,根据特征数据获取用于预测设备风险的统计学算法模型的梯度之前,包括:对多个设备的特征数据进行数据清洗。即,采集特征数据时可通过主动检测或接收设备的上报途径获取,采集过程中并不对特征数据的格式等进行筛选,能够保证获取的样本数量。In an example, before obtaining the gradient of the statistical algorithm model for predicting equipment risk according to the feature data, it includes: performing data cleaning on the feature data of multiple devices. That is, when collecting characteristic data, it can be obtained through active detection or reporting of receiving equipment, and the format of characteristic data is not screened during the collection process, which can ensure the number of samples obtained.
在一个例子中,数据清洗,包括以下之一或其任意组合:统一特征数据的数据格式;删除特征数据中的异常数据;按照时间粒度整合特征数据。具体地,统一特征数据的数据格式,即由于采集过程中对于数据不设限,且对于采集特征数据的途径也不进行限制,所以所收到的特征数据可能会产生数据格式不一致的情况,为了便于计算,首先统一所获取的特征数据的格式。删除特征数据中的异常数据,对于计算节点,可能存在所能处理的数据格式有限的 情况,在所获取的特征数据为当前计算节点不能处理的格式的情况下,将其判定为异常数据,可以选择对其进行删除,避免格式问题影响计算过程,也可以先选择进行格式统一,无法实现统一时进行删除,保障计算过程中的数据量;此外,最大值、最小值以及最大值对应的预设范围区间和最小值对应的预设范围区间也可以定义为异常数据,在处理前进行删除。按照时间粒度整合特征数据,由于各类数据的上报时间粒度不同,需要将所有获取到的特征数据整合到一条数据流中,例如:将位置数据补充到每条采集到的温度数据中,即,将采集时间粒度较粗的位置数据补充到采集时间粒度更细的温度数据中。在一个具体执行过程中,特征数据a的粒度是15分钟一条数据,特征数据b是1小时一条数据,则进行数据整合后的每15分钟的一条数据流中,包含特征数据a和特征数据b,包含的a是本次采集的,b则是一小时内上一次采集的。In one example, data cleaning includes one of the following or any combination thereof: unifying the data format of feature data; deleting abnormal data in feature data; integrating feature data according to time granularity. Specifically, the data format of the characteristic data is unified, that is, since there are no restrictions on the data during the collection process, and there is no restriction on the way to collect the characteristic data, the received characteristic data may have inconsistent data formats. To facilitate calculation, firstly, the format of the acquired feature data is unified. Delete the abnormal data in the characteristic data. For computing nodes, there may be cases where the data format that can be processed is limited. If the obtained characteristic data is in a format that cannot be processed by the current computing node, it will be judged as abnormal data. You can Choose to delete it to avoid format problems affecting the calculation process. You can also choose to unify the format first, and delete it when unification cannot be achieved to ensure the amount of data in the calculation process; The preset range interval corresponding to the range interval and the minimum value can also be defined as abnormal data, which is deleted before processing. Integrate feature data according to time granularity. Since the reporting time granularity of various types of data is different, it is necessary to integrate all acquired feature data into one data stream, for example: add location data to each piece of collected temperature data, that is, Supplement the location data with coarser granularity of collection time to the temperature data with finer granularity of collection time. In a specific execution process, the granularity of feature data a is one piece of data every 15 minutes, and the granularity of feature data b is one piece of data every hour. After data integration, one data stream every 15 minutes contains feature data a and feature data b. , containing a is collected this time, and b is collected last time within one hour.
在步骤103中,将统计学算法模型的梯度加密上传至公共服务器。即在子计算节点存在预设的加密进程,在统计学算法模型执行后得到的梯度后,对获得的梯度进行加密,将加密后的梯度上传至公共服务器。In step 103, the encrypted gradient of the statistical algorithm model is uploaded to the public server. That is to say, there is a preset encryption process in the sub-computing nodes. After the gradient obtained after the statistical algorithm model is executed, the obtained gradient is encrypted, and the encrypted gradient is uploaded to the public server.
在一个具体实现中,在获取梯度之后,根据梯度得到均值及偏移范围,判断当前子计算节点获取的特征参数在均值的偏移范围中的的数据,占当前子计算节点获取的特征参数特征参数总数的比值,若该比值大于预设比值的阈值,则表示当前的计算结果是最优,满足需求,输出该计算结果;若该比值不大于预设比值的阈值,则表示当前的计算结果不是最优,需要进行梯度的调整,所以执行将梯度加密上传至公共服务器的步骤。In a specific implementation, after the gradient is obtained, the mean value and offset range are obtained according to the gradient, and the data of the characteristic parameters acquired by the current sub-computing node in the offset range of the mean value are judged to account for the characteristic parameters obtained by the current sub-computing node. The ratio of the total number of parameters, if the ratio is greater than the threshold of the preset ratio, it means that the current calculation result is optimal and meets the requirements, and the calculation result is output; if the ratio is not greater than the threshold of the preset ratio, it means the current calculation result It is not optimal and requires gradient adjustment, so perform the steps of uploading gradient encryption to the public server.
其中,将梯度上传至公共服务器的过程可以参考联邦学习算法,即参考联邦学习算法,对梯度数据在子计算节点与公共服务器之间进行加密传输,保证梯度数据的安全性。可以理解的是,公共服务器中存在与当前子节点中加密算法对应的解密算法。Among them, the process of uploading the gradient to the public server can refer to the federated learning algorithm, that is, refer to the federated learning algorithm to encrypt and transmit the gradient data between the sub-computing nodes and the public server to ensure the security of the gradient data. It can be understood that there is a decryption algorithm corresponding to the encryption algorithm in the current child node in the public server.
在步骤104中,接收公共服务器下发的更新后的梯度,更新后的梯度为公共服务器根据N个子计算节点上传的梯度进行聚合得到。即,所接收到的公共服务器下发的更新后的梯度,是公共服务器整合自身范围内的子计算节点上传的梯度得到的。由于公共服务器范围内存在多个子计算节点,各子计算节点也分别对应多个设备,所以对于一个公共服务器来说,接收到的梯度是经过大量的样本数据得到的值。由于样本量丰富,所以将接收到的梯度进行聚合后的值更接近当前子节点的统计学算法模型的梯度的最优选择,用聚合后的梯度更新原梯度值。In step 104, the updated gradient issued by the public server is received, and the updated gradient is obtained by the public server through aggregation according to the gradients uploaded by the N sub-computing nodes. That is, the received updated gradient issued by the public server is obtained by integrating the gradients uploaded by the sub-computing nodes within the public server's own scope. Since there are multiple sub-computing nodes within the scope of the public server, and each sub-computing node also corresponds to multiple devices, for a public server, the received gradient is a value obtained through a large amount of sample data. Due to the abundant sample size, the aggregated value of the received gradient is closer to the optimal choice of the gradient of the statistical algorithm model of the current child node, and the original gradient value is updated with the aggregated gradient.
此外,在公共服务器向子计算节点下发更新后的梯度时,也可以进行加密,在子计算节点收到后子计算节点进行解密,进一步提高数据传输的安全性。In addition, when the public server sends the updated gradient to the sub-computing node, it can also be encrypted, and the sub-computing node decrypts it after the sub-computing node receives it, further improving the security of data transmission.
在步骤105中,根据更新后的梯度更新统计学算法模型。即,根据公共服务器发送的更新后的梯度,更新当前计算节点的统计学算法模型,使得当前子计算节点的统计学算法模型的输出更接近实际情况,其中,统计学算法模型例如统计学聚类算法模型,通过对相似样本不断进行合并,将相对近的群组构造成一个大的群组。例如,公共服务器中接收到的各子计算节点上传的梯度,即得到各子计算节点上传的均值及偏移范围,聚合各均值及偏移范围,得到更新后的均值及偏移范围;将更新后的梯度下发至当前子计算节点,则当前子计算节点也得到了公共服务器更新后的均值及偏移范围;采用公共服务器下发的梯度更新当前计算节点的统计学算法模型。In step 105, the statistical algorithm model is updated according to the updated gradient. That is, according to the updated gradient sent by the public server, the statistical algorithm model of the current computing node is updated, so that the output of the statistical algorithm model of the current sub-computing node is closer to the actual situation, wherein the statistical algorithm model such as statistical clustering The algorithm model constructs relatively close groups into a large group by continuously merging similar samples. For example, the gradient uploaded by each sub-computing node received in the public server is to obtain the mean value and offset range uploaded by each sub-computing node, aggregate each mean value and offset range, and obtain the updated mean value and offset range; update After the gradient is sent to the current sub-computing node, the current sub-computing node also gets the updated mean value and offset range from the public server; the statistical algorithm model of the current computing node is updated using the gradient delivered by the public server.
其中,采用公共服务器下发的梯度更新当前计算节点的统计学算法模型,例如:根据公 共服务器下发的梯度,更新统计学算法模型中的参数,包括更新各判定标准,例如均值及偏移范围,其中,均值及偏移范围包括温度、经纬度、功率等的相关均值及偏移范围,参数更新后可以判断当前统计学算法模型是否收敛。此外,还可以更新异常数据的判定标准。Among them, the gradient issued by the public server is used to update the statistical algorithm model of the current computing node, for example: according to the gradient issued by the public server, the parameters in the statistical algorithm model are updated, including updating various judgment criteria, such as the mean value and offset range , where the mean value and offset range include the relative mean value and offset range of temperature, latitude and longitude, power, etc. After the parameters are updated, it can be judged whether the current statistical algorithm model is converged. In addition, the criteria for judging abnormal data can also be updated.
在步骤106中,在更新后的统计学算法模型收敛的情况下,输出设备风险预测的结果。在当前子计算节点的统计学算法模型根据公共服务器下发的梯度更新之后,再次根据特征数据进行运算,并获取本次运算过程中的梯度,根据该梯度对应的均值及偏移范围判断本次统计学算法模型是否收敛,若收敛,则本次统计学算法模型的计算结果即为最终的设备风险预测结果。In step 106, when the updated statistical algorithm model converges, the result of equipment risk prediction is output. After the statistical algorithm model of the current sub-computing node is updated according to the gradient issued by the public server, the calculation is performed again according to the characteristic data, and the gradient during this calculation process is obtained, and the current calculation is judged according to the mean value and offset range corresponding to the gradient. Whether the statistical algorithm model converges, and if it converges, the calculation result of the statistical algorithm model is the final equipment risk prediction result.
在一个例子中,根据更新后的梯度更新统计学算法模型之后,还包括:在更新后的统计学算法模型不收敛的情况下,将更新后的统计学算法模型根据特征数据得到的梯度加密上传至公共服务器;接收公共服务器再次下发的更新后的梯度,根据公共服务器再次下发的更新后的梯度再次更新统计学算法模型,直到更新后的统计学算法模型收敛,如图2所示。即,若根据本次计算过程中的梯度得到的均值和偏移范围进行判断,获取的特征数据位于均值的偏移范围内的比例仍不大于预设比值的阈值,则本次计算过程中的统计学算法模型仍然不收敛,需要继续与公共服务器进行信息交互,根据更新后的均值及偏移范围等判定标准重新计算梯度,上传至公共服务器;直到更新后的统计学算法模型收敛,输出收敛后的设备风险预测的结果。In an example, after updating the statistical algorithm model according to the updated gradient, it also includes: in the case that the updated statistical algorithm model does not converge, encrypting and uploading the updated statistical algorithm model according to the gradient obtained from the feature data To the public server; receive the updated gradient issued by the public server again, and update the statistical algorithm model again according to the updated gradient issued by the public server again until the updated statistical algorithm model converges, as shown in Figure 2. That is, if it is judged according to the average value and offset range obtained by the gradient in this calculation process, and the proportion of the acquired feature data within the offset range of the average value is still not greater than the threshold value of the preset ratio, then the calculation process in this calculation process The statistical algorithm model still does not converge, and it is necessary to continue to exchange information with the public server, recalculate the gradient according to the updated mean and offset range and other criteria, and upload it to the public server; until the updated statistical algorithm model converges, the output converges The results of the final equipment risk prediction.
其中,统计学算法模型的设备风险预测的结果包括:正常、一般易过温、高风险易过温、严重易过温。正常则表示当前子计算节点所获取的特征数据对应的设备安装环境正常;一般易过温则表示存在过温风险,建议对安装环境的散热情况进行检查,确定引起散热的不良影响的原因,并在必要时对安装环境进行改造,消除散热不良对设备性能、硬件寿命等方面的影响;高风险易过温表示安装环境较高的过温风险,建议必要时对环境进行改造,消除散热不良对设备性能、硬件寿命等方面的影响;严重易过温,表示当前设备的安装环境极易产生过温情况,建议进行整改。前述设备的安装环境,例如RRU的安装环境。Among them, the equipment risk prediction results of the statistical algorithm model include: normal, general overheating, high risk overheating, and severe overheating. Normal means that the device installation environment corresponding to the characteristic data obtained by the current sub-computing node is normal; general overheating means that there is an overheating risk. It is recommended to check the heat dissipation of the installation environment to determine the cause of the adverse effects of heat dissipation, and Renovate the installation environment when necessary to eliminate the impact of poor heat dissipation on equipment performance and hardware life; high risk of overheating indicates a high risk of overheating in the installation environment. It is recommended to renovate the environment when necessary to eliminate the impact of poor heat dissipation The impact of equipment performance, hardware life, etc.; serious overheating, indicating that the current installation environment of the equipment is very prone to overheating, and it is recommended to rectify it. The installation environment of the aforementioned equipment, for example, the installation environment of the RRU.
为使得上述实施方式便于理解,以下结合执行场景举例说明部分实施过程。可以理解的是,仅为举例说明,并不对实施例的相关内容进行限制。In order to facilitate the understanding of the foregoing implementation manner, a part of the implementation process is described below in combination with an execution scenario. It can be understood that it is only for illustration, and does not limit the relevant content of the embodiment.
在一个例子中,某局点处于温度较高地带,经常发生RRU过温告警,怀疑是安装环境散热不好导致。需要对RRU的安装环境进行检查,找到哪些RRU是由于安装环境散热不良导致的易过温,进而对安装环境进行整改。但是这一局点的RRU类型多且每一种RRU的数量很少,得到的针对每种类型RRU的结果不够准确,因此通过以下流程进行检测:In one example, a site is located in a high temperature zone, and RRU overheating alarms often occur, which is suspected to be caused by poor heat dissipation in the installation environment. It is necessary to check the installation environment of the RRUs, find out which RRUs are prone to overheating due to poor heat dissipation in the installation environment, and then rectify the installation environment. However, there are many types of RRUs in this site and the number of each type of RRU is very small, and the result obtained for each type of RRU is not accurate enough. Therefore, the following process is used for detection:
首先选择纬度坐标的差值在第一预设范围内、所处地区的温度差值在第二预设范围内,且RRU型号相同的当前局点内的设备,获取相应设备中的特征数据;其次,将该局点根据统计学算法模型计算得到梯度加密上传至公共服务器,公共服务器聚合多个局点的数据计算得到新的梯度,下发给各局点,其中,公共服务器下的多个局点所对应的设备的型号相同,属性相近。根据公共服务器下发的新的梯度更新模型,不断重复直至模型收敛,模型收敛时所输出的结果即为最终预测结果。Firstly, select the equipment in the current site whose latitude coordinate difference is within the first preset range, the temperature difference of the region is within the second preset range, and the RRU model is the same, and obtain the characteristic data in the corresponding equipment; Secondly, the gradient calculated by the site according to the statistical algorithm model is encrypted and uploaded to the public server. The public server aggregates the data of multiple sites to calculate a new gradient and distributes it to each site. Among them, multiple sites under the public server The devices corresponding to the points have the same model and similar properties. Update the model according to the new gradient issued by the public server, and repeat until the model converges, and the output result when the model converges is the final prediction result.
通过上述过程得到RRU是否处于易过温的安装环境,以及安装环境的易过温程度,可以进一步检查严重易过温的RRU是否安装了美化罩且美化罩无散热孔、安装在密闭的狭小封闭空间等,并对有问题的安装环境进行整改,避免RRU因为安装环境问题产生运行中断或设备 损坏等。Through the above process, whether the RRU is in an installation environment prone to overheating and the degree of overheating in the installation environment can be further checked whether the RRU that is seriously prone to overheating is installed with a beautification cover and the beautification cover has no cooling holes and is installed in a small, airtight enclosure. Space, etc., and rectify the problematic installation environment to avoid RRU operation interruption or equipment damage due to installation environment problems.
在另一个例子中,某局点业务量较小,但夏季要扩展业务,为避免安装环境散热不良在RRU功率增加时带来的过温问题,希望预测RRU的安装环境的易过温程度,提前整改或者确定在哪些RRU上进行扩容。但该局点日常运行功率较小,用于进行预测的数据量不足,所以选择通过将其他局点中与该局点属性相近但运行功率较大的局点数据通过公共服务器一起进行模型训练,预测该局点易过温风险较高的RRU,并进行整改。例如局点中RRU1当前业务量较小,通过整合其他局点的不同运行功率下的RRU1的数据,可以预测RRU1在不同运行功率下的温度,用以判断功率增加时是否易过温。该方法需要采集较长时间的历史数据。In another example, the business volume of a certain site is small, but the business needs to be expanded in summer. In order to avoid the overheating problem caused by the poor heat dissipation of the installation environment when the RRU power increases, it is hoped to predict the overheating degree of the RRU installation environment. Rectify in advance or determine which RRUs to expand. However, the daily operating power of this site is small, and the amount of data used for prediction is insufficient. Therefore, we choose to use the data of other sites with similar attributes to this site but with high operating power to perform model training together through the public server. Predict the RRUs that are prone to overheating at this site and make rectifications. For example, the current service volume of RRU1 in the site is small. By integrating the data of RRU1 at different operating powers in other sites, the temperature of RRU1 at different operating powers can be predicted to determine whether it is prone to overheating when the power increases. This method needs to collect historical data for a long time.
首先选择容量更大(例如小区更多),且其中与该RRU1型号相同的RRU数量多的局点,与当前RRU1所在的局点对应同一公共服务器,组成联邦学习系统。在该公共服务器下的各局点创建采集和计算任务,将梯度加密上传给公共服务器,公共服务器通过整合各局点数据,得到新梯度。将新梯度下发到各局点。各局点根据新梯度不断更新统计学聚类算法模型并上传更新后新梯度,直至模型收敛。模型收敛时模型预测的结果即为最终预测结果。First, select a site with a larger capacity (for example, more cells) and a large number of RRUs of the same model as the RRU1, which corresponds to the same public server as the site where the current RRU1 is located to form a federated learning system. Create acquisition and calculation tasks at each site under the public server, and upload the encrypted gradient to the public server. The public server obtains a new gradient by integrating the data of each site. Send the new gradient to each site. Each site continuously updates the statistical clustering algorithm model according to the new gradient and uploads the updated new gradient until the model converges. When the model converges, the predicted result of the model is the final predicted result.
可以预测当前局点中哪些RRU在运行功率较大时易过温,或者需要对安装环境进行整改。例如结果显示RRU1易过温,如果功率继续上升会上报过温告警,可以选择优先对RRU1的安装环境进行检查和整改,整改后再扩大业务。或者在其他安装环境正常的RRU增大业务量。It can predict which RRUs in the current site are prone to overheating when the operating power is high, or the installation environment needs to be rectified. For example, the results show that RRU1 is prone to overheating. If the power continues to rise, an overtemperature alarm will be reported. You can choose to check and rectify the installation environment of RRU1 first, and expand business after rectification. Or the normal RRU increases the service volume in other installation environments.
在本实施例中,对设备是否会产生过温情况进行预测,防止由于安装环境或其他原因导致的设备产生过温情况,从而影响业务执行和设备寿命。在子计算节点用于计算的参数少的情况下,通过统计学算法模型对易过温情况进行预测,并与公共服务器进行加密数据传输,结合公共服务器下发的优化数据进行更新调整,增加参与计算的数据量,使得当前子计算节点的预测结果更准确,提高子计算节点的预测效率。In this embodiment, it is predicted whether the equipment will be overheated, so as to prevent the equipment from being overheated due to the installation environment or other reasons, thereby affecting service execution and equipment life. In the case that the sub-computing nodes have few parameters for calculation, the statistical algorithm model is used to predict the overheating situation, and the encrypted data transmission is carried out with the public server, and the optimization data issued by the public server is updated and adjusted to increase participation. The amount of calculated data makes the prediction result of the current sub-computing node more accurate and improves the prediction efficiency of the sub-computing node.
本申请的另一个实施例涉及一种设备风险预测方法,如图3所示,应用于公共服务器,下面对本实施例的设备风险预测方法的实现细节进行具体的说明,以下内容仅为方便理解提供的实现细节,并非实施本方案的必须。Another embodiment of the present application relates to a device risk prediction method, as shown in Figure 3, which is applied to a public server. The implementation details of the device risk prediction method in this embodiment will be described in detail below, and the following content is provided for convenience of understanding The implementation details are not necessary to implement this solution.
步骤201,接收N个子计算节点上传的统计学算法模型的梯度;N个子计算节点所获取的多个设备均满足预设公共条件,N为大于1的正整数。即,公共服务器对应N个子计算节点,该N个子计算节点所获取的所有设备的属性均满足预设公共条件。Step 201 , receiving gradients of statistical algorithm models uploaded by N sub-computing nodes; multiple devices acquired by N sub-computing nodes all meet preset common conditions, and N is a positive integer greater than 1. That is, the public server corresponds to N sub-computing nodes, and the attributes of all devices acquired by the N sub-computing nodes all meet preset public conditions.
在一个例子中,预设公共条件为多个设备的型号相同,多个设备的纬度坐标的差值在第三预设范围内,并且多个设备各自所处地区的温度差值在第四预设范围内。其中,所述第三预设范围不小于上述第一预设范围,所述第四预设范围不小于上述第二预设范围。也就是说同一公共服务器中所获取到的多个设备的属性相近,能够扩展子计算节点中的参数的数量。公共服务器中子计算节点所获取的多个设备的属性的范围,可以与某一子计算节点中对应的设备的属性范围一致,或能够进行适当扩展,以扩展子计算节点用于计算的数据量。In one example, the preset common condition is that the models of the multiple devices are the same, the difference between the latitude coordinates of the multiple devices is within the third preset range, and the temperature difference between the regions where the multiple devices are located is within the fourth preset range. within the set range. Wherein, the third preset range is not smaller than the first preset range, and the fourth preset range is not smaller than the second preset range. That is to say, the properties of multiple devices acquired in the same public server are similar, and the number of parameters in the sub-computing nodes can be expanded. The attribute range of multiple devices obtained by the sub-computing node in the public server can be consistent with the attribute range of the corresponding device in a sub-computing node, or can be appropriately expanded to expand the amount of data used by the sub-computing node for calculation .
具体地,公共服务器所需的设备数存在下限值,需要公共服务器下的子计算节点所获取的设备的数量大于该下限值。例如,参考NA+NB+NC……NP>M公式,其中M为下限值,NA到NP分别表示每个子计算节点选取的属性满足预设条件的设备数量。Specifically, the number of devices required by the public server has a lower limit, and the number of devices acquired by the sub-computing nodes under the public server needs to be greater than the lower limit. For example, refer to the NA+NB+NC...NP>M formula, where M is the lower limit value, and NA to NP respectively represent the number of devices whose attributes selected by each sub-computing node meet the preset conditions.
为了保证属性满足预设公共条件的设备的数量足够多,选取对应的设备的属性相近的子计算节点作为同一个系统的子计算节点。其中,以子计算节点为局点,设备为RRU为例,即 局点A中RRU的属性和局点B中RRU的属性相近,而局点C中虽然RRU数量多,但于A、B中的RRU属性都不同,则A、B更适合作为一个计算系统,对应同一个公共服务器。需要寻找其他与局点C中RRU属性相近的局点与局点C对应同一个公共服务器。局点A和局点B与公共服务器的关系如图4所示,此外,局点A、B所对应的公共服务器所对应的子计算节点也并不限于仅包含局点A和B。In order to ensure that the number of devices whose attributes meet the preset public conditions is sufficient, sub-computing nodes with similar attributes of corresponding devices are selected as sub-computing nodes of the same system. Among them, taking the sub-computing node as the site and the equipment as the RRU as an example, that is, the attributes of the RRU in the site A are similar to the attributes of the RRU in the site B, and although the number of RRUs in the site C is large, the If the RRU attributes of RRUs are different, then A and B are more suitable as a computing system, corresponding to the same public server. It is necessary to find other sites that have similar RRU attributes to site C, and site C corresponds to the same public server. The relationship between Site A and Site B and the public server is shown in Figure 4. In addition, the sub-computing nodes corresponding to the public servers corresponding to Sites A and B are not limited to only Sites A and B.
此外,所述公共服务器可以为北向服务器。In addition, the public server may be a northbound server.
步骤202,将N个子计算节点加密上传的统计学算法模型的梯度进行聚合,得到更新后的梯度。即,N个子计算节点分别采集自己所对应的设备的特征参数,进行数据清理,计算等,并将计算得到的梯度上传公共服务器。公共服务器根据当前公共服务器获取的子计算节点上传的梯度进行聚合,也就是根据当前公共服务器下的子计算节点获取的所有设备进行梯度计算,增加准确度。In step 202, the gradients of the statistical algorithm models encrypted and uploaded by the N sub-computing nodes are aggregated to obtain updated gradients. That is, the N sub-computing nodes respectively collect the characteristic parameters of their corresponding devices, perform data cleaning, calculation, etc., and upload the calculated gradients to the public server. The public server aggregates the gradients uploaded by the sub-computing nodes obtained by the current public server, that is, performs gradient calculations based on all devices obtained by the sub-computing nodes under the current public server to increase accuracy.
步骤203,向N个子计算节点发送更新后的梯度,将更新后的梯度下发至子计算节点,供子计算节点更新统计学算法模型,进而得到预测结果。当统计学算法模型收敛时所产生的预测结果,即为最终预测结果。 Step 203 , sending the updated gradients to the N sub-computing nodes, and sending the updated gradients to the sub-computing nodes, so that the sub-computing nodes can update the statistical algorithm model and obtain prediction results. The prediction result generated when the statistical algorithm model converges is the final prediction result.
在本实施方式中,提供公共服务器参与子计算节点的预测过程,获取各子计算节点的梯度,进行整合后下发,用于更新子计算节点的统计学算法模型,扩展各子计算节点参与计算的数据量,提升各子计算节点预测的准确性。In this embodiment, a public server is provided to participate in the prediction process of the sub-computing nodes, obtain the gradients of each sub-computing node, and distribute them after integration to update the statistical algorithm model of the sub-computing nodes and expand the participation of each sub-computing node in the calculation The amount of data can improve the prediction accuracy of each sub-computing node.
本申请另一个实施例涉及一种电子设备,如图5所示,包括:至少一个处理器301;以及,与所述至少一个处理器301通信连接的存储器302;其中,所述存储器302存储有可被所述至少一个处理器301执行的指令,所述指令被所述至少一个处理器301执行,以使所述至少一个处理器301能够执行上述各实施例中的应用于子计算节点的设备风险预测方法,或能够执行上述各式实例中的应用于公共服务器的设备风险预测方法。Another embodiment of the present application relates to an electronic device, as shown in FIG. 5 , including: at least one processor 301; and a memory 302 communicatively connected to the at least one processor 301; wherein, the memory 302 stores Instructions that can be executed by the at least one processor 301, the instructions are executed by the at least one processor 301, so that the at least one processor 301 can execute the devices applied to sub-computing nodes in the above embodiments A risk prediction method, or a device risk prediction method applied to a public server in the various examples above.
其中,存储器和处理器采用总线方式连接,总线可以包括任意数量的互联的总线和桥,总线将一个或多个处理器和存储器的各种电路连接在一起。总线还可以将诸如外围设备、稳压器和功率管理电路等之类的各种其他电路连接在一起,这些都是本领域所公知的,因此,本文不再对其进行进一步描述。总线接口在总线和收发机之间提供接口。收发机可以是一个元件,也可以是多个元件,比如多个接收器和发送器,提供用于在传输介质上与各种其他装置通信的单元。经处理器处理的数据通过天线在无线介质上进行传输,进一步,天线还接收数据并将数据传送给处理器。Wherein, the memory and the processor are connected by a bus, and the bus may include any number of interconnected buses and bridges, and the bus connects one or more processors and various circuits of the memory together. The bus may also connect together various other circuits such as peripherals, voltage regulators, and power management circuits, all of which are well known in the art and therefore will not be further described herein. The bus interface provides an interface between the bus and the transceivers. A transceiver may be a single element or multiple elements, such as multiple receivers and transmitters, providing means for communicating with various other devices over a transmission medium. The data processed by the processor is transmitted on the wireless medium through the antenna, further, the antenna also receives the data and transmits the data to the processor.
处理器负责管理总线和通常的处理,还可以提供各种功能,包括定时,外围接口,电压调节、电源管理以及其他控制功能。而存储器可以被用于存储处理器在执行操作时所使用的数据。The processor is responsible for managing the bus and general processing, and can also provide various functions, including timing, peripheral interface, voltage regulation, power management, and other control functions. Instead, memory can be used to store data that the processor uses when performing operations.
本申请另一个实施例涉及一种计算机可读存储介质,存储有计算机程序。计算机程序被处理器执行时实现上述方法实施例。Another embodiment of the present application relates to a computer-readable storage medium storing a computer program. The above method embodiments are implemented when the computer program is executed by the processor.
即,本领域技术人员可以理解,实现上述实施例方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,该程序存储在一个存储介质中,包括若干指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以 存储程序代码的介质。That is, those skilled in the art can understand that all or part of the steps in the method of the above-mentioned embodiments can be completed by instructing related hardware through a program, the program is stored in a storage medium, and includes several instructions to make a device ( It may be a single-chip microcomputer, a chip, etc.) or a processor (processor) to execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disc, etc., which can store program codes. .
本领域的普通技术人员可以理解,上述各实施方式是实现本申请的具体实施例,而在实际应用中,可以在形式上和细节上对其作各种改变,而不偏离本申请的精神和范围。Those of ordinary skill in the art can understand that the above-mentioned implementation modes are specific examples for realizing the present application, and in practical applications, various changes can be made to it in form and details without departing from the spirit and spirit of the present application. scope.

Claims (10)

  1. 一种设备风险预测方法,应用于子计算节点,包括:A device risk prediction method applied to sub-computing nodes, including:
    分别获取多个设备的特征数据;Obtain feature data of multiple devices separately;
    根据所述特征数据获取用于预测设备风险的统计学算法模型的梯度;Obtaining the gradient of a statistical algorithm model for predicting equipment risk according to the characteristic data;
    将所述统计学算法模型的梯度加密上传至公共服务器;Upload the gradient encryption of the statistical algorithm model to the public server;
    接收所述公共服务器下发的更新后的梯度,所述更新后的梯度为所述公共服务器根据N个子计算节点上传的梯度进行聚合得到;receiving the updated gradient issued by the public server, where the updated gradient is obtained by aggregation of the public server according to the gradients uploaded by N sub-computing nodes;
    根据所述更新后的梯度更新所述统计学算法模型;updating the statistical algorithm model according to the updated gradient;
    在更新后的统计学算法模型收敛的情况下,输出所述设备风险预测的结果。When the updated statistical algorithm model converges, output the result of the equipment risk prediction.
  2. 根据权利要求1所述的设备风险预测方法,其中,所述根据所述更新后的梯度更新所述统计学算法模型之后,还包括:The equipment risk prediction method according to claim 1, wherein, after updating the statistical algorithm model according to the updated gradient, further comprising:
    在所述更新后的统计学算法模型不收敛的情况下,将所述更新后的统计学算法模型根据所述特征数据得到的梯度加密上传至公共服务器;In the case that the updated statistical algorithm model does not converge, uploading the gradient encryption obtained by the updated statistical algorithm model according to the characteristic data to the public server;
    接收所述公共服务器再次下发的更新后的梯度,根据所述公共服务器再次下发的更新后的梯度再次更新所述统计学算法模型,直到更新后的统计学算法模型收敛。receiving the updated gradient sent again by the public server, and updating the statistical algorithm model again according to the updated gradient sent again by the public server until the updated statistical algorithm model converges.
  3. 根据权利要求1所述的设备风险预测方法,其中,所述根据所述特征数据获取用于预测设备风险的统计学算法模型的梯度之前,包括:The equipment risk prediction method according to claim 1, wherein, before obtaining the gradient of the statistical algorithm model for predicting equipment risk according to the characteristic data, comprising:
    对所述多个设备的特征数据进行数据清洗。Data cleaning is performed on the feature data of the plurality of devices.
  4. 根据权利要求3所述的设备风险预测方法,其中,所述数据清洗,包括以下之一或其任意组合:The equipment risk prediction method according to claim 3, wherein the data cleaning includes one of the following or any combination thereof:
    统一所述特征数据的数据格式;Unify the data format of the feature data;
    删除所述特征数据中的异常数据;deleting abnormal data in the characteristic data;
    按照时间粒度整合所述特征数据。The feature data is integrated according to time granularity.
  5. 根据权利要求1所述的设备风险预测方法,其中,所述多个设备的属性满足预设条件;所述预设条件包括:所述多个设备的型号相同,所述多个设备的纬度坐标的差值在第一预设范围内,并且所述多个设备各自所处地区的温度差值在第二预设范围内。The device risk prediction method according to claim 1, wherein the attributes of the multiple devices meet preset conditions; the preset conditions include: the models of the multiple devices are the same, and the latitude coordinates of the multiple devices The temperature difference is within the first preset range, and the temperature difference of the areas where the plurality of devices are located is within the second preset range.
  6. 根据权利要求1至5中任一项所述的设备风险预测方法,其中,所述设备包括射频拉远单元RRU;The equipment risk prediction method according to any one of claims 1 to 5, wherein the equipment includes a remote radio unit (RRU);
    在所述设备为RRU的情况下,所述多个设备的特征数据包括各RRU的:功率、光模块温度、基带温度、中频温度、电源温度、单板温度、收发信机温度、位置、气温。In the case where the device is an RRU, the feature data of the plurality of devices includes each RRU: power, optical module temperature, baseband temperature, intermediate frequency temperature, power supply temperature, single board temperature, transceiver temperature, position, air temperature .
  7. 一种设备风险预测方法,应用于公共服务器,包括:A device risk prediction method applied to public servers, including:
    接收N个子计算节点上传的统计学算法模型的梯度;所述N个子计算节点所获取的多个 设备的属性均满足预设公共条件,所述N为大于1的正整数;Receive the gradient of the statistical algorithm model uploaded by N sub-computing nodes; the attributes of multiple devices acquired by the N sub-computing nodes all meet the preset public conditions, and the N is a positive integer greater than 1;
    将所述N个子计算节点加密上传的统计学算法模型的梯度进行聚合,得到更新后的梯度;Aggregating gradients of statistical algorithm models encrypted and uploaded by the N sub-computing nodes to obtain updated gradients;
    向所述N个子计算节点发送所述更新后的梯度。Sending the updated gradients to the N sub-computing nodes.
  8. 根据权利要求7所述的设备风险预测方法,其中,所述预设公共条件,包括:The equipment risk prediction method according to claim 7, wherein the preset public conditions include:
    所述多个设备的型号相同,所述多个设备的纬度坐标的差值在第三预设范围内,并且所述多个设备各自所处地区的温度差值在第四预设范围内。The models of the multiple devices are the same, the difference between the latitude coordinates of the multiple devices is within a third preset range, and the temperature difference between the regions where the multiple devices are located is within a fourth preset range.
  9. 一种电子设备,包括:An electronic device comprising:
    至少一个处理器;以及,at least one processor; and,
    与所述至少一个处理器通信连接的存储器;其中,a memory communicatively coupled to the at least one processor; wherein,
    所述存储器存储有可被所述至少一个处理器执行的指令,所述指令被所述至少一个处理器执行,以使所述至少一个处理器能够执行如权利要求1至6中任一项所述的设备风险预测方法或能够执行如权利要求7至8中任一项所述的设备风险预测方法。The memory stores instructions executable by the at least one processor, the instructions are executed by the at least one processor, so that the at least one processor can perform the operation described in any one of claims 1 to 6 The equipment risk prediction method described above or the equipment risk prediction method described in any one of claims 7 to 8 can be implemented.
  10. 一种计算机可读存储介质,存储有计算机程序,其中,所述计算机程序被处理器执行时实现权利要求1至6中任一项所述的设备风险预测方法或实现如权利要求7至8中任一项所述的设备风险预测方法。A computer-readable storage medium storing a computer program, wherein, when the computer program is executed by a processor, the device risk prediction method according to any one of claims 1 to 6 is implemented or the device risk prediction method according to any one of claims 7 to 8 is implemented. The equipment risk prediction method described in any one.
PCT/CN2022/125757 2021-12-06 2022-10-17 Device risk prediction method, electronic device and computer-readable storage medium WO2023103592A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111479950.3A CN116307347A (en) 2021-12-06 2021-12-06 Device risk prediction method, electronic device, and computer-readable storage medium
CN202111479950.3 2021-12-06

Publications (1)

Publication Number Publication Date
WO2023103592A1 true WO2023103592A1 (en) 2023-06-15

Family

ID=86729580

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/125757 WO2023103592A1 (en) 2021-12-06 2022-10-17 Device risk prediction method, electronic device and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN116307347A (en)
WO (1) WO2023103592A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110134636A (en) * 2018-02-09 2019-08-16 中兴通讯股份有限公司 Model training method, server and computer readable storage medium
CN111143308A (en) * 2019-12-26 2020-05-12 许昌中科森尼瑞技术有限公司 Federal learning-based high-low voltage motor data processing method, system and device
US20210042645A1 (en) * 2019-08-06 2021-02-11 doc.ai, Inc. Tensor Exchange for Federated Cloud Learning
CN113258995A (en) * 2020-09-27 2021-08-13 中兴通讯股份有限公司 Fault prediction method, device and computer readable storage medium
CN113255953A (en) * 2020-11-27 2021-08-13 中兴通讯股份有限公司 RRU undervoltage risk prediction method, device, system, equipment and medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110134636A (en) * 2018-02-09 2019-08-16 中兴通讯股份有限公司 Model training method, server and computer readable storage medium
US20210042645A1 (en) * 2019-08-06 2021-02-11 doc.ai, Inc. Tensor Exchange for Federated Cloud Learning
CN111143308A (en) * 2019-12-26 2020-05-12 许昌中科森尼瑞技术有限公司 Federal learning-based high-low voltage motor data processing method, system and device
CN113258995A (en) * 2020-09-27 2021-08-13 中兴通讯股份有限公司 Fault prediction method, device and computer readable storage medium
CN113255953A (en) * 2020-11-27 2021-08-13 中兴通讯股份有限公司 RRU undervoltage risk prediction method, device, system, equipment and medium

Also Published As

Publication number Publication date
CN116307347A (en) 2023-06-23

Similar Documents

Publication Publication Date Title
US11469949B2 (en) Hierarchical configuration of networked communication devices
WO2019170070A1 (en) Electronic device, method and computer-readable storage medium used for wireless communication
US8442515B2 (en) Management system for monitoring and controlling remote sites and equipment
WO2014101441A1 (en) Method and system for cognitive cooperative relaying/forwarding
WO2022111068A1 (en) Rru undervoltage risk prediction method, apparatus, and system, device, and medium
CN111511003A (en) Energy-saving method for distributed base station
US20190280966A1 (en) Routing data through distributed communications network
US20230206095A1 (en) Transfer learning of machine learning model in distributed network
US11917546B2 (en) Techniques for preserving battery life in poor signal conditions using cellular protocol information
WO2023103592A1 (en) Device risk prediction method, electronic device and computer-readable storage medium
US20210409994A1 (en) Methods, Circuits, Devices, Systems and Associated Computer Executable Code for Operating a Wireless Communication Network
US20200260405A1 (en) Network access node and methods thereof
Hu et al. Caps: Energy-efficient processing of continuous aggregate queries in sensor networks
US20220303906A1 (en) Battery life-time prediction for low power devices
WO2021227807A1 (en) Method for adjusting air conditioner temperature and temperature adjusting apparatus
CN109885116B (en) Internet of things platform monitoring system and method based on cloud computing
Lei A novel WSNs routing algorithm based on game theory and energy consumption balance
US20240029078A1 (en) System and method for adaptively reducing carbon footprint for an information handling system
US11950342B2 (en) Systems and methods for dynamic control and monitoring of heterogeneous smart lighting systems
WO2023072023A1 (en) Intention execution method and apparatus
US11991639B2 (en) Battery life-time optimization for low power devices
US20230009067A1 (en) Techniques for private and secure collection and processing of data for generalized analysis
WO2022141516A1 (en) Model verification method and device
CN109862320B (en) Image collection and processing system and method thereof
US20240049020A1 (en) Data acquisition in a communication network

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22903016

Country of ref document: EP

Kind code of ref document: A1