CN114356442B - Distributed system inspection system and method for nuclear power plant software operation and maintenance - Google Patents

Distributed system inspection system and method for nuclear power plant software operation and maintenance Download PDF

Info

Publication number
CN114356442B
CN114356442B CN202111577392.4A CN202111577392A CN114356442B CN 114356442 B CN114356442 B CN 114356442B CN 202111577392 A CN202111577392 A CN 202111577392A CN 114356442 B CN114356442 B CN 114356442B
Authority
CN
China
Prior art keywords
inspection
software
configuration
state
program
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111577392.4A
Other languages
Chinese (zh)
Other versions
CN114356442A (en
Inventor
侯雪燕
魏巍
冉晓隆
曲铭
谢政权
谢明亮
戴浩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Nuclear Power Operation Technology Corp Ltd
Original Assignee
China Nuclear Power Operation Technology Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Nuclear Power Operation Technology Corp Ltd filed Critical China Nuclear Power Operation Technology Corp Ltd
Priority to CN202111577392.4A priority Critical patent/CN114356442B/en
Publication of CN114356442A publication Critical patent/CN114356442A/en
Application granted granted Critical
Publication of CN114356442B publication Critical patent/CN114356442B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Testing And Monitoring For Control Systems (AREA)

Abstract

The invention provides a distributed system inspection system for nuclear power plant software operation and maintenance, which comprises: the inspection service module is used for reading the configuration file and analyzing the configuration item; the inspection client module starts a timing task according to the inspection period; and the inspection front-end user man-machine interface module displays the state of each software to be monitored on each machine in real time in a graphic mode and provides an entry for an inspection engineer to restart the system. The invention also provides a routing inspection method of the distributed system for the operation and maintenance of the nuclear power plant software. The inspection system and the inspection method for the distributed system facing the operation and maintenance of the nuclear power plant software can make more accurate countermeasures, greatly improve the daily working efficiency and the working quality and minimize the negative influence.

Description

Distributed system inspection system and method for nuclear power plant software operation and maintenance
Technical Field
The invention relates to the technical field of distributed system monitoring, in particular to a system and a method for inspecting a distributed system oriented to the operation and maintenance of nuclear power plant software.
Background
In large nuclear power bases such as Fuqing nuclear power, qinshan nuclear power and the like, informationized software is numerous, a nuclear emergency management platform which is deployed in each nuclear power base is taken as an example, some information systems are used for carrying out information communication with an upper department, related assessment indexes of units are affected if abnormality occurs, some systems are used for processing task flows in the units, and daily work is affected if abnormality occurs.
The power plant information management department personnel are limited, and usually, one system operation and maintenance personnel needs to be responsible for daily inspection of more than ten software systems, one software information system comprises a plurality of physical servers or virtual servers, and a plurality of application programs or services are usually operated on each machine. When the system runs for a long time, the state of the system can be gradually changed, so that the system has certain uncertainty in running. The operation and maintenance personnel can carry out timing inspection for a plurality of times every day so as to carry out related treatment at the first time after the abnormality is found, and the loss is minimized, but the current inspection mode is a purely manual mode, the work is trivial, the repeatability is high, and the error is easy.
Disclosure of Invention
The invention aims to overcome the defects in the prior art, so as to provide a system and a method for inspecting a distributed system facing the operation and maintenance of nuclear power plant software, which can make more accurate countermeasures, greatly improve the daily working efficiency and the working quality and minimize the negative influence.
In order to achieve the above object, the present invention provides the following technical solutions:
a distributed system inspection system for software operation and maintenance of a nuclear power plant, comprising:
the inspection service module is used for reading the configuration file and analyzing the configuration item;
the inspection client module starts a timing task according to the inspection period;
and the inspection front-end user man-machine interface module displays the state of each software to be monitored on each machine in real time in a graphic mode and provides an entry for an inspection engineer to restart the system.
Further, the system also comprises an interface API which is provided for a program to be patrolled and examined, and if the program to be monitored is called in the program, the heartbeat packet is sent to the configuration port at regular time.
Further, the patrol service module can be deployed on any one of the server nodes in the network.
Furthermore, the inspection front-end user man-machine interface module can be opened by a browser on any one of the server nodes in the network.
Further, the configuration item comprises host information, a patrol period and information to be patrol contained in each host node.
Further, the host information comprises the IP, the physical location, the number and the machine name of the host to be patrolled and examined.
Further, in the inspection service module, whether the network link is normal is judged by the following two modes:
actively acquiring a network state by sending a PING command; or alternatively
And judging the network state according to the time stamp and the number of heartbeats by receiving the heartbeat packet.
Further, in the inspection client module, the software node state is judged in the following two ways:
accessing a task manager through a WINDOWS API to obtain a program running state; or alternatively
And correspondingly calling a program of an interface API based on a heartbeat mechanism, and acquiring the state information fed back by the application program through the monitoring port.
Furthermore, the inspection front-end user man-machine interface module provides a user with a configuration entry, so that the user can modify and view the configuration through the GUI, and control buttons are provided, so that the user can control the system to start or stop.
A method for inspecting a distributed system oriented to the operation and maintenance of nuclear power plant software comprises the following steps:
step S1: reading a configuration file and analyzing a configuration item;
step S2: starting a timing task according to the inspection period;
step S3: displaying the state of each piece of software to be monitored on each machine in a graphical mode in real time, and providing a configuration entry for a user so that the user can modify and view the configuration through a GUI;
step S4: the method is provided for a program to be patrolled and examined, and if the program to be monitored is called in the program, the heartbeat packet is sent to the configuration port at regular time.
Compared with the prior art, the inspection system and method of the distributed system for the operation and maintenance of the nuclear power plant software have the following beneficial effects:
the inspection system and the inspection method for the distributed system facing the software operation and maintenance of the nuclear power plant can automatically acquire the information of the monitoring nodes, and provide the information for operation and maintenance personnel of the system in a chart and report mode, so that the operation and maintenance personnel can acquire data more quickly and know the state of the system more comprehensively, more accurate countermeasures can be taken, on one hand, the daily workload of the operation and maintenance of the software system can be reduced, on the other hand, human error can be reduced, and the working quality can be improved.
The operation and maintenance personnel can see the inspection information clearly through the browser accessing page, so that the system state is more comprehensively known, more accurate countermeasures can be taken, and the daily work efficiency and the work quality are greatly improved. When the system is abnormal, through short messages and mail reminding, operation and maintenance personnel can know the problem at the first time and process the problem in time, so that the negative influence is minimized.
Drawings
In order to more clearly illustrate the embodiments of the invention or the technical solutions in the prior art, the drawings that are required in the embodiments or the description of the prior art will be briefly described, it being obvious that the drawings in the following description are only some embodiments of the invention, and that other drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a block diagram of a module according to an embodiment of the present invention;
FIG. 2 is a flowchart of the operation provided by an embodiment of the present invention;
FIG. 3 is a schematic diagram of one HMI page provided by an embodiment of the present invention, in which the use of hardware information is shown;
fig. 4 is a schematic diagram of another HMI page according to an embodiment of the present invention, where daily inspection information is shown.
Detailed Description
While the inspection system and method of the present invention for a distributed system of nuclear power plant software operations may be implemented in a number of different ways, it is not intended to limit the scope of the invention to the exemplary embodiments. Accordingly, the drawings and description of the embodiments are to be regarded as illustrative in nature, and not as restrictive. Further details are provided below with reference to the specific embodiments.
As shown in fig. 1 to 4, the invention provides a patrol system of a distributed system for the operation and maintenance of nuclear power plant software, which comprises the following components:
(1) The inspection service module (SAutoInstrotor) and the configuration file based on the XML format can be deployed on any server node in the network.
(2) Patrol client module (CAutoInstrotor) and text format configuration file information, patrol
(3) Front-end user human-machine interface module AutoInsprectrHMI
And displaying the state of each software to be monitored on each machine in a graphical mode in real time, providing an entry for a patrol engineer to restart the system, and opening the system by using a browser on any server node in a network.
(4) Heartbeat mechanism based interface API
The method comprises the steps that if the program to be monitored is called in the program, a heartbeat packet is sent to a configuration port at regular time, wherein the heartbeat packet comprises machine information, program name, running state and last update time, so that the SAutoInsector can acquire the information of the monitored node.
Specifically, the SAutoInstrotor is deployed on the inspection host, and after the SAutoInstrotor is started, a configuration file is read first, and a configuration item is analyzed. The configuration items comprise host information (such as a host IP to be inspected, a physical position, a number, a machine name and the like), an inspection period and information to be inspected contained in each host node, wherein the information to be inspected comprises an application program or service, directory information, file information, threshold setting and overrun processing modes.
The sautoinstructor will initiate a timed task according to the inspection cycle. Whether the network link is normal is judged in two ways:
(1) Proactively acquiring network state by sending PING commands
(2) By receiving the heartbeat packet, the network state is judged according to the time stamp and the number of the heartbeats
The SAutoInstrotor receives a heartbeat packet sent by the CAutoInstrotor, acquires the state of the inspected software node, compares the state with a threshold value in a configuration file, triggers a set disposal mode if overrun, sends an alarm to a set account or sends a short message to a set mobile phone number, and records a log.
The SAutoInsector receives a request sent by a user through the AutoInsector HMI, returns request data, receives a system shutdown and system startup command sent by the AutoInsector HMI, sends a command to the CAutoInsector to control software nodes of all machines to restart in sequence according to a configuration sequence, and can also independently start a certain software node.
The CAutoInstrotor is deployed on each server to be inspected, and after the server is started, the server is firstly registered with the SAutoInstrotor through interface call to acquire the inspection period of the server, and the information to be inspected comprises the running state of an application program or service, the number of files in a catalog, the existence of the files and the size of the files.
The CAutoInstrotor starts a timing task according to the inspection period, and judges the state of the software node in two ways.
(1) Accessing a task manager through a WINDOWS API to obtain a program running state;
(2) Program corresponding to calling interface API based on heartbeat mechanism and obtaining application through monitoring port
Status information fed back by the program;
the CAutoInstrotor acquires the number of files in the directory to be inspected, the existence of the files and the sizes of the files through a WINDOWS API.
And the CAutoInstrotor receives a starting command sent by the SAutoInstrotor, sequentially invokes a starting script according to the starting sequence configuration of the local software nodes, and feeds back the started state of each software node. And receiving all closing commands, sequentially calling a closing script according to configuration, then emptying a corresponding log, a cache or a memory area, closing a designated software node if the command is a command for closing a single node, and feeding back the closing state of each software node after operation.
The AutoInjector HMI is a page opened by a browser, and displays the state of each software to be monitored on each machine in real time in a graphical manner. The configuration entry is provided for the user, so that the user can modify and view the configuration through the GUI, and the control buttons are provided, so that the user can control the system to start or stop.
In addition, the invention also provides a method for inspecting the distributed system oriented to the operation and maintenance of the nuclear power plant software, which comprises the following steps:
step S1: reading a configuration file and analyzing a configuration item;
step S2: starting a timing task according to the inspection period;
step S3: displaying the state of each piece of software to be monitored on each machine in a graphical mode in real time, and providing a configuration entry for a user so that the user can modify and view the configuration through a GUI;
step S4: the method is provided for a program to be patrolled and examined, and if the program to be monitored is called in the program, the heartbeat packet is sent to the configuration port at regular time.
The foregoing is merely illustrative of the present invention, and the present invention is not limited thereto, and any changes or substitutions easily contemplated by those skilled in the art within the scope of the present invention should be included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (4)

1. The utility model provides a distributed system's that nuclear power plant software fortune dimension inspection system, its characterized in that includes:
the inspection service module is used for reading the configuration file, analyzing the configuration item, wherein the configuration item comprises host information, inspection period and information to be inspected contained in each host node, and the information comprises application programs or services, directory information, file information, threshold value setting and overrun processing modes; the inspection service module starts a timing task according to the inspection period, and judges whether the network link is normal or not in two ways:
(1) Actively acquiring a network state by sending a PING command;
(2) Judging the network state according to the time stamp and the number of heartbeats by receiving the heartbeat packet;
the inspection client module starts a timing task according to an inspection period, and judges the state of the software node in two ways:
(1) Accessing a task manager through a WINDOWS API to obtain a program running state;
(2) The method comprises the steps of correspondingly calling a program of an interface API based on a heartbeat mechanism, and acquiring state information fed back by an application program through a monitoring port;
the inspection front-end user man-machine interface module displays the state of each piece of software to be monitored on each machine in a graphical mode in real time and provides an entry for an inspection engineer to restart the system; the inspection front-end user man-machine interface module can be opened by a browser on any one server node in the network; the inspection front-end user man-machine interface module provides a user with a configuration inlet, so that the user can modify and check the configuration through the GUI, and a control button is provided, so that the user can control the system to start or stop;
providing an interface API based on a heartbeat mechanism for a program to be patrolled and examined, wherein if the program to be monitored is called in the program, a heartbeat packet is sent to a configuration port at regular time;
the inspection service module receives the heartbeat packet sent by the inspection client module, acquires the state of the inspected software node, compares the state with the threshold value in the configuration file, triggers the set disposal mode if the state exceeds the threshold value, sends an email alarm to the set account or sends a short message to the set mobile phone number, and records a log.
2. The inspection system of a distributed system for software operation and maintenance of a nuclear power plant according to claim 1, wherein the inspection service module can be deployed on any one of the server nodes in the network.
3. The inspection system of a distributed system for software operations and maintenance of a nuclear power plant according to claim 1, wherein the host information includes an IP, a physical location, a number, and a machine name of a host to be inspected.
4. The inspection method of the distributed system facing the operation and maintenance of the nuclear power plant software is characterized by being applied to the inspection system of the distributed system facing the operation and maintenance of the nuclear power plant software in claim 1, and comprises the following steps:
step S1: reading a configuration file and analyzing a configuration item;
step S2: starting a timing task according to the inspection period;
step S3: displaying the state of each piece of software to be monitored on each machine in a graphical mode in real time, and providing a configuration entry for a user so that the user can modify and view the configuration through a GUI;
step S4: the method is provided for a program to be patrolled and examined, and if the program to be monitored is called in the program, the heartbeat packet is sent to the configuration port at regular time.
CN202111577392.4A 2021-12-22 2021-12-22 Distributed system inspection system and method for nuclear power plant software operation and maintenance Active CN114356442B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111577392.4A CN114356442B (en) 2021-12-22 2021-12-22 Distributed system inspection system and method for nuclear power plant software operation and maintenance

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111577392.4A CN114356442B (en) 2021-12-22 2021-12-22 Distributed system inspection system and method for nuclear power plant software operation and maintenance

Publications (2)

Publication Number Publication Date
CN114356442A CN114356442A (en) 2022-04-15
CN114356442B true CN114356442B (en) 2024-02-13

Family

ID=81101281

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111577392.4A Active CN114356442B (en) 2021-12-22 2021-12-22 Distributed system inspection system and method for nuclear power plant software operation and maintenance

Country Status (1)

Country Link
CN (1) CN114356442B (en)

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0939516A2 (en) * 1998-01-19 1999-09-01 Michael Robinson User communication and monitoring system for computer networks
CN104518903A (en) * 2013-09-30 2015-04-15 腾讯科技(北京)有限公司 Method, system and device for patrolling online business
CN104539053A (en) * 2014-12-31 2015-04-22 国家电网公司 Power dispatching automation polling robot and method based on reptile technology
CN104639910A (en) * 2015-02-10 2015-05-20 国家电网公司 Scene automatic tour inspection system of electric system on the basis of operating state of transformer substation
CN105872068A (en) * 2016-04-28 2016-08-17 国网浙江省电力公司信息通信分公司 Cloud platform and automatic operation check method based on same
CN106161088A (en) * 2016-07-01 2016-11-23 成都视达科信息技术有限公司 A kind of server automatic detecting method and system based on multithreading
CN109690570A (en) * 2018-03-15 2019-04-26 深圳市中联创新自控系统有限公司 A kind of computer lab management method and rotating ring system
CN111245937A (en) * 2020-01-10 2020-06-05 成都地铁运营有限公司 Comprehensive monitoring remote inspection system
CN111835582A (en) * 2020-06-19 2020-10-27 深圳奇迹智慧网络有限公司 Configuration method and device of Internet of things inspection equipment and computer equipment
CN212810867U (en) * 2020-08-11 2021-03-26 天津轨道交通运营集团有限公司 Multipoint wireless temperature measuring device of drawer type switch cabinet
CN112787895A (en) * 2021-03-05 2021-05-11 上海中通吉网络技术有限公司 Network inspection method, device and equipment

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7501941B2 (en) * 2006-01-13 2009-03-10 Lites Out, Llc Managing advertising devices
EP3185124A1 (en) * 2015-12-22 2017-06-28 Tata Consultancy Services Limited System and method for monitoring, deploying, and tracking autonomous software robots

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0939516A2 (en) * 1998-01-19 1999-09-01 Michael Robinson User communication and monitoring system for computer networks
CN104518903A (en) * 2013-09-30 2015-04-15 腾讯科技(北京)有限公司 Method, system and device for patrolling online business
CN104539053A (en) * 2014-12-31 2015-04-22 国家电网公司 Power dispatching automation polling robot and method based on reptile technology
CN104639910A (en) * 2015-02-10 2015-05-20 国家电网公司 Scene automatic tour inspection system of electric system on the basis of operating state of transformer substation
CN105872068A (en) * 2016-04-28 2016-08-17 国网浙江省电力公司信息通信分公司 Cloud platform and automatic operation check method based on same
CN106161088A (en) * 2016-07-01 2016-11-23 成都视达科信息技术有限公司 A kind of server automatic detecting method and system based on multithreading
CN109690570A (en) * 2018-03-15 2019-04-26 深圳市中联创新自控系统有限公司 A kind of computer lab management method and rotating ring system
WO2019174009A1 (en) * 2018-03-15 2019-09-19 深圳市中联创新自控系统有限公司 Machine room management method and dynamic environment system
CN111245937A (en) * 2020-01-10 2020-06-05 成都地铁运营有限公司 Comprehensive monitoring remote inspection system
CN111835582A (en) * 2020-06-19 2020-10-27 深圳奇迹智慧网络有限公司 Configuration method and device of Internet of things inspection equipment and computer equipment
CN212810867U (en) * 2020-08-11 2021-03-26 天津轨道交通运营集团有限公司 Multipoint wireless temperature measuring device of drawer type switch cabinet
CN112787895A (en) * 2021-03-05 2021-05-11 上海中通吉网络技术有限公司 Network inspection method, device and equipment

Also Published As

Publication number Publication date
CN114356442A (en) 2022-04-15

Similar Documents

Publication Publication Date Title
CN110224858B (en) Log-based alarm method and related device
CN113238913B (en) Intelligent pushing method, device, equipment and storage medium for server faults
CN112631913B (en) Method, device, equipment and storage medium for monitoring operation faults of application program
WO2021218178A1 (en) Automatic report generation method, device, computer apparatus, and storage medium
CN103067230A (en) Method for achieving hyper text transport protocol (http) service monitoring through embedding monitoring code
CN108039956A (en) Using monitoring method, system and computer-readable recording medium
CN114500343A (en) Business chain operation monitoring method and device, equipment, medium and product thereof
CN110569140A (en) operation and maintenance method and device
US8341650B1 (en) High thread count analyzer for web application server environment
CN114356442B (en) Distributed system inspection system and method for nuclear power plant software operation and maintenance
CN113806187A (en) Intelligent one-stop operation and maintenance service method and platform
CN105025179A (en) Method and system for monitoring service agents of call center
CN114697232A (en) Skywalking probe index data acquisition system and method and electronic equipment
CN107612755A (en) The management method and its device of a kind of cloud resource
CN108154343B (en) Emergency processing method and system for enterprise-level information system
CN114387123B (en) Data acquisition management method
CN112765188B (en) Configuration information processing method, configuration management system, electronic device and storage medium
CN115525392A (en) Container monitoring method and device, electronic equipment and storage medium
CN112835794B (en) Method and system for positioning and monitoring code execution problem based on Swoole
CN113570347A (en) RPA operation and maintenance method for micro-service architecture system
CN112069027A (en) Interface data processing method and device, electronic equipment and storage medium
CN114257534A (en) Test result processing method, device and system and storage medium
CN111666178A (en) Safety monitoring method and system
CN116340690B (en) Page abnormality detection method and device and computer equipment
CN109783306A (en) Respond the processing method of operating and system of alarm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant