KR101877904B1

KR101877904B1 - Apparatus for monitoring error of server and method

Info

Publication number: KR101877904B1
Application number: KR1020170152986A
Authority: KR
Inventors: 김민수; 장인경
Original assignee: (주)웨일소프트
Priority date: 2017-11-16
Filing date: 2017-11-16
Publication date: 2018-07-12

Abstract

Disclosed are a device and a method for monitoring the malfunction of a server. The device for monitoring the malfunction of a server of the present invention includes: a switching control part connected to a LAN switch to control a switching state of the LAN switch; a guide page output part outputting a malfunction guide page through a user terminal; and a monitoring part monitoring at least one among the network and server loads, and a traffic load of a LAN switch, determining whether the server or network malfunctions or not in accordance with the monitoring result and disconnecting communication with the server through the switching control part in accordance with the determination result, and outputting a malfunction guide page through the user terminal by controlling the guide page output part. As such, the present invention is capable of enabling a manager to instantly handle the malfunction of a server.

Description

[0001] APPARATUS FOR MONITORING ERROR OF SERVER AND METHOD [0002]

본 발명은 서버 장애 모니터링 장치 및 방법에 관한 것으로서, 보다 상세하게는 서버를 모니터링하여 장애를 분석 및 안내하는 서버 장애 모니터링 장치 및 방법에 관한 것이다.The present invention relates to an apparatus and method for monitoring a server failure, and more particularly, to a server failure monitoring apparatus and method for monitoring a server to analyze and guide a failure.

인터넷의 발달로 다양한 종류의 인터넷 서비스와 이러한 인터넷 서비스를 제공하는 인터넷 서버들이 구축되어 사용되고 있다.Due to the development of the Internet, various types of Internet services and Internet servers providing such Internet services have been constructed and used.

인터넷 서비스의 이용은 사용자가 서버에 접속한 후 서비스를 요청하는 단순한 동작만으로 가능하며, 인터넷 서비스의 제공 또한 서버가 접속한 사용자 단말과 세션(session)을 연결한 후 연결된 세션을 통해 해당 서비스 컨텐츠를 제공하는 것으로 달성된다.The use of the internet service is possible only by a simple operation of requesting a service after the user accesses the server. In addition, providing the Internet service also connects the session with the user terminal connected to the server, .

서버는 세션을 연결하기 위해서 자신의 자원(resource) 중 일부를 사용하게 된다. 그러므로, 접속자 수가 많을수록 서버의 성능은 떨어지고, 심지어 서비스를 유저에게 제공하지 못하는 장애가 발생하게 된다. The server uses some of its resources to connect sessions. Therefore, the greater the number of users, the lower the performance of the server, and even a failure to provide the service to the user occurs.

이런 이유로 인터넷 서비스 사업자(ISP: Internet Service Provider)는 많은 유저의 접속에 따른 장애를 방지하기 위하여 많은 자원을 가진 고가의 서버를 구축하고 있거나, 서버 관리를 위한 전문 인력을 배치하고 있다. For this reason, an Internet Service Provider (ISP) is constructing an expensive server having a large amount of resources in order to prevent a failure due to the connection of many users, or arranging a professional manpower for server management.

그러나, 종래에는 서버에 장애가 발생할 경우 웹페이지에는 에러 페이지가 표시되는데, 이 경우에는 사용자가 사용자 단말 또는 웹페이지의 에러로 오인식하여 불안감을 갖게 되는 문제점이 있었다. However, in the related art, when an error occurs in a server, an error page is displayed on a web page. In this case, there is a problem that a user is perceived as an error of a user terminal or a web page, which causes anxiety.

본 발명의 배경기술은 대한민국 공개특허공보 10-2015-0000987호(2015.01.06)의 '연계 서버에서 연계 서비스의 지능형 장애 예측 방법'에 개시되어 있다.Background Art [0002] The background art of the present invention is disclosed in Korean Patent Laid-Open Publication No. 10-2015-0000987 (May 2015.01.06) entitled " Intelligent Fault Prediction Method of Linkage Service in Link Server ".

본 발명은 전술한 문제점을 개선하기 위해 창안된 것으로서, 본 발명의 일 측면에 따른 목적은 서버와 네트워크 장애를 분석하여 분석 결과에 따라 사용자 단말 또는 관리자 단말에 안내하는 서버 장애 모니터링 장치 및 방법을 제공하는 것이다. SUMMARY OF THE INVENTION The present invention has been made to solve the above-mentioned problems, and an object of the present invention is to provide a server fault monitoring apparatus and method for analyzing a server and a network fault and guiding the fault to a user terminal or an administrator terminal .

본 발명의 다른 측면에 따른 목적은 서버 장애 발생시 사용자 단말을 통해 서버 장애가 발생중임을 안내하여 사용자의 불안감을 감소시키고, 관리자가 서버 장애에 즉각적으로 대처할 수 있도록 한 서버 장애 모니터링 장치 및 방법을 제공하는 것이다. According to another aspect of the present invention, there is provided an apparatus and method for monitoring a server failure by informing that a server failure is occurring through a user terminal when a server failure occurs, thereby reducing anxiety of a user and allowing an administrator to immediately respond to a server failure will be.

본 발명의 일 측면에 따른 서버 장애 모니터링 장치는 랜 스위치에 접속되어 상기 랜 스위치의 스위칭 상태를 제어하는 스위칭 제어부; 사용자 단말을 통해 장애 안내 페이지를 출력하는 안내 페이지 출력부; 및 네트워크의 부하, 서버의 부하 및 상기 랜 스위치의 트래픽 부하 중 적어도 하나를 모니터링하고, 모니터링 결과에 따라 서버 또는 네트워크의 장애 발생 여부를 판단하여 판단 결과에 따라 상기 스위칭 제어부를 통해 상기 서버와의 통신을 차단하고, 상기 안내 페이지 출력부를 제어하여 상기 사용자 단말을 통해 장애 안내 페이지를 출력하는 모니터링부를 포함하는 것을 특징으로 한다.According to an aspect of the present invention, there is provided a server fault monitoring apparatus comprising: a switching controller connected to a LAN switch to control a switching state of the LAN switch; A guidance page output unit outputting a failure guidance page through a user terminal; And monitoring at least one of a load of the network, a load of the server, and a traffic load of the LAN switch, determining whether a failure has occurred in the server or the network according to the monitoring result, communicating with the server through the switching control unit And outputting a failure guidance page through the user terminal by controlling the guide page output unit.

본 발명의 상기 랜 스위치는 L4 이상의 스위치인 것을 특징으로 한다.The LAN switch of the present invention is characterized by being a switch of L4 or more.

본 발명은 상기 모니터링부의 모니터링 상태를 실시간으로 표시하는 상태 출력부를 더 포함하는 것을 특징으로 한다.The present invention is further characterized by a status output unit for displaying the monitoring status of the monitoring unit in real time.

본 발명은 상기 모니터링부의 모니터링 결과 장애가 발생된 것으로 판단되면, 관리자 단말로 장애 발생 정보를 전달하여 상기 관리자 단말을 통해 장애 발생 정보를 출력하도록 하는 장애 안내부를 더 포함하는 것을 특징으로 한다.The present invention is characterized by further comprising a failure notification unit for transmitting failure occurrence information to the administrator terminal and outputting failure occurrence information through the administrator terminal if it is determined that the monitoring result of the monitoring unit is a failure.

본 발명은 상기 모니터링부의 모니터링 결과 장애가 발생된 것으로 판단되면, 장애 발생 내역을 저장하는 로그 관리부를 더 포함하는 것을 특징으로 한다.The present invention is characterized by further comprising a log management unit for storing a fault occurrence history if it is determined that a fault has occurred as a monitoring result of the monitoring unit.

본 발명의 상기 모니터링부는 상기 랜 스위치의 트래픽량이 기 설정된 트래픽 임계값 이상이면 장애가 발생된 것으로 판단하는 것을 특징으로 한다.The monitoring unit of the present invention determines that a fault has occurred if the traffic volume of the LAN switch is greater than or equal to a predetermined traffic threshold value.

본 발명의 상기 모니터링부는 상기 서버의 세션값이 기 설정된 세션값 임계치 이상이면 장애가 발생한 것으로 판단하는 것을 특징으로 한다.The monitoring unit of the present invention determines that a failure has occurred if the session value of the server is equal to or greater than a preset session value threshold.

본 발명의 상기 모니터링부는 상기 서버의 CPU 사용율이 기 설정된 CPU 사용율 임계치 이상이면 장애가 발생한 것으로 판단하는 것을 특징으로 한다.The monitoring unit of the present invention determines that a failure has occurred if the CPU usage rate of the server is equal to or greater than a predetermined CPU utilization threshold.

본 발명의 상기 모니터링부는 상기 서버의 메모리 사용율이 기 설정된 메모리 사용율 임계치 이상이면 장애가 발생한 것으로 판단하는 것을 특징으로 한다.The monitoring unit of the present invention determines that a failure has occurred if the memory usage rate of the server is equal to or greater than a preset memory usage threshold.

본 발명의 상기 모니터링부는 상기 서버의 디스크 사용율이 기 설정된 디스크 사용율 임계치 이상이면 장애가 발생한 것으로 판단하는 것을 특징으로 한다.The monitoring unit of the present invention determines that a failure has occurred if the disk usage rate of the server is greater than or equal to a preset disk usage threshold.

본 발명의 일 측면에 따른 서버 장애 모니터링 방법은 모니터링부가 서버의 부하 또는 랜 스위치의 트래픽 부하를 모니터링하는 단계; 상기 모니터링부가 모니터링 결과에 따라 서버 또는 네트워크의 장애 발생 여부를 판단하는 단계; 상기 판단 결과에 따라, 상기 모니터링부가 스위칭 제어부를 제어하여 상기 서버와의 통신을 차단하는 단계; 및 상기 모니터링부가 상기 사용자 단말을 통해 장애 안내 페이지를 출력하는 단계를 포함하는 것을 특징으로 한다.According to an aspect of the present invention, there is provided a server fault monitoring method comprising: monitoring a load of a server or a traffic load of a LAN switch; Determining whether a failure of a server or a network occurs according to a monitoring result of the monitoring unit; Controlling the monitoring unit to block communication with the server according to a result of the determination; And the monitoring unit outputting a failure guidance page through the user terminal.

본 발명은 장애 안내부가 상기 모니터링부의 모니터링 결과 장애로 판단되면, 관리자 단말로 장애 발생 정보를 전달하여 상기 관리자 단말을 통해 장애 발생 정보를 출력하도록 하는 단계를 더 포함하는 것을 특징으로 한다.The present invention is characterized by further comprising the step of transmitting failure occurrence information to an administrator terminal and outputting failure occurrence information through the administrator terminal if the failure guidance section determines that the monitoring result is a monitoring result failure.

본 발명은 로그 관리부가 상기 모니터링부의 모니터링 결과 장애가 발생된 것으로 판단되면, 장애 발생 내역을 저장하는 단계를 더 포함하는 것을 특징으로 한다.The present invention is characterized in that the log management unit further includes a step of storing a fault occurrence history if it is determined that the monitoring result of the monitoring unit has failed.

본 발명의 상기 모니터링부는 상기 서버의 디스크 사용율이 기 설정된 디스크 사용율 임계치 이상이면 장애가 발생한 것으로 판단하는 것을 특징으로 한다. The monitoring unit of the present invention determines that a failure has occurred if the disk usage rate of the server is greater than or equal to a preset disk usage threshold.

본 발명의 일 측면에 따른 서버 장애 모니터링 장치 및 방법은 서버와 네트워크 장애를 분석하여 분석 결과에 따라 사용자 단말 또는 관리자 단말에 안내한다.An apparatus and method for monitoring a server failure according to an aspect of the present invention analyzes a server and a network failure and provides guidance to a user terminal or an administrator terminal according to the analysis result.

본 발명의 다른 측면에 따른 서버 장애 모니터링 장치 및 방법은 서버 장애 발생시 사용자 단말을 통해 서버 장애가 발생중임을 안내하여 사용자의 불안감을 감소시키고, 관리자가 서버 장애에 즉각적으로 대처할 수 있도록 한다. According to another aspect of the present invention, there is provided an apparatus and method for monitoring a server failure. The server monitoring system monitors a server failure occurring when a server failure occurs, thereby reducing a user's anxiety and allowing an administrator to immediately respond to a server failure.

도 1 은 본 발명의 일 실시예에 따른 서버 장애 모니터링 장치의 블럭 구성도이다.
도 2 는 본 발명의 일 실시예에 따른 모니터링 서버의 블럭 구성도이다.
도 3 은 본 발명의 일 실시예에 따른 서버 장애 모니터링 방법의 순서도이다.1 is a block diagram of a server fault monitoring apparatus according to an embodiment of the present invention.
2 is a block diagram of a monitoring server according to an embodiment of the present invention.
3 is a flowchart of a server fault monitoring method according to an embodiment of the present invention.

이하에서는 본 발명의 일 실시예에 따른 서버 장애 모니터링 장치 및 방법을 첨부된 도면들을 참조하여 상세하게 설명한다. 이러한 과정에서 도면에 도시된 선들의 두께나 구성요소의 크기 등은 설명의 명료성과 편의상 과장되게 도시되어 있을 수 있다. 또한 후술되는 용어들은 본 발명에서의 기능을 고려하여 정의된 용어들로서, 이는 이용자, 운용자의 의도 또는 관례에 따라 달라질 수 있다. 그러므로 이러한 용어들에 대한 정의는 본 명세서 전반에 걸친 내용을 토대로 내려져야 할 것이다. Hereinafter, an apparatus and method for monitoring a server failure according to an embodiment of the present invention will be described in detail with reference to the accompanying drawings. In this process, the thicknesses of the lines and the sizes of the components shown in the drawings may be exaggerated for clarity and convenience of explanation. Further, the terms described below are defined in consideration of the functions of the present invention, which may vary depending on the user, the intention or custom of the operator. Therefore, definitions of these terms should be made based on the contents throughout this specification.

도 1 은 본 발명의 일 실시예에 따른 서버 장애 모니터링 장치의 블럭 구성도이고, 도 2 는 본 발명의 일 실시예에 따른 모니터링 서버의 블럭 구성도이다.FIG. 1 is a block diagram of a server fault monitoring apparatus according to an embodiment of the present invention, and FIG. 2 is a block diagram of a monitoring server according to an embodiment of the present invention.

도 1 을 참조하면, 본 발명의 일 실시예에 따른 서버 장애 모니터링 장치는 모니터링 서버(40), 사용자 단말(30) 및 관리자 단말(50)을 포함한다. Referring to FIG. 1, a server failure monitoring apparatus according to an embodiment of the present invention includes a monitoring server 40, a user terminal 30, and an administrator terminal 50.

모니터링 서버(40)는 백본망(20)에 연결되어 네트워크나 서버(10)의 부하 및 랜 스위치(24)의 트래픽 부하 중 적어도 하나를 모니터링하고, 모니터링 결과에 따라 장애 발생 여부를 판단한다. 판단 결과 장애가 발생한 것으로 판단되면, 모니터링 서버(40)는 해당 장애 발생에 따라 기 설정된 동작을 수행한다. 모니터링 서버(40)의 동작에 대해서는 후술한다. The monitoring server 40 is connected to the backbone network 20 to monitor at least one of the load of the network 10 or the server 10 and the traffic load of the LAN switch 24, If it is determined that a failure has occurred, the monitoring server 40 performs a predetermined operation according to the occurrence of the failure. The operation of the monitoring server 40 will be described later.

백본망(20)에는 백본 스위치(21), IPS(Intrusion Prevention System)(22), 방화벽(23), 및 랜 스위치(24)가 포함될 수 있다. The backbone network 20 may include a backbone switch 21, an Intrusion Prevention System (IPS) 22, a firewall 23, and a LAN switch 24.

IPS(22)는 침입방지시스템으로 인터넷 웜 등의 악성코드 및 해킹 등에 기인한 유해 트래픽을 차단한다. 방화벽(23)은 포트 차단 등 외부 공격으로부터의 접근을 통제한다. 랜 스위치(24)는 부하 분산 기능을 수행한다. 본 실시예에서는 랜 스위치(24)로 L4 스위치를 예시로 설명한다.The IPS 22 is an intrusion prevention system and blocks malicious codes such as an Internet worm and harmful traffic caused by a hacking or the like. The firewall 23 controls access from external attacks such as port blocking. The LAN switch 24 performs a load balancing function. In the present embodiment, the L4 switch is exemplified by the LAN switch 24.

L4 스위치는 웹서버(11) 또는 방화벽(23) 등이 통신대역폭에 비하여 그 처리 성능이 충분하지 못하기 때문에 그 부하분산을 기능을 수행하기 위해 탑재된다. L4 이상의 스위치는 부하를 모든 서버(10)에 균등하게 배분하여 하나의 서버(10)에 부하가 집중되지 않도록 한다. The L4 switch is mounted to perform the load balancing function because the web server 11 or the firewall 23 has insufficient processing performance in comparison with the communication bandwidth. The switches L4 and L4 distribute the loads equally to all the servers 10 so that the load is not concentrated on one server 10. [

특히, L4 이상의 스위치는 인터넷망으로부터의 부하를 분산하는데, 예를 들어, 복수의 서버(10) 중 동일한 역할을 수행하는 서버(10)들을 가상 IP(Virtual IP; VIP)를 통해 관리하며, 서버(10)로 향하는 부하 또는 트래픽을 분산(로드 밸런싱)할 수 있다. 이러한 부하 분산 방법으로는, 라운드 로빈(Round Robin), 해싱(hashing), 리스트 커넥션(least connection) 등 다양한 방법이 사용될 수 있다. In particular, the switches L4 and L4 distribute loads from the Internet network. For example, servers 10 performing the same role among a plurality of servers 10 are managed through a virtual IP (VIP) (Load balancing) the load or the traffic toward the base station 10. As such a load balancing method, various methods such as round robin, hashing, and least connection can be used.

따라서, 모니터링 서버(40)는 복수의 서버(10) 및 점검 대상 장비를 모니터링 하여 장애나 네트워크 장애를 판단하고, L4 이상의 스위치를 통해 안내페이지를 사용자에게 제공하고 관리자에게 장애내역을 공지한다. Therefore, the monitoring server 40 monitors the plurality of servers 10 and the target equipment to determine a failure or a network failure, provides the guidance page through the switch L4 or more, and informs the manager of the failure details.

이러한 L4 이상의 스위치는 모니터링 서버(40)와 독립적으로 설치될 수 있으나, 모니터링 서버(40)와 일체로 설치될 수도 있다. The L4 or more switches may be installed independently of the monitoring server 40, but may be installed integrally with the monitoring server 40. [

참고로, 본 실시예에서는 L4 스위치를 예시로 설명하였으나, 본 발명의 기술적 범위는 이에 한정되는 것은 아니며, L4 이상의 스위치도 모두 포함될 수 있다. For reference, the L4 switch has been described as an example in the present embodiment, but the technical scope of the present invention is not limited thereto, and all switches of L4 and above may be included.

서버(10)에는 사용자 단말(30)로부터의 요청에 따라 웹페이지를 제공하는 웹서버(11) 및 웹페이지에 접속한 사용자 단말(30)의 요청에 따라 각종 애플리케이션을 제공하는 WAS서버(Web Application Server)(12)가 포함될 수 있으며, 일반 TCP/IP 통신에 사용되어지는 모든 서버가 포함될 수 있다.The server 10 is provided with a Web server 11 that provides a web page in response to a request from the user terminal 30 and a WAS server that provides various applications upon request of the user terminal 30 connected to the web page. Server) 12, and may include all servers used for general TCP / IP communication.

통상, 서버(10)나 네트워크 등에 장애가 발생할 경우, 해당 서버(10)를 통해 웹페이지에 접속하는 사용자 단말(30)은 '웹페이지를 표시할 수 없습니다'라는 에러 메시지를 표시하는데, 이 경우 사용자는 컴퓨터에 문제가 있거나 해당 웹페이지를 운영하는 기업이나 기관 또는 단체에 문제가 있거나, 또는 웹페이지에 에러가 발생한 것으로 인지하여 불안감을 갖게 된다.In general, when a failure occurs in the server 10 or the network, the user terminal 30 accessing the web page through the server 10 displays an error message 'unable to display the web page'. In this case, May feel uneasy because of a computer problem or a problem with a company, organization, or organization that operates the web page, or because an error has occurred on the web page.

이에, 모니터링 서버(40)는 사용자 단말(30)을 통해 서버(10)나 네트워크에 장애가 발생 중임을 안내하는 안내 메시지를 출력함으로써, 사용자의 불안감을 감소시킨다. Accordingly, the monitoring server 40 outputs a guidance message to inform the server 10 or the network that a failure has occurred through the user terminal 30, thereby reducing the user's anxiety.

아울러, 모니터링 서버(40)는 장애 발생 정보를 관리자 단말(50)로 전달하여 관리자 단말(50)을 통해 장애 발생 정보를 출력할 수 있도록 한다. 이에 관리자는 서버(10) 또는 네트워크의 장애 발생 사실을 인지하고, 장애에 즉각 대처할 수 있게 된다. In addition, the monitoring server 40 transmits the failure occurrence information to the administrator terminal 50 and outputs the failure occurrence information through the administrator terminal 50. Thus, the administrator can recognize the failure of the server 10 or the network and can deal with the failure immediately.

사용자 단말(30)은 서버(10)에 접속하여 웹페이지를 출력한다. 서버(10) 또는 네트워크에 장애가 발생할 경우, 사용자 단말(30)은 현재 장애가 발생중임을 안내하는 안내 메시지를 출력하여 사용자의 불안감이 감소될 수 있도록 한다.The user terminal 30 accesses the server 10 and outputs a web page. If a failure occurs in the server 10 or the network, the user terminal 30 outputs a guidance message informing that the current failure is occurring, thereby reducing the user's anxiety.

여기서, 사용자 단말(30)은 PC(Personal Computer), 스마트 단말 및 랩탑 컴퓨터 등이 포함될 수 있으며, 네트워크를 통해 서버(10)에 접속하여 웹페이지를 출력하고 각종 애플리케이션을 이용할 수 있는 것이라면 특별히 한정되지 않는다. The user terminal 30 may include a personal computer (PC), a smart terminal, a laptop computer, and the like, and is not particularly limited as long as it can access a server 10 through a network and output a web page and use various applications Do not.

관리자 단말(50)은 모니터링 서버(40)로부터 장애 발생 정보를 전달받아 출력한다. 또한 관리자 단말(50)은 모니터링 서버(40)의 운영 및 동작을 위한 각종 제어명령을 모니터링 서버(40)에 전달한다. The administrator terminal 50 receives the failure occurrence information from the monitoring server 40 and outputs the failure occurrence information. In addition, the administrator terminal 50 transmits various control commands for the operation and operation of the monitoring server 40 to the monitoring server 40.

예를 들어, 관리자 단말(50)은 모니터링 서버(40)에 관리자 성명, 관리자 단말(50) 정보, 및 모니터링 서버(40)의 운영 및 서버 모니터링을 위한 각종 정보를 입력한다. For example, the administrator terminal 50 inputs the administrator name, the administrator terminal 50 information, and various information for monitoring and monitoring the monitoring server 40 to the monitoring server 40.

관리자 단말(50)은 PC, 스마트 단말 및 랩탑 컴퓨터 등이 포함될 수 있다. The administrator terminal 50 may include a PC, a smart terminal, a laptop computer, and the like.

서버 모니터링을 위한 정보에는 랜 스위치(24)의 트래픽 임계치, 서버(10)의 세션값 임계치, 서버(10)의 CPU 사용율 임계치, 서버(10)의 메모리 사용율 임계치 및 서버(10)의 디스크 사용율 임계치 등이 포함될 수 있다. The server monitoring information includes a traffic threshold of the LAN switch 24, a session value threshold of the server 10, a CPU utilization threshold of the server 10, a memory utilization threshold of the server 10, And the like.

여기서, 서버 모니터링을 위한 정보를 통해 서버(10)나 네트워크의 장애를 모니터링하는 방법에 대해서는 후술한다. Hereinafter, a method for monitoring the failure of the server 10 or the network through information for monitoring the server will be described later.

도 2 를 참조하면, 모니터링 서버(40)는 네트워크의 부하, 서버(10)의 부하 및 랜 스위치(24)의 트래픽 부하 중 적어도 하나를 모니터링하고, 모니터링 결과에 따라 장애 발생 여부를 판단하여 판단 결과에 따라 기 설정된 동작을 수행하는 것으로써, 모니터링부(41), 스위칭 제어부(42), 안내 페이지 출력부(43), 장애 안내부(44), 로그 관리부(45) 및 상태 출력부(46)를 포함한다. 2, the monitoring server 40 monitors at least one of a load on the network, a load on the server 10, and a traffic load on the LAN switch 24, determines whether a failure has occurred according to the monitoring result, The switching control unit 42, the guidance page output unit 43, the failure guidance unit 44, the log management unit 45, and the status output unit 46. The monitoring unit 41, the switching control unit 42, the guidance page output unit 43, .

스위칭 제어부(42)는 랜 스위치(24)에 접속되어 랜 스위치(24)의 스위칭 상태를 제어한다. 이 경우, 서버(10)나 네트워크에 장애 발생시, 스위칭 제어부(42)는 모니터링부(41)의 제어에 따라 랜 스위치(24)를 제어하여 사용자 단말(30)과 서버(10) 간의 연결을 차단한다. The switching control unit 42 is connected to the LAN switch 24 to control the switching state of the LAN switch 24. [ In this case, when a failure occurs in the server 10 or the network, the switching control unit 42 controls the LAN switch 24 under the control of the monitoring unit 41 to block the connection between the user terminal 30 and the server 10 do.

안내 페이지 출력부(43)는 사용자 단말(30)을 통해 장애 안내 페이지를 출력한다. The guidance page output unit 43 outputs a failure guidance page through the user terminal 30. [

통상적으로, 장애나 네트워크 등에 장애가 발생할 경우, 웹페이지에 접속한 사용자 단말(30)에는 에러 메시지가 되어 사용자가 불안감을 가질 수 있다. 이에, 안내 페이지 출력부(43)는 사용자 단말(30)을 통해 서버(10)나 네트워크의 장애가 발생 중임을 안내하는 안내 메시지를 출력함으로써, 사용자의 불안감이 감소될 수 있도록 한다.Generally, when a failure occurs in a network or the like, an error message is sent to the user terminal 30 connected to the web page, and the user may feel uneasy. Accordingly, the guidance page output unit 43 outputs a guidance message informing that the failure of the server 10 or the network is occurring through the user terminal 30, thereby reducing anxiety of the user.

장애 안내부(44)는 모니터링부(41)의 모니터링 결과 서버 장애로 판단되면, 관리자 단말(50)로 장애 발생 정보를 전달하여 관리자 단말(50)을 통해 장애 발생 정보를 출력하도록 한다. 이에 따라, 관리자는 서버(10)나 네트워크 장애를 인지할 수 있고 이러한 장애에 대처할 수 있게 된다. When the monitoring unit 41 determines that the server has a failure, the failure guide unit 44 transmits the failure occurrence information to the administrator terminal 50 and outputs the failure occurrence information through the administrator terminal 50. Accordingly, the administrator can recognize the server 10 or the network failure and cope with such a failure.

로그 관리부(45)는 모니터링부(41)의 모니터링 결과 서버 장애로 판단되면, 장애 발생 내역을 저장한다. 즉, 로그 관리부(45)는 모니터링부(41)의 모니터링 결과 서버 장애로 판단되면, 해당 장애가 발생된 시간 및 장애 상황에 대한 상세 정보를 저장함으로써, 장애 발생 내역을 이력으로 관리할 수 있도록 한다. 관리자 단말(50)은 모니터링 서버(40)에 장애 발생 내역을 요청하여 출력할 수 있다. If the log management unit 45 determines that the monitoring result of the monitoring unit 41 is a server failure, the log management unit 45 stores the failure occurrence details. That is, when it is determined that the monitoring result of the monitoring unit 41 is a server failure, the log management unit 45 stores detailed information about the time and the failure status of the failure so that the log management unit 45 can manage the failure occurrence history by history. The administrator terminal 50 can request the monitoring server 40 for the failure occurrence history and output the request.

상태 출력부(46)는 모니터링부(41)의 모니터링 상태를 실시간으로 표시한다. 즉, 상태 출력부(46)는 모니터링부(41)의 모니터링 상태, 예를 들어 서버(10)의 세션값, 서버(10)의 CPU 사용율, 서버(10)의 메모리 사용율, 서버(10)의 디스크 사용율 중 적어도 하나를 표시함과 더불어 이들을 기반으로 한 장애 판단 결과를 실시간으로 출력한다. 따라서, 관리자는 자신의 관리자 단말(50)뿐만 아니라, 상태 출력부(46)를 통해 현장에서 서버(10)나 네트워크 상태를 인지할 수 있게 된다. The status output unit 46 displays the monitoring status of the monitoring unit 41 in real time. That is, the status output unit 46 stores the monitoring status of the monitoring unit 41, for example, the session value of the server 10, the CPU utilization rate of the server 10, the memory utilization rate of the server 10, And the disk usage rate, and outputs a failure determination result based on the at least one of them in real time. Accordingly, the administrator can recognize the state of the server 10 or the network in the field through the status output unit 46 as well as the manager terminal 50 of the administrator.

모니터링부(41)는 백본망(20)에 연결되어 네트워크의 부하, 서버(10)의 부하 및 랜 스위치(24)의 트래픽 부하 중 적어도 하나를 모니터링하고, 모니터링 결과에 따라 장애 발생 여부를 판단한다. The monitoring unit 41 is connected to the backbone network 20 and monitors at least one of a load on the network 10, a load on the server 10 and a traffic load on the LAN switch 24, .

여기서, 모니터링부(41)는 백본망(20)의 랜 스위치(24)에 연결되어 네트워크의 부하, 서버(10)의 부하 및 랜 스위치(24)의 트래픽 부하를 각각 모니터링하여 모니터링 결과에 따라 서버(10)나 네트워크 등의 장애 발생 여부를 판단한다. The monitoring unit 41 is connected to the LAN switch 24 of the backbone network 20 to monitor the load of the network, the load of the server 10 and the traffic load of the LAN switch 24, (10) or a network or the like.

이 경우, 모니터링부(41)는 상기한 바와 같이 서버(10)의 세션값, 서버(10)의 CPU 사용율, 서버(10)의 메모리 사용율, 서버(10)의 디스크 사용율을 기반으로 장애 발생 여부를 판단한다. In this case, the monitoring unit 41 determines whether or not a failure has occurred based on the session value of the server 10, the CPU utilization rate of the server 10, the memory utilization rate of the server 10, .

모니터링부(41)는 랜 스위치(24)의 트래픽량이 트래픽 임계치 이상이거나, 서버(10)의 세션값이 기 설정된 세션값 임계치 이상이거나, CPU 사용율이 기 설정된 CPU 사용율 임계치 이상이거나, 서버(10)의 메모리 사용율이 기 설정된 메모리 사용율 이상이거나, 서버(10)의 디스크 사용율이 기 설정된 디스크 사용율 임계치 이상이면 서버(10)나 네트워크에 장애가 발생한 것으로 판단한다. The monitoring unit 41 monitors whether or not the traffic amount of the LAN switch 24 is equal to or greater than the traffic threshold, the session value of the server 10 is equal to or greater than the predetermined session value threshold, the CPU utilization rate is equal to or greater than the predetermined CPU utilization threshold, It is determined that a failure has occurred in the server 10 or the network if the memory usage rate of the server 10 is equal to or greater than a preset memory usage rate or if the disk usage rate of the server 10 is equal to or greater than a predetermined disk usage threshold.

여기서, 서버(10)의 세션값, 서버(10)의 CPU 사용율, 서버(10)의 메모리 사용율, 서버(10)의 디스크 사용율에 각각 설정된 임계치는 서버(10)나 네트워크 등에 장애가 발생한 것으로 판단할 수 있는 기준이 되는 값으로써, 관리자 단말(50)을 통해 설정될 수 있으며, 서버(10)의 개수나 백본망(20) 등에 따라 조절될 수 있다. Here, the thresholds respectively set in the session value of the server 10, the CPU usage rate of the server 10, the memory usage rate of the server 10, and the disk usage rate of the server 10 determine that a failure has occurred in the server 10 or the network And can be adjusted according to the number of the servers 10 or the backbone 20 or the like.

모니터링부(41)는 서버(10)의 세션값, 서버(10)의 CPU 사용율, 서버(10)의 메모리 사용율, 서버(10)의 디스크 사용율 중 적어도 하나를 통해 장애나 네트워크 등에 장애가 발생된 것으로 판단되면, 스위칭 제어부(42), 안내 페이지 출력부(43), 장애 안내부(44), 로그 관리부(45) 및 상태 출력부(46)를 제어한다. The monitoring unit 41 has a failure such as a failure or a network through at least one of the session value of the server 10, the CPU utilization rate of the server 10, the memory utilization rate of the server 10, and the disk utilization rate of the server 10 And controls the switching control section 42, the guidance page output section 43, the failure guidance section 44, the log management section 45, and the status output section 46.

즉, 모니터링부(41)는 상태 출력부(46)를 통해 모니터링부(41)의 모니터링 상태, 예를 들어 서버(10)의 세션값, 서버(10)의 CPU 사용율, 서버(10)의 메모리 사용율, 서버(10)의 디스크 사용율 중 적어도 하나를 표시함과 더불어 이들을 기반으로 한 장애 판단 결과를 실시간으로 출력한다. That is, the monitoring unit 41 monitors the monitoring status of the monitoring unit 41, for example, the session value of the server 10, the CPU usage rate of the server 10, the memory of the server 10, The usage rate of the server 10, and the disk usage rate of the server 10, and outputs a failure determination result based on the at least one of them in real time.

이 과정에서, 서버(10)나 네트워크에 장애가 발생된 것으로 판단되면, 모니터링부(41)는 스위칭 제어부(42)를 통해 랜 스위치(24)를 제어하여 사용자 단말(30)과 서버(10) 간의 연결을 차단하고, 안내 페이지 출력부(43)를 제어하여 사용자 단말(30)을 통해 장애 안내 페이지를 출력한다. The monitoring unit 41 controls the LAN switch 24 through the switching control unit 42 so that the user terminal 30 and the server 10 can communicate with each other And outputs the failure guidance page through the user terminal 30 by controlling the guidance page output unit 43. [

또한, 모니터링부(41)는 관리자 단말(50)로 장애 발생 정보를 전달하여 관리자 단말(50)을 통해 장애 발생 정보를 출력함으로써, 관리자가 서버(10)나 네트워크 장애를 인지하고 대처할 수 있도록 한다. The monitoring unit 41 transmits the failure occurrence information to the administrator terminal 50 and outputs the failure occurrence information through the administrator terminal 50 so that the administrator can recognize and cope with the server 10 or the network failure .

게다가, 로그 관리부(45)는 장애 발생 내역, 예를 들어 장애가 발생된 시간 및 장애 상황에 대한 상세 정보를 저장한다. In addition, the log management unit 45 stores details of the occurrence of the failure, for example, the time at which the failure occurred and the details of the failure.

이하 본 발명의 일 실시예에 따른 서버 장애 모니터링 장치를 도 3 을 참조하여 상세하게 설명한다. Hereinafter, a server failure monitoring apparatus according to an embodiment of the present invention will be described in detail with reference to FIG.

도 3 은 본 발명의 일 실시예에 따른 서버 장애 모니터링 방법의 순서도이다.3 is a flowchart of a server fault monitoring method according to an embodiment of the present invention.

도 3 을 참조하면, 모니터링부(41)는 백본망(20)에 연결되어 네트워크의 부하, 서버(10)의 부하 및 랜 스위치(24)의 트래픽 부하 중 적어도 하나를 모니터링하고(S10), 모니터링 결과에 따라 장애 발생 여부를 판단한다(S20).3, the monitoring unit 41 is connected to the backbone network 20 to monitor at least one of a network load, a load of the server 10 and a traffic load of the LAN switch 24 (S10) It is determined whether a fault has occurred according to the result (S20).

이 경우, 모니터링부(41)는 네트워크의 부하, 서버(10)의 부하 및 랜 스위치(24)의 트래픽 부하를 각각 모니터링하는데, 서버(10)의 세션값, 서버(10)의 CPU 사용율, 서버(10)의 메모리 사용율, 서버(10)의 디스크 사용율을 기반으로 장애 발생 여부를 판단한다. In this case, the monitoring unit 41 monitors the load of the network, the load of the server 10 and the traffic load of the LAN switch 24, respectively. The monitoring unit 41 measures the session value of the server 10, The memory usage rate of the server 10, and the disk usage rate of the server 10.

즉, 모니터링부(41)는 랜 스위치(24)의 트래픽량이 트래픽 임계치 이상이거나, 서버(10)의 세션값이 세션값 임계치 이상이거나, CPU 사용율이 CPU 사용율 임계치 이상이거나, 서버(10)의 메모리 사용율이 메모리 사용율 이상이거나, 서버(10)의 디스크 사용율이 디스크 사용율 임계치 이상이면 서버(10)나 네트워크에 장애가 발생한 것으로 판단한다. That is, the monitoring unit 41 determines whether the traffic amount of the LAN switch 24 is equal to or greater than the traffic threshold, the session value of the server 10 is equal to or greater than the session value threshold, the CPU utilization rate is equal to or greater than the CPU utilization threshold, It is determined that a failure has occurred in the server 10 or the network if the usage rate is equal to or higher than the memory usage rate or the disk usage rate of the server 10 is equal to or greater than the disk usage ratio threshold.

단계(S20)에서의 판단 결과 서버(10)나 네트워크에 장애가 발생한 것으로 판단되면, 모니터링부(41)는 스위칭 제어부(42)를 통해 랜 스위치(24)를 제어(S30)하여 사용자 단말(30)과 서버(10) 간의 연결을 차단하고, 안내 페이지 출력부(43)를 제어하여 사용자 단말(30)을 통해 장애 안내 페이지를 출력한다(S40). 이 경우, 웹페이지에 접속한 사용자 단말(30)로는 현재 서버(10)나 네트워크에 장애가 발생 중임을 안내하는 안내 메시지가 출력되게 된다. If it is determined in step S20 that the server 10 or the network has failed, the monitoring unit 41 controls the LAN switch 24 through the switching controller 42 (S30) And outputs the trouble guidance page through the user terminal 30 (S40) by controlling the guide page output unit 43. In this case, In this case, the user terminal 30 connected to the web page outputs a guidance message informing that the failure of the current server 10 or the network is occurring.

또한, 모니터링부(41)는 로그 관리부(45)를 제어하여 장애 발생 내역, 예를 들어 장애가 발생된 시간 및 장애 상황에 대한 상세 정보를 저장한다(S50),In addition, the monitoring unit 41 controls the log management unit 45 to store detailed information about the occurrence of the failure, for example, the time and the failure occurrence time (S50)

게다가, 모니터링부(41)는 관리자 단말(50)로 장애 발생 정보를 전달(S60)함으로써, 관리자 단말(50)을 통해 장애 발생 정보를 출력하도록 한다. 이에 따라 관리자는 서버(10)나 네트워크 장애를 인지하고 대처할 수 있게 된다. In addition, the monitoring unit 41 transmits the failure occurrence information to the administrator terminal 50 (S60), and outputs the failure occurrence information through the administrator terminal 50. [ Accordingly, the administrator can recognize and cope with the server 10 or the network failure.

한편, 상기한 과정에서 모니터링부(41)는 상태 출력부(46)를 통해 모니터링부(41)의 모니터링 상태, 예를 들어 서버(10)의 세션값, 서버(10)의 CPU 사용율, 서버(10)의 메모리 사용율, 서버(10)의 디스크 사용율 중 적어도 하나를 표시하고, 이들을 기반으로 한 장애 판단 결과를 실시간으로 출력할 수 있다. In the above process, the monitoring unit 41 monitors the monitoring status of the monitoring unit 41, for example, the session value of the server 10, the CPU usage rate of the server 10, the server 10, and the disk usage rate of the server 10, and output a failure determination result based on the at least one of the memory usage rate and the disk usage rate of the server 10 in real time.

이와 같이, 본 발명의 일 실시예에 따른 서버 장애 모니터링 장치 및 방법은 서버(10)와 네트워크 장애를 분석하여 분석 결과에 따라 사용자 단말(30) 또는 관리자 단말(50)에 안내한다.As described above, the server fault monitoring apparatus and method according to an embodiment of the present invention analyzes the network fault with the server 10 and guides the user terminal 30 or the administrator terminal 50 according to the analysis result.

또한, 본 발명의 일 실시예에 따른 서버 장애 모니터링 장치 및 방법은 서버 장애 발생시 사용자 단말(30)을 통해 서버 장애가 발생중임을 안내하여 사용자의 불안감을 감소시키고, 관리자가 서버 장애에 즉각적으로 대처할 수 있도록 한다. In addition, the apparatus and method for monitoring a server failure according to an embodiment of the present invention can reduce anxiety of a user by informing that a server failure is occurring through a user terminal 30 when a server failure occurs and allow the administrator to immediately respond to a server failure .

본 발명은 도면에 도시된 실시예를 참고로 하여 설명되었으나, 이는 예시적인 것에 불과하며 당해 기술이 속하는 기술분야에서 통상의 지식을 가진 자라면 이로부터 다양한 변형 및 균등한 타 실시예가 가능하다는 점을 이해할 것이다. 따라서, 본 발명의 진정한 기술적 보호범위는 아래의 특허청구범위에 의하여 정해져야할 것이다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is clearly understood that the same is by way of illustration and example only and is not to be taken by way of limitation, I will understand. Accordingly, the true scope of the present invention should be determined by the following claims.

10: 서버 11: 웹서버
12: WAS서버 20: 백본망
21: 백본 스위치 22: IPS
23: 방화벽 24: 랜 스위치
30: 사용자 단말 40: 모니터링 서버
41: 모니터링부 42: 스위칭 제어부
43: 안내 페이지 출력부 44: 장애 안내부
45: 로그 관리부 46: 상태 출력부
50: 관리자 단말10: Server 11: Web server
12: WAS server 20: backbone network
21: backbone switch 22: IPS
23: Firewall 24: LAN switch
30: user terminal 40: monitoring server
41: monitoring unit 42: switching control unit
43: guide page output unit 44: trouble guide unit
45: log management unit 46: status output unit
50:

Claims

A switching controller connected to the LAN switch for controlling the switching state of the LAN switch;
A guidance page output unit outputting a failure guidance page through a user terminal; And
The network monitoring unit monitors at least one of a load of the network, a load of the server, and a traffic load of the LAN switch connected to the LAN switch of the backbone network, determines whether a failure of the server or the network occurs according to the monitoring result, And a monitoring unit for blocking communication with the server through the control unit and controlling the output unit to output a failure guidance page through the user terminal,
A status output unit displaying a session value, a CPU usage rate, a memory usage rate, and a disk usage rate of the server monitored by the monitoring unit and outputting a failure determination result in real time;
A failure notification unit for transmitting failure occurrence information to the administrator terminal and outputting failure occurrence information through the administrator terminal if it is determined that the monitoring result of the monitoring unit is a failure; And
And a log management unit for storing detailed information on the time and the failure status of the failure when the monitoring result of the monitoring unit indicates that the failure has occurred and outputting the failure occurrence details according to the request of the monitoring unit by managing the failure occurrence history by the history &Lt; / RTI &
The monitoring unit may monitor whether the traffic amount of the LAN switch is greater than or equal to a predetermined traffic threshold, the session value of the server is equal to or greater than a predetermined session value threshold, the CPU usage rate of the server is equal to or greater than a predetermined CPU usage threshold, Determines that a failure has occurred when the disk usage rate of the server is equal to or greater than a preset threshold of the memory usage rate,
The threshold value set for the session value, the CPU usage rate, the memory usage rate, and the disk usage rate of the server is a value that can be used as a criterion for determining that a failure has occurred in the server or the network, Lt; / RTI >
The monitoring server is connected to the backbone network to monitor a network, a load of the server, and a traffic load of the LAN switch, and judges whether a failure has occurred according to a monitoring result. A predetermined operation is performed according to occurrence of a fault,
The backbone network includes a backbone switch, an Intrusion Prevention System (IPS), a firewall, and a LAN switch. The IPS is an intrusion prevention system that blocks harmful traffic caused by malicious code and hacking. The LAN switch controls the access from the external attack and distributes the load of the web server or the firewall equally to apply the switch of L4 or more so that the load is not concentrated on one server,
The LAN switch manages servers that perform the same role among a plurality of servers through a virtual IP (VIP) when the load from the Internet network is distributed, and the load or traffic Is dispersed by any one of Round Robin, hashing, and least connection,
The monitoring server monitors a plurality of servers and inspection target devices to determine a failure or a network failure, provides a guidance page to the user through the LAN switch, informs the administrator of the failure details, and is installed independently of the LAN switch Or may be integrally installed,
The server includes a web server providing a web page in response to a request from a user terminal, and a WAS server providing a variety of applications according to a request of a user terminal connected to the web page,
The monitoring server outputs a guidance message to inform the manager that the failure has occurred in the server or the network through the user terminal, and transmits the failure occurrence information to the administrator terminal to output the failure occurrence information through the administrator terminal.
The user terminal is a personal computer (PC), a smart terminal, and a laptop computer that access the server through a network to output a web page,
The administrator terminal receives the fault occurrence information from the monitoring server and outputs the fault occurrence information to the monitoring server. The administrator terminal includes a PC, a smart terminal, and a laptop for inputting manager name, manager terminal information, and various information for monitoring the server, Wherein the server includes a computer.

delete

Monitoring the network load, the load of the server, and the traffic load of the LAN switch connected to the LAN switch of L4 or more of the backbone network;
Determining whether a failure of a server or a network occurs according to a monitoring result of the monitoring unit;
Controlling the monitoring unit to block communication with the server according to a result of the determination; And
The monitoring unit outputting a failure guidance page through a user terminal,
Transmitting the failure occurrence information to the administrator terminal and outputting failure occurrence information through the administrator terminal if the failure guidance section determines that the monitoring result is a failure result of monitoring by the monitoring section; And
When the log management unit determines that a fault has occurred as a result of the monitoring by the monitoring unit, storing detailed information on the time and the failure status of the failure and managing the failure occurrence history as a history,
The step of determining whether or not the failure has occurred may include determining that the monitoring unit determines that the traffic amount of the LAN switch is greater than or equal to a preset traffic threshold value, the server session value is equal to or greater than a predetermined session value threshold, Determining that a failure has occurred if the usage rate threshold is equal to or greater than a predetermined usage threshold, the memory usage rate of the server is greater than or equal to a predetermined memory usage threshold, or the disk usage rate of the server is equal to or greater than a predetermined disk usage threshold,
Wherein the monitoring unit displays the session value, the CPU usage rate, the memory usage rate, and the disk usage rate of the server monitored by the monitoring unit, and outputs the failure determination result through the status output unit in real time,
The threshold value set for the session value, the CPU usage rate, the memory usage rate, and the disk usage rate of the server is a value that can be used as a criterion for determining that a failure has occurred in the server or the network, Lt; / RTI >
The monitoring server is connected to the backbone network to monitor a network, a load of the server, and a traffic load of the LAN switch, and judges whether a failure has occurred according to a monitoring result. A predetermined operation is performed according to occurrence of a fault,
The backbone network includes a backbone switch, an Intrusion Prevention System (IPS), a firewall, and a LAN switch. The IPS is an intrusion prevention system that blocks harmful traffic caused by malicious code and hacking. The LAN switch controls the access from the external attack and distributes the load of the web server or the firewall equally to apply the switch of L4 or more so that the load is not concentrated on one server,
The LAN switch manages servers that perform the same role among a plurality of servers through a virtual IP (VIP) when the load from the Internet network is distributed, and the load or traffic Is dispersed by any one of Round Robin, hashing, and least connection,
The monitoring server monitors a plurality of servers and inspection target devices to determine a failure or a network failure, provides a guidance page to the user through the LAN switch, informs the administrator of the failure details, and is installed independently of the LAN switch Or may be integrally installed,
The server includes a web server providing a web page in response to a request from a user terminal, and a WAS server providing a variety of applications according to a request of a user terminal connected to the web page,
The monitoring server outputs a guidance message to inform the manager that the failure has occurred in the server or the network through the user terminal, and transmits the failure occurrence information to the administrator terminal to output the failure occurrence information through the administrator terminal.
The user terminal is a personal computer (PC), a smart terminal, and a laptop computer that access the server through a network to output a web page,
The administrator terminal receives the fault occurrence information from the monitoring server and outputs the fault occurrence information to the monitoring server. The administrator terminal includes a PC, a smart terminal, and a laptop for inputting manager name, manager terminal information, and various information for monitoring the server, Wherein the server includes a computer.

delete