KR20180029666A

KR20180029666A - Integrated monitoring method

Info

Publication number: KR20180029666A
Application number: KR1020160118040A
Authority: KR
Inventors: 김경수; 엄광례; 이영주; 김학진
Original assignee: 농협은행(주)
Priority date: 2016-09-13
Filing date: 2016-09-13
Publication date: 2018-03-21
Also published as: KR101953385B1

Abstract

The present invention relates to an integrated control method which comprises the following steps: an integrated control server collects control information of a work system; the integrated control server generates a pattern on the control information by analyzing the control information; the integrated control server generates a policy on a symptom expression method by combining one or more patterns; and the integrated control server expresses a symptom through a dashboard module according to policy on the symptom expression method.

Description

INTEGRATED MONITORING METHOD

본 발명은 통합관제 방법에 관한 것으로서, 보다 상세하게는 업무 시스템의 장애와 성능을 실시간으로 모니터링하는 통합관제 방법에 관한 것이다.The present invention relates to an integrated control method, and more particularly, to an integrated control method for monitoring failure and performance of a business system in real time.

일반적으로, 전자금융이란 전자화된 매체에 의하여 금융서비스를 제공하는 것이다. 전자금융은 정보통신기술과 정보처리기법을 접목하여 금융거래정보의 전자화, 전자적 응답방법, 전자적 행위 및 전자적 결제방법을 채택함으로써, 고객과 금융기관의 금융거래 및 결제업무 등을 전자적인 방법으로 행하는 금융서비스를 말한다. In general, electronic finance is the provision of financial services by means of electronic media. Electronic finance combines information and communication technology with information processing techniques, and adopts electronicization of financial transaction information, electronic response method, electronic behavior and electronic payment method, and conducts financial transaction and settlement business of customer and financial institution electronically Financial services.

통상의 금융형태는 지점 중심의 금융형태로서 가장 보편적이면서 가장 통속적인 금융업무 형태에 해당되는데, 직원과 고객간의 직접 대면에 의해 각종 금융거래가 처리되고 지점(출장소 포함) 및 창구의 확장, 대고객 서비스의 개선이 고객유치의 전략적 핵심요소로 작용하였다. The usual financial form is branch-oriented financial form, which is the most common and most prevalent form of financial business. It is a system in which financial transactions are processed by face-to-face meetings between employees and customers, and the branches (including branch offices) Improvement has become a strategic key factor in attracting customers.

최근 들어서는 인터넷을 이용한 인터넷 뱅킹이나 스마트폰 등을 이용한 스마마 뱅킹 서비스 등이 개시되면서 금융시스템은 좀 더 복잡해지고 다양화되는 실정이다. 이에 따라, 금융서비스를 위한 전산자원의 장애 등에 대한 운영 및 관리에 대한 필요성이 더욱 증가하게 되었다. Recently, the financial system has become more complicated and diversified due to the introduction of Internet banking using the Internet and smart banking service using smart phones. As a result, the necessity of operation and management of disruption of computer resources for financial services has increased.

이에 최근에는 통합관제시스템이 도입되었다. 통합관제시스템은 IT의 주요 장비, 예를 들어 서버, 네트워크, 보안, 통신, 백업 등의 장애와 성능을 실시간으로 모니터링한다. Recently, integrated control system has been introduced. The integrated control system monitors real-time failure and performance of major IT equipment such as servers, networks, security, communications, and backup.

본 발명의 배경기술은 대한민국 공개특허공보 10-2000-0021396호(2000.04.25)의 '금융 전산 네트웍의 지점 장애 관리 방법'에 개시되어 있다.BACKGROUND ART [0002] The background art of the present invention is disclosed in Korean Patent Laid-Open Publication No. 10-2000-0021396 (Apr. 25, 2000) entitled " Branch fault management method of financial computing network ".

종래의 통합관제시스템은 상기한 주요 장비와 연계된 연계 시스템으로부터 데이터를 수집하고, 수집된 데이터를 가공하고 처리하여 다수의 통보 시스템으로 통보하며 화면에 가시화한다. 특히 대시 보드는 상기한 바와 같은 장애 이벤트를 화면상으로 전시하여 관리자가 현재의 장애 이벤트를 인지할 수 있도록 하였다. A conventional integrated control system collects data from a linkage system associated with the above-mentioned main equipment, processes and processes the collected data, notifies to a plurality of notification systems and visualizes them on a screen. In particular, the dashboard displays the above-described failure event on the screen so that the administrator can recognize the current failure event.

그러나, 종래에는 관리자가 중요 장애가 발생할 수 있는지 여부를 사전에 예측하는 데에는 부족한 실정이었으며, 장애의 원인을 분석하는데에도 어려운 실정이었다. 게다가, 종래에는 최고 등급의 장애 이벤트를 중심으로 대시보드 모듈을 통해 표출하여 조치하도록 하는 바, 장애 이벤트 발생 후 조치하는 사후 조치 성격이 강한 문제점이 있었다. However, in the past, it was not enough for the manager to predict in advance whether or not a major failure could occur, and it was also difficult to analyze the cause of the failure. In addition, conventionally, a dashboard module is used to display and deal with a fault event of the highest grade, and there is a strong problem of the nature of the after-action to take action after the occurrence of the fault event.

본 발명은 전술한 문제점을 해결하기 위해 창안된 것으로서, 본 발명의 일 측면에 따른 목적은 업무 시스템의 장애 이벤트 및 성능 수치를 분석하여 이상징후를 검출하고 검출된 이상징후를 시각적으로 표출하는 통합 관제 방법을 제공하는 것이다. SUMMARY OF THE INVENTION The present invention has been made to solve the above problems, and it is an object of one aspect of the present invention to provide a system and method for analyzing a failure event and a performance value of a business system to detect an abnormal symptom and visually express the detected abnormal symptom Method.

본 발명의 다른 목적은 업무 시스템의 이상징후를 시각화하여 대시모드를 통해 표출함으로써, 관리자가 장애의 영향도를 파악하고 장애의 원인을 손쉽게 분석할 수 있도록 한 통합 관제 방법을 제공하는 것이다.Another object of the present invention is to provide an integrated control method that enables an administrator to grasp the influence of a failure and easily analyze the cause of the failure by visualizing an abnormal symptom of a business system and expressing it through a dash mode.

본 발명의 또 다른 목적은 과거 이벤트 조회와 성능 수치 조회 및 시뮬레이션 기능을 이용하여 이상징후 표출 방식에 대한 정책을 수립할 수 있도록 한 통합 관제 방법을 제공하는 것이다. Yet another object of the present invention is to provide an integrated control method for establishing a policy for anomaly indication using past event inquiry, performance numerical value inquiry, and simulation function.

본 발명의 또 다른 목적은 업무 시스템의 장애 이벤트 및 성능 수치의 조건을 토대로 이상징후를 검출하고 대시보드를 통해 표출함으로써 관리자가 업무 시스템의 장애에 더욱 능동적으로 대처할 수 있도록 한 통합 관제 방법을 제공하는 것이다.Yet another object of the present invention is to provide an integrated control method that enables an administrator to more actively cope with a failure of a business system by detecting abnormality signs and expressing them through a dashboard based on failure event and performance numerical conditions of the business system will be.

본 발명의 또 다른 목적은 업무 시스템 중 장애가 발생한 업무 시스템이 소속된 그룹 전체의 서버자원 현황, DB(DataBase)자원 현황, WAS(Web Application Server)자원 현황, TP(Transaction Processing)자원 현황, 시스템작업 현황, 프로그램작업 현황, 업무 시스템 구성도, 토폴로지, 이벤트 이력 및 장애 이력 등을 통합하여 화면을 총체적으로 구성함으로써, 관리자가 장애의 원인을 더욱 용이하게 해결할 수 있도록 하는 통합 관제 방법을 제공하는 것이다. Another object of the present invention is to provide a system and a method for managing a fault in a business system, including a server resource status, a database (DB) resource status, a WAS (Web Application Server) resource status, a TP The present invention provides an integrated control method that enables an administrator to more easily solve the cause of a trouble by integrating the status, the program operation status, the task system configuration diagram, the topology, the event history, and the failure history.

본 발명의 일 측면에 따른 통합관제 방법은 통합관제 서버가 업무 시스템의 관제 정보를 수집하는 단계; 상기 통합관제 서버가 상기 관제 정보를 분석하여 상기 관제 정보에 대한 패턴을 생성하는 단계; 상기 통합관제 서버가 적어도 하나의 패턴을 조합하여 이상징후 표출 방식에 대한 정책을 생성하는 단계; 및 상기 통합관제 서버가 이상징후 표출 방식에 대한 정책에 따라 이상징후를 대시보드 모듈을 통해 표출하는 단계를 포함하는 것을 특징으로 한다. According to an aspect of the present invention, an integrated control method includes: a step in which an integrated control server collects control information of a business system; Analyzing the control information by the integrated control server to generate a pattern of the control information; The integrated control server combining at least one pattern to generate a policy for an anomaly indication mode; And displaying the abnormal symptom through the dashboard module according to the policy for the abnormal symptom display method.

본 발명에서, 상기 관제 정보는 상기 업무 시스템의 장애 상황에 대한 장애 이벤트 및 상기 업무 시스템의 성능에 대한 성능 정보를 포함하는 것을 특징으로 한다. In the present invention, the management information includes a failure event for the failure condition of the business system and performance information on the performance of the business system.

본 발명의 상기 관제 정보에 대한 패턴을 생성하는 단계에서, 상기 통합관제 서버는 상기 관제 정보가 장애 이벤트이면, 장애 이벤트의 조건을 선별하여 선별된 조건에 따라 패턴을 생성하고, 기 설정된 과거 기간 동안 발생된 장애 이벤트에 대해 패턴을 시뮬레이션하여 과거 기간 동안에 발생된 장애 이벤트가 검색되는지를 확인하며, 확인 결과에 따라 패턴을 최종 생성하는 것을 특징으로 한다. In the step of generating the pattern for the control information according to the present invention, when the control information is a fault event, the integrated control server selects a condition of the fault event, generates a pattern according to the selected condition, A pattern is simulated for the generated fault event to check whether a fault event occurred during the past period is searched, and a pattern is finally generated according to the result of the check.

본 발명에서, 상기 장애 이벤트의 조건은 장애 이벤트의 등급, 반복 횟수, 애플리케이션 종류, 메시지의 그룹, 메시지의 오브젝트, 이벤트가 발생된 발생 시각, 호스트 네임 및 장애 이벤트가 발생한 기간 중 적어도 하나 이상을 포함하는 것을 특징으로 한다. In the present invention, the condition of the fault event includes at least one of a class of a fault event, a repetition count, an application type, a group of messages, an object of a message, an occurrence time at which an event was generated, .

본 발명의 상기 이상징후 표출 방식에 대한 정책을 결정하는 단계에서, 상기 통합관제 서버는 적어도 하나 이상의 패턴을 조합하여 정책을 정의하고, 정의된 정책을 기 설정된 과거 기간 동안 장애 이벤트에 대해 시뮬레이션하여 정책의 정확도를 검출한 후, 검출된 정확도를 바탕으로 정책을 최종 결정하는 것을 특징으로 한다. In the step of determining the policy for the abnormality symptom display method of the present invention, the integrated control server defines a policy by combining at least one pattern, simulates the defined policy for a fault event for a predetermined period, And the policy is finally determined on the basis of the detected accuracy.

본 발명의 상기 관제 정보에 대한 패턴을 생성하는 단계에서, 상기 통합관제 서버는 상기 관제 정보가 성능 정보이면, 성능정보에 적합한 조건을 선별하고 선별된 조건을 이용하여 적어도 하나 이상의 패턴을 정의하고, 기 설정된 과거 기간 동안의 성능 정보에 대해 패턴을 시뮬레이션하여 과거 기간 동안의 성능 정보가 검색되는지를 확인한 후, 확인 결과에 따라 패턴을 최종 생성하는 것을 특징으로 한다. In the step of generating a pattern for the control information according to the present invention, if the control information is performance information, the integrated control server selects at least one condition suitable for the performance information, defines at least one pattern using the selected condition, A pattern is simulated with respect to performance information for a predetermined past period to confirm whether performance information for a past period is searched for, and a pattern is finally generated according to the check result.

본 발명에서, 상기 통합관제 서버는 성능정보의 성능 수치를 체크하고, 체크된 성능정보의 성능 수치가 상기 업무 시스템에 기 설정된 임계범위를 벗어나면 성능 정보에 대응되는 패턴을 생성하는 것을 특징으로 한다. In the present invention, the integrated control server is configured to check the performance value of the performance information and to generate a pattern corresponding to the performance information when the performance value of the checked performance information is out of a predetermined threshold range in the business system .

본 발명의 상기 이상징후 표출 방식에 대한 정책을 결정하는 단계에서, 상기 통합관제 서버는 적어도 하나 이상의 패턴을 조합하여 정책을 생성하고, 생성한 정책을 기 설정된 과거 기간 동안의 성능 정보에 대해 시뮬레이션하여 정책의 정확도를 검출한 후, 검출된 정확도를 바탕으로 정책을 최종 결정하는 것을 특징으로 한다. In the step of determining the policy for the abnormal symptom display method of the present invention, the integrated control server creates a policy by combining at least one pattern, simulates the generated policy against performance information for a predetermined past period And the policy is finally determined based on the detected accuracy after detecting the accuracy of the policy.

본 발명에서, 상기 정책을 결정하는 단계에서, 상기 통합관제 서버는 패턴의 개수에 따라 적어도 하나의 그룹정책 또는 단일정책을 생성하는 것을 특징으로 한다. In the present invention, in the step of determining the policy, the integrated control server generates at least one group policy or a single policy according to the number of patterns.

본 발명의 상기 이상징후를 대시보드 모듈을 통해 표출하는 단계에서, 상기 대시보드 모듈은 이상징후 검출 타임 라인, 이상징후에 대한 검출 결과, 토폴로지 선택 버튼, 구성도 선택 버튼, 작업계획서 선택 버튼, 이벤트 이력 선택 버튼, 장애 이력 선택 버튼 및 성능정보를 하나의 화면으로 표출하는 것을 특징으로 한다. In the step of exposing the abnormal symptom of the present invention through the dashboard module, the dashboard module may include an abnormality symptom detection timeline, a detection result of an abnormality symptom, a topology selection button, a configuration diagram selection button, a task plan selection button, A history selection button, a failure history selection button, and performance information on a single screen.

본 발명의 일 측면에 따른 통합 관제 방법은 업무 시스템의 장애 이벤트 및 성능 수치를 분석하여 이상징후를 검출하고 검출된 이상징후를 대시보드 모듈을 통해 시각적으로 표출한다. An integrated control method according to an aspect of the present invention analyzes abnormality events and performance values of a business system to detect an abnormal symptom and visually express the detected abnormal symptom through a dashboard module.

본 발명의 일 측면에 따른 통합 관제 방법은 업무 시스템의 이상징후를 시각화하여 대시모드 모듈을 통해 표출함으로써, 관리자가 장애의 영향도를 파악하고 장애의 원인을 손쉽게 분석할 수 있도록 한다.The integrated control method according to one aspect of the present invention visualizes an abnormal symptom of a business system and expresses the abnormal symptom through a dash mode module so that an administrator can grasp the influence of the obstacle and easily analyze the cause of the obstacle.

본 발명의 일 측면에 따른 통합 관제 방법은 업무 시스템의 장애 이벤트에서 장애 이벤트의 조건을 토대로 이상징후 정보를 검출하고 성능 수치에서는 성능 이상징후를 검출하여 대시보드를 통해 표출함으로써 관리자가 업무 시스템의 장애에 더욱 능동적으로 대처할 수 있도록 한다.The integrated control method according to one aspect of the present invention detects abnormality symptom information based on a condition of a fault event in a fault event of a business system, detects a performance abnormality symptom in a performance value, and displays it through a dashboard, So that it can cope more actively.

본 발명의 일 측면에 따른 통합 관제 방법은 업무 시스템 중 장애가 발생한 업무 시스템이 소속된 그룹 전체의 서버자원 현황, DB(DataBase)자원 현황, WAS(Web Application Server)자원 현황, TP(Transaction Processing)자원 현황, 시스템작업 현황, 프로그램작업 현황, 업무 시스템 구성도, 토폴로지, 이벤트 이력 및 장애 이력 등을 통합하여 화면을 총체적으로 구성함으로써, 관리자가 장애의 원인을 더욱 용이하게 해결할 수 있도록 한다.The integrated control method according to one aspect of the present invention is a method of managing an integrated control system according to one aspect of the present invention, including a server resource status, a database (DB) resource status, a WAS (Web Application Server) resource status, a TP By integrating the status, system operation status, program operation status, business system configuration diagram, topology, event history, and failure history, the screen is configured as a whole so that the administrator can more easily solve the cause of the failure.

도 1 은 본 발명의 일 실시예에 따른 통합 관제 장치의 블럭 구성도이다.
도 2 는 본 발명의 일 실시예에 따른 통합관제 서버의 블럭 구성도이다.
도 3 은 본 발명의 일 실시예에 따른 장애 이벤트의 조건을 분석하여 이상징후를 표출하는 과정을 개념적으로 나타낸 도면이다.
도 4 는 본 발명의 일 실시예에 따른 성능 정보의 조건을 분석하여 이상징후를 표출하는 과정을 개념적으로 나타낸 도면이다.
도 5 는 본 발명의 일 실시예에 따른 이상징후 및 연관 정보를 표시하는 화면 예를 나타낸 도면이다.
도 6 은 본 발명의 일 실시예에 따른 연관 정보에 대한 토폴로지 구성을 나타낸 도면이다.
도 7 은 본 발명의 일 실시예에 따른 과거 시점의 이상징후 및 연관 정보를 표시하는 화면 예를 나타낸 도면이다.
도 8 은 본 발명의 일 실시예에 따른 통합 관제 방법을 도시한 순서도이다.
도 9 는 본 발명의 일 실시예에 따른 장애 이벤트 기반의 정책 생성 과정을 나타낸 순서도이다.
도 10 은 본 발명의 일 실시예에 따른 성능 정보 기반의 정책 생성 과정을 나타낸 순서도이다.1 is a block diagram of an integrated control apparatus according to an embodiment of the present invention.
2 is a block diagram of an integrated control server according to an embodiment of the present invention.
FIG. 3 is a conceptual view illustrating a process of analyzing a condition of a fault event according to an embodiment of the present invention to display an abnormal symptom. Referring to FIG.
4 is a conceptual diagram illustrating a process of analyzing conditions of performance information according to an embodiment of the present invention to display an abnormal symptom.
FIG. 5 is a diagram illustrating an example of a screen displaying an abnormal symptom and associated information according to an embodiment of the present invention. Referring to FIG.
FIG. 6 is a diagram illustrating a topology configuration of association information according to an exemplary embodiment of the present invention. Referring to FIG.
FIG. 7 is a view showing an example of a screen displaying an abnormal symptom and related information in the past according to an embodiment of the present invention.
8 is a flowchart illustrating an integrated control method according to an embodiment of the present invention.
FIG. 9 is a flowchart illustrating a fault generation process based on a fault event according to an embodiment of the present invention. Referring to FIG.
10 is a flowchart illustrating a process of generating a performance information based policy according to an embodiment of the present invention.

이하에서는 본 발명의 일 실시예에 따른 통합관제 방법을 첨부된 도면들을 참조하여 상세하게 설명한다. 이러한 과정에서 도면에 도시된 선들의 두께나 구성요소의 크기 등은 설명의 명료성과 편의상 과장되게 도시되어 있을 수 있다. 또한 후술되는 용어들은 본 발명에서의 기능을 고려하여 생성된 용어들로서, 이는 이용자, 운용자의 의도 또는 관례에 따라 달라질 수 있다. 그러므로 이러한 용어들에 대한 정의는 본 명세서 전반에 걸친 내용을 토대로 내려져야할 것이다. Hereinafter, an integrated control method according to an embodiment of the present invention will be described in detail with reference to the accompanying drawings. In this process, the thicknesses of the lines and the sizes of the components shown in the drawings may be exaggerated for clarity and convenience of explanation. Further, terms to be described later are terms generated in consideration of functions in the present invention, which may vary depending on the user, the intention or custom of the operator. Therefore, definitions of these terms should be made based on the contents throughout this specification.

도 1 은 본 발명의 일 실시예에 따른 통합 관제 장치의 블럭 구성도이고, 도 2 는 본 발명의 일 실시예에 따른 통합관제 서버의 블럭 구성도이며, 도 3 은 본 발명의 일 실시예에 따른 장애 이벤트의 조건을 분석하여 이상징후 정보를 표출하는 과정을 개념적으로 나타낸 도면이며, 도 4 는 본 발명의 일 실시예에 따른 성능 정보의 조건을 분석하여 이상징후를 표출하는 과정을 개념적으로 나타낸 도면이며, 도 5 는 본 발명의 일 실시예에 따른 이상징후 및 연관 정보를 표시하는 화면 예를 나타낸 도면이며, 도 6 은 본 발명의 일 실시예에 따른 연관 정보에 대한 토폴로지 구성을 나타낸 도면이며, 도 7 은 본 발명의 일 실시예에 따른 과거 시점의 이상징후 및 연관 정보를 표시하는 화면 예를 나타낸 도면이다. FIG. 1 is a block diagram of an integrated control apparatus according to an embodiment of the present invention. FIG. 2 is a block diagram of a combined control server according to an embodiment of the present invention. FIG. FIG. 4 is a conceptual diagram illustrating a process of analyzing conditions of performance information according to an exemplary embodiment of the present invention to display abnormality symptoms. FIG. FIG. 5 is a diagram illustrating an example of a screen displaying abnormality indications and related information according to an embodiment of the present invention, FIG. 6 is a diagram illustrating a topology configuration of association information according to an embodiment of the present invention And FIG. 7 is a view showing an example of a screen displaying an abnormal symptom and related information in the past according to an embodiment of the present invention.

도 1 을 참조하면, 본 발명의 일 실시예에 따른 통합 관제 장치는 통합관제 서버(20) 및 대시보드 모듈(30)을 포함한다.Referring to FIG. 1, an integrated control apparatus according to an exemplary embodiment of the present invention includes an integrated control server 20 and a dashboard module 30.

대시보드 모듈(30)은 통합관제 서버(20)로부터 전달된 정보를 표출한다. 대시보드 모듈(30)을 통해 표시되는 정보에는 업무 시스템 등에 발생된 장애 이벤트 또는 성능에 대한 이상징후 정보가 포함된다. The dashboard module 30 exposes the information transmitted from the integrated control server 20. The information displayed through the dashboard module 30 includes abnormality indication information on a failure event or performance occurring in a business system or the like.

이 경우, 대시보드 모듈(30)은 통합관제 서버(20)로부터 이상징후 정보가 전달되면 즉각적으로 화면을 전환시켜 주목성을 향상시킴으로써, 관리자가 장애 이벤트를 즉각적으로 인지하고 대처할 수 있도록 한다. In this case, the dashboard module 30 promptly changes the screen when the abnormal symptom information is transmitted from the integrated control server 20, thereby improving the attention, thereby allowing the administrator to immediately recognize and cope with the trouble event.

더욱이 대시보드 모듈(30)은 관리자가 장애 이벤트에 대한 원인을 찾아 해결할 수 있도록 하기 위해, 장애가 발생한 업무 시스템 그룹(10)에 대한 연관 정보를 표출하는데, 이러한 연관 정보에는 이상징후 검출 타임 라인, 이상징후에 대한 검출 결과, 토폴로지, 구성도, 작업계획서, 이벤트 이력, 장애 이력 및 성능정보가 포함될 수 있다. Furthermore, the dashboard module 30 displays association information for the failed business system group 10 in order to allow the administrator to find and solve the cause of the failure event. The association information includes an abnormality detection detection time line, A detection result for the symptom, a topology, a configuration diagram, a work plan, an event history, a failure history, and performance information.

여기서, 성능정보에는 자원 성능 정보, DB(DataBase) 성능 정보, WAS(Web Application Server) 성능 정보 및 TP(Transaction Processing) 성능 정보가 포함될 수 있다. Here, the performance information may include resource performance information, DB (DataBase) performance information, WAS (Web Application Server) performance information, and TP (Transaction Processing) performance information.

이에 관리자는 이들 연관 정보들을 참고하여 장애 이벤트의 원인을 찾고 장애 이벤트를 해결할 수 있게 된다. Accordingly, the manager can find the cause of the fault event by referring to the related information, and resolve the fault event.

즉, 대시보드 모듈(30)은 장애 이벤트 발생시, 장애 이벤트가 발생한 업무 시스템(11)의 장애 이벤트와 성능 정보에 관련된 연관 정보를 표출함으로써, 관리자가 장애 이벤트의 영향도를 파악하고 장애 이벤트의 원인을 분석하며 장애 이벤트를 해결할 수 있도록 한다. That is, when the failure event occurs, the dashboard module 30 displays association information related to the failure event and the performance information of the business system 11 in which the failure event has occurred, so that the administrator can grasp the influence degree of the failure event, So that fault events can be solved.

통합관제 서버(20)는 업무 시스템(11)에 대한 관제 정보를 수집한다. 관제 정보에는 업무 시스템(11)의 장애 상황에 대한 장애 이벤트 및 업무 시스템(11)의 성능에 대한 성능 정보가 포함된다.The integrated control server 20 collects control information for the business system 11. The control information includes a failure event for the failure situation of the business system 11 and performance information on the performance of the business system 11. [

통합관제 서버(20)는 상기한 바와 같이 수집한 관제 정보를 분석하여 패턴을 생성하고, 각각의 패턴을 조합하여 이상징후 표출 방식에 대한 정책을 생성한 후, 생성된 해당 이상징후를 대시보드 모듈을 통해 표출한다. The integrated control server 20 analyzes the collected control information to generate a pattern, combines the patterns to generate a policy for the anomaly indication system, and then transmits the generated anomaly notification to the dashboard module .

즉, 관제 정보가 장애 이벤트일 경우, 업무 시스템(11)에 발생한 장애 이벤트를 수집하고, 수집한 장애 이벤트의 조건을 조합하여 적어도 하나의 패턴을 생성한다. That is, when the management information is a failure event, the failure event occurring in the business system 11 is collected, and at least one pattern is generated by combining the collected failure event conditions.

이어 통합관제 서버(20)는 적어도 하나의 패턴을 조합하여 이상징후 표출 방식에 대한 정책을 생성하고, 생성된 정책에 따라 장애 이벤트 이상징후를 대시보드 모듈(30)을 통해 영상 및 음성 중 적어도 하나를 통해 표출한다. Then, the integrated control server 20 generates a policy for an abnormal symptom display mode by combining at least one pattern, and displays a symptom of a malfunction event abnormality according to the generated policy through at least one of video and audio through the dashboard module 30 .

또한, 관제 정보가 성능 정보일 경우, 통합관제 서버(20)는 업무 시스템(11)의 성능 정보를 수집하고, 수집한 성능 정보 각각의 패턴을 검출한다. 이어 통합관제 서버(20)는 적어도 하나의 패턴을 조합하여 표출 방식에 대한 정책을 생성하고, 생성된 정책에 따라 이벤트 이상징후 처리모듈 대시보드 모듈(30)을 통해 영상 및 음성 중 적어도 하나를 통해 표출한다. Also, when the control information is performance information, the integrated control server 20 collects performance information of the business system 11 and detects patterns of the collected performance information. Then, the integrated control server 20 generates a policy for the display mode by combining at least one pattern, and transmits the event abnormal symptom processing module 20 via at least one of video and audio through the dysfunction module 30 And

참고로, 금융 업무는 복수 개의 업무 그룹, 예를 들어 카드, 재무회계, 전자금융, 경영정보, 뱅킹 등으로 이루어지며, 각각의 업무 그룹의 업무는 해당 업무 시스템 그룹(10)에 의해 이루어진다.For reference, the financial service consists of a plurality of business groups, for example, a card, financial accounting, electronic finance, management information, banking, etc., and the tasks of the respective task groups are performed by the task system group 10.

또한 업무 시스템 그룹(10)은 복수 개의 업무 시스템(11)을 포함하고, 각각의 업무 시스템(11)은 다수의 호스트를 포함한다. 장애 이벤트 및 성능 정보는 호스트에서 발생될 수 있다. Also, the business system group 10 includes a plurality of business systems 11, and each business system 11 includes a plurality of hosts. Failure event and performance information may be generated at the host.

여기서, 장애 이벤트는 상기한 금융 업무를 수행하기 위해 마련된 서버, 네트워크, 보안, 통신, 백업 등에서 발생된 장애 사항이 모두 포함될 수 있다. Here, the failure event may include all of the faults generated in the server, the network, the security, the communication, the backup, and the like provided for performing the above-mentioned financial service.

또한, 성능정보는 호스트나 업무 시스템(11) 등의 성능에 대한 정보이다.The performance information is information on the performance of the host or the business system 11 and the like.

도 2 를 참조하면, 통합관제 서버(20)는 데이터 수집 모듈(21), 이벤트 이상징후 처리모듈(22) 및 이벤트 이상징후 처리모듈(23)을 포함한다.Referring to FIG. 2, the integrated control server 20 includes a data collection module 21, an event anomaly processing module 22, and an event anomaly processing module 23.

데이터 수집 모듈(21)은 서버 관리 시스템(SERVER MANANGEMENT SYSTEM;SMS)(미도시)과 연결되어 서버 관리 시스템으로부터 장애 이벤트를 수집한다. The data acquisition module 21 is connected to a server management system (SMS) (not shown) to collect fault events from the server management system.

또한 데이터 수집 모듈(21)은 통합관제 에이전트(미도시)로부터 단위 시스템, 예를 들어 호스트나 업무 시스템(11)으로부터 성능정보를 수집한다. The data collection module 21 also collects performance information from a unit system, for example, a host or business system 11, from an integrated control agent (not shown).

또한 데이터 수집 모듈(21)은 서버 관리 시스템으로부터 수집한 장애 이벤트를 이벤트 이상징후 처리모듈(22)에 입력하고, 통합관제 에이전트로부터 수집한 성능정보를 이벤트 이상징후 처리모듈 처리모듈(23)에 입력한다. The data acquisition module 21 also inputs the failure event collected from the server management system to the event anomaly notification processing module 22 and inputs the performance information collected from the integrated control agent to the event anomaly notification processing module processing module 23 do.

이벤트 이상징후 처리모듈(22)은 데이터 수집 모듈(21)로부터 입력된 장애 이벤트의 장애 이벤트의 조건을 조합하여 적어도 하나의 패턴을 생성하고, 이들 패턴 중 적어도 하나를 조합하여 이상징후 표출 방식에 대한 정책을 생성한 후, 이 정책에 따라 장애 이벤트 이상징후를 대시보드 모듈(30)을 통해 표출한다. The event abnormality symptom processing module 22 generates at least one pattern by combining the conditions of the fault event of the fault event input from the data collection module 21, and combines at least one of the patterns to generate an abnormality symptom After creating the policy, the dysfunction module 30 exposes a fault event abnormality according to this policy.

이벤트 이상징후 처리모듈(22)은 이벤트 패턴 생성부(221) 및 이벤트 정책 생성부(222)를 포함한다. The event abnormality symptom processing module 22 includes an event pattern generator 221 and an event policy generator 222.

도 3 을 참조하면, 이벤트 패턴 생성부(221)는 데이터 수집 모듈(21)로부터 입력된 장애 이벤트의 조건에 따라 적어도 하나의 패턴을 생성한다. Referring to FIG. 3, the event pattern generation unit 221 generates at least one pattern according to the condition of the fault event input from the data collection module 21. FIG.

먼저, 이벤트 패턴 생성부(221)는 장애 이벤트에 적합한 조건을 선별하고, 선별된 조건을 이용하여 적어도 하나 이상의 패턴을 정의한다. First, the event pattern generation unit 221 selects a condition suitable for a fault event, and defines at least one pattern using the selected condition.

장애 이벤트의 조건에는 장애 이벤트의 등급, 반복 횟수, 애플리케이션 종류, 메시지의 그룹, 메시지의 오브젝트, 이벤트가 발생된 발생 시각, 호스트 네임 및 장애 이벤트가 발생한 기간 중 적어도 하나 이상이 포함된다.The condition of the fault event includes at least one of the class of the fault event, the number of repetitions, the type of the application, the group of the message, the object of the message, the occurrence time at which the event occurred, the host name and the occurrence period of the fault event.

또한, 이벤트 패턴 생성부(221)는 기 설정된 과거 기간을 지정하고, 지정된 과거 기간 동안 발생된 장애 이벤트에 대해 해당 패턴을 시뮬레이션함으로써, 과거 기간 동안에 발생된 장애 이벤트가 검색되는지를 확인하고, 확인 결과에 따라 해당 패턴을 생성한다. In addition, the event pattern generator 221 designates a preset past period, and simulates the pattern for a fault event occurring during a specified past period to check whether a fault event occurred during the past period is searched, As shown in FIG.

이 경우, 이벤트 패턴 생성부(221)는 상기한 과정을 반복 수행하여 복수 개의 패턴을 생성할 수 있다. In this case, the event pattern generation unit 221 may repeat the above process to generate a plurality of patterns.

참고로, 본 실시예에서는 복수 개의 패턴을 생성하는 것을 예시로 설명하였으나, 본 발명의 기술적 범위는 1개의 패턴을 생성하는 것도 포함한다. For reference, in this embodiment, generation of a plurality of patterns has been described as an example, but the technical scope of the present invention also includes generating one pattern.

여기서, 과거 기간은 관리자에 의해 사전에 설정되거나 또는 현재 시점을 중심으로 자동으로 설정될 수 있다. Here, the past period may be preset by the administrator or automatically set around the current point of view.

예를 들어, 이벤트 패턴 생성부(221)는 설정 시간 동안 장애 이벤트가 반복된 횟수가 설정 횟수 이상이면 패턴1로 생성하고, 애플리케이션의 등급이 설정 등급 이상이면 패턴2로 생성하며, 오브젝트가 발생된 시각에 따라 패턴3으로 생성하며, 메시지의 호스트 네임에 따라 패턴4로 생성한다. 이외에도, 이벤트 패턴 생성부(221)는 2개의 이상의 장애 이벤트의 조건을 조합하여 하나의 패턴을 생성할 수도 있다. For example, the event pattern generator 221 generates a pattern 1 if the number of times the fault event is repeated over the set time is equal to or greater than the set number of times, generates the pattern 2 if the class of the application is equal to or higher than the set rating, Generates pattern 3 according to the time, and generates pattern 4 according to the host name of the message. In addition, the event pattern generation unit 221 may combine the conditions of two or more fault events to generate one pattern.

이벤트 정책 생성부(222)는 이벤트 패턴 생성부(221)에 의해 생성된 패턴을 조합하여 이상징후 표출 방식에 대한 정책을 생성한다. The event policy generating unit 222 combines the patterns generated by the event pattern generating unit 221 to generate a policy for an abnormal symptom display mode.

여기서, 정책은 이상징후를 표출하는 방식에 대한 정보로써, 조합된 패턴의 개수와 종류 등에 따라 다양하게 결정될 수 있다. 이 경우 패턴의 개수에 따라 그룹정책 또는 단일 정책으로 생성될 수 있다. Here, the policy is information on a method of expressing an abnormal symptom, and can be variously determined according to the number and type of combined patterns. In this case, it can be created as a group policy or a single policy depending on the number of patterns.

이에 따라 조합된 패턴의 개수 및 특성 등에 따라 장애 이벤트 이상징후가 다양한 형태로 제공될 수 있다. Accordingly, the abnormality event of the fault event can be provided in various forms depending on the number and characteristics of the combined patterns.

즉, 이벤트 정책 생성부(222)는 이벤트 패턴 생성부(221)에 의해 생성된 패턴들을 조합하여 정책을 정의한다. That is, the event policy generating unit 222 defines a policy by combining the patterns generated by the event pattern generating unit 221.

이어 이벤트 정책 생성부(222)는 정의된 정책을 상기한 과거 기간 동안 장애 이벤트에 대해 정책을 시뮬레이션하여 정책의 정확도를 검출하고, 이 정확도를 바탕으로 정책을 최종적으로 생성한다.Then, the event policy generation unit 222 detects the accuracy of the policy by simulating the policy for the failure event during the past period, and finally generates the policy based on the accuracy.

예를 들어, 이벤트 정책 생성부(222)는 패턴1과 패턴2 및 패턴3을 조합하여 그룹정책1을 생성하고, 패턴4와 패턴5를 조합하여 그룹정책2을 생성하며, 패턴3과 패턴5를 조합하여 그룹정책3을 생성할 수 있다. For example, the event policy generating unit 222 generates the group policy 1 by combining the pattern 1, the pattern 2, and the pattern 3, generates the group policy 2 by combining the pattern 4 and the pattern 5, Can be combined to create Group Policy 3.

이어 이벤트 정책 생성부(222)는 상기한 바와 같이 생성한 정책에 따라 이상징후를 대시보드 모듈(30)을 통해 다양한 방식으로 표출한다. 장애 이벤트 이상징후는 영상 및 음성으로 표출될 수 있다. The event policy creating unit 222 exposes an abnormal symptom in various ways through the dashboard module 30 according to the policy generated as described above. Failure event anomalies can be visualized and visualized.

대시보드 모듈(30)을 통해 표출되는 장애 이벤트 이상징후 정보는 장애 이벤트에 대한 해당 업무 시스템(11)의 연관 정보가 포함될 수 있다. The failure event abnormality indication information displayed through the dashboard module 30 may include association information of the corresponding business system 11 with respect to the failure event.

연관 정보에는 서버자원 현황, DB(DataBase)자원 현황, WAS(Web Application Server)자원 현황, TP(Transaction Processing)자원 현황, 시스템작업 현황, 프로그램작업 현황, 업무 시스템 구성도, 토폴로지, 이벤트 이력 및 장애 이력 중 적어도 하나가 포함될 수 있다. Related information includes server resource status, DB (database) resource status, WAS (Web Application Server) resource status, TP (Transaction Processing) resource status, system operation status, program operation status, business system configuration diagram, topology, At least one of history may be included.

즉, 이벤트 이상징후 처리모듈(22)은 데이터 수집 모듈(21)에 의해 상대적으로 장애 등급이 낮은 장애 이벤트가 수집되면, 수집된 장애 이벤트의 장애 이벤트의 조건을 통해 적어도 하나의 패턴을 생성한다. 이어 이벤트 이상징후 처리모듈(22)은 적어도 하나의 패턴을 이용하여 이상징후를 표시할 정책을 생성한 후, 이 정책에 따라 대시보드 모듈(30)을 통해 장애 이벤트 이상징후를 표출한다. 따라서, 상대적으로 낮은 장애 등급의 장애 이벤트가 발생하더라도 해당 연관 정보를 표출함으로써, 관리자가 장애 이벤트의 영향도 및 이벤트 장애의 원인을 분석하고 장애 이벤트에 대처할 수 있도록 한다. That is, when the fault event having a relatively low fault class is collected by the data collection module 21, the event fault symptom processing module 22 generates at least one pattern through the condition of the fault event of the collected fault event. The event abnormality symptom processing module 22 generates a policy for displaying an abnormality symptom using at least one pattern, and then displays a fault abnormality abnormality through the dashboard module 30 according to the policy. Accordingly, even if a failure event of a relatively low failure class occurs, the association information is displayed so that the administrator can analyze the cause of the failure event and the event failure and cope with the failure event.

성능 이상징후 처리모듈(23)은 단위 시스템 등으로부터 데이터 수집모듈을 통해 수집된 성능정보의 성능 수치를 체크하고, 체크된 성능정보의 성능 수치를 해당 단위 시스템에 기 설정된 임계 범위와 비교하여 비교 결과에 따라 성능정보의 조건을 조합하여 적어도 하나의 패턴을 생성하며, 이들 패턴 중 적어도 하나를 조합하여 성능 이상징후 처리모듈 표출 방식에 대한 정책을 생성한 후, 이 정책에 따라 성능 이상징후를 대시보드 모듈(30)을 통해 표출한다. The performance abnormality symptom processing module 23 checks the performance numbers of the performance information collected through the data collection module from the unit system and compares the performance numbers of the checked performance information with predetermined threshold ranges in the corresponding unit system, And generates at least one pattern by combining the conditions of the performance information according to the performance information. Then, at least one of the patterns is combined to generate a policy for the performance abnormality symptom processing module exposing method, Through the module (30).

아울러, 임계 범위는 여러 등급으로 구성할 수 있으며, 장애나 에러 등이 발생된 것으로 유추할 수 있는 값으로써, 상기한 업무 시스템별로 각각 설정된다. 따라서, 성능정보의 성능 수치가 임계범위를 벗어나면 해당 단위 시스템에 장애 등이 발생된 것으로 유추할 수 있다. In addition, the critical range can be composed of several grades, and it can be deduced that a fault or an error has occurred, and is set for each of the above-mentioned business systems. Therefore, if the performance value of the performance information deviates from the critical range, it can be deduced that the failure occurs in the unit system.

즉, 성능 이상징후 처리모듈(23)은 데이터 수집 모듈(21)로부터 입력된 성능정보의 성능 수치를 체크하고, 체크된 성능정보의 성능 수치가 기 설정된 임계범위를 벗어나면, 성능정보의 조건을 조합하여 적어도 하나의 패턴을 생성하며, 이들 패턴 중 적어도 하나를 조합하여 성능 이상징후 처리모듈 표출 방식에 대한 정책을 생성한 후, 이 정책에 따라 성능 이상징후를 대시보드 모듈(30)을 통해 표출한다. That is, the performance abnormality symptom processing module 23 checks the performance numerical value of the performance information inputted from the data acquisition module 21, and when the performance numerical value of the checked performance information is out of the predetermined threshold range, At least one of the patterns is combined to generate a policy for the performance abnormality symptom processing module display method by combining at least one of the patterns, and then a performance abnormality indication is displayed through the dashboard module 30 according to the policy do.

성능 이상징후 처리모듈(23)은 성능 신규패턴 생성부(231) 및 성능 정책 생성부(232)를 포함한다. The performance abnormality symptom processing module 23 includes a performance new pattern generator 231 and a performance policy generator 232.

성능 패턴 생성은 과거의 성능 정보를 기준으로 통계적 방법을 사용하여 기준 성능 정보를 기반으로 임계 범위가 설정되며 기준 임계 범위이 벗어나는 지 확인하는데 사용된다. 성능 그룹 정책은 다수의 성능 패턴을 조합하여 구성한다.The performance pattern generation is used to check whether the threshold range is set based on the reference performance information using the statistical method based on the past performance information and whether the reference threshold range is out of range. Performance group policies consist of a combination of multiple performance patterns.

성능 신규패턴 생성부(231)는 데이터 수집모듈을 통해 각 단위 시스템으로부터 성능정보가 수집될 때마다 각 단위 시스템별로 성능정보의 성능 수치를 체크한다. The performance new pattern generation unit 231 checks performance numbers of performance information for each unit system whenever performance information is collected from each unit system through the data collection module.

성능 신규패턴 생성부(231)는 성능정보의 성능 수치를 해당 단위 시스템에 기 설정된 임계범위와 비교하고, 비교 결과 성능정보의 성능 수치가 임계 범위를 벗어나면 성능정보의 조건을 조합하여 적어도 하나의 패턴을 정의한다. The performance new pattern generation unit 231 compares the performance value of the performance information with a preset threshold range in the corresponding unit system, and when the performance value of the performance information is out of the threshold range, Define a pattern.

여기서, 임계범위는 시간/요일 등을 기준으로 시간대(5분)별로 설정될 수 있으며, 과거의 성능 정보를 기준으로 한 통계적 방법을 사용하여 설정된다. 이 성능 수치가 임계범위를 벗어나면 해당 성능 정보를 기반으로 하나의 패턴이 정의될 수 있다.Here, the threshold range can be set for each time period (5 minutes) based on time / day, etc., and is set using a statistical method based on past performance information. If this performance value falls outside the critical range, one pattern can be defined based on the performance information.

즉, 도 4 에 도시된 바와 같이, 성능 신규패턴 생성부(231)는 각 단위 시스템별 성능정보의 성능 수치를 체크하고, 체크된 성능정보의 성능 수치가 해당 단위 시스템에 기 설정된 임계범위를 벗어나면 해당 성능 정보에 대응되는 패턴을 정의한다. 4, the performance new pattern generator 231 checks the performance numbers of the performance information for each unit system, and when the performance numbers of the checked performance information exceed the predetermined threshold range in the unit system A pattern corresponding to the performance information is defined.

이 경우, 성능 패턴 생성부(231)는 성능정보에 적합한 조건을 선별하고, 선별된 조건을 이용하여 적어도 하나 이상의 패턴을 정의한다. 이어 성능 패턴 생성부(231)는 기 설정된 과거 기간을 지정하고, 지정된 과거 기간 동안의 성능 정보에 대해 해당 패턴을 시뮬레이션함으로써, 과거 기간 동안의 성능 정보가 검색되는지를 확인하고, 확인 결과에 따라 해당 패턴을 생성한다. In this case, the performance pattern generator 231 selects a condition suitable for the performance information, and defines at least one pattern using the selected condition. Then, the performance pattern generator 231 specifies a preset past period, simulates the pattern for the performance information for the specified past period to check whether the performance information for the past period is searched, Create a pattern.

이 경우, 성능 패턴 생성부(231)는 상기한 과정을 반복 수행하여 복수 개의 패턴을 생성할 수 있다. 참고로, 본 실시예에서는 복수 개의 패턴을 생성하는 것을 예시로 설명하였으나, 본 발명의 기술적 범위는 1개의 패턴을 생성하는 것도 포함한다. In this case, the performance pattern generator 231 may repeat the above-described process to generate a plurality of patterns. For reference, in this embodiment, generation of a plurality of patterns has been described as an example, but the technical scope of the present invention also includes generating one pattern.

한편, 상기한 바와 같이 복수 개의 패턴이 생성되면, 성능 정책 생성부(232)는 상기한 바와 같이 성능 패턴 생성부(221)에 의해 생성된 패턴을 조합하여 성능 이상징후 표출 방식에 대한 정책을 생성한다. 즉, 성능 정책 생성부(232)는 생성한 정책을 상기한 과거 기간 동안의 성능 정보에 대해 해당 정책을 시뮬레이션하여 정책의 정확도를 검출하고, 이 정확도를 바탕으로 정책을 최종 생성한다. Meanwhile, when a plurality of patterns are generated as described above, the performance policy generator 232 combines the patterns generated by the performance pattern generator 221 as described above to generate a policy for the performance abnormality indication scheme do. That is, the performance policy generation unit 232 detects the accuracy of the policy by simulating the generated policy with respect to the performance information for the past period, and finally generates the policy based on the accuracy.

여기서, 정책은 성능 이상징후를 표출하는 방식을 생성한 것으로써, 조합된 패턴의 개수와 종류 등에 따라 다양하게 생성될 수 있다. 이 경우 패턴의 개수에 따라 그룹정책 또는 단일 정책으로 생성될 수 있다. Here, the policy is a method of expressing a performance abnormality indication, and can be variously generated according to the number and type of combined patterns. In this case, it can be created as a group policy or a single policy depending on the number of patterns.

이에 따라 조합된 패턴의 개수 및 특성 등에 따라 성능 이상징후가 다양한 형태로 제공될 수 있다. Accordingly, the performance abnormality indication may be provided in various forms depending on the number and characteristics of the combined patterns.

대시보드 모듈(30)은 통합관제 서버(20)로부터 전달된 정보를 표출한다. 즉, 대시보드 모듈(30)은 통합관제 서버(20)로부터 장애 이벤트 이상징후 또는 성능 이상징후가 전달되면, 장애 이벤트 이상징후 또는 성능 이상징후에 대한 화면으로 즉각적으로 전환한다. The dashboard module 30 exposes the information transmitted from the integrated control server 20. That is, the dashboard module 30 immediately switches from the integrated control server 20 to the screen for a fault event abnormality or a performance abnormality indication if a fault event abnormality or performance abnormality indication is transmitted.

이 경우, 대시보드 모듈(30)은 해당 관리자가 장애 이벤트 이상징후 또는 성능 이상징후에 대한 원인을 찾아 해결할 수 있도록 하기 위해, 장애가 발생한 업무 시스템(11)의 연관 정보를 표출한다. In this case, the dashboard module 30 displays association information of the failed business system 11 in order to allow the manager to find out the cause of the failure event abnormality or the performance abnormality symptom.

도 5 를 참조하면, 대시보드 모듈(30)은 이상징후 검출 타임 라인, 이상징후에 대한 검출 결과, 버튼(토폴로지 선택 버튼, 구성도 선택 버튼, 작업계획서 선택 버튼, 이벤트 이력 선택 버튼, 장애 이력 선택 버튼), 성능정보를 한 화면에서 즉시 확인할 수 있는 화면을 구성한다. Referring to FIG. 5, the dashboard module 30 includes an abnormality detection detection time line, a detection result of an abnormality symptom, a button (topology selection button, a configuration diagram selection button, a task plan selection button, an event history selection button, Button), and a screen for confirming performance information immediately on one screen.

즉, 대시보드 모듈(30)은 이상징후 발생 시 관련된 모든 연관 정보를 통합하여 볼 수 있는 화면을 제공함으로써, 이상징후가 발생된 업무시스템 중심으로 이벤트, 성능정보 및 주요정보 표현하고, 장애가 발생한 구성요소와 이에 영향을 받는 업무 시스템에 대한 정보, 담당자 및 구성정보(Configuration Information;CI) 등을 표시한다. In other words, the dashboard module 30 can display events, performance information, and key information based on the business system in which anomalous indications are generated by providing a screen in which all related information can be integrated when the anomalous indications occur, Information about the element and the affected business system, the person in charge and the configuration information (CI).

도 6 에는 주요 대한 토폴로지 정보가 도시되었는데, 대시보드 모듈은 상기한 도 5 에 도시된 토폴로지가 관리자에 의해 선택되면, 해당 연관 정보 중 토폴로지를 표출한다. FIG. 6 shows main topology information. When the topology shown in FIG. 5 is selected by the administrator, the dashboard module displays the topology among the related information.

도 6 을 참조하면, 대시보드 모듈(30)은 관리자에 의해 업무, 업무시스템, 담당자, 구성정보 중 어느 하나가 선택(더블 클릭)되면 해당 객체를 중심으로 새로운 토폴로지를 다시 유지하며, 이 경우 선택된 객체에 대한 상세 정보가 표출된다.Referring to FIG. 6, the dashboard module 30 maintains a new topology around a corresponding object when any one of a task, a task system, a contact person, and configuration information is selected (double clicked) by an administrator, Detailed information about the object is displayed.

이를 바탕으로 관리자는 이상징후 발생시 업무 영향도나 담당자 등을 확인할 수 있게 된다. Based on this, the manager can check the influence of the business or the person in charge when the abnormal symptom occurs.

토폴로지에는 상기한 바와 같이 업무시스템, 업무, 구성CI 및 업무 담당자가 포함된다. 업무 시스템은 해당 업무시스템 중심으로 한 업무과 구성CI를 포함하고, 업무는 해당 업무 중심으로 업무시스템과 업무담당자를 포함하며, 구성CI는 해당 구성CI를 중심으로 업무시스템과 구성CI 담당자를 포함되며, 업무담당자는 해당 업무 담당자 중심으로 업무정보를 포함한다. The topology includes the business systems, tasks, configuration CI, and personnel responsible for the tasks, as described above. The business system includes the task and configuration CI based on the business system, the business includes the business system and the person in charge of the task based on the business, and the configuration CI includes the business system and the constituent CI person, The person in charge of the task includes the task information mainly in charge of the task person in charge.

한편, 도 7 에는 관리자가 지정한 과거 시점에서의 이상징후에 대한 연관 정보가 도시되었다. 여기서, 과거 시점은 관리자가 임의로 선택할 수 있으며, 이에 관리자는 현재 시점과 과거 시점 각각에 대한 연관 정보를 비교할 수 있으므로, 현재의 이상징후에 대한 대처를 더욱 효과적으로 수행할 수 있게 된다. Meanwhile, FIG. 7 shows the association information of the abnormal symptom at the past time point designated by the administrator. Here, the past time point can be arbitrarily selected by the administrator, so that the manager can compare the association information for each of the current time point and the past time point, thereby making it possible to more effectively cope with the current abnormal point of view.

이하 본 발명의 일 실시예에 따른 통합 관제 방법을 내지 도 10 을 참조하여 상세하게 설명한다. Hereinafter, an integrated control method according to an embodiment of the present invention will be described in detail with reference to FIGS.

도 8 은 본 발명의 일 실시예에 따른 통합 관제 방법을 도시한 순서도이다.8 is a flowchart illustrating an integrated control method according to an embodiment of the present invention.

도 8 을 참조하면, 통합관제 서버(20)는 서버 관리 시스템으로부터 장애 이벤트를 수집한다(S10).Referring to FIG. 8, the integrated control server 20 collects fault events from the server management system (S10).

이어 통합관제 서버(20)는 장애 이벤트의 장애 이벤트의 조건에 따라 적어도 하나의 패턴을 정의한다(S20). 여기서, 장애 이벤트의 조건에는 장애 이벤트가 설정 기간 동안 반복된 횟수, 애플리케이션 종류, 메시지의 그룹, 메시지의 오브젝트, 이벤트의 등급, 이벤트가 발생된 시각, 호스트 네임 중 적어도 하나 이상이 포함될 수 있다. The integrated control server 20 then defines at least one pattern according to the condition of the fault event of the fault event (S20). Here, the condition of the fault event may include at least one of the number of times the fault event is repeated during the set period, the type of the application, the group of the message, the object of the message, the class of the event, the time when the event occurred, and the host name.

장애 이벤트의 조건을 토대로 패턴을 정의한 후에는, 통합관제 서버(20)는 패턴을 조합하여 이상징후 표출 방식에 대한 정책을 생성한다(S30). 이 경우 통합관제 서버(20)는 패턴의 개수에 따라 정책을 그룹정책 또는 단일정책 중 어느 하나로 생성한다. After defining the pattern based on the condition of the fault event, the integrated control server 20 combines the patterns to create a policy for the anomaly indication system (S30). In this case, the integrated control server 20 generates the policy as either a group policy or a single policy according to the number of patterns.

예를 들어, 통합관제 서버(20)는 복수 개의 패턴을 조합하여 정책을 그룹정책으로 생성하고, 하나의 패턴을 이용하여 정책을 단일정책으로 생성한다.For example, the integrated control server 20 creates a policy as a group policy by combining a plurality of patterns, and creates a policy as a single policy using one pattern.

이어 통합관제 서버(20)는 상기한 바와 같이 생성한 정책에 따라 이상징후를 대시보드 모듈(30)을 통해 영상 및 음성으로 표출한다(S40).In step S40, the integrated control server 20 exposes an abnormal symptom to the user through the dashboard module 30 in accordance with the policy created as described above.

이때, 대시보드 모듈(30)을 통해 표출되는 이상징후는 장애 이벤트 정보 및 연관 정보가 포함될 수 있다. At this time, the abnormal symptom displayed through the dashboard module 30 may include the failure event information and the association information.

장애 이벤트 정보에는 장애 이벤트가 발생한 호스트나 업무 시스템(11)의 상태나 또는 장애 이벤트 등급 등과 같이 장애 이벤트에 대한 다양한 정보가 포함될 수 있다. The failure event information may include various information about the failure event, such as the status of the host or business system 11 in which the failure event occurred, or the failure event class.

통합관제 서버(20)는 상대적으로 낮은 장애 등급의 장애 이벤트가 검출하더라도 이 장애 이벤트를 토대로 연관 정보를 표출함으로써, 관리자가 장애 이벤트의 영향도 및 이벤트 장애의 원인을 분석하고 장애 이벤트에 대처할 수 있도록 한다. Even if a failure event of a relatively low failure grade is detected, the integrated control server 20 displays the association information based on the failure event so that the administrator can analyze the cause of the failure event and the event failure, do.

또한 통합관제 서버(20)는 단위 시스템 각각으로부터 성능정보를 수집한다(S50).The integrated control server 20 also collects performance information from each of the unit systems (S50).

통합관제 서버(20)는 각 단위 시스템으로부터 성능정보가 수집될 때마다 각 단위 시스템별로 성능정보의 성능 수치를 체크하고, 체크된 성능정보의 성능 수치를 해당 단위 시스템에 기 설정된 임계범위와 비교한다.The integrated control server 20 checks performance numbers of performance information for each unit system whenever performance information is collected from each unit system and compares the performance numbers of the checked performance information with preset threshold ranges in the corresponding unit system .

이때, 체크된 성능정보의 성능 수치가 기 설정된 임계범위를 벗어나면, 통합관제 서버(20)는 성능정보의 조건을 조합하여 적어도 하나의 패턴을 생성한다(S60).At this time, if the performance value of the checked performance information is out of the predetermined threshold range, the integrated control server 20 generates at least one pattern by combining the conditions of the performance information (S60).

이어 통합관제 서버(20)는 이들 패턴 중 적어도 하나를 조합하여 성능 이상징후 표출 방식에 대한 정책을 결정(S70)한 후, 이 정책에 따라 성능 이상징후를 대시보드 모듈(30)을 통해 표출한다(S80). Then, the integrated control server 20 determines at least one of these patterns to determine the policy for the performance abnormality indication mode (S70), and then displays the performance abnormality indication through the dashboard module 30 according to this policy (S80).

이하, 통합관제 서버(20)가 장애 이벤트 및 성능 정보를 기반으로 각각의 정책을 생성하는 과정을 도 9 및 도 10 을 참조하여 구체적으로 설명한다. Hereinafter, a process in which the integrated control server 20 generates the respective policies based on the fault event and performance information will be described in detail with reference to FIGS. 9 and 10. FIG.

도 9 는 본 발명의 일 실시예에 따른 장애 이벤트 기반의 정책 생성 과정을 나타낸 순서도이다. FIG. 9 is a flowchart illustrating a fault generation process based on a fault event according to an embodiment of the present invention. Referring to FIG.

도 9 를 참조하면, 이벤트 패턴 생성부(221)는 장애 이벤트가 발생되면 해당 장애 이벤트에 적합한 조건을 선별(S110)하고, 선별된 조건을 이용하여 적어도 하나 이상의 패턴을 정의한다(S120). Referring to FIG. 9, when a fault event occurs, the event pattern generator 221 selects a condition suitable for the fault event (S110) and defines at least one pattern using the selected condition (S120).

이어 이벤트 패턴 생성부(221)는 기 설정된 과거 기간을 지정하고, 지정된 과거 기간 동안 발생된 장애 이벤트에 대해 해당 패턴을 시뮬레이션함(S130)으로써, 과거 기간 동안에 발생된 장애 이벤트가 검색되는지를 확인하고, 확인 결과에 따라 해당 패턴을 생성한다. Then, the event pattern generation unit 221 designates a predetermined past period, and simulates a pattern corresponding to a fault event occurring during a specified past period (S130), thereby checking whether a fault event occurred during the past period is searched , And generates the pattern according to the result of the check.

특히, 이벤트 패턴 생성부(221)는 상기한 바와 같이 패턴을 생성한 후에는 장애 이벤트가 발생된 시점의 장애 이벤트의 조건을 다시 선별하여 상기한 과정을 다시 수행하여 새로운 패턴을 생성하는 과정을 반복함으로써, 복수 개의 패턴을 생성한다(S140).In particular, after generating the pattern as described above, the event pattern generator 221 re-selects the condition of the fault event at the time of occurrence of the fault event and repeats the above process to generate a new pattern Thereby generating a plurality of patterns (S140).

한편, 상기한 바와 같이 복수 개의 패턴이 생성되면, 이벤트 정책 생성부(222)는 상기한 바와 같이 생성된 패턴들을 조합하여 정책을 정의한다(S150). Meanwhile, when a plurality of patterns are generated as described above, the event policy generating unit 222 defines the policies by combining the patterns generated as described above (S150).

이어 이벤트 정책 생성부(222)는 생성한 정책을 상기한 과거 기간 동안 장애 이벤트에 대해 정책을 시뮬레이션(S160)하여 정책의 정확도를 검출하고, 이를 바탕으로 정책을 최종 생성한다(S170).Then, the event policy generating unit 222 simulates the generated policy for the fault event during the past period (S160), detects the accuracy of the policy, and finally generates the policy based on the detected policy (S170).

도 10 은 본 발명의 일 실시예에 따른 성능 정보 기반의 정책 생성 과정을 나타낸 순서도이다.10 is a flowchart illustrating a process of generating a performance information based policy according to an embodiment of the present invention.

도 10 을 참조하면, 성능 패턴 생성부(231)는 성능정보에 적합한 조건을 선별(S210)하고, 선별된 조건을 이용하여 적어도 하나 이상의 패턴을 정의한다(S220). Referring to FIG. 10, the performance pattern generator 231 selects a condition suitable for performance information (S210) and defines at least one pattern using the selected condition (S220).

이어 성능 패턴 생성부(231)는 기 설정된 과거 기간을 지정하고, 지정된 과거 기간 동안의 성능 정보에 대해 해당 패턴을 시뮬레이션(S230)함으로써, 과거 기간 동안의 성능 정보가 검색되는지를 확인하고, 확인 결과에 따라 해당 패턴을 생성한다. Then, the performance pattern generation unit 231 designates a predetermined past period, and simulates the pattern for the performance information for the specified past period (S230), thereby confirming whether the performance information for the past period is searched, As shown in FIG.

특히, 성능 패턴 생성부(232)는 상기한 바와 같이 패턴을 생성한 후에는 성능 정보가 수집된 시점의 성능 정보의 조건을 다시 선별하여 상기한 과정을 다시 수행하여 새로운 패턴을 생성하는 과정을 반복함으로써, 복수 개의 패턴을 생성한다(S240). Particularly, after generating the pattern as described above, the performance pattern generator 232 re-selects the condition of the performance information at the time when the performance information is collected, and repeats the above process to generate a new pattern Thereby generating a plurality of patterns (S240).

한편, 상기한 바와 같이 복수 개의 패턴이 생성되면, 성능 정책 생성부(232)는 상기한 바와 같이 생성된 패턴들을 조합하여 정책을 정의한다(S250). Meanwhile, when a plurality of patterns are generated as described above, the performance policy generating unit 232 defines the policies by combining the patterns generated as described above (S250).

이어 성능 정책 생성부(232)는 생성한 정책을 상기한 과거 기간 동안의 성능 정보에 대해 해당 정책을 시뮬레이션(S260)하여 정책의 정확도를 검출하고, 이를 바탕으로 정책을 최종 생성한다(S270).Then, the performance policy generating unit 232 detects the policy accuracy by simulating the generated policy with performance information for the past period (S260), and finally generates a policy based on the detected policy accuracy (S270).

이와 같이 본 발명의 일 실시예에 따른 통합 관제 방법은 업무 시스템(11)의 장애 이벤트 정보를 분석하여 이상징후 정보를 검출하고 검출된 이상징후 정보를 대시보드 모듈(30)을 통해 실시간으로 표출함으로써, 관리자가 장애의 영향도를 파악하고 장애의 원인을 손쉽게 분석할 수 있도록 한다.As described above, the integrated control method according to an embodiment of the present invention analyzes failure event information of the business system 11 to detect anomalous symptom information and display the detected anomalous symptom information through the dashboard module 30 in real time , The administrator can identify the impact of the failure and easily analyze the cause of the failure.

또한 본 발명의 일 실시예에 따른 통합 관제 방법은 업무 시스템(11)의 장애 이벤트 정보에서 장애 이벤트의 조건을 토대로 이상징후 정보를 검출하고 대시보드 모듈(30)을 통해 표출함으로써 관리자가 업무 시스템(11)의 장애에 더욱 능동적으로 대처할 수 있도록 한다.In addition, the integrated control method according to an embodiment of the present invention detects abnormality symptom information based on the condition of the fault event in the fault event information of the business system 11 and expresses the abnormality symptom information through the dashboard module 30, 11) to be able to cope with the disorder more actively.

게다가 본 발명의 일 실시예에 따른 통합 관제 방법은 업무 시스템(11) 중 장애가 발생한 업무 시스템(11)이 소속된 그룹 전체의 서버자원 현황, DB자원 현황, WAS 자원 현황, TP 자원 현황, 시스템작업 현황, 프로그램작업 현황, 업무 시스템 구성도 등을 통합하여 화면을 총체적으로 구성함으로써, 관리자가 장애의 원인을 더욱 용이하게 해결할 수 있도록 한다.In addition, the integrated control method according to an embodiment of the present invention is a method for managing a total of a server resource status, a DB resource status, a WAS resource status, a TP resource status, and a system operation status of the entire group to which the failed business system 11 belongs among the business systems 11 By integrating the status, program operation status, and work system configuration diagram, the screen is configured as a whole so that the administrator can more easily solve the cause of the trouble.

본 발명은 도면에 도시된 실시예를 참고로 하여 설명되었으나, 이는 예시적인 것에 불과하며 당해 기술이 속하는 기술분야에서 통상의 지식을 가진 자라면 이로부터 다양한 변형 및 균등한 타 실시예가 가능하다는 점을 이해할 것이다. 따라서, 본 발명의 진정한 기술적 보호범위는 아래의 특허청구범위에 의하여 정해져야할 것이다.While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is clearly understood that the same is by way of illustration and example only and is not to be taken by way of limitation, I will understand. Accordingly, the true scope of the present invention should be determined by the following claims.

10: 업무 시스템 그룹
11: 업무 시스템
20: 통합관제 서버
21: 데이터 수집 모듈
22: 이벤트 이상징후 처리모듈
221: 이벤트 패턴 생성부
222: 이벤트 정책 생성부
23: 성능 이상징후 처리모듈
231: 성능정보 카운터부
232: 성능 정책 생성부
30: 대시보드 모듈10: Business system group
11: Business system
20: Integrated control server
21: Data acquisition module
22: Event Notification Signaling Module
221: Event pattern generation unit
222: Event policy generation unit
23: Performance Error Signaling Module
231: Performance information counter section
232: Performance policy generation unit
30: Dashboard Module

Claims

Collecting control information of the business system by the integrated control server;
Analyzing the control information by the integrated control server to generate a pattern of the control information;
The integrated control server combining at least one pattern to generate a policy for an anomaly indication mode; And
And the integrated control server displays the abnormal symptom through the dashboard module according to the policy on the abnormal symptom display method.

The integrated control method according to claim 1, wherein the control information includes a failure event for a failure condition of the business system and performance information on the performance of the business system.

3. The method according to claim 2, wherein in the step of generating the pattern for the control information,
If the control information is a failure event, the integrated control server selects a condition of the fault event, defines a pattern according to the selected condition, simulates a pattern of a fault event generated during a predetermined past period, And a pattern is finally generated in accordance with the result of the check.

4. The method of claim 3, wherein the condition of the fault event includes at least one of a class of a fault event, a repetition frequency, an application type, a group of messages, an object of a message, a generation time at which an event is generated, And an integrated control method.

4. The method according to claim 3, wherein, in the step of generating the policy for the abnormality symptom display method,
The integrated control server defines a policy by combining at least one pattern, detects the accuracy of the policy by simulating the defined policy for a failure event for a predetermined past period, and finally generates a policy based on the detected accuracy And a control unit for controlling the integrated control unit.

3. The method according to claim 2, wherein in the step of generating the pattern for the control information,
If the management information is performance information, the integrated control server selects at least one pattern by selecting a condition suitable for the performance information, using the selected conditions, simulates a pattern for performance information for a predetermined past period, The performance information is searched for, and the pattern is finally generated according to the result of the verification.

The integrated control server according to claim 6, wherein the integrated control server checks a performance value of the performance information and generates a pattern corresponding to the performance information if the performance value of the checked performance information is out of a predetermined threshold range in the business system .

The method according to claim 6, wherein, in the step of generating the policy for the abnormality symptom display method,
The integrated control server defines a policy by combining at least one pattern, simulates the created policy against performance information for a predetermined past period, detects the accuracy of the policy, and then, based on the detected accuracy, Wherein the method comprises:

2. The method of claim 1, wherein in creating the policy,
Wherein the integrated control server generates at least one group policy or a single policy according to the number of patterns.

The method according to claim 1, wherein in the step of displaying the abnormal symptom through the dashboard module,
The dashboard module displays an abnormality symptom detection timeline, a detection result of an abnormality symptom, a topology selection button, a configuration diagram selection button, a work plan selection button, an event history selection button, a fault history selection button, And a control unit for controlling the integrated control unit.