KR101939370B1 - System for managing integrated failure and method thereof - Google Patents

System for managing integrated failure and method thereof Download PDF

Info

Publication number
KR101939370B1
KR101939370B1 KR1020150097351A KR20150097351A KR101939370B1 KR 101939370 B1 KR101939370 B1 KR 101939370B1 KR 1020150097351 A KR1020150097351 A KR 1020150097351A KR 20150097351 A KR20150097351 A KR 20150097351A KR 101939370 B1 KR101939370 B1 KR 101939370B1
Authority
KR
South Korea
Prior art keywords
event
propagation
fault
management unit
failure
Prior art date
Application number
KR1020150097351A
Other languages
Korean (ko)
Other versions
KR20170006552A (en
Inventor
성종규
김종우
이영우
Original Assignee
주식회사 케이티
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 주식회사 케이티 filed Critical 주식회사 케이티
Priority to KR1020150097351A priority Critical patent/KR101939370B1/en
Publication of KR20170006552A publication Critical patent/KR20170006552A/en
Application granted granted Critical
Publication of KR101939370B1 publication Critical patent/KR101939370B1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • H04L41/065Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis involving logical or physical relationship, e.g. grouping and hierarchies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Telephonic Communication Services (AREA)

Abstract

An integrated fault management system and method for managing the same are disclosed.
The system includes a fault management unit for receiving fault information about a network including at least one network to analyze the cause of the fault and analyzing the influence of the fault, and generating and managing a fault event according to the analysis result. And a situation propagation manager for generating and managing a situation propagation event when a situation propagation is required by analyzing a fault event generated by the fault management unit. Here, the situation propagation management unit selects a group corresponding to the status propagation event among the status propagation groups, propagates the status propagation event, receives a fail-over result from the outgoing worker in the status propagation group, And terminates the process.

Description

BACKGROUND OF THE INVENTION 1. Field of the Invention [0001] The present invention relates to an integrated fault management system,

The present invention relates to an integrated fault management system and a management method thereof.

In the case of a network operator or a company with a single network, a network operator monitors network monitoring using a network management system (NMS). In addition, depending on the network, there are various NMSs such as wired / wireless depending on IP / transmission / exchange media. These NMSs are operated to manage the operation of the nationwide or regional area in the centralized center and regional operation rooms

In the area of operational management, fault handling is a task area directly related to the survival of the service, and priority is given to rapid processing. For this, the propagation of the situation from the perception of the disability to the concerned person is an important factor directly connected with the service disconnection time.

In the case of a fault, one person develops and uses an NMS to operate a large number of equipments. In this NMS, a fault alarm window or a topology map map for a network such as that shown in a traffic control center Express it visually. Alternatively, there is a method of notifying an alarm sound or an alarm history to a previously stored sound source.

However, for this purpose, the operator should always monitor the system in front of the system. However, there is a problem in that it is impossible to recognize a fault in a business place other than an outsider or a situation room.

In the case of the fault situation propagation and the worker demanding, after the fault is recognized, the fault handling work must be handled by the operator directly or the operator of the site should be arranged. At this time, rapid and accurate situation propagation is the main factor. To this end, many operation management systems call people concerned by telephone or SMS text messaging.

However, in this method, it takes time to grasp the related persons at the same time and to select actual workers, and there is also a difficulty in accurately conveying the contents. In addition, there is a problem that it is more difficult after the work hour such as at night and there is a problem that it is unknown to whom to select because the inside worker can not grasp the work progress of the worker.

In the case of precise recognition of the working environment, the worker is currently going to the site based on the information of the company or the customer information which the operator knows in advance. However, after the field worker is selected, the performance level of the work depends on the degree of mastery of the worker's facilities, but not everyone may be a skilled person and may memorize all facility information such as location and equipment model none. Further, there is a problem in that it is impossible to grasp the state of the worker whether the worker has arrived at the site or is moving.

Finally, in the case of recognition of work results and summary of work results, after the worker completes the work, the worker communicates with the employee by telephone and confirms whether the result of the work is reflected in the system such as NMS. .

However, in such a system, the immediacy of recognition of the outcome of the field work of the outsourcer is lowered, the oral communication may be insufficient, and the internal worker can check the result of the work action such as photographs There is a problem.

SUMMARY OF THE INVENTION The present invention provides an integrated fault management system and a method for managing the same, wherein a fault processing speed is increased due to wired / wireless situation propagation.

An integrated fault management system according to an aspect of the present invention includes:

A failure management unit for receiving failure information about a network including at least one network to analyze the effect of the failure cause and the service, and to generate and manage the failure event according to the analysis result; And a status propagation management unit for managing and managing a plurality of operators capable of fail-over by classifying them into a plurality of types of situation propagation groups, analyzing fault events generated by the fault management unit, Wherein the status propagation management unit selects a group corresponding to the status propagation event among the status propagation groups and propagates the status propagation event, receives a failover result from the outgoing worker in the status propagation group, And terminates the event.

Here, the situation propagation management unit propagates the status propagation event through the smartphone application of the dispatching worker, and the dispatching worker checks the status propagation event propagated through the smartphone application, And transmits it to the situation propagation management unit through the phone application.

An additional function management unit for managing authority of an administrator and an operator for each of the status propagation groups, performing management for propagating the status of a nationwide unit when a large failure occurs, and managing data related to the failure event and the status propagation event; And a job management unit for performing job creation and deletion management, performing schedule management for the job by each status propagation group, and performing job history and performance management.

The fault management unit may include an interface unit connected to the network management server and the customer center, the interface unit receiving the fault information received through the network management server and the customer center; An analysis unit for analyzing the cause of the failure and the influence information for each service using the failure information received through the interface unit; And generating and managing a fault event by using the fault information received through the interface unit and the analysis result performed by the analyzing unit, and classifying the fault event requiring work in the field among the fault events and transmitting the fault event to the situation propagation management unit And a failure event management unit.

Also, the situation propagation management unit may include a group management unit for classifying and managing a plurality of fail-capable workers into a plurality of types of situation propagation groups; A worker management unit for selecting and managing a dispatching worker to perform a failover in a situation propagation group managed by the group management unit; And a status propagation event management unit for analyzing a failure event generated by the failure management unit to generate and manage a situation propagation event to be transmitted to a dispatch worker who is a final worker when a situation propagation is required, A radio wave event management unit for selecting a group corresponding to a radio wave event to propagate the radio wave event, selecting a dispatching worker through the worker management unit, receiving a failover result from the dispatching worker, and terminating the status propagation event; And a situation management unit for delivering a failure event transmitted from the failure management unit to the propagation event management unit and performing a management function of a situation propagation event by the propagation event management unit and a history propagation result history management function.

The additional function management unit may further include: a rights management unit for performing a management function for the manager and the worker for each situation propagation group managed by the situation propagation management unit; A large disorder management unit for providing and managing a situation board for the situation propagation in a nationwide case when the fault event transmitted from the fault management unit is a large fault, and performing inquiry and management of faults and work process progresses on a nationwide basis; Storing information of a fault event managed by the fault management unit and a situation propagation event managed by the situation propagation management unit, storing selection result information of an operator for processing a fault event, A knowledge database to store and manage; And a facility management section that manages the network-related facility information database and performs the function of registering, changing, and deleting the information of the facility.

The failure information may be received from a network management server for monitoring a network failure or received from a customer through a customer center.

In addition, the plurality of types of situation propagation groups are state propagation groups of service, region, and organization type.

In addition, the situation propagation management section updates the location of the dispatching worker in accordance with the registration of the dispatching worker, and displays the state in which the dispatching worker is in trouble.

According to another aspect of the present invention,

A method for receiving and managing fault information on a network including at least one network, the method comprising: analyzing the cause of the fault and analyzing the effect of the fault and receiving the fault; Analyzing the generated failure event to generate a situation propagation event when a situation propagation is required; Selecting a group capable of failover from a plurality of context propagation groups and propagating the context propagation events to a selected context propagation group; And terminating the situation propagation event by receiving a fail-over result after the fail-over from the dispatching worker of the situation propagation group confirming the status propagation event.

Here, before generating the fault event, the method may further include classifying and managing a plurality of fail-capable operators into a plurality of types of situation propagation groups, and in the propagating condition, the plurality of types of situation propagation groups The status propagation group capable of fail-over is selected.

In addition, the step of terminating the status propagation event may include: checking the status propagation event through the app of the smartphone by the dispatch worker; Registering as a dispatch worker for fail-over based on the contents and position of the fault identified through the situation propagation event; And registering an action result of the failure through an app of the smartphone after the action of the obstacle.

Also, before the failure event is generated, the network management server is connected to the network management server and the customer center, receives the failure information received through the network management server and the customer center, or receives the failure information recognized through the inspection of the field worker .

In addition, the dispatch worker can input trouble occurrence information through the app of the smartphone.

According to the present invention, fault handling speed is increased due to wired / wireless situation propagation.

In addition, the network situation propagation to cope with a network failure, the dispatch time and the travel time by the number of the adjacent caller are shortened.

In addition, as a smartphone environment, work productivity is improved by inputting the work contents on the site at the office by inputting the failover history on the field, mobile settlement, and the like,

1 is a schematic block diagram of an integrated fault management system according to an embodiment of the present invention.
FIG. 2 is a diagram showing a specific configuration of the failure management unit shown in FIG. 1. FIG.
FIG. 3 is a diagram showing a specific configuration of the situation propagation management section shown in FIG. 1. FIG.
FIG. 4 is a diagram showing a specific configuration of the additional function management unit shown in FIG. 1. FIG.
5 is a diagram showing a specific configuration of the task management unit shown in FIG.
6 is a flowchart illustrating an integrated failure management for a failure received through the NMS server by the failure management system according to an embodiment of the present invention.
FIG. 7 is a flowchart illustrating an integrated fault management for a fault received by the fault management system according to an embodiment of the present invention.
8 is a flowchart of an integrated fault management method performed by the integrated fault management system according to an embodiment of the present invention.
9 is an internal configuration diagram of an integrated fault management system for performing an integrated fault management method according to an embodiment of the present invention.

Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those skilled in the art can easily carry out the present invention. The present invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. In order to clearly illustrate the present invention, parts not related to the description are omitted, and similar parts are denoted by like reference characters throughout the specification.

Throughout the specification, when an element is referred to as " comprising ", it means that it can include other elements as well, without excluding other elements unless specifically stated otherwise. Also, the terms " part, " " module, " and " module ", etc. in the specification mean a unit for processing at least one function or operation and may be implemented by hardware or software or a combination of hardware and software have.

Hereinafter, an integrated failure management system according to an embodiment of the present invention will be described with reference to the drawings.

1 is a schematic block diagram of an integrated fault management system according to an embodiment of the present invention.

1, the integrated fault management system 100 according to the embodiment of the present invention includes a fault management unit 110, a situation propagation management unit 120, an additional function management unit 130, and a task management unit 140 do.

The failure management unit 110 interlocks with the NMS server 200 and the customer center 300 to identify the cause of the network failure of the communication service provider having various networks, processes it as a failure event, and manages the failure event.

More specifically, the failure management unit 110 receives failure occurrence information received through the NMS server 200 and failure occurrence information received by the customer's VOC (Voice of Consumer) through the customer center 300, And service, and manages the fault event according to the result of the analysis of the cause and effect of the fault.

The situation propagation management unit 120 manages a situation propagation group for each service, region, and organization associated with the organization chart, selects a worker 400 from a propagation group for a fault event transmitted from the fault management unit 110, And transmits it to the worker 400. The status propagation process includes the following steps.

The situation propagation management unit 120 receives the result of the work according to the propagation of the situation from the worker 400 and processes it so that the fault event processing can be completed through the situation propagation process.

The supplementary function management unit 130 manages authority of administrators and operators for each group, performs management for spreading the situation of a nationwide unit when a large-scale fault occurs, and manages data related to various fault events and event propagation events.

The task management unit 140 manages the tasks to be processed in the integrated failure management system 100. In other words, the task management unit 120 performs management during creation and deletion of tasks, and also performs schedule management for group-specific tasks including individuals and teams performing tasks. Then, the job management unit 120 performs job history and performance management.

FIG. 2 is a diagram illustrating a specific configuration of the failure management unit 110 shown in FIG.

2, the failure management unit 110 includes an interface unit 111, an analysis unit 113, and a failure failure event management unit 115. [

The interface unit 111 is connected to the NMS server 200 and the customer center 300. The interface unit 111 receives the failure occurrence information received through the NMS server 200 and the failure occurrence information received by the customer's VOC through the customer center 300 Information is received.

The interface unit 111 interlocks with the NMS server 200 according to the network type when interworking with the NMS server 200. For example, an interface for interworking with the NMS server 200, which is divided into an Internet network, a PSTN network, a transport network, and a wireless network, is provided.

The analysis unit 113 analyzes the failure cause and the influence information using the failure occurrence information received through the interface unit 111. [

The failure cause analysis will be described below.

First, the failure cause analysis distinguishes the fault source service network. For example, distinguish between a source network and an associated network. At this time, assuming that various networks such as a dedicated transport network, a PSTN, an IP network, a WiBro, and a wireless network (3G, LTE) are interlocked to provide a network service, there is a service failure in a single network , A network failure in one area may be a cause of failure of other services. Therefore, failure analysis is performed by referring to the origins and relevance of various networks.

In case of failure analysis, disturbances such as network line, equipment, port, and service server are distinguished.

In addition, when analyzing the fault, it identifies the locality where the fault occurred and the subscriber.

Next, the influence information analysis will be described.

The network is structured like a neural network and differs in scope and influence from the obstacles of the backbone network equipment acting as a hub and the access network equipment corresponding to the peripheral nerve. Therefore, the impact information analysis is performed in that the network has metrics information, and the failure of the upper metrics means a failure to all the networks below it.

This impact information analysis shows that the failure of a device or port with a higher local ID and network ID is indicated by a network failure of all the subordinate network topologies, where the failure event is handled as one backbone failure. The influence information includes the number of lower equipment, the number of ports, the number of subscribers, and the like. For example, L3 equipment can be estimated in apartment complexes, and L2 equipment can be estimated in units.

The failure failure event management unit 115 generates a failure event using the failure occurrence information received through the interface unit 111 and the analysis result performed by the analysis unit 113 and performs management of the generated failure event do.

Such management includes a fault monitoring function.

In the case of fault monitoring function, it includes fault information inquiry function such as inquiry of fault information by network type and inquiry of fault information by customer reception. In this case, the failure information for each type of network includes information such as a local ID, a service type, a network ID, an equipment ID, a port ID, a failure occurrence time, a processing step, an end time, However, in the case of failure information by customer reception, it includes information such as local ID, service type, customer ID, reception time, processing step, end time, and processing time.

The fault failure event management unit 115 classifies fault events that require work in the field among the fault events and transmits the fault events to the situation propagation management unit 120. [

3 is a diagram showing a specific configuration of the situation propagation management unit 120 shown in FIG.

3, the situation propagation management unit 120 includes a group management unit 121, a radio wave event management unit 123, a worker management unit 125, and a situation management unit 127. [

The group management unit 121 performs group setting for propagating a situation propagation event for propagating the fault event to the worker to the worker inside the worker and the worker in the worker.

To this end, the group management unit 121 classifies the workers 400 by service, region, and organization in association with the organization chart, and manages them as a group for carrying out the situation propagation.

The group management unit 121 performs creation, modification, and deletion of a group to be managed.

The propagation event management unit 123 generates a state propagation event to be transmitted to the worker who is the final worker, and manages the state of the generated state propagation event. The state of this situation propagation event includes the number of workers, the selection of the workers, the movement, the work, the result processing, and the work completion.

The worker management unit 125 selects an operator to handle the failure event.

The worker management unit 125 performs worker selection based on the group information managed by the group management unit 121. [

The worker management unit 125 performs dispatch management of the selected worker.

The situation management unit 127 transmits a failure event transmitted from the failure management unit 110 to the propagation event management unit 123 and controls the propagation event management function of the propagation event management unit 123 and the status propagation result history management function .

Meanwhile, the propagation event management unit 123 transmits a status propagation message and a work message for processing a situation propagation event to the smartphone of the worker selected by the worker management unit 125 through a push message.

FIG. 4 is a diagram showing a specific configuration of the additional function management unit 130 shown in FIG.

4, the additional function management unit 130 includes a rights management unit 131, a large failure management unit 133, a knowledge database (DB) 135, and a business location management unit 137. [

The authority management unit 131 performs a management function for each group manager and an operator managed by the situation propagation management unit 120. The management function includes an ID management and authentication function of the administrator and the worker.

The large failure management unit 133 provides and manages a status board for the situation propagation in units of a nation when a failure event transmitted from the failure management unit 110 is a large failure, and performs inquiry and management of failures and work process progresses on a nationwide basis.

The knowledge DB 135 stores situation propagation event information managed by the situation propagation management unit 120 and stores information on the type of the formalized work and processing result information such as storing the selection result information of the worker to process the fault event do.

The business management unit 137 manages the in-house facility and the customer's network-related facility information database, and performs business information registration, change, and deletion functions.

5 is a diagram showing a specific configuration of the task management unit 140 shown in FIG.

5, the job management unit 140 includes an information processing unit 141, a job approval unit 143, a job schedule unit 145, and a history management unit 147. [

The information processing unit 141 provides a function for creating, changing, deleting, and inquiring a job that requires action in the field.

The job approval section 143 processes the approval of the manager in charge of each job classification. This includes Web processing, Android, and iPhone OS (IOS) processing. In addition, the work includes AS activities such as construction (direct construction, external construction) including direct construction and external construction, fault handling, customer requirement processing, field trip inspection, inspection of vulnerable areas, emergency dispatch, .

The job scheduling unit 145 creates a schedule management for each individual to be assigned and assigned in the group as an operator to process the failure event, and manages the job schedule for each team.

The history management unit 147 provides a function of managing and viewing the history of classified jobs and a function of generating and managing statistics of classified results, teams, and individual job results.

Next, a method for performing integrated fault management using the integrated fault management system 100 will be described.

First, a case where a fault is received through the NMS server 200 will be described. Since the case of receiving a fault from the customer through the customer center 300 is also similar to the case where the fault is received through the NMS server 200, the reference is made.

6 is a flowchart illustrating an integrated failure management for a failure received through the NMS server 200 by the failure management system 100 according to an embodiment of the present invention.

Referring to FIG. 6, when a failure is detected through the wired failure monitoring in the NMS server 200 located on the central control center side, it is displayed on the wired failure monitoring screen and the failure information is transmitted to the integrated failure management system 100 (S100).

The integrated fault management system 100 receives the fault information transmitted from the NMS server 200 and processes the fault event, analyzes the fault, selects the propagation group when the fault handling is required, And transmits a status propagation event to a group selected to perform the status propagation process to perform integrated status propagation (S110).

The worker 400 checks the status propagation event received through his / her smartphone app and inquires of the failure status (S120). The worker 400 adjacent to the failure zone is connected to the integrated failure management system 100, (S130).

Thereafter, the worker 400 registered to be active registers the position of the obstacle displayed on the smartphone using the map, and arrives at the site through the automatically connected navigation (S140) As a result of the failover and the processing, the integrated fault management system 100 registers it through the smart phone (S150).

Accordingly, the integrated failure management system 100 transmits the failure processing details to the NMS server 200 according to the failure actions and the processing results reported from the dispatching worker 400, thereby completing the failure processing (S160).

Meanwhile, the integrated failure management system 100 stores the failures received from the NMS server 200 and the failure handling and processing result delivered from the dispatching worker 400 in the DB 135, It enables managers and workers as well as third parties to inquire about their contents.

Meanwhile, the integrated fault management system 100 according to the embodiment of the present invention can assign dispatch duties for fault handling by directly designating field dispatchers without performing the state propagation process.

Hereinafter, the outgoing work flow will be described.

FIG. 7 is a flowchart illustrating a fault management system 100 according to an embodiment of the present invention, which performs integrated fault management for a fault received through a field report.

Referring to FIG. 7, when a field worker (410) recognizes a fault situation of a facility on a site in a jurisdictional area such as an activity such as a circulation check, the integrated fault management The system 100 is notified of the failure (S200).

Accordingly, the integrated fault management system 100 processes the fault notification received from the field worker 410 and analyzes the fault, and when the fault is required, the field worker 410 (Step S210). In step S210, the dispatching worker 400 is checked.

Thereafter, the integrated fault management system 100 allocates a dispatch service for fault handling to the dispatched worker 400 through the smartphone application (S220).

Accordingly, the dispatching worker 400 confirms the fault handling task allocated through the application of the smartphone, and reports the result of the fault handling to the integrated fault handling system 100 (S230).

Likewise, the integrated fault management system 100 stores the fault received from the field worker 410 and the contents of the failover and processing result delivered from the dispatching worker 400 in the DB 135, It enables managers and workers as well as third parties to inquire about their contents.

Hereinafter, a specific integrated fault management method for handling a fault that the integrated fault management system 100 according to an embodiment of the present invention receives from outside will be described.

FIG. 8 is a flowchart of an integrated fault management method performed by the integrated fault management system 100 according to an embodiment of the present invention. FIG. 9 is a flowchart illustrating an integrated fault management method for performing an integrated fault management method according to an embodiment of the present invention. (100).

9 illustrates only the configuration elements used to perform the integrated fault management method according to the embodiment of the present invention. However, it is to be understood that other components can be used to perform the integrated fault management method according to the embodiment of the present invention It will be understood by those skilled in the art.

8 and 9, when the interface unit 111 of the failure management unit 110 receives the failure information collected from the network equipment at the NMS server 200 or receives the failure information received from the customer at the customer center 300 The fault information is received (S300).

The received fault is processed as a fault event, and the analysis unit 113 receives the fault event (S310) and analyzes the cause of the fault, and at the same time, analyzes the service influence of the fault event, (S320) and transmits the result to the failure event management unit 115 (S330).

The failure event management unit 115 receives a failure event from the interface unit 111 in step S310 and analyzes the correlation between the failure events and performs a root cause analysis using the analysis result transmitted from the analysis unit 113 Cause Analysis), and the remaining events are classified as sub-events (S340).

After that, the group management unit 121 inquires the propagation target to process the fault event, and the fault event is transmitted to the propagation event management unit 123 in order to propagate the fault condition together with the propagation target group (S350) (S360) . Here, the failure event management unit 115 registers the failure event in the knowledge DB 135 and stores it (S355).

Meanwhile, if the failure event management unit 115 determines that it is a large failure as a result of the failure analysis, the failure event management unit 115 transmits a failure event to the large failure management unit 133 so that the large failure can be registered as a large failure (S345).

Next, the propagation event management unit 123 of the situation propagation management unit 120 generates a situation propagation event that can be propagated to the final worker in response to the request from the failure event management unit 115 (S370) The fault event information is propagated to the smartphone of the user (S380).

Thereafter, the workers of the group of situation propagation candidates confirm the contents of the situation propagation through their smartphone apps, and the worker adjacent to the obstacle area registers himself / herself as the dispatch worker 400 (S390) and performs fail-over (S400). At this time, the dispatching worker 400 can check the model of the equipment, the type of the fault, and the standard work procedure by inquiring the workplace information and perform the fail-over.

Thereafter, the dispatching worker 400 completes the action corresponding to the obstacle and the task corresponding to the content of the situation propagation, and inputs the fail-over result through the smart phone (S410).

Accordingly, the propagation event management unit 123 stores the failover result transmitted from the dispatching worker 400 in the knowledge DB 135, and at the same time, transfers the failover result to the failure event management unit 115 to complete the received failure processing (S420, S430 ).

Meanwhile, the failure event stored in the knowledge DB 135 and the contents of the fail-over result can be inquired through the NMS server 200 or through the interface unit 111 by the operators of the customer center 300.

In addition, the work management unit 140 may inquire about the failure-related work or work.

Also, the history of the failure event and the result of the fail-over can be managed through the history management unit 157, and the history can be also inquired.

[Table 1] shows an example of the situation propagation management format used by the integrated fault management system 100 according to the embodiment of the present invention for fault event processing, and Table 2 shows an example of the group management format .

[Table 1]

Figure 112015066291666-pat00001

[Table 2]

Figure 112015066291666-pat00002

While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it is to be understood that the invention is not limited to the disclosed exemplary embodiments, It belongs to the scope of right.

Claims (15)

A failure management unit for receiving failure information about a network including at least one network to analyze the effect of the failure cause and the service, and to generate and manage the failure event according to the analysis result; And
A situation propagation management unit for classifying and managing a plurality of workers capable of fail-over into a plurality of types of situation propagation groups and analyzing fault events generated by the fault management unit to generate and manage a situation propagation event when a situation propagation is required Including,
The failure management unit analyzes the scope of the failure of the backbone network equipment and the access network equipment in the impact analysis and analyzes the event corresponding to the top event on the network as a failure event by the root cause analysis using the result of the impact analysis ≪ / RTI &
The situation propagation management unit selects a situation propagation group capable of processing the fault event among the situation propagation groups and propagates a situation propagation event corresponding to the fault event and receives a fail-over result from the dispatch worker in the selected situation propagation group And terminates the situation propagation event
Wherein said fault management system comprises:
The method according to claim 1,
The status propagation manager transmits the status propagation event through the smartphone application of the dispatching worker, and the dispatching worker confirms the status propagation event propagated through the smartphone application, To the situation propagation management unit
Wherein said fault management system comprises:
The method according to claim 1,
An additional function management unit which manages authority of an administrator and an operator for each of the status propagation groups, performs management for propagating a situation on a nationwide basis in case of a large failure, and manages data related to the failure event and the status propagation event; And
A work management unit for performing job creation and deletion management, performing schedule management for the job by each status propagation group, and performing job history and performance management
Further comprising:
The method according to claim 1,
The fault management unit,
An interface unit connected to the network management server and the customer center, the interface unit receiving the failure information received through the network management server and the customer center;
An analysis unit for analyzing the cause of the failure and the influence information for each service using the failure information received through the interface unit; And
Generates and manages a fault event using the fault information received through the interface unit and the analysis result performed by the analyzing unit, and classifies the fault event requiring work in the field among the fault events and transmits the fault event to the situation propagation management unit Fault event management unit
And an integrated fault management system.
The method according to claim 1,
The situation propagation management unit,
A group management unit for classifying and managing a plurality of fail-capable workers in a plurality of types of situation propagation groups;
A worker management unit for selecting and managing a dispatching worker to perform a failover in a situation propagation group managed by the group management unit;
And a status propagation event management unit for analyzing a failure event generated by the failure management unit to generate and manage a situation propagation event to be transmitted to a dispatch worker who is a final worker when a situation propagation is required, A radio wave event management unit for selecting a group corresponding to a radio wave event to propagate the radio wave event, selecting a dispatching worker through the worker management unit, receiving a failover result from the dispatching worker, and terminating the status propagation event; And
A situation management unit for delivering a failure event transmitted from the failure management unit to the propagation event management unit and performing a management function of a situation propagation event by the propagation event management unit and a history propagation result history management function,
And an integrated fault management system.
The method of claim 3,
The additional-
A rights management unit for performing a management function for a manager and a worker for each situation propagation group managed by the situation propagation management unit;
A large disorder management unit for providing and managing a situation board for the situation propagation in a nationwide case when the fault event transmitted from the fault management unit is a large fault, and performing inquiry and management of faults and work process progresses on a nationwide basis;
Storing information of a fault event managed by the fault management unit and a situation propagation event managed by the situation propagation management unit, storing selection result information of an operator for processing a fault event, A knowledge database to store and manage; And
A business management department that manages the network-related facility information database and performs the business information registration, change and deletion functions
And an integrated fault management system.
7. The method according to any one of claims 1 to 6,
Wherein the failure information is received from a network management server for monitoring a network failure or received from a customer through a customer center.
7. The method according to any one of claims 1 to 6,
Wherein the plurality of status propagation groups are status propagation groups of service, region, and organization types.
7. The method according to any one of claims 1 to 6,
Wherein the status propagation management unit updates the position of the dispatching worker according to the registration of the dispatching worker and displays the status in which the dispatching worker is in trouble.
A method for receiving and managing fault information on a network consisting of at least one network,
Analyzing the cause of the received failure and the impact on the service, and generating a failure event according to the analysis result;
Analyzing the generated failure event to generate a situation propagation event when a situation propagation is required;
Selecting a group capable of failover from a plurality of context propagation groups and propagating the context propagation events to a selected context propagation group; And
Receiving a failover result after the failover from the dispatching worker of the situation propagation group confirming the status propagation event and ending the status propagation event
/ RTI >
Wherein the step of generating the fault event comprises the steps of: analyzing a scope of a failure of the backbone network equipment and the access network equipment at the time of the impact analysis; Generates an event as the fault event,
The condition propagation step selects a situation propagation group capable of processing the fault event among the plurality of condition propagation groups and propagates a situation propagation event corresponding to the fault event
Integrated fault management method.
11. The method of claim 10,
Before the step of generating the fault event,
Further comprising the step of classifying and managing a plurality of fail-capable workers into a plurality of types of situation propagation groups,
In the situation propagating step, the plurality of types of situation propagation groups are selected from the group of status propagation groups capable of failover
Wherein said fault management method comprises the steps of:
11. The method of claim 10,
The step of terminating the situation propagation event comprises:
Confirming the situation propagation event through the app of the smartphone;
Registering as a dispatch worker for fail-over based on the contents and position of the fault identified through the situation propagation event; And
Registering an action result of the failure through an app of the smartphone after the action of the obstacle
The method comprising the steps of:
11. The method of claim 10,
Before the step of generating the fault event,
Receiving the fault information received through the network management server and the customer center connected to the network management server and the customer center or receiving the fault information recognized through the inspection of the field worker.
13. The method of claim 12,
Wherein the dispatcher is capable of inputting failure occurrence information through an app of the smartphone.
12. The method of claim 11,
Wherein the plurality of status propagation groups are status propagation groups of service, region, and organization types.
KR1020150097351A 2015-07-08 2015-07-08 System for managing integrated failure and method thereof KR101939370B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
KR1020150097351A KR101939370B1 (en) 2015-07-08 2015-07-08 System for managing integrated failure and method thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020150097351A KR101939370B1 (en) 2015-07-08 2015-07-08 System for managing integrated failure and method thereof

Publications (2)

Publication Number Publication Date
KR20170006552A KR20170006552A (en) 2017-01-18
KR101939370B1 true KR101939370B1 (en) 2019-01-16

Family

ID=57992060

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020150097351A KR101939370B1 (en) 2015-07-08 2015-07-08 System for managing integrated failure and method thereof

Country Status (1)

Country Link
KR (1) KR101939370B1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR102149930B1 (en) * 2018-02-19 2020-08-31 주식회사 케이티 System and method for handling network failure
KR102616136B1 (en) * 2023-06-29 2023-12-20 주식회사 서경산업 Smart maintenance integrated system and method performing thereof

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100456719B1 (en) * 2001-08-17 2004-11-10 엘지전자 주식회사 System And The Method For Integration Surveillance And Test Of Wireless Network
KR100908131B1 (en) * 2007-11-02 2009-07-16 주식회사 케이티프리텔 Fault detection device and method using log filtering and fault detection system using the device
KR20100131181A (en) * 2009-06-05 2010-12-15 주식회사 케이티 Trouble recovery system using standard operating procedure and method thereof

Also Published As

Publication number Publication date
KR20170006552A (en) 2017-01-18

Similar Documents

Publication Publication Date Title
JP6263668B1 (en) Chat type task management system
US7898410B2 (en) Firefighter response system
US8417553B2 (en) Incident communication system
WO2013144787A1 (en) Work and quality management system, device and method
US20120084108A1 (en) Systems and methods for a work flow management application suite for mobile communications devices
CN105621180A (en) Obtaining method and device for elevator information
US20200160464A1 (en) System for collaborative infrastructure management and incident response
KR20160130120A (en) Historical Maintenance System with NFC or QR via the Web
JP2023143944A (en) Network system, monitoring system, work management system, and work management method
CN112288903A (en) Card punching system, method and equipment
KR101620605B1 (en) Apparatus and method for safety diagnosis report
KR101939370B1 (en) System for managing integrated failure and method thereof
CN112085225A (en) Operation and maintenance system and method for user side power equipment
WO2016180083A1 (en) Airport operation command system
JP6344755B2 (en) Support request system that can log
JP6022002B1 (en) Maintenance management system
KR101179359B1 (en) Work system and Method for managing mobile communication system
JP2007161431A (en) Patrol schedule management system of maintenance worker
JP2012033200A (en) Facility management system, facility management method and facility management program
JP2018180801A (en) Monitoring system and information terminal device in monitoring system
CN116402315A (en) Event reporting task response method and device
KR101166920B1 (en) Building management service providing method using smart device
JP2010218324A (en) Task allocation device, task allocation method and task allocation program
JP4507471B2 (en) Communication command system
KR102533361B1 (en) A method of operating messenger linked to HR information

Legal Events

Date Code Title Description
A201 Request for examination
E902 Notification of reason for refusal
E701 Decision to grant or registration of patent right
GRNT Written decision to grant