KR20040026934A

KR20040026934A - Apparatus and Method for Processing Obstacle of Switch

Info

Publication number: KR20040026934A
Application number: KR1020020058572A
Authority: KR
Inventors: 최우영
Original assignee: 엘지전자 주식회사
Priority date: 2002-09-26
Filing date: 2002-09-26
Publication date: 2004-04-01

Abstract

PURPOSE: A failure processing method and apparatus of an exchange are provided to report and process a failure occurring at a processor board according to an automatic procedure without intervention of an operator. CONSTITUTION: When a base station(110) detects a failure, it stores corresponding failure information and re-starts a processor. The base station(110) transmits failure history information to a control station(120). The control station(120) stores the failure history information by base stations and transmits the stored failure history information to a BSM(Base Station Manager)(130). The BSM(130) transmits the failure history information to a previously designated operator terminal(140) at certain periods or according to an operator's request. The operator analyzes a cause of the failure and quickly copes with the failure.

Description

Apparatus and Method for Processing Obstacle of Switch}

본 발명은 이동통신 교환 시스템에서 적용되는 프로세서 보드의 장애 발생시 해당 보드의 장애 정보를 비휘발성 메모리에 저장하고 운용자의 요구에 의해 장애 이력 정보를 전송하는 교환기의 장애 처리 방법 및 장치에 관한 것이다.The present invention relates to a failure handling method and apparatus for an exchange that stores failure information of a corresponding board in a nonvolatile memory and transmits failure history information at the request of an operator when a failure occurs in a processor board applied in a mobile communication exchange system.

교환기의 제어 시스템은 일반 컴퓨터 시스템과는 달리 고도의 고장 감내 기능과 실시간 처리 기능이 요구되며, 이러한 요구를 충족시키기 위하여 각 교환기는조건에 따라 다양한 구조를 채택하고 있다.Unlike general computer systems, the exchange control system requires a high level of fault tolerance and real-time processing, and each exchange adopts a variety of structures depending on conditions.

그러나 이러한 장애에 대한 대처에도 불구하고 시스템에 장착되는 프로세서 보드는 운용상에서 여러가지 장애가 발생할 수 있다. 운용중 장애의 유형 가운데 가장 문제가 되는 사항은 실제 프로세서 보드가 장애로 인하여 재기동되었으나 실제 장애가 어떤 원인에 의하여 발생되었는지를 알 수 없는 경우이다.However, despite these failures, the processor board mounted in the system may cause various failures in operation. The most problematic type of failure during operation is when the actual processor board has been restarted due to a failure, but the cause of the failure is unknown.

이 경우는 운용자가 장애가 발생하는 시점을 정확하게 포착하기 힘든 경우이므로, 장애 시점의 포착을 위해 장시간 시스템을 모니터링하는 등의 노력이 필요한 것이 사실이다.In this case, since it is difficult for the operator to accurately capture the point of failure, it is true that an effort such as monitoring the system for a long time is required to capture the point of failure.

또한, 이러한 장애가 불특정 다수의 프로세서에서 발생되는 경우에 운용자는 이러한 장애에 대해 모니터링을 시도하는 것조차 힘든것 또한 현실이다.It is also a reality that even when such a failure occurs on an unspecified number of processors, it is difficult for the operator to even try to monitor for this failure.

특히 이동 통신 교환기는 제어국과 기지국으로 구성되며 이 가운데 기지국의 경우는 대부분 운용자가 직접 제어하지 못하는 원격지에 떨어져 있어서 실제 장애 발생의 순간을 운용자가 쉽게 모니터링 하지 못한다는 한계를 가지고 있으며, 앞서 말한 바와같이 프로세서의 재기동 사실만으로 장애가 발생하였다는 내용을 짐작할 뿐이다. 이러한 상태에서는 프로세서 장애에 대한 적극적인 대응을 하기가 힘들뿐더러 원인의 파악은 엄두조차 내지 못하게 되므로 시스템 전체의 안정성에 치명적인 위험으로 작용하게 된다.In particular, the mobile communication switch is composed of a control station and a base station. Among the base stations, most of them are located at a remote location which cannot be directly controlled by the operator. Therefore, the mobile communication exchange has a limitation in that the operator cannot easily monitor the actual moment of occurrence of the failure. Likewise, the fact that the processor is restarted only tells us that the failure occurred. In such a state, it is difficult to proactively respond to processor failures, and even the cause cannot be determined, which poses a critical risk to the stability of the entire system.

이하 도면을 참조하여 종래의 교환기의 장애 처리 방법 및 장치에 대하여 설명하기로 한다.Hereinafter, a failure handling method and apparatus of a conventional exchange will be described with reference to the drawings.

도 1은 종래의 교환기의 장애 처리 장치를 개략적으로 나타낸 블럭도이고, , 도 2는 장애 처리부의 구성을 개략적으로 나타낸 블럭도이다.1 is a block diagram schematically showing a failure handling apparatus of a conventional switch, and FIG. 2 is a block diagram schematically showing a configuration of a failure processing unit.

도 1 및 도 2를 참조하면, 운용자는 교환기(100)에 컴퓨터등과 같은 모니터링 장비(110)를 유선으로 연결하여 교환기(100)의 상태를 감시한다.1 and 2, the operator monitors the state of the exchange 100 by connecting a monitoring device 110 such as a computer to the exchange 100 by wire.

교환기(100)는 장애가 감지되면, 상기 모니터링 장비(110)에 장애 정보를 전송하고, 재기동된다.When the switch 100 detects a fault, the switch 100 transmits the fault information to the monitoring device 110 and restarts.

상기 교환기(100)는 시스템을 제어하는 프로세서(102), 장애를 감지하고 장애 감지 알람 정보를 발생하는 장애 처리부(104), 장애가 감지되면 재기동을 수행하는 리셋부(106)를 포함한다.The switch 100 includes a processor 102 for controlling the system, a failure processing unit 104 for detecting a failure and generating failure detection alarm information, and a reset unit 106 for restarting when a failure is detected.

상기 장애 처리부(104)는 장애를 감지하는 장애 감지부(200), 모니터링 장비(110)에 장애 감지 알람 정보를 전송하는 장애 보고부(210)를 포함한다.The failure processing unit 104 includes a failure detection unit 200 for detecting a failure and a failure report unit 210 for transmitting failure detection alarm information to the monitoring device 110.

즉, 상기 교환기(100)는 하드웨어적인 결함이나 소프트웨어적인 결함으로 인하여 정상적인 동작에 치명적인 문제점이 발생한 경우에 리셋부(106)에 의하여 재기동된다.That is, the switch 100 is restarted by the reset unit 106 when a problem that is fatal to normal operation occurs due to a hardware defect or a software defect.

프로세서(102)는 재기동에 의하여 모든 기능이 재초기화되므로 장애의 요소가 일시적으로 제거되어 정상적인 동작을 하게 된다.Since all functions are re-initialized by restarting, the processor 102 temporarily removes the element of failure to operate normally.

그러나 이러한 장애 요소가 잠재적으로 내포되어 있는 경우 해당 프로세서(102)의 장애는 재발하는 것이 일반적인 형태이며, 장애 발생의 시간적인 차이는 존재할 수 있다. 이러한 경우 장애에 대한 어떠한 조치를 취하기 위해서는 하드웨어적인 장애인지 소프트웨어적인 장애인지를 구분할 필요가 있으며 시스템 전반에 유사한 장애가 발생하는지를 파악할 필요가 있다.However, if such a failure element is potentially implicated, the failure of the processor 102 is generally recurring, and there may be a time difference in occurrence of the failure. In this case, in order to take any action on the failure, it is necessary to distinguish between hardware handicap and software handicap and to find out whether a similar failure occurs in the system as a whole.

대부분의 경우에 이러한 장애에 대해서는 프로세서가 재기동됨으로써 얻게되는 재초기화에 의한 정상 복구에 의존하여 운용되는 경우가 일반적이며, 적극적인 대응을 하지 않는 경우가 더 많았다. 그러나 이러한 장애가 시간적인 간격을 두고 전시스템에 걸쳐 불특정하게 발생하는 경우 해당 장애의 원인을 파악하여 대응할 필요가 있으나, 재기동에 의해 해당 프로세서의 기억 공간 역시 초기화되는 것이일반적이므로 운용자는 장애의 원인을 파악할 방법이 없어져 장애의 원인이 무엇이었는지를 짐작에 의하여 대처하는 비효율적인 작업을 수행하게 된다.In most cases, these failures are usually relying on normal recovery by reinitialization obtained by restarting the processor, and more often they do not respond proactively. However, if such a failure occurs unevenly over the entire system at a time interval, it is necessary to identify the cause of the failure and respond to it.However, since the memory space of the processor is generally initialized by restarting, the operator cannot determine the cause of the failure. There is no way to do this, inefficient work to cope by guessing what caused the failure.

상기와 같은 종래에는 장애에 대한 어떠한 기록을 남기지 않음으로써 장애 발생 이후 운용자가 대처할 방법이 없는 문제점이 있다.In the related art, there is a problem in that there is no way for an operator to cope with a failure after leaving a record of failure.

또한, 시스템 전반에 걸친 장애 발생의 경우에 일반적으로 유사한 장애 유형을 보이는 것이 일반적이나 이러한 장애의 기록을 시스템 단위별로 취합하여 분석할 수 있는 방법이 제공되지 않은 문제점이 있다.In addition, in the case of system-wide failures, it is common to show similar types of failures in general, but there is a problem in that a method of collecting and analyzing records of such failures by system unit is not provided.

또한, 장애의 유형별 분석이 어려웠고 원격지에 떨어져 있는 기지국 프로세서의 장애에 대해서는 장시간 운용자가 모니터링을 수행하여야하는 불편함이 있었다.In addition, it was difficult to analyze by type of failure, and there was an inconvenience in that the operator had to perform monitoring for a failure of the base station processor that is remotely located.

따라서, 본 발명의 목적은 운용자의 개입없이 자동적인 절차에 의하여 장애의 보고와 처리가 이루어지는 교환기의 장애 처리 방법 및 장치를 제공하는데 있다.Accordingly, it is an object of the present invention to provide a failure handling method and apparatus of an exchange where failure reporting and processing is performed by an automatic procedure without operator intervention.

도 1은 종래의 교환기의 장애 처리 장치를 개략적으로 나타낸 블럭도.1 is a block diagram schematically showing a failure handling apparatus of a conventional exchange.

도 2는 종래의 장애 처리부의 구성을 개략적으로 나타낸 블럭도.Figure 2 is a block diagram schematically showing the configuration of a conventional failure processing unit.

도 3은 본 발명의 바람직한 일 실시예에 따른 이동통신교환시스템에서 장애 처리 방법을 나타낸 도면.3 is a diagram illustrating a failure handling method in a mobile communication switching system according to an exemplary embodiment of the present invention.

도 4는 본 발명의 바람직한 일 실시예에 따른 장애 처리부의 구성을 개략적으로 나타낸 블럭도.Figure 4 is a block diagram schematically showing the configuration of the failure processing unit according to an embodiment of the present invention.

도 5는 본 발명의 바람직한 일 실시예에 따른 장애 이력 데이터베이스를 나타낸 도면.5 illustrates a failure history database in accordance with one preferred embodiment of the present invention.

도 6은 본 발명의 바람직한 일 실시예에 따른 장애 처리 방법을 나타낸 흐름도.6 is a flowchart illustrating a failure processing method according to an exemplary embodiment of the present invention.

도 7은 본 발명의 바람직한 일실시예에 따른 기지국의 장애 처리방법을 나타낸 흐름도.7 is a flowchart illustrating a failure processing method of a base station according to an embodiment of the present invention.

<도면의 주요 부분에 대한 부호의 설명><Explanation of symbols for the main parts of the drawings>

100 : 교환기 102 : 프로세서100: switch 102: processor

104 : 장애 처리부 106 : 리셋부104: failure processing unit 106: reset unit

110, 140 : 운용자 단말기 200, 400 : 장애 감지부110, 140: operator terminal 200, 400: failure detection unit

210, 430 : 장애 보고부 110 : 기지국210, 430: failure reporting unit 110: base station

120 : 제어국 130 : BSM120: control station 130: BSM

410 : 장애 관리부 420 : 장애 저장부410: failure management unit 420: failure storage unit

상기 목적들을 달성하기 위하여 본 발명의 일 측면에 따르면, 기지국에서 장애가 발생되면, 상기 발생된 장애를 분류하여 저장한 후, 상기 분류된 장애의 종류에 따라 조치를 취하여 장애 이력 정보를 제어국/BSM에 전송하고, 상기 제어국/BSM에서 상기 기지국으로부터 전송된 장애 이력 정보를 통계 처리하여 운용자에게 전송하는 것을 특징으로 하는 교환기의 장애 처리 방법이 제공된다.According to an aspect of the present invention to achieve the above objects, when a failure occurs in the base station, after classifying and storing the generated failure, and taking action according to the type of the classified failure control station / BSM And a fault history information transmitted from the base station at the control station / BSM is statistically transmitted to the operator.

상기 장애 이력 정보는 장애가 발생한 해당 기지국의 IP 어드레스, 장애 종류, 장애 발생 시각, 프로그램 카운터, 레지스터 내용, 장애 발생 위치의 메모리 중 적어도 하나를 포함한다.The fault history information includes at least one of an IP address, a fault type, a fault occurrence time, a program counter, a register content, and a memory of a fault occurrence location of the corresponding base station in which the fault occurs.

상기 발생된 장애가 치명적이면, 장애가 발생된 프로세서의 절체 기능이 제공되는지의 여부를 판단하고, 상기 판단결과 장애가 발생된 프로세서의 절체 기능이 제공되면, 절체를 수행한 후, 장애 이력 정보를 제어국/BSM에 전송한다.If the generated failure is fatal, it is determined whether the transfer function of the failed processor is provided. If the transfer function of the failed processor is provided as a result of the determination, after the transfer is performed, the failure history information is stored. Send to BSM.

상기 판단결과 장애가 발생된 프로세서의 절체 기능이 제공되지 않으면, 프로세서를 재기동한 후, 장애 이력 정보를 제어국에 전송한다.If it is determined that the transfer function of the failed processor is not provided, after restarting the processor, the failure history information is transmitted to the control station.

상기 발생된 장애가 치명적이 아니면, 장애 처리 루틴을 동작하여 장애를 처리한 후, 장애 이력 정보를 제어국에 전송한다.If the generated fault is not fatal, the fault handling routine is operated to handle the fault, and the fault history information is transmitted to the control station.

상기 통계 처리된 장애 이력 정보에 대하여 운용자로부터 장애 이력 정보 요구 명령이 수신되는지의 여부를 판단하고, 상기 판단결과 상기 운용자로부터 장애 이력 정보 요구 명령이 수신되면, 상기 통계 처리된 장애 이력 정보를 상기 운용자에게 전송한다.It is determined whether an error history information request command is received from an operator with respect to the statistically processed error history information. If the error history information request command is received from the operator as a result of the determination, the operator displays the statistical error history information that has been processed. Send to.

상기 판단결과 운용자로부터 장애 이력 정보 요구 명령이 수신되지 않으면, 미리 정해진 일정 주기로 상기 운용자에게 장애 이력 정보를 전송한다.If the failure history information request command is not received from the operator as a result of the determination, the failure history information is transmitted to the operator at a predetermined period.

상기 운용자에게 전송되는 장애 이력 정보는 이메일, 문자 메시지, 음성 메시지, 파일 중 적어도 하나의 형식으로 전송된다.The failure history information transmitted to the operator is transmitted in at least one of an email, a text message, a voice message, and a file.

본 발명의 다른 측면에 따르면, 교환기의 장애를 처리하는 장치에 있어서,According to another aspect of the present invention, in the apparatus for handling the failure of the exchange,

장애 발생을 감지하는 장애 감지부, 상기 발생된 장애를 분류하여 상기 발생된 장애에 대한 조치를 취하는 장애 관리부, 상기 장애 관리부로부터 전송된 장애 정보를 저장하는 장애 저장부 및 주기적으로 상기 장애 저장부에 저장된 장애 이력 정보를 전송하는 장애 보고부를 포함하는 것을 특징으로 하는 교환기의 장애 처리 장치가 제공된다.A failure detection unit for detecting a failure occurrence, a failure management unit for classifying the generated failure to take action on the generated failure, a failure storage unit for storing failure information transmitted from the failure management unit, and the failure storage unit periodically Provided is a failure processing apparatus of an exchange comprising a failure report unit for transmitting stored failure history information.

상기 장애 저장부는 비휘발성 메모리이다.The fault storage unit is a nonvolatile memory.

이하 첨부된 도면을 참조하여 본 발명의 바람직한 실시예를 상세히 설명하기로 한다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 3은 본 발명의 바람직한 일 실시예에 따른 이동통신교환시스템에서 장애 처리 방법을 나타낸 도면이다.3 is a diagram illustrating a failure handling method in a mobile communication switching system according to an exemplary embodiment of the present invention.

도 3을 참조하면, 기지국(110)은 장애 발생이 감지되면, 발생된 장애 정보를 저장한 후, 프로세서를 재기동한다.Referring to FIG. 3, when a failure occurs, the base station 110 stores the generated failure information and restarts the processor.

그런다음 상기 기지국(110)은 장애 이력 정보를 제어국(120)에 전송한다.The base station 110 then transmits the failure history information to the control station 120.

그러면, 상기 제어국(110)은 상기 기지국(100)으로부터 전송된 장애 이력 정보를 기지국별로 저장하고, 상기 저장된 장애 이력 정보를 BSM(130)에 전송한다.Then, the control station 110 stores the failure history information transmitted from the base station 100 for each base station, and transmits the stored failure history information to the BSM 130.

상기 BSM(130)은 미리 정해진 일정 주기로 또는 운용자의 요구에 의하여 미리 지정된 운용자 단말기(140)로 상기 장애 이력 정보를 전송한다. 상기 운용자에게 전송되는 장애 이력 정보는 이메일 또는 메시지 형태로 전송된다.The BSM 130 transmits the failure history information to a predetermined operator terminal 140 at a predetermined cycle or at the request of an operator. The failure history information transmitted to the operator is transmitted in the form of an email or a message.

즉, BSM(130)이 TCP/IP망을 통하여 외부 인터넷 망과 연동이 가능하다면, 장애의 원인과 유형에 대해 저장되어진 장애 이력 정보를 운용자에게 직접 이메일 형태로 제공하게 한다.In other words, if the BSM 130 is able to interwork with an external Internet network through a TCP / IP network, the BSM 130 directly provides the operator with error history information stored on the cause and type of the failure in an e-mail form.

해당 프로세서의 RTOS에서 파일 시스템의 일부로 취급되는 SRAM이나 플래시 메모리상에 저장되어지는 장애는 일종의 파일의 형태로 취급되어질 수 있으며 TCP/IP 프로토콜을 지원하는 네트웍으로 연결되어지는 것이 3G이후의 대부분의 이동통신 시스템의 형태이므로, TCP/IP를 기반으로 하는 인터넷망과의 연동을 통하여 장애의 내역은 실시간으로 장애가 발생한 프로세서 보드의 하드웨어 또는는 소프트웨어 담당자에게 파일 전송 또는 이메일의 형태로 전송된다.Faults stored on SRAM or flash memory, which are treated as part of the file system in the processor's RTOS, can be treated as a kind of file, and most of the movements since 3G are connected to a network that supports the TCP / IP protocol. Since it is a form of communication system, the details of the failure are transmitted in the form of a file transfer or an e-mail to a hardware or software manager of the failed processor board in real time through interworking with the Internet network based on TCP / IP.

그러면, 운용자는 장애의 원인에 대한 분석으로 발생된 장애에 대하여 신속하게 대처할 수 있다.The operator can then quickly respond to the failure caused by the analysis of the cause of the failure.

도 4는 본 발명의 바람직한 일 실시예에 따른 장애 처리부의 구성을 개략적으로 나타낸 블럭도이다.4 is a block diagram schematically illustrating a configuration of a failure processing unit according to an exemplary embodiment of the present invention.

도 4를 참조하면, 장애 처리부는 장애 발생을 감지하는 장애 감지부(400), 상기 발생된 장애를 분류하여 발생된 장애에 대한 조치를 취하는 장애 관리부(410), 장애의 이력을 저장하는 장애 저장부(420), 운용자의 요청 또는 주기적으로 장애 이력을 보고하는 장애 보고부(430)를 포함한다.Referring to FIG. 4, the failure processing unit may include a failure detection unit 400 that detects a failure occurrence, a failure management unit 410 which classifies the generated failure, and takes action on a failure that occurs, and stores a history of the failure. The unit 420 includes a failure report unit 430 for reporting a failure history or a request of an operator periodically.

상기 장애 관리부(410)는 장애 감지부(400)로부터 장애 감지 정보가 수신되면, 발생된 장애를 critical, major, minor로 분류한다.When failure detection information is received from the failure detection unit 400, the failure management unit 410 classifies the generated failure into critical, major, and minor.

그런다음 상기 장애 관리부(410)는 상기 분류된 장애가 critical이면, 상기장애가 발생된 해당 프로세서가 절체 기능이 제공되는지의 여부를 판단한다.Then, if the classified failure is critical, the failure management unit 410 determines whether the corresponding processor that has generated the failure provides a switching function.

상기 판단결과 장애가 발생된 해당 프로세서가 절체 기능이 제공되면, 상기 장애 관리부(410)는 절체 동작을 수행하고, 해당 프로세서가 절체 기능이 제공되지 않으면, 하드웨어 교체등의 조치를 취한다.As a result of the determination, when the processor in which the failure occurs is provided with a switching function, the failure management unit 410 performs a switching operation, and if the processor is not provided with the switching function, take a countermeasure such as hardware replacement.

만약, 상기 분류된 장애가 major, minor이면, 상기 장애 관리부(410)는 장애 처리 루틴을 이용하여 장애를 처리한다.If the categorized disorder is major or minor, the disorder managing unit 410 processes the disorder using a disorder processing routine.

상기 장애 저장부(420)는 상기 장애 관리부(410)로부터 전송된 장애 정보를 저장하는 것으로서, SRAM 또는 플래시 메모리와 같은 비휘발성 메모리일 수 있다.The failure storage unit 420 stores failure information transmitted from the failure management unit 410 and may be a nonvolatile memory such as an SRAM or a flash memory.

따라서, 상기 장애 저장부(410)는 전원이 제거되더라도 저장되어 있는 장애 정보가 지워지지 않는다. 이렇게 저장된 장애 정보는 운용자가 장애 복구후에 직접 내용을 볼수 있도록 하는 기능을 제공하며, 기지국 프로세서인 경우에는 제어국의 요구에 따라 네트웍을 통해 해당 프로세서가 가지고 있는 기억공간의 장애 이력을 보고하는 기능을 함께 제공한다.Accordingly, the fault information stored in the fault storage unit 410 is not deleted even if the power is removed. The fault information stored in this way provides the operator with the ability to see the contents directly after a fault recovery.In the case of a base station processor, the fault history of the storage space of the processor is reported through the network at the request of the control station. Provide together.

이하 상기와 같이 구성된 장애 처리부의 동작에 대하여 설명하기로 한다.Hereinafter, the operation of the failure processing unit configured as described above will be described.

장애 감지부(400)에서 장애가 감지되면, 상기 장애 감지부(400)는 장애 감지 정보를 장애 관리부(410)에 전송한다. 상기 장애 관리부(410)는 상기 발생된 장애를 critical, major, minor로 분류한다.When a failure is detected by the failure detection unit 400, the failure detection unit 400 transmits failure detection information to the failure management unit 410. The failure management unit 410 classifies the generated failure into critical, major, and minor.

그런다음 상기 장애 관리부(410)는 상기 분류된 장애가 critical이면, 해당 프로세서가 절체 기능이 제공되는지의 여부를 판단한다.Then, when the classified failure is critical, the failure management unit 410 determines whether the corresponding processor provides a switching function.

상기 판단결과 해당 프로세서가 절체 기능이 제공되면, 상기 장애관리부(410)는 절체 동작을 수행하고, 해당 프로세서가 절체 기능이 제공되지 않으면, 하드웨어 교체등의 조치를 취한다.If the processor determines that the transfer function is provided, the failure management unit 410 performs the transfer operation, and if the transfer function is not provided by the processor, take a measure such as hardware replacement.

만약, 상기 분류된 장애가 major, minor이면, 상기 장애 관리부(410)는 장애 처리 루틴을 이용하여 장애를 처리한다. 그런다음 상기 장애 관리부(410)는 상기 장애 정보를 장애 저장부(420)에 전송한다.If the categorized disorder is major or minor, the disorder managing unit 410 processes the disorder using a disorder processing routine. Then, the failure management unit 410 transmits the failure information to the failure storage unit 420.

그러면, 상기 장애 저장부(420)는 상기 전송된 장애 정보를 저장한 후, 장애 이력 정보를 장애 관리부(410)를 통하여 장애 보고부(430)에 전송한다.Then, after storing the transmitted failure information, the failure storage unit 420 transmits the failure history information to the failure report unit 430 through the failure management unit 410.

상기 장애 보고부(430)는 상기 장애 저장부(420)로부터 전송된 장애 이력 정보를 제어국에 전송한다.The failure report unit 430 transmits the failure history information transmitted from the failure storage unit 420 to the control station.

상기 장애 저장부(420)에 저장되는 장애 이력 정보에 대한 설명은 도 5를 참조한다.The failure history information stored in the failure storage unit 420 is described with reference to FIG. 5.

도 5는 본 발명의 바람직한 일 실시예에 따른 장애 이력 데이터베이스를 나타낸 도면이다.5 is a diagram illustrating a failure history database according to an exemplary embodiment of the present invention.

도 5를 참조하면, 장애 이력 데이터베이스는 IP어드레스, 장애 종류, 장애발생 시각, PC(Program Counter), 레지스터 내용, 장애 발생 위치의 메모리를 저장한다.Referring to FIG. 5, the failure history database stores an IP address, a failure type, a failure time, a PC (Program Counter), register contents, and a memory of a failure location.

상기 IP어드레스는 장애를 발생한 기지국이 어느 기지국인지를 나타내는 기지국 식별 주소이다.The IP address is a base station identification address indicating which base station has failed.

상기 장애 종류는 critical, major, minor 또는 버스 에러, 시스템 에러 등과 같이 표현된다.The failure type is expressed as critical, major, minor or bus error, system error, and the like.

도 6은 본 발명의 바람직한 일 실시예에 따른 장애 처리 방법을 나타낸 흐름도이다.6 is a flowchart illustrating a failure processing method according to an exemplary embodiment of the present invention.

도 6을 참조하면, 기지국은 장애 발생이 감지되면(S600), 상기 발생된 장애 정보를 저장한다(S602). 그런다음 상기 기지국은 프로세서를 재기동한 후(S604), 저장되어 있는 장애 이력 정보를 제어국/BSM에 전송한다(S606).Referring to FIG. 6, when a failure is detected (S600), the base station stores the generated failure information (S602). Then, the base station restarts the processor (S604), and transmits the stored failure history information to the control station / BSM (S606).

상기 제어국/BSM은 상기 기지국으로부터 전송된 장애 이력 정보를 수신하여 상기 장애 이력 정보의 통계 처리를 수행한다(S608). 즉, 상기 제어국/BSM은 상기 전송된 장애 이력 정보를 시간대별, 장애내역별, 프로세서별 분류에 의하여 통계 처리를 수행한다.The control station / BSM receives the failure history information transmitted from the base station and performs statistical processing of the failure history information (S608). That is, the control station / BSM performs statistical processing on the transmitted failure history information by time zone, failure history, and processor classification.

그런다음 상기 제어국/BSM은 운용자로부터 장애 이력 정보 요구 명령이 수신되는지의 여부를 판단한다(S610).Then, the control station / BSM determines whether a failure history information request command is received from the operator (S610).

단계 610의 판단결과 운용자로부터 장애 이력 정보 요구 명령이 수신되면, 상기 제어국/BSM은 상기 장애 이력 정보를 해당 운용자에게 전송한다(S612).If the failure history information request command is received from the operator as a result of the determination in step 610, the control station / BSM transmits the failure history information to the operator (S612).

만약, 단계 610의 판단결과 운용자로부터 장애 이력 정보 요구 명령이 수신되지 않으면, 상기 제어국/BSM은 미리 정해진 일정 주기로 해당 운용자에게 장애 이력 정보를 전송한다(S614). 상기 제어국/BSM은 해당 운용자에게 이메일 또는 메시지형태로 상기 장애 이력 정보를 전송한다.If the failure history information request command is not received from the operator as a result of the determination of step 610, the control station / BSM transmits the failure history information to the corresponding operator at a predetermined period (S614). The control station / BSM sends the fault history information to the operator in the form of an email or a message.

그러면, 해당 운용자는 장애 이력 정보를 수신하여 장애의 유지 보수를 보다 신속하게하여 장애에 대한 대처가 가능하다(S616).Then, the operator can respond to the failure by receiving the failure history information to more quickly maintain the failure (S616).

도 7은 본 발명의 바람직한 일실시예에 따른 기지국의 장애 처리방법을 나타낸 흐름도이다.7 is a flowchart illustrating a failure processing method of a base station according to an embodiment of the present invention.

도 7을 참조하면, 기지국은 장애 발생이 감지되면(S700), 상기 발생된 장애를 분류한 후, 저장한다(S702). 즉, 상기 기지국은 발생된 장애를 critical, major, minor로 분류한다.Referring to FIG. 7, when the occurrence of a failure is detected (S700), the base station classifies and stores the generated failure (S702). That is, the base station classifies the generated failure into critical, major, and minor.

그런다음 상기 기지국은 상기 분류된 장애가 critical인지의 여부를 판단한다(S704). 단계 704의 판단결과 상기 발생된 장애가 critical이면, 상기 기지국은 상기 장애가 발생된 해당 프로세서가 절체 기능이 제공되는지의 여부를 판단한다(S706). 단계 706의 판단결과 상기 장애가 발생된 해당 프로세서가 절체 기능이 제공되면, 상기 기지국은 절체 동작을 수행한 후(S708), 장애 이력 정보를 제어국에 전송한다(S710).Then, the base station determines whether the classified failure is critical (S704). If the generated fault is critical as a result of the determination of step 704, the base station determines whether the transfer function is provided in the processor in which the fault has occurred (S706). As a result of the determination in step 706, when the corresponding processor that has the failure is provided with the transfer function, the base station performs the transfer operation (S708), and transmits the failure history information to the control station (S710).

만약, 단계 706의 판단결과 상기 장애가 발생된 해당 프로세서가 절체 기능이 제공되지 않으면, 하드웨어 교체등의 조치를 취하여 프로세서를 재기동한다(S712). 그런다음 상기 기지국은 장애 이력 정보를 제어국에 전송한다(S710).If it is determined in step 706 that the failing processor is not provided with a switching function, the processor is restarted by taking measures such as hardware replacement (S712). Then, the base station transmits the failure history information to the control station (S710).

만약, 단계 704의 판단결과 상기 분류된 장애가 major, minor이면, 상기 기지국은 장애 처리 루틴을 이용하여 장애를 처리한다(S714). 그런다음 상기 기지국은 장애 이력 정보를 제어국에 전송한다(S710).If the classified failure is major or minor, the base station processes the failure using a failure processing routine (S714). Then, the base station transmits the failure history information to the control station (S710).

본 발명은 상기 실시예에 한정되지 않으며, 많은 변형이 본 발명의 사상 내에서 당 분야에서 통상의 지식을 가진 자에 의하여 가능함은 물론이다.The present invention is not limited to the above embodiments, and many variations are possible by those skilled in the art within the spirit of the present invention.

상술한 바와 같이 본 발명에 따르면, 운용자의 개입없이 자동적인 절차에 의하여 장애의 보고와 대처가 이루어지는 교환기의 장애 처리 방법 및 장치를 제공할 수 있다.As described above, according to the present invention, it is possible to provide a failure handling method and apparatus of an exchange in which a failure reporting and coping is performed by an automatic procedure without operator intervention.

또한, 본 발명에 따르면, 이동 통신 교환시스템에서 프로세서군으로 분류되는 장비뿐 아니라, CPU와 불휘발성 기억장치를 장착하고있고, TCP/IP 프로토콜을 사용하는 네트웍으로 연결되어지는 모든 시스템의 장애 처리 방식으로 적용되어 질수 있는 교환기의 장애 처리 방법 및 장치를 제공할 수 있다.In addition, according to the present invention, not only the equipment classified as a processor group in a mobile communication switching system, but also a CPU and a nonvolatile memory device, and a failure handling method of all systems connected to a network using a TCP / IP protocol. It is possible to provide a method and apparatus for handling a failure of an exchange that can be applied to.

Claims

When a failure occurs in the base station, classifying and storing the generated failure, and taking an action according to the classified failure type and transmitting failure history information to the control station / BSM;

Statistically processing the error history information transmitted from the base station in the control station / BSM and transmitting the result to the operator

Failure handling method of the exchange comprising a.

The method of claim 1,

The fault history information includes at least one of an IP address, a fault type, a fault occurrence time, a program counter, a register content, and a memory of a fault occurrence location of the corresponding base station in which the fault occurs.

The method of claim 1,

If the generated failure is fatal, determining whether a switching function of the failed processor is provided;

And if the transfer function of the processor having a failure is provided as a result of the determination, after performing the transfer, transmitting the failure history information to the control station / BSM.

The method of claim 3,

And if the transfer function of the failed processor is not provided, restarting the processor and transmitting failure history information to the control station.

The method of claim 1,

If the generated failure is not fatal, operate a failure processing routine to handle the failure, and transmit failure history information to the control station.

The method of claim 1,

Determining whether a failure history information request command is received from an operator with respect to the statistically processed failure history information;

If the failure history information request command is received from the operator as a result of the determination, transmitting the statistically processed failure history information to the operator.

Failure handling method of the exchange comprising a.

The method of claim 6,

And if the failure history information request command is not received from the operator as a result of the determination, transmitting the failure history information to the operator at a predetermined predetermined cycle.

The method according to claim 1 and 6 and 7,

The fault history information transmitted to the operator is transmitted in at least one of an email, a text message, a voice message, and a file.

In the apparatus for handling the failure of the exchange,

A failure detecting unit detecting a failure occurrence;

A failure management unit classifying the generated failure and taking action on the generated failure;

A fault storage unit storing fault information transmitted from the fault management unit; and

Failure reporting unit that periodically transmits failure history information stored in the failure storage unit

Failure handling apparatus of the exchange comprising a.

The method of claim 9,

And the fault storage unit is a nonvolatile memory.