KR20020093544A

KR20020093544A - Duplication method for real-time fault tolerance in wireless communication system

Info

Publication number: KR20020093544A
Application number: KR1020010032346A
Authority: KR
Inventors: 강권일
Original assignee: 주식회사 하이닉스반도체
Priority date: 2001-06-09
Filing date: 2001-06-09
Publication date: 2002-12-16
Also published as: KR100455872B1

Abstract

PURPOSE: A duplication method for admitting fault in real time in a wireless communication system is provided to perform a successive service as to a call in progress as well as a set call when the fault is generated during the operation of a system by transmitting a changed item of a memory to a standby side while application tasks are operated by a message unit and logging a message for recovery when the application tasks are switched. CONSTITUTION: When performing an initialization, a registration message for registering its storage region is transmitted to a duplication module(201). An IPC(Inter Processor Communication) terminal receives an external message(202), and a corresponding application block processes the received message(203) and transmits backup data to the duplication module(204). It is judged whether the backup data are transmitted to the standby side(205). If the backup data are transmitted to the standby side, a check point message is transmitted to the duplication module(206).

Description

Duplication method for real-time fault tolerance in wireless communication system

본 발명은 무선통신시스템에서 실시간 결함 허용을 위한 이중화 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것으로, 특히 응용 태스크(Task)들이 메시지 단위로 동작을 하면서 메모리의 변경사항을 스탠바이(Sandby)측에 전달하고, 절체시 복구(recovery)를 위해 메시지를 로깅(logging)함으로써, 시스템 동작 중에 장애가 발생하더라도 설정된 호 뿐만 아니라 진행중인 호에 대해서도 연속적인 서비스가 가능하도록 시스템의 안정성을 높여주는 실시간 결함 허용을 위한 이중화 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체에 관한 것이다.The present invention relates to a duplication method for real-time fault tolerance in a wireless communication system and a computer-readable recording medium recording a program for realizing the method. Particularly, an application task operates in a message unit, By sending the changes to the standby side and logging messages for recovery during switchover, the system can provide continuous service not only for established calls but also for ongoing calls, even if a failure occurs during system operation. The present invention relates to a duplication method for real-time defect tolerance that improves stability and a computer-readable recording medium having recorded thereon a program for realizing the method.

본 발명은 일예로 차세대이동통신(IMT-2000) 시스템의 무선망제어기(RNC : Radio Network Control)를 결함 허용 시스템으로 개발하기 위한 것이나, 이는 단지 RNC에 국한된 기술이 아니라 실시간적인 결함 허용 방식을 채택하는 시스템에 모두 활용할 수 있다. 특히, IMT-2000 시스템에서 노드(Node)-B와 교환기에도 적용이 가능하다.The present invention, for example, to develop a radio network controller (RNC: Radio Network Control) of the next generation mobile communication (IMT-2000) system as a fault-tolerant system, which adopts a real-time fault-tolerant method, not just RNC. It can be used for all systems. In particular, it can be applied to the Node-B and the switch in the IMT-2000 system.

종래의 이중화 방식으로는 크게 동시동작방식, 동시쓰기방식, 상태체크포인팅방식 등으로 나누어진다.Conventional duplication methods are largely divided into a simultaneous operation method, a simultaneous write method, a state check pointing method.

먼저, 동시동작방식은 액티브(Active) 프로세서와 스탠바이(Standby) 프로세서가 독립된 하드웨어 모듈에서 같은 코드를 수행함으로써, 한 모듈이 고장에 의한 절체가 요구될 때에 다른 모듈의 외부통신 경로만 변경해주면 되기 때문에 실시간 결함 허용이 가능하다. 그러나, 초기에 액티브 프로세서와 스탠바이 프로세서가 동시에 시동되어야 하는 제약이 있으며, 스탠바이가 늦게 동작 되었다면 액티브 프로세서의 기억장소를 일괄 복제하는 과정이 필요하다.First, in the simultaneous operation method, since the active processor and the standby processor execute the same code in separate hardware modules, only one external module needs to change the external communication path when a module needs to be replaced by a fault. Real time fault tolerance is possible. However, there is a limitation that the active processor and the standby processor must be started at the same time in the beginning, and if the standby operation is late, a process of collectively replicating the storage of the active processor is required.

무선망제어기(RNC)에서는 실행의 기본단위가 호 처리 메시지이기 때문에 어떤 일시적인 결함으로 인해 각 태스크간의 콘택스트가 일치하지 않는 경우가 발생할 수 있으며, 이는 시스템의 안정성에 문제가 될 수 있다. 동시 동작 방식은 절체시 완벽하게 연속적으로 서비스를 할 수 있으나, 하드웨어의 성능이 급격히 좋아짐에 따라 두 프로세서간의 동기를 맞추기가 어려운 문제점이 있다.In RNC, since the basic unit of execution is the call processing message, there may be a case where the context is not matched between tasks due to a temporary defect, which may cause a problem of system stability. Simultaneous operation method can service completely continuously when switching, but there is a problem that it is difficult to synchronize the synchronization between the two processors as the performance of hardware rapidly improves.

다음, 동시 쓰기 방식은 액티브 프로세서가 기억장소의 내용을 변경하면 동시에 스탠바이 프로세서의 기억장소에도 똑같이 적용되어져야 한다. 이를 위해서는 공유 기억장치를 이용하거나, 동시 쓰기를 지원하는 특별한 하드웨어가 필요하다. 고장에 의해 절체가 요구될 때에는 액티브 프로세서의 콘택스트(Context)만을 복사하여 자신의 콘택스트에 적용한 후, 활성화하여 액티브 프로세서가 수행하던 작업을 연속적으로 수행한다. 이 방식은 액티브측의 오동작이 그대로 스탠바이측에 넘겨지기 때문에 듀얼 다운(dual down)이 발생하며, 동시쓰기를 위한 하드웨어 구성이 쉽지 않고, 특별한 하드웨어 장비가 필요하므로 비용도 그만큼 더 추가되는 문제점이 있다.Next, the simultaneous write method should be applied to the storage of the standby processor at the same time as the active processor changes the contents of the storage. This requires special hardware that uses shared storage or supports concurrent writes. When switching is required due to a failure, only the context of the active processor is copied and applied to its own context, which is then activated to continuously perform the tasks performed by the active processor. This method has a dual down because the malfunction of the active side is passed to the standby side as it is, and the hardware configuration for simultaneous writing is not easy, and special hardware equipment is required, which adds cost. .

마지막으로, 상태 체크 포인팅 기법은 액티브 프로세서만 동작을 하다가 상태변화가 발생하면 스탠바이 프로세서로 알려주는 방식이다. 액티브 프로세서의 작업 중 오류나 결함 발생시 스탠바이 프로세서는 최근의 체크포인트 데이터를 토대로 복구를 한다. 로그(log) 정보로는 콘텍스트 상태 정보 및 레지스터 값, 입력 큐(Input Queue) 등이 있다. 이 방식은 중요한 작업마다 상태정보를 갖게 되므로 복구율이 높은 편이지만 이를 위한 구현이 매우 어렵고 이중화 모듈의 오버헤드(overhaed)가 크다는 단점이 있다.Lastly, the state check pointing method operates only the active processor and notifies the standby processor when a state change occurs. If an error or fault occurs during the operation of the active processor, the standby processor recovers based on the latest checkpoint data. Log information includes context status information, register values, and input queues. This method has high recovery rate because it has status information for each important task. However, this method is difficult to implement and has a large overhead of the redundant module.

자동 체크포인팅 기법은 이중화를 지원하는 OS(Operation System)차원에서 자동적으로 로그 정보를 생성하여 스탠바이 프로세서로 전달하는 방식으로 이는 상태 체크포인팅 방식의 구현 부담을 줄이기 위해 제안되었다. 그러나, 이를 지원하는 OS를 새로 구축해야 하는 비용 상승 문제와 스탠바이로 전송되는 데이터의 양이 급증되므로 관리하는데 많은 비용이 드는 문제점이 있다.The automatic checkpointing technique automatically generates log information at the operating system (OS) level that supports redundancy and delivers it to the standby processor. This is proposed to reduce the burden of implementing the status checkpointing method. However, there is a problem in that a cost increase to build a new OS that supports this, and the amount of data to be sent to the standby is rapidly increased, which is expensive to manage.

본 발명은, 상기한 바와 같은 문제점을 해결하기 위하여 제안된 것으로, 응용 태스크(Task)들이 메시지 단위로 동작을 하면서 메모리의 변경사항을 스탠바이측에 전달하고, 절체시 복구를 위해 메시지를 로깅(logging)함으로써, 시스템 동작 중에 장애 발생시 설정된 호 뿐만 아니라 진행중인 호에 대해서도 연속적인 서비스가 가능하도록 하는 실시간 결함 허용을 위한 이중화 방법과 상기 방법을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공하는데 그 목적이 있다.The present invention has been proposed to solve the above problems, the application tasks (Tasks) to operate in the unit of a message to transfer the changes of the memory to the standby side, logging the message (logging) for recovery when switching By providing a duplication method for real-time fault tolerance that enables continuous service for ongoing calls as well as calls established when a failure occurs during system operation, and a computer-readable recording medium recording a program for realizing the method. The purpose is.

도 1 은 본 발명이 적용되는 연동 이중화 결함 허용 시스템의 구성예시도.1 is an exemplary view illustrating a configuration of an interworking redundancy defect tolerance system to which the present invention is applied.

도 2 는 본 발명에 따른 무선통신시스템에서 실시간 결함 허용을 위한 이중화 방법에 대한 일실시예 흐름도.2 is a flowchart illustrating a duplication method for real-time fault tolerance in a wireless communication system according to the present invention.

도 3a 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 액티브측 이중화 모듈의 동작 과정에 대한 일실시예 흐름도.3A is a flowchart illustrating an operation of an active-side duplication module of a duplication method for real-time fault tolerance according to the present invention.

도 3b 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 스탠바이 측 이중화 모듈의 동작 과정에 대한 일실시예 흐름도.3B is a flowchart illustrating an operation of a standby side duplication module in a duplication method for real time fault tolerance according to the present invention;

도 4 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 절체시 스탠바이측의 복구 과정에 대한 일실시예 흐름도.Figure 4 is a flow diagram of an embodiment of a recovery process of the standby side when switching among the duplication method for real-time fault tolerance according to the present invention.

도 5 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 초기 동기화 절차에 대한 일실시예 흐름도.5 is a flow diagram of an embodiment of an initial synchronization procedure of a duplication method for real time fault tolerance according to the present invention;

도 6 은 본 발명에 따른 로직 레벨에서 이중화 제어 알고리즘을 나타낸 일예시도.6 is an exemplary diagram illustrating a redundancy control algorithm at a logic level in accordance with the present invention.

* 도면의 주요 부분에 대한 부호의 설명* Explanation of symbols for the main parts of the drawings

11 : 액티브 프로세서12 : 스탠바이 프로세서11 Active Processor 12 Standby Processor

상기 목적을 달성하기 위한 본 발명은, 무선통신시스템에서 실시간 결함 허용을 위한 이중화 방법에 있어서, 액티브측과 스탠바이측에 적재되는 각 응용 태스크(Task)들을 메시지 단위로 동작시키기 위하여, 액티브 프로세서 및 스탠바이 프로세서를 초기화시키는 제 1 단계; 상기 각 응용 태스크들을 자신의 메모리에 응용 블록별로 등록한 후, 액티브 프로세서를 실행시키는 제 2 단계; 및 상기 응용 태스크들을 메시지 단위로 동작시키면서 메모리의 변경 사항을 스탠바이 프로세서로 전달하고, 상기 액티브측의 결함으로 인한 절체 발생시, 복구를 위한 메시지에 대한 로그 정보를 작성하여 상기 스탠바이측으로 전송하는 제 3 단계를 포함하여 이루어진 것을 특징으로 한다.The present invention for achieving the above object, in a duplication method for real-time fault tolerance in a wireless communication system, in order to operate each of the application tasks (Tasks) loaded on the active side and standby side by message unit, active processor and standby A first step of initializing the processor; A second step of registering each application task in its memory for each application block and executing an active processor; And transmitting a change in memory to a standby processor while operating the application tasks in a message unit, and in the event of a transfer due to a fault on the active side, creating log information on a message for recovery and transmitting the changed log information to the standby side. Characterized in that comprises a.

또한, 본 발명은, 프로세서를 구비한 무선통신시스템에, 액티브측과 스탠바이측에 적재되는 각 응용 태스크(Task)들을 메시지 단위로 동작시키기 위하여, 액티브 프로세서 및 스탠바이 프로세서를 초기화시키는 제 1 기능; 상기 각 응용 태스크들을 자신의 메모리에 응용 블록별로 등록한 후, 액티브 프로세서를 실행시키는 제 2 기능; 및 상기 응용 태스크들을 메시지 단위로 동작시키면서 메모리의 변경 사항을 스탠바이 프로세서로 전달하고, 상기 액티브측의 결함으로 인한 절체 발생시, 복구를 위한 메시지에 대한 로그 정보를 작성하여 상기 스탠바이측으로 전송하는 제 3 기능을 실현시키기 위한 프로그램을 기록한 컴퓨터로 읽을 수 있는 기록매체를 제공한다.The present invention also provides a wireless communication system including a processor, comprising: a first function of initializing an active processor and a standby processor to operate each application task loaded on the active side and the standby side in a message unit; A second function of executing the active processor after registering the respective application tasks in its own memory for each application block; And a third function of transmitting a change in memory to a standby processor while operating the application tasks in a message unit, and writing log information on a message for recovery and transmitting the changed information to a standby side when a changeover occurs due to a fault on the active side. A computer readable recording medium having recorded thereon a program for realizing the present invention is provided.

본 발명은, 응용 태스크들이 메시지 단위로 동작을 하면서 메모리의 변경사항을 스탠바이측에 전달하는 방식을 기본으로 하며, 절체시 복구를 위해 메시지를 로깅하는 방식인 응용 블록 연동 이중화 방식이다. 액티브-스탠바이 프로세서간의 동기화 기능을 메시지 단위로 하였기 때문에, 응용 태스크의 콘택스트 상태 정보 및 레지스터 값 등이 필요하지 않다. 그리고, 응용 태스크들이 직접 메시지를 처리하고 난 다음에 그 메시지에 대해서 로그 정보를 작성하여 스탠바이로 전송함으로써, 동기가 맞춰지기 때문에 더욱 정확하고 쉽게 동기를 맞출 수 있으며, 프로그램의 구현도 용이하다. 또한, 오동작으로 인해 치명적인 장애가 발생할지라도, 곧 바로 절체됨으로써 잘못된 정보를 스탠바이로 전송하지 않는다. 이는 듀얼 다운을 막을 수 있는 결과를 낳는다.The present invention is based on a method of transferring application memory changes to the standby side while application tasks operate on a per message basis, and is an application block interworking duplexing scheme in which a message is logged for recovery when switching. Since the synchronization function between the active and standby processors is in units of messages, the contact status information and the register value of the application task are not necessary. Application tasks directly process the message, and then write log information about the message and send it to the standby, so synchronization is more accurate and easier to synchronize, and program implementation is easier. In addition, even if a fatal failure occurs due to a malfunction, it is immediately switched over and does not transmit wrong information to the standby. This results in a dual down prevention.

상술한 목적, 특징들 및 장점은 첨부된 도면과 관련한 다음의 상세한 설명을 통하여 보다 분명해 질 것이다. 이하, 첨부된 도면을 참조하여 본 발명에 따른 바람직한 일실시예를 상세히 설명한다.The above objects, features and advantages will become more apparent from the following detailed description taken in conjunction with the accompanying drawings. Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 1 은 본 발명이 적용되는 연동 이중화 결함 허용 시스템의 구성예시도이다.1 is an exemplary configuration diagram of an interworking redundancy defect tolerance system to which the present invention is applied.

도 1에 도시된 바와 같이, 이중화를 위한 액티브 프로세서와 스탠바이 프로세서간의 경로(Path)는 신뢰성있게 데이터 및 제어 신호가 전송되어야 함으로 100베이스-T 이더넷(100base-T Ethernet)으로 구축한다.As shown in FIG. 1, a path between an active processor and a standby processor for redundancy is constructed as 100base-T Ethernet because data and control signals must be transmitted reliably.

호 처리를 담당하는 프로세서는 크게 호 설정 및 호 해제로 나누어진다. 호 설정 후에는 결함이 발생하더라도 설정된 호에 대해서는 호가 유지되나, 호 설정 과정 중에 결함이 발생하게 되면 진행중인 호는 해제되는 문제점이 발생하게 된다. 따라서, 호 설정 및 해제에 대한 과정을 트랜잭션으로 정하는 것보다는 기억장소의 내용을 변경하는 메시지 단위로 상태를 일치시킨다면, 진행중인 호에 대해서도 서비스를 연속적으로 할 수가 있어 효율적이라 할 수 있다.The processor in charge of call processing is divided into call setup and call release. After a call is set up, even if a fault occurs, the call is maintained for the set call. However, if a fault occurs during the call setup process, the call in progress is released. Therefore, if the status is matched by a message unit that changes the contents of a storage location, rather than a process for establishing and releasing a call, the service can be continuously performed even for an ongoing call.

도 2 는 본 발명에 따른 무선통신시스템에서 실시간 결함 허용을 위한 이중화 방법에 대한 일실시예 흐름도이다.2 is a flowchart illustrating a duplication method for real-time fault tolerance in a wireless communication system according to the present invention.

도 2에 도시된 바와 같이, 초기화시에 이중화 모듈로 자신의 기억장소 영역을 등록하는 등록 메시지를 전송한다(201).As shown in FIG. 2, upon initialization, a registration message for registering its storage area is transmitted to the duplication module (201).

이어서, IPC(Inter Processor Communication) 단에서 외부 메시지를 수신하여(202) 해당 응용 블록에서 메시지를 처리한 후(203), 이중화 모듈로 백업 데이터(backup data)를 전송한다(204).Subsequently, an external message is received at the Inter Processor Communication (IPC) terminal (202), the message is processed in the corresponding application block (203), and then backup data (backup data) is transmitted to the duplication module (204).

그리고 나서, 스탠바이측으로 백업 데이터를 전송할 것인지를 판단하여(205), 백업 데이터를 전송하지 않으면 IPC단에서 외부메시지를 수신하는 과정(202)으로 진행하고, 백업 데이터를 전송하면 이중화 모듈로 체크포인트 메시지를 전송한다(206).Then, it is determined whether to transmit the backup data to the standby side (205). If the backup data is not transmitted, the process proceeds to the step 202 of receiving an external message from the IPC stage. Send 206.

즉, 응용 블록 기반 이중화 방식은 액티브측과 스탠바이측에 적재되는 각 응용 태스크(Task)들은 초기화시에 이중화 모듈로 자신의 기억장소 영역을 등록한다. 이과정은 액티브 프로세서와 스탠바이 프로세서 양쪽에서 똑같은 절차에 의해 수행되어야 한다. 그런 다음 액티브 프로세서 및 스탠바이 프로세서는 동시에 호 메시지를 받는다.That is, in the application block-based duplication method, each application task loaded on the active side and the standby side registers its storage area with the duplication module at initialization. This process must be performed by the same procedure on both the active and standby processors. The active processor and standby processor then receive the call message at the same time.

도 3a 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 액티브측 이중화 모듈의 동작 과정에 대한 일실시예 흐름도이다.3A is a flowchart illustrating an operation of an active side duplication module in a duplication method for real time fault tolerance according to the present invention.

도 3a에 도시된 바와 같이, 액티브측 이중화 모듈에서의 동작 과정은, 외부 통신 채널(수신/송신(Rx/Tx))이 인에이블(enable) 상태이면(301) 각 응용 블록들로부터 등록 메시지를 수신하여(302) 각 응용 블록별로 기억장소 영역을 관리하도록 한다(303).As shown in FIG. 3A, the operation process of the active-side redundancy module may register a registration message from each application block if the external communication channel (receive / transmit (Rx / Tx)) is enabled (301). Received (302) to manage the storage area for each application block (303).

이어서, IPC단을 통해 메시지 정보를 수신하고(304), 스탠바이측의 이중화 모듈로 메시지 정보를 전송하며(305), 응용 블록으로부터 백업 메시지를 수신하여(306) 각 응용 블록별로 백업 데이터를 관리한다(307).Subsequently, the message information is received through the IPC terminal (304), the message information is transmitted to the redundant module on the standby side (305), and the backup message is received from the application block (306) to manage the backup data for each application block. (307).

이후, 응용 블록들로부터 체크포인트(checkpoint) 메시지를 수신하여(308) 각 응용 블록별로 관리하는 백업 데이터 및 메시지 정보를 스탠바이측의 이중화 모듈로 전송한다(309).Thereafter, a checkpoint message is received from the application blocks (308), and the backup data and message information managed for each application block are transmitted to the redundancy module on the standby side (309).

즉, 액티브측에 있는 IPC단은 이 메시지를 바로 해당응용 태스크로 전달하여 처리되도록 하고, 동시에 스탠바이측에 메시지에 대한 정보를 전송하도록 한다. 그리고, 메시지 처리 결과로 인해 기억장소의 내용이 변경되었다면 백업 메시지(변경된 내역 및 그 메시지에 대한 정보)를 이중화 모듈로 전송한다. 이중화 모듈은 이를 응용 태스크별로 관리하다가 그 응용 태스크가 체크포인트(checkpoint)를 호출하면 이를 스탠바이측의 이중화 모듈로 전송한다. 여기서, 체크포인트를 두는 이유는 응용 블록이 알아서 백업 메시지를 관리할 수 있도록 응용성을 부여하기 위함이다.That is, the IPC side on the active side forwards this message to the corresponding application task to be processed and at the same time transmits information about the message to the standby side. If the content of the storage location is changed as a result of the message processing, the backup message (the changed details and information on the message) is transmitted to the duplication module. The redundancy module manages this for each application task. When the application task calls a checkpoint, the redundancy module sends it to the redundancy module on the standby side. Here, the reason for the checkpoint is to give applicability so that the application block can manage the backup message by itself.

도 3b 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 스탠바이측 이중화 모듈의 동작 과정에 대한 일실시예 흐름도이다.3B is a flowchart illustrating an operation of a standby side duplication module in a duplication method for real time fault tolerance according to the present invention.

도 3b에 도시된 바와 같이, 스탠바이측 이중화 모듈에서의 동작 과정은, 외부 통신 채널(송신(Tx))이 디스에이블(disable) 상태이면(311), 각 응용 블록들로부터 등록 메시지를 수신하여(312), 각 응용 블록별로 기억장소 영역을 관리하도록 한다(313).As shown in FIG. 3B, the operation process in the standby-side redundancy module may receive a registration message from respective application blocks when the external communication channel (transmission (Tx)) is disabled (311) (see FIG. 3B). 312, the storage area is managed for each application block (313).

이어서, IPC단에서 메시지 정보를 수신하여(314), 각 응용 블록별로 메시지들을 액티브측에서 처리된 메시지순으로 정렬해서 해쉬 테이블로 관리한다(315).Subsequently, the message information is received at the IPC stage (314), and the messages are sorted in order of messages processed by the active side for each application block and managed as a hash table (315).

이후, 액티브측의 이중화 모듈로부터 백업 메시지를 수신하고(316), 백업 데이터를 기억장치 영역에 적용하고, 메시지 정보를 이용하여 액티브측에서 처리된 메시지를 제거한다(317).Thereafter, the backup message is received from the duplication module on the active side (316), the backup data is applied to the storage area, and the message processed by the active side is removed using the message information (317).

즉, 스탠바이측에서는 IPC단에서 외부 전송(Tx)을 차단하고, 외부로부터 입력된 메시지에 대해서도 해당 응용 태스크로 전송하지 않고 큐 형태로 저장한다. 그리고, 효율적인 메시지 관리 차원에서 각 메시지들에 대해서 태스크 아이디(Task ID)를 키로 하는 해쉬 테이블을 작성하여 관리한다. 액티브측에서 메시지에 대한 정보를 보내오면 이를 응용 태스크별로 관리하는 해쉬 테이블에 삽입하고, 데이터 백업 메시지를 전달받으면 자신의 기억장소에 이를 반영한다. 그리고, 백업 메시지와 함께 전송되어온 메시지 정보를 이용하여 해쉬 테이블에서 일치하는 해당 태스크를 조사하고, 그 중에서 일치하는 메시지를 찾아 그 메시지까지 이전의 모든 메시지를 삭제한다.That is, the standby side blocks the external transmission (Tx) at the IPC terminal and stores the message input from the outside in the form of a queue without transmitting the message to the corresponding application task. And, for efficient message management, a hash table with a task ID as a key is created and managed for each message. When the active side sends information about a message, it inserts it into a hash table managed by each application task, and when a data backup message is received, it is reflected in its own storage. Then, using the message information transmitted along with the backup message, the corresponding task is examined in the hash table, the matching message is found among them, and all previous messages are deleted.

도 4 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 절체시 스탠바이측의 복구 과정에 대한 일실시예 흐름도이다.Figure 4 is a flow diagram of an embodiment of a recovery process of the standby side when switching among the duplication method for real-time fault tolerance according to the present invention.

도 4에 도시된 바와 같이, 절체시 스탠바이측의 복구 과정은, 외부 통신 채널(수신/송신(Rx/Tx))이 인에이블(enable) 상태이면(401), IPC단의 버퍼에 남아있으면서 해쉬 테이블로 관리되는 메시지를 해당 응용 블록으로 전송한다(402).As shown in FIG. 4, the standby process recovery procedure is performed when the external communication channel (receive / transmit (Rx / Tx)) is enabled (401) and remains in the buffer of the IPC terminal while hashing. The message managed by the table is transmitted to the corresponding application block (402).

이후, 해쉬 테이블에 있는 모든 메시지를 해당 응용 블록으로 모두 전송했으면 액티브 상태로 역할을 변경한다(403).After all messages in the hash table have been transmitted to the corresponding application block, the role is changed to an active state (403).

즉, 액티브 프로세서에서 결함으로 인해 절체가 발생하면, 스탠바이 프로세서는 입력 큐에 삭제되지 않고 남아있는 메시지 및 새로 입력되는 호 처리 메시지를 처리한다. 이 과정에서 동일한 메시지를 이중으로 처리하는 경우가 발생할 수 있는데 이는 쓰레기 자원을 생성하게 되는 원인이 되며, 자원의 고갈을 초래할 수 있다. 그러나, 응용 블록 연동 이중화 방식에서는 이를 해결하기 위해 응용 태스크에서 그 자원이 유용한지 쓰레기인지를 체크하여 해결할 수 있다.That is, when a switchover occurs due to a defect in the active processor, the standby processor processes the remaining message and the newly input call processing message without being deleted from the input queue. In this process, the same message may be processed twice, which causes the generation of garbage resources and may cause the resource to be exhausted. However, in the application block interworking duplication method, the application task can solve the problem by checking whether the resource is useful or garbage.

도 5 는 본 발명에 따른 실시간 결함 허용을 위한 이중화 방법 중 초기 동기화 절차에 대한 일실시예 흐름도이다.5 is a flowchart illustrating an initial synchronization procedure of a duplexing method for real-time fault tolerance according to the present invention.

도 5에 도시된 바와 같이, 스탠바이측 이중화 모듈이 액티브측 이중화 모듈로 메모리 복사 요청 메시지를 전송하면(501), 스탠바이측 이중화 모듈을 통해 메모리 복사 요청 메시지를 수신한다(502).As illustrated in FIG. 5, when the standby side duplication module transmits a memory copy request message to the active side duplication module (501), the standby side duplication module receives a memory copy request message through the standby side duplication module (502).

이어서, 액티브측 이중화 모듈이 복제될 메모리 영역의 해당 응용 블록의 백업 메시지를 홀딩(holding)하고(503), 해당 응용 블록의 메모리 영역을 스탠바이측 이중화 모듈로 전송한 후(504), 해당 응용 블록의 홀딩을 해제하면(505), 스탠바이측 이중화 모듈은 이를 메모릴 영역에 반영한다(506).Subsequently, the active side redundancy module holds a backup message of the corresponding application block of the memory area to be replicated (503), transmits the memory area of the corresponding application block to the standby side redundancy module (504), and then the corresponding application block. When releasing of (505), the standby side redundancy module reflects this to the memory area (506).

즉, 초기에 액티브 프로세서가 먼저 시동되고 스탠바이가 늦게 시동되었을 때 액티브측과 스탠바이측의 기억장소 내용이 틀리므로, 이를 동기화시켜주기 위해 메모리 복제가 필요하게 된다. 그러나, 이를 일괄적으로 복제하게 되면 어느 한 순간에는 많은 양의 데이터 전송으로 인해 다른 용도의 메시지들을 잃어버릴 수 있다. 이를 해결하기 위한 방안으로, 응용 태스크별로 관리되는 기억장소 영역을 차례로 복제하며, 복제되는 기억장소 영역과 관련 있는 응용 태스크에 대해서는 백업 메시지를 전송하지 않도록 일시적으로 백업 메시지를 홀딩하는 기법을 사용한다.In other words, when the active processor is initially started and the standby is started late, the memory contents of the active side and the standby side are different, so memory duplication is necessary to synchronize them. However, replicating them in bulk can result in the loss of messages for other purposes at one moment due to the large amount of data transfer. As a solution to this problem, a storage area managed by each application task is sequentially copied, and a technique of temporarily holding a backup message so as not to transmit a backup message for an application task related to the replicated storage area is used.

이렇게 함으로써, 다른 응용 태스크들이 전송하는 메시지의 손실을 방지할 수 있다. 그리고 액티브, 스탠바이 역할 결정은 하드웨어 레벨에서 장애 체크(도 6 참조)를 통해 결정되도록 한다.This prevents the loss of messages sent by other application tasks. Active, standby role determination is made through a failure check (see FIG. 6) at the hardware level.

상술한 바와 같은 본 발명의 방법은 프로그램으로 구현되어 컴퓨터로 읽을 수 있는 기록매체(씨디롬, 램, 롬, 플로피 디스크, 하드 디스크, 광자기 디스크 등)에 저장될 수 있다.The method of the present invention as described above may be implemented as a program and stored in a computer-readable recording medium (CD-ROM, RAM, ROM, floppy disk, hard disk, magneto-optical disk, etc.).

이상에서 설명한 본 발명은 전술한 실시예 및 첨부된 도면에 의해 한정되는 것이 아니고, 본 발명의 기술적 사상을 벗어나지 않는 범위 내에서 여러 가지 치환, 변형 및 변경이 가능하다는 것이 본 발명이 속하는 기술분야에서 통상의 지식을 가진 자에게 있어 명백할 것이다.The present invention described above is not limited to the above-described embodiments and the accompanying drawings, and various substitutions, modifications, and changes are possible in the art without departing from the technical spirit of the present invention. It will be apparent to those of ordinary knowledge.

상기한 바와 같은 본 발명은, 차세대이동통신(IMT-2000) 시스템의 무선망제어기(RNC)를 결함 허용 시스템으로 개발하기 위하여, 응용 태스크들이 메시지 단위로 동작을 하면서 메모리의 변경사항을 스탠바이측에 전달하여, 절체시 복구를 위해 메시지를 로깅(logging)하는 방식을 사용하기 때문에 시스템 동작 중에 장애가 발생하더라도 설정된 호 뿐만 아니라 진행중인 호에 대해서도 연속적인 서비스를 제공할 수 있다. 이는 시스템의 안정성을 극대화시켜 효울적인 서비스를 제공할 수 있으며, 유지보수 비용을 절감시킴으로써, 서비스 가격의 인하라는 경제적 효과를 가져올 수 있다.The present invention as described above, in order to develop a radio network controller (RNC) of the next generation mobile communication (IMT-2000) system as a fault-tolerant system, the application tasks are operated in the unit of messages, the change of memory to the standby side By using this method, a message logging method is used for recovery when switching. Therefore, even if a failure occurs during system operation, continuous service can be provided not only for the established call but also for the ongoing call. This can maximize the stability of the system to provide efficient service, and by reducing the maintenance cost, the reduction in service price can bring an economic effect.

또한, 본 발명은, 호 설정을 요구하는 호, 호 설정을 진행중인 호, 이미 설정된 호에 대해서 장애가 발생하더라도 연속적인 서비스를 할 수 있으므로 간접적으로는 큰 경제적 손실을 방지할 수 있는 우수한 효과가 있다.In addition, the present invention has an excellent effect of preventing a large economic loss indirectly because continuous service can be performed even if a failure occurs for a call requiring call setup, a call in progress of call setup, or a call already established.

Claims

In the duplication method for real-time fault tolerance in a wireless communication system,

A first step of initializing the active processor and the standby processor to operate each application task loaded on the active side and the standby side in a message unit;

A second step of registering each application task in its memory for each application block and executing an active processor; And

The third step of transferring the changes of the memory to the standby processor while operating the application tasks in the message unit, and in the event of a transfer due to a fault on the active side, to write log information on the message for recovery to send to the standby side

Redundancy method for real-time fault tolerance, including.

The method of claim 1,

The first step is,

During initialization, if the active processor is started first and the standby is started late, the memory contents of the active side and the standby side are wrong and memory duplication is necessary to synchronize the memory areas. A method of redundancy for real-time fault tolerance, which employs a technique of duplicating and temporarily holding a backup message so as not to send a backup message for application tasks related to the area of memory being replicated.

The method according to claim 1 or 2,

The second step,

Through the IPC (Inter Processor Communication) terminal on the active side, the message is delivered to the corresponding application task to be processed and the information on the message is transmitted to the standby side. If a change is made, a backup message (change history and information on the message) is transmitted to the redundancy module, and the redundancy module manages it for each application task, and when the application task calls a checkpoint, Redundancy method for real-time fault tolerance, characterized in that the transmission to the redundancy module.

The method of claim 3, wherein

The third step,

Blocking the external transmission (Tx) in the IPC stage, and stores the message input from the outside in the form of a queue without transmitting to the application task, and task ID for each message for efficient message management It creates and manages a hash table with keys, and inserts it into the hash table managing the application task when information on the message is delivered from the active side, and reflects it in its own storage when the data backup message is received. Investigate the corresponding task in the hash table using the message information transmitted with the message, find a matching message among them, and delete all previous messages up to the message. .

The method of claim 4, wherein

The transfer recovery process due to the defect of the active side,

When a transfer occurs due to a defect in the active processor, the standby processor processes the remaining message and the newly input call processing message without being deleted from the input queue, and checks whether the resource is useful or garbage in the application task. Redundancy method for real-time fault tolerance, characterized in that the recovery.

In a wireless communication system having a processor,

A first function of initializing the active processor and the standby processor to operate each application task loaded on the active side and the standby side in a message unit;

A second function of executing the active processor after registering the respective application tasks in its own memory for each application block; And

A third function of transmitting a change in memory to a standby processor while operating the application tasks in a message unit, and writing log information on a message for recovery and transmitting the changed information to a standby side when a changeover occurs due to a fault on the active side;

A computer-readable recording medium having recorded thereon a program for realizing this.