CN1134135C - Communication method applicable to double-network fault-tolerance system - Google Patents

Communication method applicable to double-network fault-tolerance system Download PDF

Info

Publication number
CN1134135C
CN1134135C CNB001275682A CN00127568A CN1134135C CN 1134135 C CN1134135 C CN 1134135C CN B001275682 A CNB001275682 A CN B001275682A CN 00127568 A CN00127568 A CN 00127568A CN 1134135 C CN1134135 C CN 1134135C
Authority
CN
China
Prior art keywords
data
frame
communication
fault
send
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB001275682A
Other languages
Chinese (zh)
Other versions
CN1354575A (en
Inventor
胡德波
涂晓平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CHINA TECHNOLOGY EXCHANGE CO., LTD.
State Grid Beijing Electric Power Co Ltd
State Grid Economic and Technological Research Institute
Original Assignee
ZTE Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ZTE Corp filed Critical ZTE Corp
Priority to CNB001275682A priority Critical patent/CN1134135C/en
Publication of CN1354575A publication Critical patent/CN1354575A/en
Application granted granted Critical
Publication of CN1134135C publication Critical patent/CN1134135C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Computer And Data Communications (AREA)

Abstract

The present invention discloses a communication method applicable to a double-network fault-tolerant system. In the methoc, a mechanism of 'sending-waiting for response-overtime retransmission' is adopted for sending data, and a mechanism of 'checking-filtering retransmission data-response' is adopted for receiving data. If a fault checking point is triggered in the communication process, the fault switching is carries out and sending time and sending number of data frames which are sent from the connection of the sending queue and does not receive a response are initialized again. The data frame can be sent from another normal connection. The method not only raises the reliability of the system but also guarantees that data are not lost during switching the network by mistake.

Description

A kind of means of communication that are applied to double-network fault-tolerance system
The present invention relates to communication field, relate in particular to the means of communication of double-network fault-tolerance system on the local area network (LAN).
At present, be based upon application system on the local area network (LAN) just growing, function and scale are all in continuous expansion, and the industry of application comprises finance, telecommunications, electric power, factories and miness, school, government etc., almost spreads all over each department of national product.In the application system on these local area network (LAN)s, the communication reliability of network system is most important to the normal operation of using system, because the communication failure of network system will cause system normally not move.And in system in the past, the communication reliability of network system is not subjected to enough attention, is sufficiently high because generally believe its reliability.Along with the function of these application systems constantly strengthens and scale constantly enlarges, the requirement of reliability is also improved constantly.
The communication reliability that improves network system has two kinds of solution routes, i.e. misarrangement and fault-tolerant.Misarrangement is meant avoids fault, by the assembly of forming system is carried out strict screening, to system carries out strict detection, the methods such as interference that reduce external environment improve system reliability.By troubleshooting technology, can to a certain degree improve the reliability of system, but this technology there is certain limit, do not adapt to the extra high occasion of reliability requirement, this another kind of technology---fault-toleranr technique just occurred.The fault-tolerant tolerance mistake that is meant refers in system's generation hardware fault or exists under the situation of software error that system still can correctly finish the work.Fault-toleranr technique commonly used now is that dual network is fault-tolerant, improves the reliability of network communication by the backup of network interface card, netting twine, hub.In the tolerant system of this dual network, the mechanism of data communication is very important, has only suitable communication mechanism could guarantee the high reliability and the high efficiency of communication, and can obliterated data when the dual network failover, otherwise the fault-tolerant effect of dual network just can not be brought into play.Communication mechanism in the traditional two net system does not solve the not problem of obliterated data of the high reliability of communication and failover well, so the effect of double-network fault-tolerance is not just given full play of.
The objective of the invention is to overcome the problem that above-mentioned data communication reliability is low and data are easily lost, a kind of means of communication that are applied in the double-network fault-tolerance system are proposed, not only make system have very high reliability, and when network failure switches, can not cause loss of data.
In order to realize goal of the invention, the means of communication of described double-network fault-tolerance system mainly comprise:
1, double-network fault-tolerance system will be converted to the data with frame structure F2 from the data with frame structure F1 that application module transmits, and be stored in the transmit queue;
2,, send data or receive data, or both carry out simultaneously according to the requirement of application system;
If 3 in the communication processing procedure fault detecting point be triggered, then carry out failover and fault recovery and handle.
Described frame structure F1 comprises the type of host id, data, the length and the Frame of Frame.
Described frame structure F2 comprises the type of host id, data, length, Frame, serial number, transmitting time and the transmission number of times of Frame.
In the described means of communication, wait acknowledge that the transmission data in the step 2 are taked is " transmission-〉-overtime repeating transmission " mechanism, receive that data take be " verification-〉 and filter retransmission data-reply " mechanism.
The present invention is described in further detail below in conjunction with drawings and Examples:
Fig. 1 .1 and Fig. 1 .2 are respectively the Frame F1 that adopts of the present invention and the structure of F2.
Fig. 2 is a data sending processing flow process of the present invention.
Fig. 3 is a Data Receiving handling process of the present invention.
Shown in Fig. 1 .1, frame structure F1 is double-network fault-tolerance system and the employed data structure of upper layer application system interaction, when application system will send data or double-network fault-tolerance system the data passes that receives is used this frame structure during to application system.
Frame structure F1 comprises the type of host id, data, the length and the Frame of Frame.Host id represents to send ID number of destination host when these frame data are transmit frame, when these frame data are represented the ID that comes source host number of data during for received frame.It is associated with the IP address, because each main frame has two IP addresses, so it is corresponding with two IP addresses of main frame, and their corresponding relation is preserved or left in the database in configuration file and manages concentratedly.The importance of " data type " expression data is used for Frame and lines up at transmit queue in transmit frame, and 0 for the highest, and it is the most forward to line up in sending chained list, and 1 takes second place, and the rest may be inferred.The classification of " data type " expression data in received frame, be used to distinguish general data and channel status, channel status comprises: link initialization success, link initialization failure, link down, link-recovery, transmit queue are full, transmit queue is had living space etc. by full becoming, channel status return the work that the application program that makes the upper strata can be carried out flow control.The length of data Frame is represented the Frame length that application system sends or receives.Frame is the concrete data that application system sends or receives.
Frame structure F2 is the inner communication frames structure of using of double-network fault-tolerance system.Shown in Fig. 1 .2, frame structure F2 comprises the type of host id, data, length, Frame, serial number, transmitting time and the transmission number of times of Frame.
Serial number is the unique ID number of every frame data, and main frame receives data and verification and serial number is sent back to transmitting terminal after correct, i.e. the answer signal that returns of recipient.Transmitting time is the time that this frame data of expression sent last time, be used to judge whether overtime, if overtime then need to retransmit.Sending number of times is the number of times that these frame data of expression have sent, is used to judge whether the transmission number of times of these frame data surpasses restriction, breaks down if surpass then think that this communication connects.All the other parameters are identical with implication among the frame structure F1.
As shown in Figure 2, transmission data of the present invention comprise the steps:
Judge at first whether transmit queue is empty, if be empty, then whether timed sending detection data frame connects normal to detect network.If be not empty, then take out first frame data, judge whether the time that these frame data send apart from last time is overtime, if not overtime then directly return, if overtime then enter next step.If exist normal communication to connect, judge then whether the transmission number of times of these frame data surpasses restriction, if do not surpass, then these frame data are resend, and the number of times of transmission of these frame data is added 1, transmitting time is changed to the current time.Surpass restriction if send number of times, think that then fault has taken place in this communication connection, for interrupting, starts handoff procedure with this linkage flag.When failover, will be in the transmit queue send from this connection but do not receive the response transmit queue, so all receive that the Frame of replying all will resend from another normal connection that so just can guarantee can obliterated data.
As shown in Figure 3, reception data of the present invention comprise the steps:
When receiving data, judge at first whether verification is correct, if incorrectly just directly return,, just carry out next step if correct.If data are peers return reply acknowledgement frame, then from Frame, take out serial number, then the corresponding Frame of deletion in the transmit queue.If the detection data frame that transmit queue sent during the free time then directly sends and replys acknowledgement frame.If general data, preserved a Frame flowing water chained list that receives recently the recipient, if in this chained list, there is the serial number of this Frame, then explanation is the data of heavily receiving, therefore just do not deal with, directly send and reply acknowledgement frame, handle, send then and reply acknowledgement frame otherwise give application system on behind the framing.So just the data of heavily receiving can be discarded, avoid the same data of reprocessing.
Specify the application of this method in system below:
Under normal circumstances, double-network fault-tolerance system is equipped with two hubs, and every main frame is equipped with two network interface cards, is furnished with two IP addresses (IP1 and IP2).Be connected needing to set up two simultaneously by IP1 and IP2 between two main frames of communication, each connection all can be carried out normal data communication, and two connect and use same transmission data queues.
At first by application system the data with frame structure F1 are sent to double-network fault-tolerance system, double-network fault-tolerance system is changed the data of receiving, makes it have frame structure F2.When sending, certain frame data judges at first whether the time that these frame data send apart from last time is overtime, if not overtime then handle next frame, if overtime then enter next step.If exist normal communication to connect, judge then whether the transmission number of times of these frame data surpasses restriction, if do not surpass, then these frame data are sent, and the number of times of transmission of these frame data is added 1, transmitting time is changed to the current time.Deletion immediately after Frame sends, but to wait until the recipient send reply after just with its deletion.Surpass restriction if send number of times, think that then fault has taken place in this communication connection, this linkage flag for interrupting, is triggered fault detecting point, start handoff procedure.
Therefore because between two main frames of communication, keeping two connections simultaneously, when some connection faults, can another normal connection be set to main the connection, thereby realize failover fast.When starting switching, will be with connect sending from fault but not receiving that also the Frame of replying reinitializes in the transmit queue, be about to the transmitting time of Frame and sent number of times reinitialize, because two network connection uses is same transmit queue, and used " transmission-〉 to reply-overtime repeating transmission " mechanism, therefore all do not receive that the Frame of replying all will resend from another normal connection, so just can guarantee not can obliterated data, and has simplified processing.The action of fault recovery begins to carry out after change action is finished.Tolerant system can be continuous detection failure connect and whether recover, if recover then this connection is set to stand-by state (the active and standby operational mode of using) or joins (load sharing operational mode) in the normal communication.
After receiving data, correctly just handle if judge verification.If the data that receive are peers return reply acknowledgement frame, then from Frame, take out serial number, then the corresponding Frame of deletion in the transmit queue.If the data that receive are detection data frames that transmit queue sent during the free time, then directly send and reply acknowledgement frame.If the data that receive are general data, because communication system has adopted the mechanism of " overtime repeating transmission ", same frame data might be sent out repeatedly, therefore preserved a Frame flowing water chained list that receives recently the recipient, judge whether it is the data of heavily receiving by this serial number chained list, if heavy data of Shouing just do not deal with, handle otherwise give application system on behind the framing, send then and reply acknowledgement frame.
Adopt the means of communication of the present invention, to " send-〉 reply-overtime repeating transmission " communication mechanism be applied in the double-network fault-tolerance system, realize the reliability of communication, and in receiving processing, added the retransmission data filtration, avoided the Frame reprocessing.On the basis of this mechanism, because two network connection uses is same transmit queue, therefore when failover, as long as with connect sending from fault but not receiving that also the Frame of replying reinitializes in the transmit queue, these Frames just will resend from another normal connection, having guaranteed not can obliterated data, has simplified processing simultaneously.

Claims (5)

1, a kind of means of communication that are applied to double-network fault-tolerance system is characterized in that, mainly may further comprise the steps: step 1, double-network fault-tolerance system will be converted to from the data with frame structure F1 that application system transmits has frame
The data of structure F2, and be stored in the transmit queue;
Step 2 according to the requirement of application system, sends data or receive data, or both is carried out simultaneously;
Step 3 if fault detecting point is triggered in the communication processing procedure, is then carried out failover and fault recovery and is handled;
Described frame structure F1 comprises the type of host id, data, the length and the Frame of Frame;
Described frame structure F2 comprises the type of host id, data, length, Frame, serial number, transmitting time and the transmission number of times of Frame.
2, the means of communication as claimed in claim 1, it is characterized in that, wait acknowledge that transmission data in the described step 2 are taked is " transmission-〉-overtime repeating transmission " mechanism, receive that data take be " verification-〉 and filter retransmission data-reply " mechanism.
3, the means of communication as claimed in claim 2 is characterized in that, the reception data of described step 2 specifically comprise:
Judge at first whether verification is correct,,, just carry out next step if correct if incorrectly just directly return;
If receive data and be the acknowledgement frame of replying that peer returns, then from Frame, take out serial number, then the corresponding Frame of deletion in the transmit queue;
If receive data are detection data frames that transmit queue sent during the free time, then directly send and reply acknowledgement frame;
If receiving data is general data, then judge whether to exist the serial number of this Frame according to recipient's Frame flowing water chained list, reply acknowledgement frame if exist just directly to send, handle, send then and reply acknowledgement frame otherwise give application system on behind the framing.
4, the means of communication as claimed in claim 2 is characterized in that, the transmission data of described step 2 specifically comprise:
Judge at first whether transmit queue is empty, if be empty, then timed sending detection data frame if be not empty, then enters next step;
Take out first frame data, judge whether the time that these frame data send apart from last time is overtime, if not overtime then directly return, if overtime then judge whether to exist normal communication to connect;
If there is no normal communication connects, and connects then this linkage flag to interrupting, triggers fault detecting point, startup failover process; Otherwise whether the transmission number of times of judging these frame data surpasses restriction;
Do not surpass restriction if send number of times, then these frame data are resend, and the number of times of transmission of these frame data is added 1, transmitting time is changed to the current time;
Surpass restriction if send number of times, think that then fault has taken place in this communication connection, this linkage flag for interrupting, is triggered fault detecting point, start the failover process.
5, the means of communication as claimed in claim 4, it is characterized in that, described failover process comprises: will be in the transmit queue from this connection send but do not receive response Frame transmitting time and send number of times and reinitialize, make all not receive that the Frame of replying is all from resending another normal connection.
CNB001275682A 2000-11-22 2000-11-22 Communication method applicable to double-network fault-tolerance system Expired - Fee Related CN1134135C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB001275682A CN1134135C (en) 2000-11-22 2000-11-22 Communication method applicable to double-network fault-tolerance system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB001275682A CN1134135C (en) 2000-11-22 2000-11-22 Communication method applicable to double-network fault-tolerance system

Publications (2)

Publication Number Publication Date
CN1354575A CN1354575A (en) 2002-06-19
CN1134135C true CN1134135C (en) 2004-01-07

Family

ID=4592581

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB001275682A Expired - Fee Related CN1134135C (en) 2000-11-22 2000-11-22 Communication method applicable to double-network fault-tolerance system

Country Status (1)

Country Link
CN (1) CN1134135C (en)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7397759B2 (en) * 2004-03-15 2008-07-08 Microsoft Corporation Response for spurious timeout
CN100431300C (en) * 2005-07-18 2008-11-05 迈普(四川)通信技术有限公司 Method for using backup circuitry automatically
JP2007174644A (en) * 2005-12-22 2007-07-05 Intuitive Surgical Inc Synchronized data communication
EP2068510B1 (en) * 2006-10-05 2018-01-17 NTT DoCoMo, Inc. Communication system, communication device, and communication method
CN101127668B (en) * 2007-09-12 2010-12-29 中兴通讯股份有限公司 Method for perfecting transaction processing in access network process
US9516524B2 (en) * 2011-10-25 2016-12-06 Mediatek, Inc. Transmitter assisted quality of service measurement
CN103326897B (en) * 2013-06-08 2016-12-28 烽火通信科技股份有限公司 A kind of distributed computing environment versatile monitoring device and abatement detecting method
CN105245442B (en) * 2015-10-27 2018-11-23 广州市百果园网络科技有限公司 A kind of transmission method and device, system of instant messaging IM message

Also Published As

Publication number Publication date
CN1354575A (en) 2002-06-19

Similar Documents

Publication Publication Date Title
EP1108343B1 (en) Method and apparatus for input/output link retry, failure and recovery in a computer network
US6343067B1 (en) Method and apparatus for failure and recovery in a computer network
JPS5866448A (en) Error detecting system for exchange of packet
JPH10326261A (en) Error reporting system using hardware element of decentralized computer system
NZ296583A (en) Protocol for transferring data packets between interconnected nodes in a multi-processor environment
EP1708445A1 (en) Communication device and logical link abnormality detection method
CN1134135C (en) Communication method applicable to double-network fault-tolerance system
CN112787960B (en) Stack splitting processing method, device and equipment and storage medium
CN1758622A (en) Heterogeneous multi-bus data transmission method between information processing device
JP2001509657A (en) Active fault detection
JP3101604B2 (en) How to report errors in a distributed computer system
CN101160794A (en) Disaster recovery system and method of service controlling device in intelligent network
CN1533100A (en) Method for protecting coupling based on flow control transfer protocol
US7178051B2 (en) Method for synchronous support of fault-tolerant and adaptive communication
US20030035368A1 (en) Redundant ethernet transmission line system
CN114884767B (en) Synchronous dual-redundancy CAN bus communication system, method, equipment and medium
CN110677283B (en) Method for fast switching network dual redundancy
CN102014033B (en) Method and system for restoring services of fault nodes in ring network
JPH05204811A (en) Control information communication system
KR100318966B1 (en) A system and method for automatically recovering a network by use of health check in atm exchane network
CN113783805B (en) Transmission method and system of FC switching network
JP2528938B2 (en) Communication control system
CN116781802A (en) Interaction system and method based on UDP and Modbus-TCP
CN1741405A (en) Communication system and method for realizing back-up and negation
JP2000059394A (en) Line duplexing maintaining system for atm connection

Legal Events

Date Code Title Description
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C06 Publication
PB01 Publication
C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee

Owner name: ZTE CO., LTD.

Free format text: FORMER NAME OR ADDRESS: SHENZHENG CITY ZTE CO., LTD.

CP03 Change of name, title or address

Address after: 518057 Zhongxing building, science and technology south road, Nanshan District hi tech Industrial Park, Guangdong, Shenzhen

Patentee after: ZTE Corporation

Address before: 518057, Shenzhen Nanshan District hi tech Industrial Park, science and technology south road, ZTE building, 6 floor of the Ministry of law

Patentee before: Zhongxing Communication Co., Ltd., Shenzhen City

ASS Succession or assignment of patent right

Owner name: STATE GRID BEIJING ELECTRIC POWER COMPANY CHINA TE

Effective date: 20140129

Owner name: BEIJING POWER ECONOMIC RESEARCH INSTITUTE

Free format text: FORMER OWNER: ZTE CORPORATION

Effective date: 20140129

COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518057 SHENZHEN, GUANGDONG PROVINCE TO: 100055 XICHENG, BEIJING

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20140129

Address after: 100055 No. 15 West Street, Guanganmen station, Beijing, Xicheng District

Patentee after: State Power Economic Research Institute

Patentee after: State Grid Beijing Electric Power Company

Patentee after: CHINA TECHNOLOGY EXCHANGE CO., LTD.

Address before: 518057 Nanshan District science and Technology Industrial Park, Guangdong high tech Industrial Park, ZTE building

Patentee before: ZTE Corporation

C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20040107

Termination date: 20131122