CN105812181A - Distributed communication system maintenance method facing to high-speed maglev transportation simulation - Google Patents

Distributed communication system maintenance method facing to high-speed maglev transportation simulation Download PDF

Info

Publication number
CN105812181A
CN105812181A CN201610136060.5A CN201610136060A CN105812181A CN 105812181 A CN105812181 A CN 105812181A CN 201610136060 A CN201610136060 A CN 201610136060A CN 105812181 A CN105812181 A CN 105812181A
Authority
CN
China
Prior art keywords
address list
hosts
communication system
main frame
distributed communication
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610136060.5A
Other languages
Chinese (zh)
Other versions
CN105812181B (en
Inventor
袁时金
马骕
穆斌
林晔
张旭彤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tongji University
Original Assignee
Tongji University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tongji University filed Critical Tongji University
Priority to CN201610136060.5A priority Critical patent/CN105812181B/en
Publication of CN105812181A publication Critical patent/CN105812181A/en
Application granted granted Critical
Publication of CN105812181B publication Critical patent/CN105812181B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/14Network analysis or design
    • H04L41/145Network analysis or design involving simulating, designing, planning or modelling of a network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors
    • H04L43/0829Packet loss
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0823Errors, e.g. transmission errors
    • H04L43/0847Transmission error
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/18Protocol analysers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/60Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/16Implementation or adaptation of Internet protocol [IP], of transmission control protocol [TCP] or of user datagram protocol [UDP]
    • H04L69/164Adaptation or special uses of UDP protocol

Abstract

The invention relates to a distributed communication system maintenance method facing to high-speed maglev transportation simulation, the distributed communication system comprises at least one host, and the method comprises the following steps: (1) software structure layering is performed on a communication middleware; (2) an IP address list is initialized; (3) a worker thread transmits a heartbeat message to all hosts in sequence and updates entry timestamps of hosts which return heartbeat respond message in the IP address list; (4) according to the entry timestamps, if host faults exist is judged, if yes, user should turns to step (5), otherwise, the worker thread updates the entry timestamps of hosts in the IP address list; (5) the hosts which have fault are defined as fault hosts, and the fault information can be transmit to all other hosts, then fault hosts are repaired, after that, the repaired information is transmitted to all other hosts, so that the system resumes to communicate as normal. Compared with methods in the prior art, the method provided by the invention has advantages of improving stability, fault tolerance and the like of the system.

Description

A kind of distributed communication system maintaining method towards high speed Maglev emulation
Technical field
The invention belongs to communication of distributed system field, especially relate to a kind of distributed communication system maintaining method towards high speed Maglev emulation.
Background technology
In distributed system, having worked in coordination with the function of distributed system between modules each through interaction message, the performance of message communicating and service quality directly determine the service quality that distributed system realizes.Emulate in this cross-platform interchange environment at high speed Maglev, mutual communication between subsystem main frame, and change during the structure of subsystem internal.
Summary of the invention
Defect that the purpose of the present invention is contemplated to overcome above-mentioned prior art to exist and a kind of distributed communication system maintaining method towards high speed Maglev emulation promoting system stability and fault-tolerance is provided.
The purpose of the present invention can be achieved through the following technical solutions: a kind of distributed communication system maintaining method towards high speed Maglev emulation, and described distributed communication system includes at least one main frame, comprises the following steps:
(1) communication middleware software configuration is layered;
(2) IP address list is initialized;
(3) worker thread sends heartbeat message successively to All hosts, and updates the entry time stamp having the main frame returning heart beating response message in IP address list;
(4) stabbing according to entry time in step (3), it may be judged whether have hostdown, if it is, go to step (5), otherwise, worker thread updates main frame entry time stamp in IP address list;
(5) it is failed host by the host definition broken down, and fault message is transferred to other All hosts, then, repair failed host, after fault restoration, recovery information is transferred to other All hosts, thus system recovery proper communication.
Described step (1) is particularly as follows: adopt the variant of " data-logic-performance " three-decker, communication middleware software configuration is divided into Web communication layer, guaranteed reliability's layer and IPC layer, described Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, and have been used for the design of class in guaranteed reliability's layer and the definition of IP address list and communications status maintenance packet.IP address list is used for recording Host Status in network, worker thread CThreadForNetManage in guaranteed reliability's layer be responsible for maintenance.
Described step (2) is particularly as follows: when network house keeper's class in described guaranteed reliability's layer completes the reading of configuration file, the IP address read in configuration file is inserted in IP address list by worker thread, and the connection of default ip address and main frame is all effective, and the insertion time of IP address is write in IP address list as timestamp.When communication layers first time starts, its IP address list is empty, described network house keeper's class is a class in communication middleware, is provided with for initializing the functions such as initNetBoss with additions and deletions related resource, the function such as initNetBoss referring mainly to CNetBoss apoplexy due to endogenous wind in the present invention.
Described step (3) is particularly as follows: under normal condition, after IP address list has initialized the K second, 0 < K≤15, every the N second, 5≤N≤30, worker thread sends heartbeat message successively to All hosts, after worker thread receives the heart beating response message of corresponding serial number, at the entry time of IP address list, corresponding main frame is stabbed the time point being updated to reception heart beating response message.N is defaulted as 15 seconds, can specify in configuration file.
Described step (4) is particularly as follows: judge that whether described entry time stamp and current point in time difference are more than or equal to 2N ± 1 second, if, main frame is then had to break down, perform step (5), otherwise, main frame entry time stamp in IP address list is updated to the time point of heart beating response message by worker thread.
Repair host fault in described step (5) particularly as follows: effectiveness field during the entry time of failed host is stabbed change into invalid, and no longer sending heartbeat message to failed host, the resource relevant with failed host is deleted by notice network house keeper's class simultaneously.If now application program requester network state, the inaccessible notice of fault also can be obtained.Except updating the IP address list of this locality, worker thread CThreadNetManage also other host broadcast except failed host can send a mainframe network fault broadcast in IP address list.
In described step (5), after fault restoration, network house keeper's class reads configuration file, the IP address having the main frame of transmission demand is joined in IP address list, then send heartbeat message to these main frames and test connection status, entries relevant with failed host all in IP address list are set to effectively by first main frame receiving this heartbeat message, then send a mainframe network fault restoration broadcasting packet to every other main frame in IP address list.Recursion through the short time is broadcasted, and system is recovered from malfunction at short notice, decreases the time run under abnormality, improves reliability.
Compared with prior art, the invention have the advantages that
(1) in the present invention, Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, define mutual link information agreement between a subsystem and carry out real-time exchange connection status, thus being efficiently completed state-maintenance in whole communication layers, reduce the time coupling between subsystem, promote system stability and fault-tolerance;
(2) being broadcasted by recursion, system is recovered from malfunction at short notice, decreases the time run under abnormality, improves reliability.
Accompanying drawing explanation
Fig. 1 is the flow chart of the present invention.
Detailed description of the invention
Below in conjunction with the drawings and specific embodiments, the present invention is described in detail.
As it is shown in figure 1, a kind of distributed communication system maintaining method towards high speed Maglev emulation, distributed communication system includes at least one main frame, comprises the following steps:
(1) variant of " data-logic-performance " three-decker is adopted, communication middleware software configuration is divided into Web communication layer, guaranteed reliability's layer and IPC layer, Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, and have been used for the design of class in guaranteed reliability's layer and the definition of IP address list and communications status maintenance packet.IP address list is used for recording Host Status in network, worker thread CThreadForNetManage in guaranteed reliability's layer be responsible for maintenance.In the hierarchy of this " deformation ", being interacted by interface between each level, level internal height couples, loose couplings between level.
Complete the design of class in guaranteed reliability's layer: definition CWorkerThread is to inherit ACE thread encapsulation class ACE_Task, definition CNetboss class has been responsible in whole level the generation of original target, management and destruction, and CNetboss class is the core of whole guaranteed reliability's property layer.
Definition IP address list and communications status maintenance packet, be used for recording Host Status in network, and in IP address list, each represents a main frame being connected with the machine, worker thread CThreadForNetManage in guaranteed reliability's layer be responsible for maintenance.Communications status maintenance packet belongs to guaranteed reliability's layer protocol, and operation coding is corresponding in turn to data message, heartbeat message, heart beating response message, the broadcast of mainframe network fault and the broadcast of mainframe network fault restoration from 0x01-0x05.
(2), when communication layers first time starts, its IP address list is empty.When initNetboss (void) function of the network house keeper's apoplexy due to endogenous wind in guaranteed reliability's layer completes the reading of configuration file, the IP address read in configuration file is inserted in IP address list by worker thread CThreadNetManage, and the connection of default ip address and main frame is all effective, and the insertion time of IP address is write in IP address list as timestamp.Network house keeper's class is a class in communication middleware, is provided with for initializing the functions such as initNetBoss with additions and deletions related resource, the function such as initNetBoss referring mainly to CNetBoss apoplexy due to endogenous wind in the present invention.
(3) under normal condition, after IP address list has initialized the K second, 0 < K≤15 second, every the N second, 5≤N≤30, worker thread sends heartbeat message successively to All hosts, after worker thread receives the heart beating response message of corresponding serial number, at the entry time of IP address list, corresponding main frame is stabbed the time point being updated to reception heart beating response message.N is defaulted as 15 seconds, can specify in configuration file.
(4) judge that whether entry time stamp and current point in time difference are more than or equal to 2N ± 1 second, if it is, there is main frame to break down, perform step (5), otherwise, main frame entry time stamp in IP address list is updated to the time point of heart beating response message by worker thread.
(5) it is failed host by the host definition broken down, then failed host is repaired, it is invalid that concrete repair process is that the effectiveness field in being stabbed the entry time of failed host changes into, and no longer sending heartbeat message to failed host, the resource relevant with failed host is deleted by notice network house keeper's class simultaneously.If now application program requester network state, the inaccessible notice of fault also can be obtained.Except updating the IP address list of this locality, worker thread CThreadNetManage also other host broadcast except failed host can send a mainframe network fault broadcast in IP address list;
After fault restoration, network house keeper's class reads configuration file, the IP address having the main frame of transmission demand is joined in IP address list, then send heartbeat message to these main frames and test connection status, entries relevant with failed host all in IP address list are set to effectively by first main frame receiving this heartbeat message, then send a mainframe network fault restoration broadcasting packet to every other main frame in IP address list.Recursion through the short time is broadcasted, and system is recovered from malfunction at short notice, decreases the time run under abnormality, improves reliability.

Claims (7)

1., towards a distributed communication system maintaining method for high speed Maglev emulation, described distributed communication system includes at least one main frame, it is characterised in that comprise the following steps:
(1) communication middleware software configuration is layered;
(2) IP address list is initialized;
(3) worker thread sends heartbeat message successively to All hosts, and updates the entry time stamp having the main frame returning heart beating response message in IP address list;
(4) stabbing according to entry time in step (3), it may be judged whether have hostdown, if it is, go to step (5), otherwise, worker thread updates main frame entry time stamp in IP address list;
(5) it is failed host by the host definition broken down, and fault message is transferred to other All hosts, then, repair failed host, after fault restoration, recovery information is transferred to other All hosts, thus system recovery proper communication.
2. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 1, it is characterized in that, described step (1) particularly as follows: be divided into Web communication layer, guaranteed reliability's layer and IPC layer by communication middleware software configuration, described Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, and have been used in guaranteed reliability's layer the design of class.
3. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 2, it is characterized in that, described step (2) is particularly as follows: when network house keeper's class in described guaranteed reliability's layer completes the reading of configuration file, the IP address read in configuration file is inserted in IP address list by worker thread, and the connection of default ip address and main frame is all effective, and the insertion time of IP address is write in IP address list as timestamp
Described network house keeper's class is a class in communication middleware, is provided with for initializing the initNetBoss function with additions and deletions related resource.
4. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 1, it is characterized in that, described step (3) is particularly as follows: after IP address list initialized the K second, 0 < K≤15, every the N second, 5≤N≤30, worker thread sends heartbeat message successively to All hosts, after worker thread receives the heart beating response message of corresponding serial number, corresponding main frame is stabbed, at the entry time of IP address list, the time point being updated to reception heart beating response message.
5. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 1, it is characterized in that, described step (4) is particularly as follows: judge that whether described entry time stamp and current point in time difference are more than or equal to 2N ± 1 second, if, then there is hostdown, performing step (5), otherwise, main frame entry time stamp in IP address list is updated to the time point of heart beating response message by worker thread.
6. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 3, it is characterized in that, described step (5) repairs failed host particularly as follows: effectiveness field during the entry time of failed host is stabbed change into invalid, and no longer sending heartbeat message to failed host, the resource relevant with failed host is deleted by notice network house keeper's class simultaneously.
7. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 3, it is characterized in that, in described step (5), after fault restoration, network house keeper's class reads configuration file, the IP address having the main frame of transmission demand is joined in IP address list, then send heartbeat message to these main frames and test connection status, entries relevant with failed host all in IP address list are set to effectively by first main frame receiving this heartbeat message, then a mainframe network fault restoration broadcasting packet is sent to every other main frame in IP address list.
CN201610136060.5A 2016-03-10 2016-03-10 A kind of distributed communication system maintaining method towards high speed Maglev emulation Expired - Fee Related CN105812181B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610136060.5A CN105812181B (en) 2016-03-10 2016-03-10 A kind of distributed communication system maintaining method towards high speed Maglev emulation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610136060.5A CN105812181B (en) 2016-03-10 2016-03-10 A kind of distributed communication system maintaining method towards high speed Maglev emulation

Publications (2)

Publication Number Publication Date
CN105812181A true CN105812181A (en) 2016-07-27
CN105812181B CN105812181B (en) 2019-08-02

Family

ID=56467993

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610136060.5A Expired - Fee Related CN105812181B (en) 2016-03-10 2016-03-10 A kind of distributed communication system maintaining method towards high speed Maglev emulation

Country Status (1)

Country Link
CN (1) CN105812181B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114513437A (en) * 2022-01-20 2022-05-17 杭州朗和科技有限公司 Network testing method, device, medium and computing equipment
CN115277482A (en) * 2022-06-10 2022-11-01 浙江清捷智能科技有限公司 Online detection method for industrial edge equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1323040A2 (en) * 2000-09-08 2003-07-02 Goahead Software Inc. A system and method for managing clusters containing multiple nodes
CN1661968A (en) * 2004-02-27 2005-08-31 联想(北京)有限公司 Method for implementig mechanism of variable speed heartbeat based on network and load in host
CN101296182A (en) * 2008-05-20 2008-10-29 华为技术有限公司 Data transmission control method and data transmission control device
CN102811415A (en) * 2012-07-17 2012-12-05 北京交通大学 Train-ground communication method of urban railway transportation
CN102821114A (en) * 2011-06-07 2012-12-12 南京中兴新软件有限责任公司 Distributed memory system and implementation method of timestamps thereof

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1323040A2 (en) * 2000-09-08 2003-07-02 Goahead Software Inc. A system and method for managing clusters containing multiple nodes
CN1661968A (en) * 2004-02-27 2005-08-31 联想(北京)有限公司 Method for implementig mechanism of variable speed heartbeat based on network and load in host
CN101296182A (en) * 2008-05-20 2008-10-29 华为技术有限公司 Data transmission control method and data transmission control device
CN102821114A (en) * 2011-06-07 2012-12-12 南京中兴新软件有限责任公司 Distributed memory system and implementation method of timestamps thereof
CN102811415A (en) * 2012-07-17 2012-12-05 北京交通大学 Train-ground communication method of urban railway transportation

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李风忠,穆斌: "跨平台通信中间件的研究与实现", 《计算机光盘软件与应用》 *
邬国安等: "分布式仿真系统的自适应通信中间件设计", 《计算机技术与发展》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114513437A (en) * 2022-01-20 2022-05-17 杭州朗和科技有限公司 Network testing method, device, medium and computing equipment
CN115277482A (en) * 2022-06-10 2022-11-01 浙江清捷智能科技有限公司 Online detection method for industrial edge equipment
CN115277482B (en) * 2022-06-10 2023-08-22 浙江清捷智能科技有限公司 On-line detection method for industrial edge equipment

Also Published As

Publication number Publication date
CN105812181B (en) 2019-08-02

Similar Documents

Publication Publication Date Title
CN1921369B (en) Adapting method for network connection
CN110365644B (en) Method for constructing high-performance monitoring platform of networking equipment
CN106709043A (en) Data synchronous loading method based on database log
CN110752943B (en) Distributed fault diagnosis system and method for power transmission line
CN101404588B (en) Front end equipment management method, platform and system
CN110955725A (en) Industrial internet identification analysis root node data synchronization method
CN104579937A (en) server operating system and operating method thereof
WO2015149523A1 (en) Method and apparatus for rapid sending and detection processing of oam packet, and storage medium
CN105812181A (en) Distributed communication system maintenance method facing to high-speed maglev transportation simulation
CN110690994A (en) Universal dual-redundancy network card switching method
CN102223378B (en) File synchronization method and system
CN102857380A (en) Remote upgrading method for power line carrier communication routing
CN101227333A (en) Disaster tolerance network managing system and login method of network managing subscriber end
CN103269277B (en) For finding method and system, the first network element and the webmaster equipment of link
CN108040041B (en) Image difference transmission protocol design system and method based on service drive
CN100514928C (en) Method for rapidly forming dendriform topology structure of network equipment
CN102045187A (en) Method and equipment for realizing HA (high-availability) system with checkpoints
CN106487598A (en) The many examples of isomery redundancy Snmp agreement realize system and its implementation
CN104468232A (en) Method and system for using xml file and mib to complete network device configuration
CN100413248C (en) Improved method and system for carrying out charging based on flow
CN102412949B (en) Method for supporting double control boards by NAT (network address translation) port resources in distributed system
CN101841425A (en) Network backup method, device and system without proxy
CN105634955B (en) Flow table updating method and device
CN102970221A (en) Redundancy communication method among nodes based on multiple networks
CN110557369A (en) high-speed data processing platform based on domestic operating system kernel mode

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190802

CF01 Termination of patent right due to non-payment of annual fee