CN105812181A - Distributed communication system maintenance method facing to high-speed maglev transportation simulation - Google Patents
Distributed communication system maintenance method facing to high-speed maglev transportation simulation Download PDFInfo
- Publication number
- CN105812181A CN105812181A CN201610136060.5A CN201610136060A CN105812181A CN 105812181 A CN105812181 A CN 105812181A CN 201610136060 A CN201610136060 A CN 201610136060A CN 105812181 A CN105812181 A CN 105812181A
- Authority
- CN
- China
- Prior art keywords
- address list
- hosts
- communication system
- main frame
- distributed communication
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/01—Protocols
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/14—Network analysis or design
- H04L41/145—Network analysis or design involving simulating, designing, planning or modelling of a network
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0823—Errors, e.g. transmission errors
- H04L43/0829—Packet loss
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0823—Errors, e.g. transmission errors
- H04L43/0847—Transmission error
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/18—Protocol analysers
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L67/00—Network arrangements or protocols for supporting network services or applications
- H04L67/50—Network services
- H04L67/60—Scheduling or organising the servicing of application requests, e.g. requests for application data transmissions using the analysis and optimisation of the required network resources
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L69/00—Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
- H04L69/16—Implementation or adaptation of Internet protocol [IP], of transmission control protocol [TCP] or of user datagram protocol [UDP]
- H04L69/164—Adaptation or special uses of UDP protocol
Abstract
The invention relates to a distributed communication system maintenance method facing to high-speed maglev transportation simulation, the distributed communication system comprises at least one host, and the method comprises the following steps: (1) software structure layering is performed on a communication middleware; (2) an IP address list is initialized; (3) a worker thread transmits a heartbeat message to all hosts in sequence and updates entry timestamps of hosts which return heartbeat respond message in the IP address list; (4) according to the entry timestamps, if host faults exist is judged, if yes, user should turns to step (5), otherwise, the worker thread updates the entry timestamps of hosts in the IP address list; (5) the hosts which have fault are defined as fault hosts, and the fault information can be transmit to all other hosts, then fault hosts are repaired, after that, the repaired information is transmitted to all other hosts, so that the system resumes to communicate as normal. Compared with methods in the prior art, the method provided by the invention has advantages of improving stability, fault tolerance and the like of the system.
Description
Technical field
The invention belongs to communication of distributed system field, especially relate to a kind of distributed communication system maintaining method towards high speed Maglev emulation.
Background technology
In distributed system, having worked in coordination with the function of distributed system between modules each through interaction message, the performance of message communicating and service quality directly determine the service quality that distributed system realizes.Emulate in this cross-platform interchange environment at high speed Maglev, mutual communication between subsystem main frame, and change during the structure of subsystem internal.
Summary of the invention
Defect that the purpose of the present invention is contemplated to overcome above-mentioned prior art to exist and a kind of distributed communication system maintaining method towards high speed Maglev emulation promoting system stability and fault-tolerance is provided.
The purpose of the present invention can be achieved through the following technical solutions: a kind of distributed communication system maintaining method towards high speed Maglev emulation, and described distributed communication system includes at least one main frame, comprises the following steps:
(1) communication middleware software configuration is layered;
(2) IP address list is initialized;
(3) worker thread sends heartbeat message successively to All hosts, and updates the entry time stamp having the main frame returning heart beating response message in IP address list;
(4) stabbing according to entry time in step (3), it may be judged whether have hostdown, if it is, go to step (5), otherwise, worker thread updates main frame entry time stamp in IP address list;
(5) it is failed host by the host definition broken down, and fault message is transferred to other All hosts, then, repair failed host, after fault restoration, recovery information is transferred to other All hosts, thus system recovery proper communication.
Described step (1) is particularly as follows: adopt the variant of " data-logic-performance " three-decker, communication middleware software configuration is divided into Web communication layer, guaranteed reliability's layer and IPC layer, described Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, and have been used for the design of class in guaranteed reliability's layer and the definition of IP address list and communications status maintenance packet.IP address list is used for recording Host Status in network, worker thread CThreadForNetManage in guaranteed reliability's layer be responsible for maintenance.
Described step (2) is particularly as follows: when network house keeper's class in described guaranteed reliability's layer completes the reading of configuration file, the IP address read in configuration file is inserted in IP address list by worker thread, and the connection of default ip address and main frame is all effective, and the insertion time of IP address is write in IP address list as timestamp.When communication layers first time starts, its IP address list is empty, described network house keeper's class is a class in communication middleware, is provided with for initializing the functions such as initNetBoss with additions and deletions related resource, the function such as initNetBoss referring mainly to CNetBoss apoplexy due to endogenous wind in the present invention.
Described step (3) is particularly as follows: under normal condition, after IP address list has initialized the K second, 0 < K≤15, every the N second, 5≤N≤30, worker thread sends heartbeat message successively to All hosts, after worker thread receives the heart beating response message of corresponding serial number, at the entry time of IP address list, corresponding main frame is stabbed the time point being updated to reception heart beating response message.N is defaulted as 15 seconds, can specify in configuration file.
Described step (4) is particularly as follows: judge that whether described entry time stamp and current point in time difference are more than or equal to 2N ± 1 second, if, main frame is then had to break down, perform step (5), otherwise, main frame entry time stamp in IP address list is updated to the time point of heart beating response message by worker thread.
Repair host fault in described step (5) particularly as follows: effectiveness field during the entry time of failed host is stabbed change into invalid, and no longer sending heartbeat message to failed host, the resource relevant with failed host is deleted by notice network house keeper's class simultaneously.If now application program requester network state, the inaccessible notice of fault also can be obtained.Except updating the IP address list of this locality, worker thread CThreadNetManage also other host broadcast except failed host can send a mainframe network fault broadcast in IP address list.
In described step (5), after fault restoration, network house keeper's class reads configuration file, the IP address having the main frame of transmission demand is joined in IP address list, then send heartbeat message to these main frames and test connection status, entries relevant with failed host all in IP address list are set to effectively by first main frame receiving this heartbeat message, then send a mainframe network fault restoration broadcasting packet to every other main frame in IP address list.Recursion through the short time is broadcasted, and system is recovered from malfunction at short notice, decreases the time run under abnormality, improves reliability.
Compared with prior art, the invention have the advantages that
(1) in the present invention, Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, define mutual link information agreement between a subsystem and carry out real-time exchange connection status, thus being efficiently completed state-maintenance in whole communication layers, reduce the time coupling between subsystem, promote system stability and fault-tolerance;
(2) being broadcasted by recursion, system is recovered from malfunction at short notice, decreases the time run under abnormality, improves reliability.
Accompanying drawing explanation
Fig. 1 is the flow chart of the present invention.
Detailed description of the invention
Below in conjunction with the drawings and specific embodiments, the present invention is described in detail.
As it is shown in figure 1, a kind of distributed communication system maintaining method towards high speed Maglev emulation, distributed communication system includes at least one main frame, comprises the following steps:
(1) variant of " data-logic-performance " three-decker is adopted, communication middleware software configuration is divided into Web communication layer, guaranteed reliability's layer and IPC layer, Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, and have been used for the design of class in guaranteed reliability's layer and the definition of IP address list and communications status maintenance packet.IP address list is used for recording Host Status in network, worker thread CThreadForNetManage in guaranteed reliability's layer be responsible for maintenance.In the hierarchy of this " deformation ", being interacted by interface between each level, level internal height couples, loose couplings between level.
Complete the design of class in guaranteed reliability's layer: definition CWorkerThread is to inherit ACE thread encapsulation class ACE_Task, definition CNetboss class has been responsible in whole level the generation of original target, management and destruction, and CNetboss class is the core of whole guaranteed reliability's property layer.
Definition IP address list and communications status maintenance packet, be used for recording Host Status in network, and in IP address list, each represents a main frame being connected with the machine, worker thread CThreadForNetManage in guaranteed reliability's layer be responsible for maintenance.Communications status maintenance packet belongs to guaranteed reliability's layer protocol, and operation coding is corresponding in turn to data message, heartbeat message, heart beating response message, the broadcast of mainframe network fault and the broadcast of mainframe network fault restoration from 0x01-0x05.
(2), when communication layers first time starts, its IP address list is empty.When initNetboss (void) function of the network house keeper's apoplexy due to endogenous wind in guaranteed reliability's layer completes the reading of configuration file, the IP address read in configuration file is inserted in IP address list by worker thread CThreadNetManage, and the connection of default ip address and main frame is all effective, and the insertion time of IP address is write in IP address list as timestamp.Network house keeper's class is a class in communication middleware, is provided with for initializing the functions such as initNetBoss with additions and deletions related resource, the function such as initNetBoss referring mainly to CNetBoss apoplexy due to endogenous wind in the present invention.
(3) under normal condition, after IP address list has initialized the K second, 0 < K≤15 second, every the N second, 5≤N≤30, worker thread sends heartbeat message successively to All hosts, after worker thread receives the heart beating response message of corresponding serial number, at the entry time of IP address list, corresponding main frame is stabbed the time point being updated to reception heart beating response message.N is defaulted as 15 seconds, can specify in configuration file.
(4) judge that whether entry time stamp and current point in time difference are more than or equal to 2N ± 1 second, if it is, there is main frame to break down, perform step (5), otherwise, main frame entry time stamp in IP address list is updated to the time point of heart beating response message by worker thread.
(5) it is failed host by the host definition broken down, then failed host is repaired, it is invalid that concrete repair process is that the effectiveness field in being stabbed the entry time of failed host changes into, and no longer sending heartbeat message to failed host, the resource relevant with failed host is deleted by notice network house keeper's class simultaneously.If now application program requester network state, the inaccessible notice of fault also can be obtained.Except updating the IP address list of this locality, worker thread CThreadNetManage also other host broadcast except failed host can send a mainframe network fault broadcast in IP address list;
After fault restoration, network house keeper's class reads configuration file, the IP address having the main frame of transmission demand is joined in IP address list, then send heartbeat message to these main frames and test connection status, entries relevant with failed host all in IP address list are set to effectively by first main frame receiving this heartbeat message, then send a mainframe network fault restoration broadcasting packet to every other main frame in IP address list.Recursion through the short time is broadcasted, and system is recovered from malfunction at short notice, decreases the time run under abnormality, improves reliability.
Claims (7)
1., towards a distributed communication system maintaining method for high speed Maglev emulation, described distributed communication system includes at least one main frame, it is characterised in that comprise the following steps:
(1) communication middleware software configuration is layered;
(2) IP address list is initialized;
(3) worker thread sends heartbeat message successively to All hosts, and updates the entry time stamp having the main frame returning heart beating response message in IP address list;
(4) stabbing according to entry time in step (3), it may be judged whether have hostdown, if it is, go to step (5), otherwise, worker thread updates main frame entry time stamp in IP address list;
(5) it is failed host by the host definition broken down, and fault message is transferred to other All hosts, then, repair failed host, after fault restoration, recovery information is transferred to other All hosts, thus system recovery proper communication.
2. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 1, it is characterized in that, described step (1) particularly as follows: be divided into Web communication layer, guaranteed reliability's layer and IPC layer by communication middleware software configuration, described Web communication layer and guaranteed reliability's layer all use adaptive communications environment ACE to realize, and have been used in guaranteed reliability's layer the design of class.
3. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 2, it is characterized in that, described step (2) is particularly as follows: when network house keeper's class in described guaranteed reliability's layer completes the reading of configuration file, the IP address read in configuration file is inserted in IP address list by worker thread, and the connection of default ip address and main frame is all effective, and the insertion time of IP address is write in IP address list as timestamp
Described network house keeper's class is a class in communication middleware, is provided with for initializing the initNetBoss function with additions and deletions related resource.
4. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 1, it is characterized in that, described step (3) is particularly as follows: after IP address list initialized the K second, 0 < K≤15, every the N second, 5≤N≤30, worker thread sends heartbeat message successively to All hosts, after worker thread receives the heart beating response message of corresponding serial number, corresponding main frame is stabbed, at the entry time of IP address list, the time point being updated to reception heart beating response message.
5. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 1, it is characterized in that, described step (4) is particularly as follows: judge that whether described entry time stamp and current point in time difference are more than or equal to 2N ± 1 second, if, then there is hostdown, performing step (5), otherwise, main frame entry time stamp in IP address list is updated to the time point of heart beating response message by worker thread.
6. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 3, it is characterized in that, described step (5) repairs failed host particularly as follows: effectiveness field during the entry time of failed host is stabbed change into invalid, and no longer sending heartbeat message to failed host, the resource relevant with failed host is deleted by notice network house keeper's class simultaneously.
7. a kind of distributed communication system maintaining method towards high speed Maglev emulation according to claim 3, it is characterized in that, in described step (5), after fault restoration, network house keeper's class reads configuration file, the IP address having the main frame of transmission demand is joined in IP address list, then send heartbeat message to these main frames and test connection status, entries relevant with failed host all in IP address list are set to effectively by first main frame receiving this heartbeat message, then a mainframe network fault restoration broadcasting packet is sent to every other main frame in IP address list.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610136060.5A CN105812181B (en) | 2016-03-10 | 2016-03-10 | A kind of distributed communication system maintaining method towards high speed Maglev emulation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610136060.5A CN105812181B (en) | 2016-03-10 | 2016-03-10 | A kind of distributed communication system maintaining method towards high speed Maglev emulation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN105812181A true CN105812181A (en) | 2016-07-27 |
CN105812181B CN105812181B (en) | 2019-08-02 |
Family
ID=56467993
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610136060.5A Expired - Fee Related CN105812181B (en) | 2016-03-10 | 2016-03-10 | A kind of distributed communication system maintaining method towards high speed Maglev emulation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105812181B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114513437A (en) * | 2022-01-20 | 2022-05-17 | 杭州朗和科技有限公司 | Network testing method, device, medium and computing equipment |
CN115277482A (en) * | 2022-06-10 | 2022-11-01 | 浙江清捷智能科技有限公司 | Online detection method for industrial edge equipment |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1323040A2 (en) * | 2000-09-08 | 2003-07-02 | Goahead Software Inc. | A system and method for managing clusters containing multiple nodes |
CN1661968A (en) * | 2004-02-27 | 2005-08-31 | 联想(北京)有限公司 | Method for implementig mechanism of variable speed heartbeat based on network and load in host |
CN101296182A (en) * | 2008-05-20 | 2008-10-29 | 华为技术有限公司 | Data transmission control method and data transmission control device |
CN102811415A (en) * | 2012-07-17 | 2012-12-05 | 北京交通大学 | Train-ground communication method of urban railway transportation |
CN102821114A (en) * | 2011-06-07 | 2012-12-12 | 南京中兴新软件有限责任公司 | Distributed memory system and implementation method of timestamps thereof |
-
2016
- 2016-03-10 CN CN201610136060.5A patent/CN105812181B/en not_active Expired - Fee Related
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1323040A2 (en) * | 2000-09-08 | 2003-07-02 | Goahead Software Inc. | A system and method for managing clusters containing multiple nodes |
CN1661968A (en) * | 2004-02-27 | 2005-08-31 | 联想(北京)有限公司 | Method for implementig mechanism of variable speed heartbeat based on network and load in host |
CN101296182A (en) * | 2008-05-20 | 2008-10-29 | 华为技术有限公司 | Data transmission control method and data transmission control device |
CN102821114A (en) * | 2011-06-07 | 2012-12-12 | 南京中兴新软件有限责任公司 | Distributed memory system and implementation method of timestamps thereof |
CN102811415A (en) * | 2012-07-17 | 2012-12-05 | 北京交通大学 | Train-ground communication method of urban railway transportation |
Non-Patent Citations (2)
Title |
---|
李风忠,穆斌: "跨平台通信中间件的研究与实现", 《计算机光盘软件与应用》 * |
邬国安等: "分布式仿真系统的自适应通信中间件设计", 《计算机技术与发展》 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114513437A (en) * | 2022-01-20 | 2022-05-17 | 杭州朗和科技有限公司 | Network testing method, device, medium and computing equipment |
CN115277482A (en) * | 2022-06-10 | 2022-11-01 | 浙江清捷智能科技有限公司 | Online detection method for industrial edge equipment |
CN115277482B (en) * | 2022-06-10 | 2023-08-22 | 浙江清捷智能科技有限公司 | On-line detection method for industrial edge equipment |
Also Published As
Publication number | Publication date |
---|---|
CN105812181B (en) | 2019-08-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1921369B (en) | Adapting method for network connection | |
CN110365644B (en) | Method for constructing high-performance monitoring platform of networking equipment | |
CN106709043A (en) | Data synchronous loading method based on database log | |
CN110752943B (en) | Distributed fault diagnosis system and method for power transmission line | |
CN101404588B (en) | Front end equipment management method, platform and system | |
CN110955725A (en) | Industrial internet identification analysis root node data synchronization method | |
CN104579937A (en) | server operating system and operating method thereof | |
WO2015149523A1 (en) | Method and apparatus for rapid sending and detection processing of oam packet, and storage medium | |
CN105812181A (en) | Distributed communication system maintenance method facing to high-speed maglev transportation simulation | |
CN110690994A (en) | Universal dual-redundancy network card switching method | |
CN102223378B (en) | File synchronization method and system | |
CN102857380A (en) | Remote upgrading method for power line carrier communication routing | |
CN101227333A (en) | Disaster tolerance network managing system and login method of network managing subscriber end | |
CN103269277B (en) | For finding method and system, the first network element and the webmaster equipment of link | |
CN108040041B (en) | Image difference transmission protocol design system and method based on service drive | |
CN100514928C (en) | Method for rapidly forming dendriform topology structure of network equipment | |
CN102045187A (en) | Method and equipment for realizing HA (high-availability) system with checkpoints | |
CN106487598A (en) | The many examples of isomery redundancy Snmp agreement realize system and its implementation | |
CN104468232A (en) | Method and system for using xml file and mib to complete network device configuration | |
CN100413248C (en) | Improved method and system for carrying out charging based on flow | |
CN102412949B (en) | Method for supporting double control boards by NAT (network address translation) port resources in distributed system | |
CN101841425A (en) | Network backup method, device and system without proxy | |
CN105634955B (en) | Flow table updating method and device | |
CN102970221A (en) | Redundancy communication method among nodes based on multiple networks | |
CN110557369A (en) | high-speed data processing platform based on domestic operating system kernel mode |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20190802 |
|
CF01 | Termination of patent right due to non-payment of annual fee |