CN101686261A - RAC-based redundant server system - Google Patents

RAC-based redundant server system Download PDF

Info

Publication number
CN101686261A
CN101686261A CN200910194974A CN200910194974A CN101686261A CN 101686261 A CN101686261 A CN 101686261A CN 200910194974 A CN200910194974 A CN 200910194974A CN 200910194974 A CN200910194974 A CN 200910194974A CN 101686261 A CN101686261 A CN 101686261A
Authority
CN
China
Prior art keywords
node
host
rac
application
virtual
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN200910194974A
Other languages
Chinese (zh)
Inventor
周庭梁
张立鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Casco Signal Ltd
Original Assignee
Casco Signal Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Casco Signal Ltd filed Critical Casco Signal Ltd
Priority to CN200910194974A priority Critical patent/CN101686261A/en
Publication of CN101686261A publication Critical patent/CN101686261A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Hardware Redundancy (AREA)

Abstract

The invention relates to a RAC-based redundant server system, which comprises N nodes, wherein each node operates an application server, a database server and a watchdog system; each node is connectedwith a shared disk array device; one end of each node is connected into a private network uniformly; the other end of each node is connected to a public network uniformly; each node has a virtual IPof the public network; a public network IP address accessed by a user is a main virtual IP address; a server having the main virtual IP is called as a host computer; other servers are standby computers; and the host computer and the standby computers are switched by the watchdog system. Compared with the prior art, the RAC-based redundant server system has the advantages of low cost, good expansibility, good reliability, short switching time and the like.

Description

A kind of redundant server system based on RAC
Technical field
The present invention relates to redundant server system, relate in particular to a kind of redundant server system based on RAC (the real application cluster of real application clusters).
Background technology
Along with the raising of domestic information degree, more and more higher to the online availability requirement of using system, generally require service uninterruptedly was provided in 7*24 hour, and system redundancy is to realize a kind of effective means of above-mentioned purpose.
The typical application system comprises application server and two parts of data server, and is corresponding, and system redundancy also should cover the redundant and redundant two parts of data server of application server.
Redundant for using, implementation mainly contains Clustering or load-balancing technique (component software load balancing and hardware load balancing again); And for database redundancy, main implementation is to adopt the data-base cluster technology.
Before Oracle 10g released, the redundancy of database also depended on the cluster of operating system, and after Oracle10g released, database itself had comprised cluster external member (RAC).
Share under the situation of a cover hardware platform at application server and data server,, both increased the buying expenses of software, increased system maintenance personnel's burden again if use operating system cluster or third party's cluster to realize the redundancy of using.The RAC that how to make full use of Oracle realizes system redundancy, has just become a good problem to study.
Summary of the invention
Purpose of the present invention is exactly in order to overcome the defective that above-mentioned prior art exists, and a kind of redundant server system based on RAC of with low cost, favorable expandability is provided.
Purpose of the present invention can be achieved through the following technical solutions:
A kind of redundant server system based on RAC, this system comprises N node, wherein each node all moves application server, database server, watchdog system, described each node links to each other with the Disk Array of sharing, the unified private network that inserts of one end of described each node, the unified public network that inserts of the other end of described each node, described each node all has the virtual IP address of a public network, a public network IP address accessed by the user is main virtual ip address, the server that has main virtual IP address is called main frame, other server then is a standby host, and active and standby machine switches to be realized by watchdog system.
Described active and standby machine switching comprises following flow process:
(1) because whole system is the RAC cluster, so we use the host identification of main virtual IP address conduct, after system start-up, if node has been obtained main virtual IP address then has been become main frame, otherwise be standby host, in the active and standby machine running, whether the watchdog system surveillance application is normal;
(2) unusual if host application occurs, then discharge host identification (stopping the RAC service) by watchdog system, make it become standby host, if certain standby host is obtained host identification, then this standby host becomes main frame;
(3) unusual if standby host occurs, watchdog system is attempted restarting application, recovers normal if restart the back application, then switch back to the standby host state, otherwise shutdown excludes whole cluster system with it;
(4) if main frame shuts down or restarts, main frame discharges host identification automatically, and promptly RAC discharges all virtual IP addresses that are bundled in this machine automatically, comprise main virtual IP address, RAC transfers to other nodes with all virtual IP addresses of this node simultaneously, at this moment, certain standby host will obtain host identification, becomes main frame;
(5) if standby host shuts down or restarts, then the RAC of standby host discharges all virtual IP addresses that all are bundled in this machine automatically, transfers to other nodes, but does not influence host work;
The flow process of described watchdog system work is as follows:
1) whether inquiry uses normal after the system start-up;
2) if normal, dormancy is inquiry again after one second;
3) if use unusual (losing response as using thread), watchdog system judges whether present node is host node, and promptly whether present node has main virtual IP address;
4) if present node is a host node, then discharge host identification;
5) restart application;
6) after application was restarted, whether watchdog system is inquired about application again normal, if normal, then continues the poll application state;
7) if application state is undesired, then shutdown.
Compared with prior art, the present invention has the following advantages:
(1) with low cost: as to make full use of the RAC external member that database itself comprises, saved the cost of buying the cluster of operating system, realized that 24 hours of server are online.
(2) favorable expandability: along with the expansion of business, system is free to add server, need not to change the existing application configuration.
(3) good reliability: the reliability of this framework mainly depends on the reliability of RAC.
(4) switching time is shorter: redundant machine forwards the switching time of normal use to smaller or equal to 30s.
Description of drawings
Fig. 1 is the structural representation of a kind of redundant server system based on RAC of the present invention;
Fig. 2 is the active and standby machine switching flow figure of a kind of redundant server system based on RAC of the present invention;
Fig. 3 is the watchdog system workflow diagram of a kind of redundant server system based on RAC of the present invention.
Embodiment
The present invention will be further described below in conjunction with specific embodiment.
Embodiment 1
As Fig. 1, Fig. 2, shown in Figure 3, a kind of redundant server system based on RAC, this system comprises N node 1, wherein each node 1 all moves application server, database server, watchdog system, described each node links to each other with the Disk Array of sharing 2, the unified private network 3 that inserts of one end of described each node, the unified public network 4 that inserts of the other end of described each node, described each node all has the virtual IP address of a public network, a public network IP address accessed by the user is main virtual ip address, the server that has main virtual IP address is called main frame, and other server then is a standby host, and active and standby machine switches to be realized by watchdog system.
Described active and standby machine switching comprises following flow process:
1) because whole system is the RAC cluster, so we use the host identification of main virtual IP address conduct.After system start-up,, otherwise be standby host if node has been obtained main virtual IP address then become main frame.In the active and standby machine running, whether the watchdog system surveillance application is normal.
2) unusual if host application occurs, then discharge host identification (stopping the RAC service) by watchdog system, make it become standby host; If certain standby host is obtained host identification, then this standby host becomes main frame.
3) unusual if standby host occurs, watchdog system is attempted restarting application, recovers normal if restart the back application, then switch back to the standby host state, otherwise shutdown excludes whole cluster system with it.
4) if main frame shuts down or restarts, main frame discharges host identification automatically, and promptly RAC discharges all virtual IP addresses that are bundled in this machine automatically, comprise main virtual IP address, RAC transfers to other nodes with all virtual IP addresses of this node simultaneously, at this moment, certain standby host will obtain host identification, becomes main frame.
5) if standby host shuts down or restarts, then the RAC of standby host discharges all virtual IP addresses that all are bundled in this machine automatically, transfers to other nodes, but does not influence host work.
The flow process of described watchdog system work is as follows:
In the 301st step, whether inquiry uses normal after the system start-up.
In the 302nd step, if normal, dormancy is inquiry again after one second.
In the 303rd step, if use unusual (losing response as using thread), watchdog system judges whether present node is host node, and promptly whether present node has main virtual IP address.
In the 304th step,, then discharge host identification if present node is a host node.
In the 305th step, restart application.
In the 306th step, after application was restarted, whether watchdog system is inquired about application again normal, if normal, then continues the poll application state.
The 307th step, if application state is undesired, then shutdown.
Embodiment 2
This invention is applied to somewhere factories and miness transportation Production Scheduling System (hereinafter to be referred as " transportation Production Scheduling System "):
The transportation Production Scheduling System comprises two-server, adopts redundancy structure, is deployed in the central machine room of company.The shared cover disk array of two-server, all data of system all are stored in this disk array, have guaranteed that effectively data and application 24 hours are online.
The application of system comprises two parts: based on the ERP system of B/S structure, reach the interface service program based on TCP/IP.The application of two-server all is in heat and is equipped with state, provides service by unified main VIP to the user.
In the service of central server deploy house dog, realized the automatic switchover when application system breaks down, guaranteed that effectively 24 hours of application are online.
In addition, this method is safeguarded the timing unit of system becomes possibility, as: can regularly equipment be safeguarded by select time, can further improve the availability of system like this.
Prove through field practice,, can effectively reduce the cost of maintenance, improve the reliabilty and availability of system based on the redundant server framework of RAC.

Claims (3)

1. redundant server system based on RAC, it is characterized in that, this system comprises N node, wherein each node all moves application server, database server, watchdog system, described each node links to each other with the Disk Array of sharing, the unified private network that inserts of one end of described each node, the unified public network that inserts of the other end of described each node, described each node all has the virtual IP address of a public network, a public network IP address accessed by the user is main virtual ip address, the server that has main virtual IP address is called main frame, and other server then is a standby host, and active and standby machine switches to be realized by watchdog system.
2. the redundant server system based on RAC according to claim 1 is characterized in that, described active and standby machine switching comprises following flow process:
(1) because whole system is the RAC cluster, so we use the host identification of main virtual IP address conduct, after system start-up, if node has been obtained main virtual IP address then has been become main frame, otherwise be standby host, in the active and standby machine running, whether the watchdog system surveillance application is normal;
(2) unusual if host application occurs, then discharge host identification (stopping the RAC service) by watchdog system, make it become standby host, if certain standby host is obtained host identification, then this standby host becomes main frame;
(3) unusual if standby host occurs, watchdog system is attempted restarting application, recovers normal if restart the back application, then switch back to the standby host state, otherwise shutdown excludes whole cluster system with it;
(4) if main frame shuts down or restarts, main frame discharges host identification automatically, and promptly RAC discharges all virtual IP addresses that are bundled in this machine automatically, comprise main virtual IP address, RAC transfers to other nodes with all virtual IP addresses of this node simultaneously, at this moment, certain standby host will obtain host identification, becomes main frame;
(5) if standby host shuts down or restarts, then the RAC of standby host discharges all virtual IP addresses that all are bundled in this machine automatically, transfers to other nodes, but does not influence host work.
3. the redundant server system based on RAC according to claim 1 is characterized in that, the flow process of described watchdog system work is as follows:
1) whether inquiry uses normal after the system start-up;
2) if normal, dormancy is inquiry again after one second;
3) if use unusual (losing response as using thread), watchdog system judges whether present node is host node, and promptly whether present node has main virtual IP address;
4) if present node is a host node, then discharge host identification;
5) restart application;
6) after application was restarted, whether watchdog system is inquired about application again normal, if normal, then continues the poll application state;
7) if application state is undesired, then shutdown.
CN200910194974A 2009-09-01 2009-09-01 RAC-based redundant server system Pending CN101686261A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN200910194974A CN101686261A (en) 2009-09-01 2009-09-01 RAC-based redundant server system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN200910194974A CN101686261A (en) 2009-09-01 2009-09-01 RAC-based redundant server system

Publications (1)

Publication Number Publication Date
CN101686261A true CN101686261A (en) 2010-03-31

Family

ID=42049229

Family Applications (1)

Application Number Title Priority Date Filing Date
CN200910194974A Pending CN101686261A (en) 2009-09-01 2009-09-01 RAC-based redundant server system

Country Status (1)

Country Link
CN (1) CN101686261A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102510343A (en) * 2011-11-16 2012-06-20 广东新支点技术服务有限公司 Highly available cluster system feign death solution based on both remote detection and power management
CN102521060A (en) * 2011-11-16 2012-06-27 广东新支点技术服务有限公司 Pseudo halt solving method of high-availability cluster system based on watchdog local detecting technique
CN104283710A (en) * 2014-08-18 2015-01-14 四川长虹电器股份有限公司 Database cluster fault handling method and management server
CN105843713A (en) * 2016-04-01 2016-08-10 杭州沃趣网络科技有限公司 Method for realizing Oracle RAC (real application cluster) through shared-nothing storage of dual system
CN106982259A (en) * 2017-04-19 2017-07-25 聚好看科技股份有限公司 The failure solution of server cluster
CN114827080A (en) * 2022-06-06 2022-07-29 武汉四通信息服务有限公司 IP switching method and system

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102510343A (en) * 2011-11-16 2012-06-20 广东新支点技术服务有限公司 Highly available cluster system feign death solution based on both remote detection and power management
CN102521060A (en) * 2011-11-16 2012-06-27 广东新支点技术服务有限公司 Pseudo halt solving method of high-availability cluster system based on watchdog local detecting technique
CN104283710A (en) * 2014-08-18 2015-01-14 四川长虹电器股份有限公司 Database cluster fault handling method and management server
CN105843713A (en) * 2016-04-01 2016-08-10 杭州沃趣网络科技有限公司 Method for realizing Oracle RAC (real application cluster) through shared-nothing storage of dual system
CN105843713B (en) * 2016-04-01 2019-06-28 杭州沃趣科技股份有限公司 A kind of method that dual systems realizes Oracle RAC without shared storage
CN106982259A (en) * 2017-04-19 2017-07-25 聚好看科技股份有限公司 The failure solution of server cluster
CN114827080A (en) * 2022-06-06 2022-07-29 武汉四通信息服务有限公司 IP switching method and system
CN114827080B (en) * 2022-06-06 2022-09-23 武汉四通信息服务有限公司 IP switching method and system

Similar Documents

Publication Publication Date Title
CN102402395B (en) Quorum disk-based non-interrupted operation method for high availability system
CN102394774B (en) Service state monitoring and failure recovery method for controllers of cloud computing operating system
EP2281240B1 (en) Maintaining data integrity in data servers across data centers
EP2648114B1 (en) Method, system, token conreoller and memory database for implementing distribute-type main memory database system
US8032786B2 (en) Information-processing equipment and system therefor with switching control for switchover operation
CN101689114B (en) Dynamic cli mapping for clustered software entities
US8245077B2 (en) Failover method and computer system
CN104408071A (en) Distributive database high-availability method and system based on cluster manager
CN103019889A (en) Distributed file system and failure processing method thereof
CN103199972A (en) Double machine warm backup switching method and warm backup system achieved based on SOA and RS485 bus
CN103346903A (en) Dual-machine backup method and device
CN102394914A (en) Cluster brain-split processing method and device
CN103581225A (en) Distributed system node processing task method
CN101686261A (en) RAC-based redundant server system
CN108984349B (en) Method and device for electing master node, medium and computing equipment
CN102761528A (en) System and method for data management
CN103345470A (en) Database disaster tolerance method, database disaster tolerance system and server
CN102681917A (en) Operating system (OS) and recovery method thereof
CN111935244B (en) Service request processing system and super-integration all-in-one machine
CN102045187B (en) Method and equipment for realizing HA (high-availability) system with checkpoints
JP5285045B2 (en) Failure recovery method, server and program in virtual environment
CN104052799B (en) A kind of method that High Availabitity storage is realized using resource ring
CN113515316A (en) Novel edge cloud operating system
US10067841B2 (en) Facilitating n-way high availability storage services
JP2005055995A (en) Storage control method and server system with redundancy function

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20100331