CN101686261A - RAC-based redundant server system - Google Patents
RAC-based redundant server system Download PDFInfo
- Publication number
- CN101686261A CN101686261A CN200910194974A CN200910194974A CN101686261A CN 101686261 A CN101686261 A CN 101686261A CN 200910194974 A CN200910194974 A CN 200910194974A CN 200910194974 A CN200910194974 A CN 200910194974A CN 101686261 A CN101686261 A CN 101686261A
- Authority
- CN
- China
- Prior art keywords
- node
- host
- rac
- application
- virtual
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Landscapes
- Hardware Redundancy (AREA)
Abstract
The invention relates to a RAC-based redundant server system, which comprises N nodes, wherein each node operates an application server, a database server and a watchdog system; each node is connectedwith a shared disk array device; one end of each node is connected into a private network uniformly; the other end of each node is connected to a public network uniformly; each node has a virtual IPof the public network; a public network IP address accessed by a user is a main virtual IP address; a server having the main virtual IP is called as a host computer; other servers are standby computers; and the host computer and the standby computers are switched by the watchdog system. Compared with the prior art, the RAC-based redundant server system has the advantages of low cost, good expansibility, good reliability, short switching time and the like.
Description
Technical field
The present invention relates to redundant server system, relate in particular to a kind of redundant server system based on RAC (the real application cluster of real application clusters).
Background technology
Along with the raising of domestic information degree, more and more higher to the online availability requirement of using system, generally require service uninterruptedly was provided in 7*24 hour, and system redundancy is to realize a kind of effective means of above-mentioned purpose.
The typical application system comprises application server and two parts of data server, and is corresponding, and system redundancy also should cover the redundant and redundant two parts of data server of application server.
Redundant for using, implementation mainly contains Clustering or load-balancing technique (component software load balancing and hardware load balancing again); And for database redundancy, main implementation is to adopt the data-base cluster technology.
Before Oracle 10g released, the redundancy of database also depended on the cluster of operating system, and after Oracle10g released, database itself had comprised cluster external member (RAC).
Share under the situation of a cover hardware platform at application server and data server,, both increased the buying expenses of software, increased system maintenance personnel's burden again if use operating system cluster or third party's cluster to realize the redundancy of using.The RAC that how to make full use of Oracle realizes system redundancy, has just become a good problem to study.
Summary of the invention
Purpose of the present invention is exactly in order to overcome the defective that above-mentioned prior art exists, and a kind of redundant server system based on RAC of with low cost, favorable expandability is provided.
Purpose of the present invention can be achieved through the following technical solutions:
A kind of redundant server system based on RAC, this system comprises N node, wherein each node all moves application server, database server, watchdog system, described each node links to each other with the Disk Array of sharing, the unified private network that inserts of one end of described each node, the unified public network that inserts of the other end of described each node, described each node all has the virtual IP address of a public network, a public network IP address accessed by the user is main virtual ip address, the server that has main virtual IP address is called main frame, other server then is a standby host, and active and standby machine switches to be realized by watchdog system.
Described active and standby machine switching comprises following flow process:
(1) because whole system is the RAC cluster, so we use the host identification of main virtual IP address conduct, after system start-up, if node has been obtained main virtual IP address then has been become main frame, otherwise be standby host, in the active and standby machine running, whether the watchdog system surveillance application is normal;
(2) unusual if host application occurs, then discharge host identification (stopping the RAC service) by watchdog system, make it become standby host, if certain standby host is obtained host identification, then this standby host becomes main frame;
(3) unusual if standby host occurs, watchdog system is attempted restarting application, recovers normal if restart the back application, then switch back to the standby host state, otherwise shutdown excludes whole cluster system with it;
(4) if main frame shuts down or restarts, main frame discharges host identification automatically, and promptly RAC discharges all virtual IP addresses that are bundled in this machine automatically, comprise main virtual IP address, RAC transfers to other nodes with all virtual IP addresses of this node simultaneously, at this moment, certain standby host will obtain host identification, becomes main frame;
(5) if standby host shuts down or restarts, then the RAC of standby host discharges all virtual IP addresses that all are bundled in this machine automatically, transfers to other nodes, but does not influence host work;
The flow process of described watchdog system work is as follows:
1) whether inquiry uses normal after the system start-up;
2) if normal, dormancy is inquiry again after one second;
3) if use unusual (losing response as using thread), watchdog system judges whether present node is host node, and promptly whether present node has main virtual IP address;
4) if present node is a host node, then discharge host identification;
5) restart application;
6) after application was restarted, whether watchdog system is inquired about application again normal, if normal, then continues the poll application state;
7) if application state is undesired, then shutdown.
Compared with prior art, the present invention has the following advantages:
(1) with low cost: as to make full use of the RAC external member that database itself comprises, saved the cost of buying the cluster of operating system, realized that 24 hours of server are online.
(2) favorable expandability: along with the expansion of business, system is free to add server, need not to change the existing application configuration.
(3) good reliability: the reliability of this framework mainly depends on the reliability of RAC.
(4) switching time is shorter: redundant machine forwards the switching time of normal use to smaller or equal to 30s.
Description of drawings
Fig. 1 is the structural representation of a kind of redundant server system based on RAC of the present invention;
Fig. 2 is the active and standby machine switching flow figure of a kind of redundant server system based on RAC of the present invention;
Fig. 3 is the watchdog system workflow diagram of a kind of redundant server system based on RAC of the present invention.
Embodiment
The present invention will be further described below in conjunction with specific embodiment.
As Fig. 1, Fig. 2, shown in Figure 3, a kind of redundant server system based on RAC, this system comprises N node 1, wherein each node 1 all moves application server, database server, watchdog system, described each node links to each other with the Disk Array of sharing 2, the unified private network 3 that inserts of one end of described each node, the unified public network 4 that inserts of the other end of described each node, described each node all has the virtual IP address of a public network, a public network IP address accessed by the user is main virtual ip address, the server that has main virtual IP address is called main frame, and other server then is a standby host, and active and standby machine switches to be realized by watchdog system.
Described active and standby machine switching comprises following flow process:
1) because whole system is the RAC cluster, so we use the host identification of main virtual IP address conduct.After system start-up,, otherwise be standby host if node has been obtained main virtual IP address then become main frame.In the active and standby machine running, whether the watchdog system surveillance application is normal.
2) unusual if host application occurs, then discharge host identification (stopping the RAC service) by watchdog system, make it become standby host; If certain standby host is obtained host identification, then this standby host becomes main frame.
3) unusual if standby host occurs, watchdog system is attempted restarting application, recovers normal if restart the back application, then switch back to the standby host state, otherwise shutdown excludes whole cluster system with it.
4) if main frame shuts down or restarts, main frame discharges host identification automatically, and promptly RAC discharges all virtual IP addresses that are bundled in this machine automatically, comprise main virtual IP address, RAC transfers to other nodes with all virtual IP addresses of this node simultaneously, at this moment, certain standby host will obtain host identification, becomes main frame.
5) if standby host shuts down or restarts, then the RAC of standby host discharges all virtual IP addresses that all are bundled in this machine automatically, transfers to other nodes, but does not influence host work.
The flow process of described watchdog system work is as follows:
In the 301st step, whether inquiry uses normal after the system start-up.
In the 302nd step, if normal, dormancy is inquiry again after one second.
In the 303rd step, if use unusual (losing response as using thread), watchdog system judges whether present node is host node, and promptly whether present node has main virtual IP address.
In the 304th step,, then discharge host identification if present node is a host node.
In the 305th step, restart application.
In the 306th step, after application was restarted, whether watchdog system is inquired about application again normal, if normal, then continues the poll application state.
The 307th step, if application state is undesired, then shutdown.
This invention is applied to somewhere factories and miness transportation Production Scheduling System (hereinafter to be referred as " transportation Production Scheduling System "):
The transportation Production Scheduling System comprises two-server, adopts redundancy structure, is deployed in the central machine room of company.The shared cover disk array of two-server, all data of system all are stored in this disk array, have guaranteed that effectively data and application 24 hours are online.
The application of system comprises two parts: based on the ERP system of B/S structure, reach the interface service program based on TCP/IP.The application of two-server all is in heat and is equipped with state, provides service by unified main VIP to the user.
In the service of central server deploy house dog, realized the automatic switchover when application system breaks down, guaranteed that effectively 24 hours of application are online.
In addition, this method is safeguarded the timing unit of system becomes possibility, as: can regularly equipment be safeguarded by select time, can further improve the availability of system like this.
Prove through field practice,, can effectively reduce the cost of maintenance, improve the reliabilty and availability of system based on the redundant server framework of RAC.
Claims (3)
1. redundant server system based on RAC, it is characterized in that, this system comprises N node, wherein each node all moves application server, database server, watchdog system, described each node links to each other with the Disk Array of sharing, the unified private network that inserts of one end of described each node, the unified public network that inserts of the other end of described each node, described each node all has the virtual IP address of a public network, a public network IP address accessed by the user is main virtual ip address, the server that has main virtual IP address is called main frame, and other server then is a standby host, and active and standby machine switches to be realized by watchdog system.
2. the redundant server system based on RAC according to claim 1 is characterized in that, described active and standby machine switching comprises following flow process:
(1) because whole system is the RAC cluster, so we use the host identification of main virtual IP address conduct, after system start-up, if node has been obtained main virtual IP address then has been become main frame, otherwise be standby host, in the active and standby machine running, whether the watchdog system surveillance application is normal;
(2) unusual if host application occurs, then discharge host identification (stopping the RAC service) by watchdog system, make it become standby host, if certain standby host is obtained host identification, then this standby host becomes main frame;
(3) unusual if standby host occurs, watchdog system is attempted restarting application, recovers normal if restart the back application, then switch back to the standby host state, otherwise shutdown excludes whole cluster system with it;
(4) if main frame shuts down or restarts, main frame discharges host identification automatically, and promptly RAC discharges all virtual IP addresses that are bundled in this machine automatically, comprise main virtual IP address, RAC transfers to other nodes with all virtual IP addresses of this node simultaneously, at this moment, certain standby host will obtain host identification, becomes main frame;
(5) if standby host shuts down or restarts, then the RAC of standby host discharges all virtual IP addresses that all are bundled in this machine automatically, transfers to other nodes, but does not influence host work.
3. the redundant server system based on RAC according to claim 1 is characterized in that, the flow process of described watchdog system work is as follows:
1) whether inquiry uses normal after the system start-up;
2) if normal, dormancy is inquiry again after one second;
3) if use unusual (losing response as using thread), watchdog system judges whether present node is host node, and promptly whether present node has main virtual IP address;
4) if present node is a host node, then discharge host identification;
5) restart application;
6) after application was restarted, whether watchdog system is inquired about application again normal, if normal, then continues the poll application state;
7) if application state is undesired, then shutdown.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200910194974A CN101686261A (en) | 2009-09-01 | 2009-09-01 | RAC-based redundant server system |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200910194974A CN101686261A (en) | 2009-09-01 | 2009-09-01 | RAC-based redundant server system |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101686261A true CN101686261A (en) | 2010-03-31 |
Family
ID=42049229
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200910194974A Pending CN101686261A (en) | 2009-09-01 | 2009-09-01 | RAC-based redundant server system |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101686261A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102510343A (en) * | 2011-11-16 | 2012-06-20 | 广东新支点技术服务有限公司 | Highly available cluster system feign death solution based on both remote detection and power management |
CN102521060A (en) * | 2011-11-16 | 2012-06-27 | 广东新支点技术服务有限公司 | Pseudo halt solving method of high-availability cluster system based on watchdog local detecting technique |
CN104283710A (en) * | 2014-08-18 | 2015-01-14 | 四川长虹电器股份有限公司 | Database cluster fault handling method and management server |
CN105843713A (en) * | 2016-04-01 | 2016-08-10 | 杭州沃趣网络科技有限公司 | Method for realizing Oracle RAC (real application cluster) through shared-nothing storage of dual system |
CN106982259A (en) * | 2017-04-19 | 2017-07-25 | 聚好看科技股份有限公司 | The failure solution of server cluster |
CN114827080A (en) * | 2022-06-06 | 2022-07-29 | 武汉四通信息服务有限公司 | IP switching method and system |
-
2009
- 2009-09-01 CN CN200910194974A patent/CN101686261A/en active Pending
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102510343A (en) * | 2011-11-16 | 2012-06-20 | 广东新支点技术服务有限公司 | Highly available cluster system feign death solution based on both remote detection and power management |
CN102521060A (en) * | 2011-11-16 | 2012-06-27 | 广东新支点技术服务有限公司 | Pseudo halt solving method of high-availability cluster system based on watchdog local detecting technique |
CN104283710A (en) * | 2014-08-18 | 2015-01-14 | 四川长虹电器股份有限公司 | Database cluster fault handling method and management server |
CN105843713A (en) * | 2016-04-01 | 2016-08-10 | 杭州沃趣网络科技有限公司 | Method for realizing Oracle RAC (real application cluster) through shared-nothing storage of dual system |
CN105843713B (en) * | 2016-04-01 | 2019-06-28 | 杭州沃趣科技股份有限公司 | A kind of method that dual systems realizes Oracle RAC without shared storage |
CN106982259A (en) * | 2017-04-19 | 2017-07-25 | 聚好看科技股份有限公司 | The failure solution of server cluster |
CN114827080A (en) * | 2022-06-06 | 2022-07-29 | 武汉四通信息服务有限公司 | IP switching method and system |
CN114827080B (en) * | 2022-06-06 | 2022-09-23 | 武汉四通信息服务有限公司 | IP switching method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102402395B (en) | Quorum disk-based non-interrupted operation method for high availability system | |
CN102394774B (en) | Service state monitoring and failure recovery method for controllers of cloud computing operating system | |
EP2281240B1 (en) | Maintaining data integrity in data servers across data centers | |
EP2648114B1 (en) | Method, system, token conreoller and memory database for implementing distribute-type main memory database system | |
US8032786B2 (en) | Information-processing equipment and system therefor with switching control for switchover operation | |
CN101689114B (en) | Dynamic cli mapping for clustered software entities | |
US8245077B2 (en) | Failover method and computer system | |
CN104408071A (en) | Distributive database high-availability method and system based on cluster manager | |
CN103019889A (en) | Distributed file system and failure processing method thereof | |
CN103199972A (en) | Double machine warm backup switching method and warm backup system achieved based on SOA and RS485 bus | |
CN103346903A (en) | Dual-machine backup method and device | |
CN102394914A (en) | Cluster brain-split processing method and device | |
CN103581225A (en) | Distributed system node processing task method | |
CN101686261A (en) | RAC-based redundant server system | |
CN108984349B (en) | Method and device for electing master node, medium and computing equipment | |
CN102761528A (en) | System and method for data management | |
CN103345470A (en) | Database disaster tolerance method, database disaster tolerance system and server | |
CN102681917A (en) | Operating system (OS) and recovery method thereof | |
CN111935244B (en) | Service request processing system and super-integration all-in-one machine | |
CN102045187B (en) | Method and equipment for realizing HA (high-availability) system with checkpoints | |
JP5285045B2 (en) | Failure recovery method, server and program in virtual environment | |
CN104052799B (en) | A kind of method that High Availabitity storage is realized using resource ring | |
CN113515316A (en) | Novel edge cloud operating system | |
US10067841B2 (en) | Facilitating n-way high availability storage services | |
JP2005055995A (en) | Storage control method and server system with redundancy function |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20100331 |