CN103905234A - Method and system for improving resource availability in distributed system - Google Patents

Method and system for improving resource availability in distributed system Download PDF

Info

Publication number
CN103905234A
CN103905234A CN201210580070.XA CN201210580070A CN103905234A CN 103905234 A CN103905234 A CN 103905234A CN 201210580070 A CN201210580070 A CN 201210580070A CN 103905234 A CN103905234 A CN 103905234A
Authority
CN
China
Prior art keywords
resource
data
service
event
database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201210580070.XA
Other languages
Chinese (zh)
Inventor
孙晓光
朱海东
王明哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
BEIJING YOYO TIANYU SYSTEM TECHNOLOGY CO LTD
Original Assignee
BEIJING YOYO TIANYU SYSTEM TECHNOLOGY CO LTD
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by BEIJING YOYO TIANYU SYSTEM TECHNOLOGY CO LTD filed Critical BEIJING YOYO TIANYU SYSTEM TECHNOLOGY CO LTD
Priority to CN201210580070.XA priority Critical patent/CN103905234A/en
Publication of CN103905234A publication Critical patent/CN103905234A/en
Pending legal-status Critical Current

Links

Abstract

The invention relates to the technical field of a distributed computer system, and provides a method and system for improving resource availability in a distributed system. The method comprises the following steps: at the time when resources are disposed, reporting the basic information of the resources; monitoring the resource state operated in a system platform and the equipment capability and state of a resource operation environment in real time; according to the resource state and the equipment capability and state, determining whether an event generation condition is satisfied, and according to different determination results, throwing corresponding abnormity events; and in response to each abnormity event, providing a resource provider with a resource transplant guide service and/or proving a resource user with a resource access guide service. According to the technical scheme provided by the invention, through management, monitoring and dispensing of basic resources in a large-scale distributed system, based on monitored events, a user is provided with a resource access/transplant tracking guide service, so that the service quality of the user in accessing the resources and the disaster recovery capability are ensured, and it can also be ensured that non-hardware resources are continuously and reliably provided.

Description

Improve the method and system of Resource Availability in distributed system
Technical field
The present invention relates to technical field of the computer network, particularly a kind of method and system that improve Resource Availability in distributed system.
Background technology
Along with the variation of computing equipment and the expansion of internet scale, the development characteristic in IT industry future must be extensive distribution, mass data, highly simultaneous access and personalized service.Such as the cloud computing technology of rising in recent years, provide the service of distribution according to need, dynamic expansion and/or the large-scale distributed system of the one of resource by the Internet for user exactly, it has realized the integration of computational resource, has significantly reduced the cost of service in providing diversified service for user.
But for distributed system, along with the increase of system scale and data volume, the difficulty of system management and maintenance is exponential growth, certainly will bring immeasurable loss to user or system once break down.Even if do not break down, the load imbalance causing by poor management makes troubles also can to user's use; Or because existing network system often will be in the face of even network service and/or user's access of hundred million grades of millions, the user of large-scale distributed system is often difficult to quick position application end points.Therefore the availability that, how to improve system is the problem that distributed system must be considered.
In prior art, in group system, the general mode that adopts redundancy provides high availability for system, and the most general way is that system unit is doubled, and in the time that some parts are unavailable, is switched to immediately component working for subsequent use.Even if a strong highly available system is often referred to the system that also can continue operation after system hardware or software fault, strong highly available system does not have Single Point of Faliure (Single Point of Faliure is that single unit failure causes the disabled phenomenon of whole system).
But the processing mode of group system more spininess right be hardware fault, for large-scale distributed system (such as cloud computing platform), in system, ubiquity a large amount of services and virtual resource, the fault of these non-hardware resources can affect the normal operation of system equally, if all adopt the mode of switching hardware to solve fault in the time of serv-fail or virtual resource inefficacy, obviously can have a strong impact on again systematic function.Can find out, simple hardware redundancy cannot solve the high availability problem in cloud computing platform, and the redundancy of parts has further increased again the difficulty of system management (as obtaining of load balancing or service/data).
Summary of the invention
(1) technical problem that will solve
Be difficult in order to solve the Resource Availability of the medium-and-large-sized distributed system of prior art the problem guaranteeing, the invention provides a kind of method and system that improve Resource Availability in distributed system.
(2) technical scheme
For solving the problems of the technologies described above, the present invention adopts following technical scheme to implement:
First, the invention provides a kind of method that improves Resource Availability in distributed system, described method comprises step:
S1, the essential information while reporting this resource to move in system platform to system platform in deploy resources;
S2, capacity of equipment and the state of the resource status moving in real-time monitoring system platform and resource running environment;
S3, judges whether to meet event occurrence condition according to described resource status and described capacity of equipment and state, according to the different judged results corresponding anomalous event of dishing out;
S4, in response to each anomalous event, for resource provider provides the service of resource migration wizard and/or serves for resource user provides resource access guide.
Preferably, in step S3, described anomalous event comprises maiden visit event, access exception event, alarm event and resource transplanting event.
Preferably, in step S4, in response to maiden visit event, for resource user provides the service of resource access guide; Transplant event in response to resource, first for resource provider provides the service of resource migration wizard, after resource has been transplanted, serve for resource user provides resource access guide again.
Preferably, in step S4, in response to access exception event or alarm event, first determine resource status, if resource is normally moved, in resource running environment, carry out load balancing and serve for resource user provides resource access guide; Otherwise, first for resource provider provides the service of resource migration wizard, after resource has been transplanted, serve for resource user provides resource access guide again.
Preferably, in step S1, report the process of described essential information to comprise step:
S11, client is put into message queue by SOAP/HTTP interface by registration request data, and resource registering module is obtained the registration request data in queue, and described data are submitted to directory service (as services such as UDDI);
S12, directory service writes described data in registration table/database, and generates the service key of resource;
S13, registering result and service key are returned to client by resource registering module, at service key described in client storage.
Preferably, in step S2, monitor procedure comprises step:
S21, the notice of reception resource registering module, the information data of acquisition new registration;
S22, according to the monitoring and scheduling strategy pre-seting, initiates monitoring request by nrpe ssl passage to monitored Resource Server;
S23, monitored Resource Server is received after request, carries out plug-in unit collection resource current status data and completes monitoring request;
S24, monitored Resource Server returns to monitoring resource module by the status data of collection by NRPE agreement;
S25, by the image data write into Databasce of receiving.
Preferably, it is synchronous that described method is also carried out data between the multiple active database entity as data source.
Preferably, described data synchronously specifically comprise step:
At source database end, obtain data change information in database by database manipulation interface;
Described data change information is carried out to preset filtration and conversion, be transferred to target database end by JMS messaging bus;
By database manipulation interface by described data change information reproduction in target database, the increment that carries out data between two client database is synchronous.
On the other hand, the present invention also provides a kind of system that improves Resource Availability in distributed system simultaneously, and described system comprises:
Registering modules, for reporting the essential information of this resource in the time that system platform is moved to system platform in deploy resources;
Monitoring module, capacity of equipment and the state of the resource status moving for real-time monitoring system platform and resource running environment;
Event module, for judging whether to meet event occurrence condition according to described resource status and described capacity of equipment and state, according to the different judged results corresponding anomalous event of dishing out;
To guide module, in response to each anomalous event, serve and/or serve for resource user provides resource access guide for resource provider provides resource migration wizard.
Preferably, described system also comprises data simultaneous module, synchronous for carrying out data between the multiple active database entity as data source.
(3) beneficial effect
In technical scheme of the present invention, by the basic resource in large-scale distributed system is managed, is monitored and allocates, serve for user provides the tracking guide of resource access/transplanting based on monitor event, guarantee on the one hand service quality and the disaster tolerance ability of user access resources, guaranteed again on the other hand to provide continuation and the reliability of non-hardware resource.In addition, serve by guide, technical scheme of the present invention makes system management more transparent succinct in the availability that strengthens basic resource and computational resource, has reduced the difficulty of resource management and maintenance.
Embodiment
Technical scheme in the embodiment of the present invention is carried out to clear, complete description below, obviously, described embodiment is a part of embodiment of the present invention, rather than whole embodiment.Based on the embodiment in the present invention, the every other embodiment that those of ordinary skills obtain under the prerequisite of not making creative work, belongs to the scope of protection of the invention.
The operation of traditional computer system is the hardware resource based on separate mostly, thereby the high availability of system is mainly also the redundancy processing for hardware resource, but in large-scale distributed system, especially in cloud computing platform at present, system resource is further subdivided into service, data, virtual machine and/or entity device, and simple hardware redundancy cannot adapt to the high availability requirement of this type systematic.
In an embodiment of the present invention, improve the method for Resource Availability and can be used as a service and move in system platform, to this platform and on the operation of service monitor, and the resource in this platform is managed and is safeguarded; The basic goal of this service is highly reliable, the high credible assurance that large-scale distributed system platform for using this service and other services on this platform of running on provide resource (generally including service and/or data).In a preferred embodiment, technical scheme of the present invention manages and safeguards for a cloud computing platform, this cloud computing platform can be various network services resource support and running environment is provided, and comprises server resource, storage resources, Internet resources and supports the software environment (as operating system, J2EE service operation environment and database software etc.) of service and data run.Non-hardware resource in cloud computing platform mainly exists with the form of service or data, and preferably, service is the J2EEServlets based on SOA framework, and external WebService interface is provided; The data source of service is relevant database, and service is all deployed with multiple backups with data source in system.
In this preferred embodiment, in raising distributed system of the present invention, the method for Resource Availability generally comprises several basic steps such as resource information management, monitoring, affair alarm and service guide, and each step is specially:
First, obtain the essential information of every resource of moving in system platform.Because first the operation maintenance of system needs to understand the person's of being scheduled basic condition, could in the person's of being scheduled limit of power, carry out subsequently comprehensive consideration and scheduling.For traditional load balancing based on hardware resource, information gathering adopts periodically hardware capabilities and the state of collecting device of prior art conventionally; But for cloud computing platform, data and service are all dynamic changes, may there be at any time new data source or service add or log off, the all right dynamic setting of ability of every resource, thereby be difficult at resource initial operating stage with regard to key messages such as accurate ability, content and the access modes of grasping data and/or ISP.For this reason, in embodiments of the invention, provide register flow path, in disposing data source and/or service with regard to the essential information of active reporting resource, such as registering above-mentioned key message by data and ISP to system platform.Certainly, if do not require the basic condition of just understanding resource at initial operating stage, also can carry out statistical analysis according to long log, infer above-mentioned key message, but this will certainly cause certain influence to the stability at system initial stage.
Secondly, in real time the state of monitor service and data source with and resource capability and the state of running environment.After obtaining data and ISP's log-on message, system needs the state information of the each resource of monitoring in real time, such as availability, load etc.; Meanwhile, system also needs resource capability and the state of real-time monitor service and data source running environment, and these information are foundations that system realizes every Control and Schedule function (as functions such as access navigation, migration navigation, access control).
Again, judge whether to meet the occurrence condition of system event according to the state of monitoring, based on the judged result concrete anomalous event of dishing out.In order to make monitoring on resource, to administer and maintain the normal operation that does not affect resource, in embodiments of the invention, adopt the mode based on Event triggered to control resource.Particularly, in the time that the essential information of resource is reported to system, (while registration in system platform), need to provide the definition of anomalous event, comprises event occurrence condition, external treatment interface and external notification interface etc.Event processing module periodically takes out monitoring resource information from database, expression formula according to anomalous event is calculated, if meet predefined occurrence condition, the event information of dishing out is pointed out, and calls subsequently the external treatment interface of event or external notification interface and trigger next step operation.
Finally, for system platform user provides the service of resource guide.Elementary object of the present invention makes system can be used by user under any situation exactly, because the resource type relating in cloud computing platform is many, number is large, and resource is provided and use the number of users of resource huge, have a very wide distribution, in guaranteeing reliability, also need to consider the convenience that user accesses (be difficult to accessed by the user to resource be generally also regarded as unreliable resource).For this reason, in embodiments of the invention, based on resource capability and service condition, be that user's (comprising resource provider and resource user) provides guide service according to predetermined policy.Particularly, based on all kinds of anomalous events, while needing to adjust (comprising system adjustment or the manual adjustment of supplier etc. automatically) at faulty resource or because of service condition, for resource provider provides the service of resource migration wizard; In the time that resource is normally moved or after successfully transplanting, for resource user provides the service of resource access guide.In this way, make all kinds of resources can dynamically and reasonably be assigned to each node, can be user simultaneously optimum access node and route are provided, realized the transparency of data and service dispatch and access, guaranteed the reliability of entire system.
In preferred embodiment, to consider in cloud computing technology and often will face heterogeneous resource, different resource providers is seldom paid close attention to the compatibility of other resources, and therefore, the present invention further realizes to the function of above steps the support that two kinds of patterns are provided.On the one hand, in technical scheme of the present invention, define the interface specification of standard, if cloud platform and/or wherein this interface specification of service support of operation, service can initiatively be registered by Interface realization, and system also can directly be called these interfaces resource and service are monitored.On the other hand, if this interface specification can not be directly supported in the service of cloud platform and/or wherein operation, further provide remote service agency in the present invention, system will have been acted on behalf of corresponding registration and monitoring function by this remote service.In a preferred embodiment, remote service agency can be NRPE (Nagios Remote Plugin Executor) Daemon program.
Particularly, registration step realizes by the Registering modules (existing with the registration service form operating in system) of system, and Registering modules is further divided into again service registry module and data Registering modules.In registration step, whether support the registration interface standard of standard according to data or service, registration process is divided into direct registration and two kinds of patterns of agency of trademark registration:
A. directly registration: after database and service startup initialization complete, directly call Registering modules interface (SOAP/HTTP) and realize registration;
B. agency of trademark registration: remote service is acted on behalf of according to system setting, starts after initialization completes monitoring database and service, calls Registering modules interface (SOAP/HTTP) and realizes agency of trademark registration.
No matter adopt above-mentioned which kind of pattern, the concrete register flow path that calling interface is realized is:
1) client Registering modules (operating in the client application in service and/or data server) is put into message queue by SOAP/HTTP interface by registration request data, and service registry module and data Registering modules obtain the registration request data in queue;
2) system registry module is submitted to directory service by service registry module and data Registering modules by log-on data; Generally, the log-on message of service comprises service name and description, service access end points, service access binding protocol, service interface, service function operation, service invocation information, service state inquiry mode etc.; The log-on message of data source comprises DSN and description, data access end points, data access binding protocol, data definition information, data mode inquiry mode etc.
3) directory service writes log-on data in registration table/database, and generates the service key of this service and data;
4) registering result and service key are returned to client Registering modules by system registry module, client storage service key.
Monitoring step is realized by the monitoring module of system, the service state that system is acted on behalf of supervisory control system state, resource service condition and moved on server in the remote service of the server deploy of needs monitoring by SSL (SecureSockets Layer, SSL) call connected.System is acted on behalf of and is obtained real-time usable service condition and running state information by remote service, and by the data of the obtaining resource service information database that upgrades in time.
The flow process of monitoring resource is as follows:
1) notice of receiving system Registering modules, the information data of acquisition new registration;
2), according to the monitoring and scheduling strategy pre-seting, initiate monitoring request by nrpe ssl passage to monitored service and/or data server;
3) monitored service and/or data server are received after request, carry out plug-in unit collection resource current status data and complete monitoring request;
4) monitored service and/or data server return to monitoring resource module by the status data of collection by NRPE agreement;
5) by the image data write into Databasce of receiving.
The triggering of event and processing are object and resource guide initial of monitoring, for alleviating the impact of monitor service on resource operation, monitoring module is as independently service (or server system process etc.) operation, regularly image data is carried out to computational analysis, judge whether to occur corresponding event according to the definition of anomalous event, and called relevant interface anomalous event is notified to being for further processing to guide module.
Resource guide is the guarantee that in technical scheme of the present invention, high availability realizes, and particularly, resource guide is subdivided into again access guide and migration wizard.It assists to carry out resource transplanting (copying transfer) on the one hand in the time of the system failure or load imbalance, has guaranteed the reliability of resource; Under the environment of large data, multi-user and extensively distribution, provide on the other hand the access transparency to control, guaranteed the ease for use of resource.Wherein, the user that access guide is resource provides the guide of access resource requirement, it is in response to specific Access Events, as the maiden visit event of user to resource, according to supplier's log-on message, resource capability, the real-time utilization of resources state information (as resource load etc.) and the predefined access strategy that collect, calculate optimum access end points and offer user.This optimum access end points can directly offer user in the time that same user accesses this resource again; if but that user repeatedly accesses failure, resource is long-time serious when uneven without response or system load; conventionally can produce access exception event or alarm event; may cause resource migration process, system can be again for user provides the service of resource access guide in the time that resource is transplanted rear stable operation.Under normal circumstances, the idiographic flow of resource access guide service is as follows:
Receive the resource access request that resource user sends, initiate inquiry based on this request to directory service, obtain the Resources list that meets request;
Each resource items in list is carried out to monitor message retrieval, essential information, resource state information, capacity of equipment and the state information of Gains resources;
Calculate optimum resource access end points, and result is returned to resource user.
Resource migration wizard is (now conventionally can produce access exception event or alarm event) need dynamic transfer to back up to other due to hardware and software failure or system load reason etc. during in resource; or be that resource provider initiatively carries out resource transplanting (now produce resource and transplant event); according to the log-on message of resource, relevant system resource ability, real-time utilization of resources state information (resource load etc.) and the predefined transplantation strategies collecting, calculate the optimum transplanting end points of resource.According to resource essential information and the state information of record, the resource corresponding with event intactly moved on this optimum transplanting end points subsequently, the corresponding log-on message of revising resource after transplanting completes, and in the time that user accesses, start resource access guide.
Further, for guaranteeing the reliability of data and the success rate of services migrating, make service carry out dynamic transfer under the state that keeps current data content, it is synchronous that the present invention also can carry out data between the multiple active database entity as data source.In a preferred embodiment, the increment that system realizes data at the live-vertex deploy relevant database data Replica synchronization means of each database is synchronous: at source database end, the synchronous Replication Tools of data obtain data change information in database by database manipulation interface; After preset filtration and conversion, be transferred to reliably target database end by JMS messaging bus; By database manipulation interface, data change is copied in target database, the increment that reaches two ends database data is synchronous again.Data in the preferred embodiment of the present invention synchronously provide the support to Oracle and MySQL database, main by the data change log analysis of database (Oracle and MySQL) is obtained to data change information, such as the instrument that utilizes database developer to provide (as OracleGolden Gate) completes.Subsequently, the data change information collecting is converted into automatically to the executable SQL statement sequence text of text formatting; According to preset strategy, the SQL text generating is carried out to filtration, statement conversion, code conversion, type conversion etc., to meet the requirement in operation system and remote data storehouse; Then the SQL text of generation is reliably transferred to one or more remote endpoints efficiently, incremental transmission and breakpoint transmission support are provided; Finally by the SQL text application receiving in target database, by carry out this SQL text in target database, to realize data Replica synchronous.In technical scheme of the present invention, also further provide database Checkpoint is supported, for realizing the data increment replication synchronization based on Checkpoint.
One of ordinary skill in the art will appreciate that, the all or part of step realizing in above-described embodiment method is can carry out the hardware that instruction is relevant by program to complete, described program can be stored in a computer read/write memory medium, this program can realize each step in above-described embodiment method in the time carrying out, and described storage medium can be: ROM/RAM, magnetic disc, hard disk, CD, storage card etc.Therefore, corresponding with method of the present invention, the present invention also comprises a kind of high available safe enhancement service system simultaneously, and this system comprises:
Registering modules, for being registered to cloud computing platform by data and/or ISP in disposing data source and/or service, obtains the essential information of every resource of moving in cloud computing platform;
Monitoring module, for the state of in real time monitor service and data source with and resource capability and the state of running environment;
Navigation module, for essential information, resource capability, real-time state information and the navigation strategy of monitoring registered according to supplier, need to adjust at faulty resource or because of service condition time, for resource provider provides migration navigation Service; After the normal operation of resource or migration, for resource user provides access navigation Service.
In sum, in technical scheme of the present invention, carry out allotment monitoring by the basic resource in large-scale distributed system (especially cloud computing platform), and by the follow-up service of registration management and access thereof/migration navigation is provided, strengthen the availability of basic resource and computational resource; In addition, by navigation Service, technical scheme of the present invention has strengthened the assurance ability of service in network congestion situation, has strengthened again the disaster tolerance of service in regional network paralysis situation and has guaranteed ability.
Above execution mode is only for illustrating the present invention; and be not limitation of the present invention; the those of ordinary skill in relevant technologies field; without departing from the spirit and scope of the present invention; can also make a variety of changes and modification; therefore all technical schemes that are equal to also belong to category of the present invention, and scope of patent protection of the present invention should be defined by the claims.

Claims (10)

1. a method that improves Resource Availability in distributed system, is characterized in that, described method comprises step:
S1, the essential information while reporting this resource to move in system platform to system platform in deploy resources;
S2, capacity of equipment and the state of the resource status moving in real-time monitoring system platform and resource running environment;
S3, judges whether to meet event occurrence condition according to described resource status and described capacity of equipment and state, according to the different judged results corresponding anomalous event of dishing out;
S4, in response to each anomalous event, for resource provider provides the service of resource migration wizard and/or serves for resource user provides resource access guide.
2. method according to claim 1, is characterized in that, in step S3, described anomalous event comprises maiden visit event, access exception event, alarm event and resource transplanting event.
3. method according to claim 2, is characterized in that, in step S4, in response to maiden visit event, for resource user provides the service of resource access guide; Transplant event in response to resource, first for resource provider provides the service of resource migration wizard, after resource has been transplanted, serve for resource user provides resource access guide again.
4. method according to claim 2, is characterized in that, in step S4, in response to access exception event or alarm event, first determine resource status, if resource is normally moved, in resource running environment, carry out load balancing and serve for resource user provides resource access guide; Otherwise, first for resource provider provides the service of resource migration wizard, after resource has been transplanted, serve for resource user provides resource access guide again.
5. method according to claim 1, is characterized in that, in step S1, reports the process of described essential information to comprise step:
S11, client is put into message queue by SOAP/HTTP interface by registration request data, and resource registering module is obtained the registration request data in queue, and described data are submitted to directory service;
S12, directory service writes described data in registration table/database, and generates the service key of resource;
S13, registering result and service key are returned to client by resource registering module, at service key described in client storage.
6. method according to claim 5, is characterized in that, in step S2, monitor procedure comprises step:
S21, the notice of reception resource registering module, the information data of acquisition new registration;
S22, according to the monitoring and scheduling strategy pre-seting, initiates monitoring request by nrpe ssl passage to monitored Resource Server;
S23, monitored Resource Server is received after request, carries out plug-in unit collection resource current status data and completes monitoring request;
S24, monitored Resource Server returns to monitoring resource module by the status data of collection by NRPE agreement;
S25, by the image data write into Databasce of receiving.
7. according to the method described in any one in claim 1-6, it is characterized in that, it is synchronous that described method is also carried out data between the multiple active database entity as data source.
8. method according to claim 7, is characterized in that, described data synchronously specifically comprise step:
At source database end, obtain data change information in database by database manipulation interface;
Described data change information is carried out to preset filtration and conversion, be transferred to target database end by JMS messaging bus;
By database manipulation interface by described data change information reproduction in target database, the increment that carries out data between two client database is synchronous.
9. a system that improves Resource Availability in distributed system, is characterized in that, described system comprises:
Registering modules, for reporting the essential information of this resource in the time that system platform is moved to system platform in deploy resources;
Monitoring module, capacity of equipment and the state of the resource status moving for real-time monitoring system platform and resource running environment;
Event module, for judging whether to meet event occurrence condition according to described resource status and described capacity of equipment and state, according to the different judged results corresponding anomalous event of dishing out;
To guide module, in response to each anomalous event, serve and/or serve for resource user provides resource access guide for resource provider provides resource migration wizard.
10. system according to claim 9, is characterized in that, described system also comprises data simultaneous module, synchronous for carrying out data between the multiple active database entity as data source.
CN201210580070.XA 2012-12-28 2012-12-28 Method and system for improving resource availability in distributed system Pending CN103905234A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210580070.XA CN103905234A (en) 2012-12-28 2012-12-28 Method and system for improving resource availability in distributed system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210580070.XA CN103905234A (en) 2012-12-28 2012-12-28 Method and system for improving resource availability in distributed system

Publications (1)

Publication Number Publication Date
CN103905234A true CN103905234A (en) 2014-07-02

Family

ID=50996390

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210580070.XA Pending CN103905234A (en) 2012-12-28 2012-12-28 Method and system for improving resource availability in distributed system

Country Status (1)

Country Link
CN (1) CN103905234A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104063560A (en) * 2014-07-08 2014-09-24 广东轩辕网络科技股份有限公司 Dispatching system and method based on cloud computing platform
CN104270434A (en) * 2014-09-22 2015-01-07 珠海许继芝电网自动化有限公司 Service state monitoring system based on cloud service
CN104394198A (en) * 2014-11-10 2015-03-04 中国电子科技集团公司第二十八研究所 A global scheduling method based on an ESB
CN104503974A (en) * 2014-11-17 2015-04-08 杭州斯凯网络科技有限公司 Automatic optimization method of relational database on the basis of cloud platform
CN105589924A (en) * 2015-11-23 2016-05-18 江苏瑞中数据股份有限公司 Transaction granularity synchronizing method of database
CN106598723A (en) * 2015-10-19 2017-04-26 北京国双科技有限公司 Configuration method and device for resources in distributed system
CN106685684A (en) * 2015-12-22 2017-05-17 北京轻元科技有限公司 System-level management method of container in cloud calculating
WO2017107484A1 (en) * 2015-12-23 2017-06-29 深圳市华讯方舟软件技术有限公司 Cloud computing monitoring method and device
CN109413125A (en) * 2017-08-18 2019-03-01 北京京东尚科信息技术有限公司 The method and apparatus of dynamic regulation distributed system resource
CN109643264A (en) * 2016-06-24 2019-04-16 施耐德电子系统美国股份有限公司 Dynamically promote method, system and the equipment of non-boundary, high availability M..N active configuration management using supplemental resources
CN112116270A (en) * 2020-09-27 2020-12-22 成都中科合迅科技有限公司 Scientific computing service arrangement system based on heterogeneous computing resources
CN116170448A (en) * 2023-04-20 2023-05-26 河北先见软件科技股份有限公司 Cross-organization data sharing method and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100214940A1 (en) * 2009-02-23 2010-08-26 Macauley Daniel W Methods and Systems for Monitoring Changes Made to a Network that Alter the Services Provided to a Server
CN101969391A (en) * 2010-10-27 2011-02-09 北京邮电大学 Cloud platform supporting fusion network service and operating method thereof
CN102135929A (en) * 2010-01-21 2011-07-27 腾讯科技(深圳)有限公司 Distributed fault-tolerant service system
CN102624919A (en) * 2012-03-30 2012-08-01 电子科技大学 Distributed service integrated system for service-oriented architecture and application method thereof

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100214940A1 (en) * 2009-02-23 2010-08-26 Macauley Daniel W Methods and Systems for Monitoring Changes Made to a Network that Alter the Services Provided to a Server
CN102135929A (en) * 2010-01-21 2011-07-27 腾讯科技(深圳)有限公司 Distributed fault-tolerant service system
CN101969391A (en) * 2010-10-27 2011-02-09 北京邮电大学 Cloud platform supporting fusion network service and operating method thereof
CN102624919A (en) * 2012-03-30 2012-08-01 电子科技大学 Distributed service integrated system for service-oriented architecture and application method thereof

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104063560A (en) * 2014-07-08 2014-09-24 广东轩辕网络科技股份有限公司 Dispatching system and method based on cloud computing platform
CN104063560B (en) * 2014-07-08 2018-04-17 广东轩辕网络科技股份有限公司 Scheduling system and dispatching method based on cloud computing platform
CN104270434A (en) * 2014-09-22 2015-01-07 珠海许继芝电网自动化有限公司 Service state monitoring system based on cloud service
CN104394198A (en) * 2014-11-10 2015-03-04 中国电子科技集团公司第二十八研究所 A global scheduling method based on an ESB
CN104394198B (en) * 2014-11-10 2017-08-25 中国电子科技集团公司第二十八研究所 A kind of overall scheduling method based on ESB
CN104503974B (en) * 2014-11-17 2017-07-18 杭州斯凯网络科技有限公司 A kind of relational database automatic optimization method based on cloud platform
CN104503974A (en) * 2014-11-17 2015-04-08 杭州斯凯网络科技有限公司 Automatic optimization method of relational database on the basis of cloud platform
CN106598723A (en) * 2015-10-19 2017-04-26 北京国双科技有限公司 Configuration method and device for resources in distributed system
CN105589924A (en) * 2015-11-23 2016-05-18 江苏瑞中数据股份有限公司 Transaction granularity synchronizing method of database
CN106685684A (en) * 2015-12-22 2017-05-17 北京轻元科技有限公司 System-level management method of container in cloud calculating
CN106685684B (en) * 2015-12-22 2019-06-11 北京轻元科技有限公司 The system-level management method of container in cloud computing
WO2017107484A1 (en) * 2015-12-23 2017-06-29 深圳市华讯方舟软件技术有限公司 Cloud computing monitoring method and device
CN109643264A (en) * 2016-06-24 2019-04-16 施耐德电子系统美国股份有限公司 Dynamically promote method, system and the equipment of non-boundary, high availability M..N active configuration management using supplemental resources
CN109643264B (en) * 2016-06-24 2023-01-03 施耐德电子系统美国股份有限公司 Method, system and apparatus for dynamically facilitating M: N work configuration management with supplemental resources
CN109413125A (en) * 2017-08-18 2019-03-01 北京京东尚科信息技术有限公司 The method and apparatus of dynamic regulation distributed system resource
CN112116270A (en) * 2020-09-27 2020-12-22 成都中科合迅科技有限公司 Scientific computing service arrangement system based on heterogeneous computing resources
CN116170448A (en) * 2023-04-20 2023-05-26 河北先见软件科技股份有限公司 Cross-organization data sharing method and storage medium

Similar Documents

Publication Publication Date Title
CN103905234A (en) Method and system for improving resource availability in distributed system
CN110809017B (en) Data analysis application platform system based on cloud platform and micro-service framework
WO2020253347A1 (en) Container cluster management method, device and system
US9432462B2 (en) Distributed metering and monitoring system
CN105723679B (en) System and method for configuration node
US10084858B2 (en) Managing continuous priority workload availability and general workload availability between sites at unlimited distances for products and services
CN103024060B (en) Open type cloud computing monitoring system for large scale cluster and method thereof
US20170046146A1 (en) Autonomously healing microservice-based applications
CN106487574A (en) Automatic operating safeguards monitoring system
CN113037560B (en) Service flow switching method and device, storage medium and electronic equipment
CN104077212A (en) Pressure test system and method
WO2015157896A1 (en) Disaster recovery scheme configuration method and apparatus in cloud computing architecture
US20070198554A1 (en) Apparatus for business service oriented management infrastructure
CN103677967A (en) Remote data service system of data base and task scheduling method
CN102939594A (en) Methods and apparatus related to migration of customer resources to virtual resources within a data center environment
US9887889B1 (en) State reconciliation using event tracking and polling
AU2014209611A1 (en) Instance host configuration
CN112288423A (en) Aggregation payment method and system of distributed framework
US8892518B1 (en) System and method of intelligent log agents
CN104601624A (en) Data interaction method and device
CN105404530B (en) It is a kind of to realize easy deployment and the system and method using private clound
CN112598529B (en) Data processing method and device, computer readable storage medium and electronic equipment
CN102346698A (en) Time program management method, server and system
EP3306471B1 (en) Automatic server cluster discovery
JP2004164610A (en) Management device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20140702