WO2015071975A1 - Application- and data-distribution management method, application- and data-distribution management system, and storage medium - Google Patents

Application- and data-distribution management method, application- and data-distribution management system, and storage medium Download PDF

Info

Publication number
WO2015071975A1
WO2015071975A1 PCT/JP2013/080672 JP2013080672W WO2015071975A1 WO 2015071975 A1 WO2015071975 A1 WO 2015071975A1 JP 2013080672 W JP2013080672 W JP 2013080672W WO 2015071975 A1 WO2015071975 A1 WO 2015071975A1
Authority
WO
WIPO (PCT)
Prior art keywords
condition
application
data
user
slo
Prior art date
Application number
PCT/JP2013/080672
Other languages
French (fr)
Japanese (ja)
Inventor
洋 中越
崇利 加藤
Original Assignee
株式会社日立製作所
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社日立製作所 filed Critical 株式会社日立製作所
Priority to PCT/JP2013/080672 priority Critical patent/WO2015071975A1/en
Publication of WO2015071975A1 publication Critical patent/WO2015071975A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3442Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for planning or managing the needed capacity
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3452Performance evaluation by statistical analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements

Definitions

  • the present invention relates to a technique for managing the arrangement of applications and data.
  • Cloud computing that provides information technology as a service is widespread.
  • Many cloud computing service providers (hereinafter referred to as cloud providers) that provide various cloud computing services have emerged, and cloud providers have built data centers to provide cloud computing services in a wide area on a global scale. Yes.
  • Non-Patent Document 1 discloses a technique for automatically arranging applications and application data in consideration of position information of end users who use application services.
  • Non-Patent Document 1 various constraint information, specifically, end information of the end user or the terminal used by the end user, position information of the data center, characteristic information of the application and application data, Service level target (SLO: Service Level Objects) information, data center characteristic information, data center cost information, and the like that are set for each service and transaction configured by applications and application data are input as constraint information. Then, the application and application data must be arranged in a data center where the application and application data are distributed so as to satisfy all the input constraint information. This arrangement plan is a NP (Non-deterministic Polynomial) difficult problem and requires a huge amount of calculation.
  • NP Non-deterministic Polynomial
  • application indicates an application binary that provides a part of the service to the end user.
  • Application data indicates data accessed by the application binary.
  • KPI Key Performance Indicator
  • SLO is a conditional statement related to the performance of the computer system such as “response time of 1 second or less”.
  • KPI items and SLO items items targeted by KPI and SLO, such as “contract rate” and “response time”, will be referred to as KPI items and SLO items, respectively, and values that KPI and SLO should satisfy such as “20% or more” and “1 second or less”.
  • threshold are referred to as KPI standard and SLO standard, respectively.
  • the contract rate indicates, for example, the ratio of users who have purchased a product among users who have visited the WEB site.
  • the biggest factor that requires an enormous amount of calculation is that even if there are many SLOs, they are set only for each transaction, and the number of combinations of patterns for arranging applications and application data to satisfy the SLOs is enormous. is there. Specifically, if the number of data centers is 100 and the number of applications and application data is 1000, the number of calculation patterns is 100 ⁇ 1000 (100 to the 1000th power).
  • the present invention has been made in view of the above problems, and an object of the present invention is to reduce the amount of calculation required for an application or application data arrangement plan that satisfies SLO.
  • the present invention is an application and data arrangement management method in which a management computer having a processor and a memory and connected to a data center and a user terminal manages an application and data arrangement that can be divided into one or more data centers.
  • An agent deployed in the data center monitors the data center, and an agent deployed in the user terminal monitors the user terminal, and the agent of the data center and the user terminal
  • the third step of collecting from the agent of the terminal and the management computer have a correlation between the user trace information and the first condition, with a predetermined service level target of the application and data as a second condition
  • the present invention it is possible to greatly reduce the amount of calculation required for the target application satisfying the service level and the data allocation plan. For example, if the number of data centers is 100 and the number of applications and application data is 1000, the number of patterns to be calculated is about 100 * 1000.
  • FIG. 1A to 1C are block diagrams of a computer system according to an embodiment of the present invention.
  • FIG. 1A is a block diagram illustrating an example of functions of a data center and a user terminal.
  • FIG. 1B is a block diagram illustrating an example of a management computer that manages cloud computing.
  • FIG. 1C is a block diagram illustrating an example of the configuration of the data center.
  • the computer system of the present invention includes a plurality of user terminals 1-1 to 1-n, a plurality of data centers 2-1 to 2-n, and data centers 2-1 to 2- A management computer 100 that manages n is connected.
  • the user terminal 1-1 to 1-n is generically referred to as a user terminal 1
  • the data center 2-1 to 2-n is generically referred to as a data center 2 (hereinafter DC2).
  • User terminal 1 uses services provided on cloud computing.
  • a plurality of computing nodes are installed in DC2, and cloud computing is operated.
  • DC2 indicates a data center.
  • DC2 is indicated. Indicates a rack in which a plurality of computing nodes are combined or a cluster constituted by a plurality of racks.
  • the user application 10 of the user terminal 1 is software that processes a part of the service provided by the service provider (or DC2), and is downloaded to the user terminal 1.
  • the user application 10 is a service GUI that is executed on, for example, a browser application installed in the user terminal 1 and receives user operations.
  • FIG. 1C is a block diagram illustrating an example of DC2.
  • DC2-1 will be described, but DC2-2 to 2-n have the same configuration.
  • the DC 2-1 includes a plurality of computing nodes (# 1 to #n) 200-1 to 200-n, a storage apparatus 250, and a network 260 that interconnects the gateway apparatus 270.
  • a generic name of the nodes 200-1 to 200-n is represented by a node 200.
  • Each node 200 is connected to the user terminal 1 and the management computer 100 from the external network 50 via the internal network 260 and the gateway device 270.
  • the node (# 1) 200-1 includes a physical computer 201-1, a virtualization unit 202-1 that assigns the computer resources of the physical computer 201-1 to one or more virtual computers 210-1 to 210-n, and each virtual machine 201-1. And applications 14-1 to 14-n running on the OS 211-1 to 211-n of the computers 210-1 to 210-n.
  • the OS 211-1 to 211 -n is generically referred to as OS 211
  • the applications 14-1 to 14 -n are generically referred to as application 14.
  • the physical computer 201-1 includes a processor 2011 and a memory 2012.
  • the storage apparatus 250 stores the OS 211, the application 14, or data 140 used by the application 14.
  • the DC 2-1 generates one or more virtual machines 210 according to a command from the management computer 100, executes the OS 211 and the application 14, and provides the service of the application 14 to the user terminal 1.
  • the hardware and software shown in FIG. 1C are DC2 resources, and the management computer 100 manages the resources for each DC2.
  • an application 14 that provides a service to the user terminal 1 a user trace acquisition unit 11 that acquires user trace information such as a history of use of the application 14 by the user terminal 1, and a user trace acquisition unit 11 have acquired.
  • Accepts requests from a user trace storage unit 12 that stores user trace information a resource information acquisition unit 15 that acquires DC2-1 resource information, a resource information storage unit 16 that stores resource information, and a management computer 100
  • a transmission / reception unit 131 that transmits user trace information or resource information.
  • the user trace acquisition unit 11 and the resource information acquisition unit 15 can be executed by a predetermined virtual computer 210 or physical computer 201, and the user trace storage unit 12 and the resource information storage unit 16 can be set in the storage apparatus 250. .
  • the user trace acquisition unit 11 and the resource information acquisition unit 15 function as an agent of the management computer 100 in each DC2.
  • the user terminal 1 provides an application 10 that executes a predetermined process such as a process of connecting to the application 14 of the DC 2 and receiving a user input, and a transmission / reception unit 13 that communicates with the outside.
  • the user terminal 1 includes a user trace acquisition unit 11A that acquires performance information such as response time viewed from the user terminal, and a user trace storage unit 12A that stores user trace information acquired by the user trace acquisition unit 11A.
  • the user terminal 1 includes a processor and a memory (not shown). Further, the user trace acquisition unit 11 ⁇ / b> A and the user trace storage unit 12 ⁇ / b> A function as an agent of the management computer 100 in each user terminal 1.
  • the user trace acquisition unit 11 is implemented at least in the DC 2 and, if possible, is implemented in the user terminal 1 as the user trace acquisition unit 11 A, and the actual service usage history (service action log) by the user using the user terminal 1 is recorded. Acquired and stored in the user trace storage unit 12A.
  • the user trace information means that each DC 2 or an agent in each user terminal 1 monitors requests exchanged between users who use the user terminal 1, all applications 14 and data 140, and communication and processing related to responses. This is information gathered for each service transaction (or workload) by acquiring information related to logs and performance. Therefore, a set of user trace information related to one transaction (hereinafter referred to as transaction trace) is composed of one or more user trace information. User trace information collected by each agent is collected in the management computer 100.
  • the user trace information can be acquired using a known or well-known method. For example, “Dapper, a large-scale distributed systems tracing infrastructure, structure.” (Sigelman, B. H., Barroso, L. A., Burs, M., P. M., Peter. ... & Shanbag, C. 2010, Google research) may be applied.
  • the information acquired by the user trace information includes information on the tables T1 and T2 shown in FIGS. 2A and 2B.
  • FIG. 2A is a diagram illustrating an example of a table T1 that stores information in units of user traces.
  • FIG. 2B is a diagram illustrating an example of a table T2 that stores information in transaction units.
  • a table T1 includes a Trace ID 301 for storing an identifier of user trace information, a Parent ID 302 for storing an identifier having a parent relationship between user traces, and a Sibling ID 303 for storing an identifier having a sibling relationship between user traces.
  • the Destination Location 312, the Response Time 313 that stores the response time for the trace, the Throughput 314 that stores the throughput of the transaction, and the TAT 315 that stores the turnaround time of the transaction constitute one entry.
  • the location information (309, 312) may be expressed in latitude and longitude as shown in the table T1, or may be converted by a GeoIP database service or the like by holding network location information such as an IP address (for example, , Quova IP Geo-Location Database. ⁇ Http://www.quova.com>).
  • part of the service is an application (APP in the figure) It is composed of A and application B, and it is recorded that the service is established by two connections of the user terminal 1 used by the user from application A and application A to application B.
  • APP application
  • SLO Service Level Objects
  • DC2 and network 50 such as response time, throughput, and TAT, as described above, and excludes security, availability, reliability, and the like.
  • the table T2 includes a transaction ID 321 storing an identifier for each transaction, a parent ID 322 storing an identifier of a transaction having a parent relationship between transactions, and an identifier of a transaction having a sibling relationship between transactions.
  • Sibling ID 323 to be stored Trace Set 324 to store a set of Trace IDs included in the transaction, Location 325 to store location information where the transaction has occurred, and Conversion 1 (326) to store the results of conversion 1 and conversion 2 respectively And one entry from Conversion 2 (327) and Sales 328 for storing the sales amount It is made.
  • KPI is set for each transaction. Even if the KPI is set for each service, the same KPI may be set for all transactions belonging to the service.
  • the location 325 is used when user trace information is divided and collected for each predetermined area and analyzed for each area.
  • the KPI may be set for each area where the service is deployed.
  • the user trace information is managed for each predetermined area, and the SLO is set for each area as described later.
  • the above Conversion 1/2 (326, 327) indicates the achievement result of KPI. Specifically, “Is this registered?”, “Is the frequency of service usage more than twice / month”, “Number of social activities performed” Indicates whether or not the KPI item has been achieved in the user trace information such as “Is it once or more than a month?”, “Has the target billing amount been achieved”, or the like.
  • KPI is achieved
  • “True” is stored in Conversion 1/2 (326, 327)
  • “False” is stored.
  • the number of KPI items that are items targeted by the KPI in other words, the number of conversion items can be changed according to the set KPI.
  • KPI covers items related to business income such as sales and profits, and user satisfaction related to them, and does not cover expenses such as expenses and expenses (expenses and expenses).
  • indicators such as “income per expense” such as sales against expenses and sales and profits against cost of sales (cost rate and profit ratio) are subject to KPI.
  • the application 14 executed by the virtual computer 210 of DC2 processes a part of the service provided by the service provider.
  • a Web service it is an application that transmits a file constituting the Web service in response to a request from the user terminal 1, or in the case of an electronic commerce (EC) service, an application that searches for product data.
  • EC electronic commerce
  • the resource information acquisition unit 15 collects information regarding the operating status of the computing nodes in the DC 2 and stores the information in the resource information storage unit 16.
  • the resource information indicates, for example, a processor usage rate, storage capacity, network usage amount, application log, and the like.
  • acquisition of resource information can apply a well-known or well-known method. For example, “Nagios-The Industry Standard in IT Infrastructure Monitoring” (http://www.nagios.org/) may be applied.
  • the user trace information stored in the user trace storage unit 12 and the resource information stored in the resource information storage unit 16 are transmitted to the transmission / reception unit 20 of the SLO management unit 3 of the management computer 100 by the transmission / reception unit 13.
  • the transmission timing may be periodic, or may be event-driven triggered by acquisition of user trace information and resource information.
  • the management computer 100 includes a processor 110, a memory 120, a storage 130, and a management console 5, and executes an SLO management unit 3, an arrangement plan unit 41, and an arrangement execution unit 42.
  • the management console 5 includes an input device and an output device, and receives input from the service provider 4 or a system administrator.
  • the SLO management unit 3 of the management computer 100 manages the arrangement of the application 14 and data 140 to the distributed DCs 2-1 to 2-n.
  • the mounting location of the SLO management unit 3 is not limited to the management computer 100 of the present embodiment, but is the physical computer 201 and the virtual computer 210 in the DC2. Note that the SLO management unit 3 may be arranged in a distributed manner in each DC 2 to be distributed, or may be centrally arranged in one DC 2.
  • the user trace accumulation unit 21 accumulates user trace information transmitted from the DC 2 and the user terminal 1 by the transmission / reception unit 20.
  • the resource information accumulation unit 40 receives and accumulates the resource information transmitted from the DC 2 by the transmission / reception unit 20.
  • the SLO management unit 3 when the SLO management unit 3 is distributed and distributed to each DC 2, the user trace accumulation unit 21 and the resource information accumulation unit 40 are connected to the DC 2-1 to 2 -n and the user who are in charge of each SLO management unit 3. Only user trace information transmitted from the terminal 1 and resource information transmitted from the DCs 2-1 to 2-n in charge of each SLO management unit 3 are accumulated.
  • the SLO management unit 3 when the SLO management unit 3 is centrally arranged in one DC 2, the user trace accumulation unit 21 and the resource information accumulation unit 40 respectively have user trace information transmitted from all DCs 2 and all user terminals 1, Resource information transmitted from all DCs 2 is accumulated.
  • the important SLO selection unit 22 extracts the KPI actual measurement value and the SLO actual measurement value for each transaction from the user trace information acquired from the user trace accumulation unit 21, and selects an SLO item highly related to the KPI. Detailed processing of the SLO management unit 3 including the important SLO selection unit 22 will be described later with reference to FIG.
  • the SLO standard derivation unit 26 uses the KPI and SLO stored in the KPI storage unit 24 and the SLO storage unit 25 for each SLO item highly relevant to the KPI selected by the important SLO selection unit 22, respectively.
  • the “application unit” and the “data unit” indicate units that can be arranged by dividing the application 14 and the data 140 into one or more data centers 2 corresponding to the distributed DCs 2-1 to 2-n. .
  • the KPI storage unit 24 stores the KPI designated by the service provider 4 from the management console 5 via the setting input unit 23.
  • FIG. 5A is a diagram illustrating an example of the KPI storage unit 24.
  • the table T6 includes one entry from the Item 361 and the Value 362.
  • the table T6 includes KPIs having a click through rate (Click Through Rate) of 15% or more, a sales (Sales Performance) of 100 million yen or more, and a profit rate (Profit Rate) of 20% or more.
  • Click Through Rate Click Through Rate
  • Sales Performance sales Performance
  • Profile Rate profit rate
  • FIG. 6A is a screen image for setting a KPI provided by the management computer 100.
  • the setting input unit 23 of the SLO management unit 3 outputs the screen G1 to the management console 5.
  • the management console 5 receives the KPI setting from the service provider 4.
  • a target transaction G11, a KPI item G12, and a KPI standard G13 can be set.
  • the KPI item G12 and the KPI standard G13 can be selected from values set in advance by a pull-down menu. Further, a new target transaction can be added by operating the “ADD” button.
  • the set KPI information can be stored in the KPI storage unit 24 by operating the “Save” button.
  • the SLO storage unit 25 stores the SLO as an initial value designated by the service provider 4 from the management console 5 via the setting input unit 23.
  • An example of the SLO stored in the SLO storage unit 25 is shown in a table T7 in FIG. 5B.
  • FIG. 5B is a diagram illustrating an example of the SLO storage unit 25.
  • the table T7 constitutes one entry from Item 371 and Value 372.
  • Table T7 includes SLOs with a response time of 500 ms or less, a throughput of 3 Mbps or more, and a TAT of 1 second or less.
  • the SLO output unit 27 outputs the transaction unit or application and data unit SLO standard derived by the SLO standard deriving unit 26 to the management console 5 or the like.
  • a known or publicly known technique may be applied to the output method and output format.
  • the arrangement planning unit 41 plans the arrangement location of the application 14 and the data 140 on the DC 2 by using the SLO standard derived for each application and data by the SLO standard deriving unit 26 and the resource information of the resource information accumulation unit 40. .
  • the arrangement execution unit 42 executes distribution of the application 14 and the data 140 to the DC 2 based on the arrangement plan output from the arrangement planning unit 41.
  • the processor 110 operates as a functional unit that provides a predetermined function by performing processing according to a program of each functional unit.
  • the processor 110 functions as the SLO management unit 3 by performing processing according to the SLO management program.
  • the processor 110 also operates as a function unit that provides the functions of a plurality of processes executed by each program.
  • a computer and a computer system are an apparatus and a system including these functional units.
  • Information such as programs and tables for realizing each function of the SLO management unit 3 includes storage 130, nonvolatile semiconductor memory, hard disk drive, storage device such as SSD (Solid State Drive), or IC card, SD card, DVD, etc. Can be stored in any computer-readable non-transitory data storage medium.
  • FIG. 3 is a flowchart illustrating an example of processing performed in the SLO management unit 3.
  • the user trace acquisition unit 11 deployed in the user terminal 1 and the DC 2 collects user trace information, respectively (F1).
  • the SLO management unit 3 of the management computer 100 acquires user trace information from the DC2 and the user trace storage units 12 and 12A of the user terminal 1 via the transmission / reception unit 20, and stores them in the user trace accumulation unit 21 (F2). .
  • the user trace accumulating unit 21 aggregates the user trace information collected by the user trace acquisition units 11 and 11A as agents of each DC 2 and user terminal 1 in the management computer 100.
  • the user trace accumulating unit 21 does not store the raw data of the user trace information shown in the table T1 and the table T2 as they are, but instead of the raw data of the user trace information as shown in the tables T3 to T5 shown in FIGS. User trace information of users may be stored together.
  • FIG. 2C is a diagram illustrating an example of a table T3 that stores user traces of a plurality of users for a certain period in a transaction unit and stores them as transaction traces for KPI items.
  • FIG. 2D is a diagram illustrating an example of a table T4 that stores user traces of a plurality of users for a certain period as transaction traces in a transaction unit with respect to the SLO item.
  • FIG. 2E is a diagram illustrating an example of a table T5 in which user trace information of a plurality of users for a certain period is collected in units of one trace regarding the SLO item.
  • the table T3 in FIG. 2C stores a transaction ID 331 for storing a transaction identifier, a period 332 for storing an accumulation period, a trace set 333 for storing a set of trace IDs included in the transaction, and position information where the transaction has occurred.
  • One entry is composed of Location 334, Conversion 1 Rate (335) for storing the ratios of Conversion 1 and Conversion 2, respectively, Conversion 2 Rate (336), and Sales 3337 for storing the sales amount.
  • the table T4 in FIG. 2D includes a transaction ID 341 for storing a transaction identifier, a location 342 for storing position information where the transaction has occurred, a response time 343 for storing a response time for the trace, and a turnaround time of the transaction.
  • One entry is composed of the TAT 345 to be stored.
  • the table T5 of FIG. 2E includes a trace ID 351 for storing user trace information identifiers, a transaction ID 352 for storing transaction identifiers, a period 353 for storing an accumulation period, and a response time 354 for storing response times for the traces.
  • One entry consists of Throughput 355 that stores the throughput of the transaction and TAT 356 that stores the turnaround time of the transaction.
  • a period 332 indicating an accumulation period is added to the table T3 with respect to the table T2.
  • KPI items relating to rates are itemized, such as Conversion Rate 1/2 (335, 336).
  • the values to be collected are in accordance with the KPI items and criteria. When the average is adopted in KPI, these items may be averaged. Thereafter, the KPI item information in the table T3 is used as the KPI actual measurement value.
  • there is an item indicating the total sum of user trace information to be accumulated such as a Sales item 337.
  • the table T4 similarly does not limit the values collected for the SLO items, such as the average, mode, and median. Thereafter, the SLO item information in the table T4 is used as the actual SLO value.
  • the important SLO selection unit 22 of the SLO management unit 3 is the SLO that is important for the KPI item. Select an item (F5).
  • the important SLO item refers to an SLO item having a high correlation with the KPI item or having a high causal relationship.
  • a known or well-known method can be used as a method for deriving an important SLO item among SLO items having a high correlation (or causal relationship) with a KPI item.
  • the SLO management unit 3 may perform multiple regression analysis using the KPI actual value acquired from the table T3 as an objective variable and the SLO actual value acquired from the table T4 in FIG. 2D as an explanatory variable.
  • explanatory variables it is desirable to use a plurality of measured SLO values with different types of SLO items.
  • the KPI actual values and SLO actual values in the table T3 in FIG. 2C and the table T4 in FIG. 2D are not normalized. It ’s fine.
  • the t value can be treated as an index indicating the height (or importance) of the correlation between the KPI item and the SLO item.
  • K is a matrix whose elements are k1, k2 and KPI measured values.
  • KPI Conversion 1 Rate 335
  • Transaction ID 331 of Table T3 is set to values of Conversion 1 Rate 335 of 100 and 101 in the figure for each of k1 and k2.
  • Si is a matrix having SLO actual values of SLO item i as elements such as si1 and si2, and values such as response time 343 and throughput 344 of table T4 are entered in si1 and si2, respectively.
  • S is a matrix having Si as an element
  • is a partial regression coefficient matrix
  • is an intercept matrix.
  • T indicates that the matrix is a transposed matrix.
  • an SLO item satisfying a certain standard or a higher-level SLO item is set as an important SLO item.
  • a predetermined value can be used for the above-mentioned “constant standard” and “higher number to be extracted”, or a threshold value for the t value calculated by the above-described t-test can be used.
  • steps F7 and F8 are executed for all the important SLO items calculated in step F5 (F6).
  • the processing returns to step F4.
  • step F7 the transaction unit SLO standard deriving unit 30 of the SLO standard deriving unit 26 derives an SLO standard along the KPI.
  • a flowchart shown in FIG. 4A is shown as an example of a method for deriving an SLO standard along the KPI.
  • FIG. 4A is a flowchart showing an example of the SLO standard derivation process performed by the transaction unit SLO standard derivation unit 30 of the management computer 100.
  • the transaction unit SLO standard derivation unit 30 refers to the table T3, acquires the actual measured value of the KPI, and refers to the table T4 to acquire the correlation of the important SLO actual measured value (F21).
  • the table T3 and the table T4 generated by the user trace information by a large number of users calculate a large number of measured KPI values and a large number of SLO measured values, and the distribution around the SLO items and the KPI items as axes. To do.
  • a distribution approximate curve is obtained from the distribution calculated in step F21, and the intersection of the distribution curve and the KPI line is obtained (F22).
  • KPI items related to income such as sales, profit ratio, user satisfaction, etc.
  • SLO items related to performances such as response time, throughput, TAT, etc. in this example have a correlation
  • the distribution is a power distribution.
  • the first quadrant with the SLO item (Response Time) and the KPI item (Conversion 1 Rate) as the axes is divided into four areas by two straight lines, the KPI standard and the SLO standard. Then, there is a transaction trace that is plotted in a region that satisfies the SLO criterion but does not satisfy the KPI criterion.
  • a transaction trace in which an error from the distribution approximate curve is plotted within a predetermined range is highly likely that the KPI has not been achieved due to a setting failure of the SLO standard.
  • the transaction trace of the plot with a small error from the distribution approximate curve (within a predetermined range) satisfies the SLO standard. Can be interpreted as being plotted in a region not satisfying the KPI standard.
  • the transaction trace in which the error from the distribution approximate curve is plotted within a predetermined range is expected to be plotted in a region that satisfies the KPI criterion, and the SLO criterion is changed to a new SLO criterion.
  • the value of the SLO item axis at the intersection obtained in step F22 is set as the SLO reference along the KPI (F23).
  • the SLO management unit 3 can automatically derive the SLO standard along the KPI.
  • FIG. 7 is a graph showing a correlation between the KPI actual measurement value described in Step F21 and the SLO actual measurement value acquired from the table T4.
  • FIG. 7 shows an example of the present invention in which the horizontal axis is Response Time as the SLO item and the vertical axis is Conversion 1 Rate as the KPI item.
  • a vertical line S1 indicates the SLO standard
  • a horizontal line K1 indicates the KPI standard
  • C1 indicates a distribution approximate curve.
  • A1 in the figure indicates an area where transaction traces that satisfy both the SLO standard and the KPI standard are plotted (user plot points in the figure), and A2 is a transaction that satisfies the SLO standard but does not satisfy the KPI.
  • A3 indicates an area where a trace is plotted, and A3 indicates an area where a transaction trace that does not satisfy both the SLO criterion and the KPI criterion is plotted.
  • the process of step F22 is performed for the purpose of moving the transaction trace plotted in the area A2 to the area A1 that satisfies the KPI.
  • FIG. 8 is a graph illustrating the correlation between the measured KPI value shown in step F21 and the measured SLO value obtained from the table T4, as in FIG.
  • I1 indicates an intersection between the distribution approximate curve C1 and the KPI standard K1
  • S2 indicates a new SLO reference line passing through the intersection I1.
  • a new SLO reference line S2 passing through the intersection I1 is set as the SLO reference along the KPI.
  • it can be expected that transaction traces that satisfy the KPI standard will increase by changing the new SLO standard S2 in a direction that shortens the response time.
  • the application / data unit SLO standard deriving unit 31 of the SLO standard deriving unit 26 expands the transaction unit SLO standard along the KPI into the application 14 and the data 140 unit. (F8).
  • FIG. 4B is a flowchart illustrating an example of processing for setting an SLO standard for each application 14 and data 140.
  • the application / data unit SLO standard deriving unit 31 refers to the table T3 to acquire the KPI actual measurement value as the objective variable, and refers to the table T5 to acquire the SLO actual measurement value of the application 14 and the data 140 unit as the explanatory variable.
  • the application / data unit SLO criterion derivation unit 31 generates a model of the above equation (1) from the acquired objective variable and explanatory variable, and calculates a regression line by multiple regression analysis.
  • the partial regression coefficient of the regression line may be t-tested and the calculated t values may be ranked.
  • Si is a matrix having SLO actual values of SLO items i as elements such as si1 and si2, and the same SLO items of different user trace information are entered.
  • si1 and si2 are stored in si1 and si2 in table T5.
  • S is a matrix having Si as an element, and S1, S2,... Have different transaction IDs.
  • is a partial regression coefficient matrix, and ⁇ is an intercept matrix.
  • T indicates that the matrix is a transposed matrix.
  • the application / data unit SLO criterion derivation unit 31 extracts the contribution rate (or the degree of influence) of the application 14 and the data 140 to the corresponding SLO item by the multiple regression analysis described above (F30).
  • the application / data unit SLO criterion derivation unit 31 calculates a large number of SLO actual measurement values, in other words, in order to increase n in the above equation (1), the application 14 is within a range that satisfies the SLO.
  • the data 140 is distributed in a plurality of DCs 2-1 to 2-n. With this distributed arrangement, user trace information and transaction traces having different actual SLO values can be generated, so that a more accurate contribution rate can be derived.
  • the application / data unit SLO criterion deriving unit 31 determines the new SLO criterion derived in step F23 of FIG. 4A as the SLO criterion for each application 14 and data 140 according to the contribution rate derived in step F30. Is set (F31).
  • Step F31 a known or well-known method can be employed. For example, when the SLO item is response time or TAT, the contribution rate is normalized, and the reciprocal number is multiplied by the SLO criterion derived in step F23, thereby calculating the SLO criterion for each of the application 14 and the data 140.
  • the SLO item is throughput
  • the SLO standard for each of the application 14 and the data 140 is calculated by multiplying the normalized value of the contribution rate by the new SLO standard derived in step F23.
  • a known or well-known technique can be adopted as a specific method for adjusting the SLO standard.
  • a minimum required SLO standard may be selected. Specifically, when the response time standard for KPI-1 is 100 ms and the response time standard for KPI-2 is 50 ms, in order to satisfy both KPI-1 and KPI-2, What is necessary is just to employ 50 ms.
  • an SLO that takes into account the costs for the arrangement of the application 14 and the data 140
  • the standard can be set. Similar to the above example, if the response time standard is 100 ms for the profit rate (KPI-3) with respect to the operation management cost and the response time standard is 50 ms for the profit rate (KPI-4), KPI-
  • KPI- 3 the profit rate
  • KPI-4 the response time standard
  • the SLO management unit 3 performs the above processing for all transactions (F3 loop), and then ends.
  • the execution timing of the processing in FIG. 3 is arbitrary.
  • the SLO standard according to the present embodiment is determined based on the history of processing by the user terminal 1 up to the time when the processing of FIG. 3 is executed. For this reason, if the process of FIG. 3 is periodically executed at long-term intervals, the user's action using the user terminal 1 is highly likely to deviate from the SLO standard. Therefore, it is preferable to execute the processing of FIG. 3 by periodic execution at short intervals or event driven after a certain period after service update.
  • FIG. 6B shows an example of the output to the management console 5 based on the important SLO items of the transaction unit or the application 14 and the data 140 unit and the SLO standard.
  • FIG. 6B is a screen image in which the output unit 27 of the management computer 100 outputs SLO information to the management console 5.
  • the screen G2 can display the target transaction G21, the target application / data G22, the SLO item G23, and the SLO standard G24.
  • the SLO management unit 3 outputs a screen G2 to the management console 5.
  • the arrangement planning unit 41 formulates an arrangement plan for the application 14 and the data 140 using the important SLO items and SLO criteria of the transaction unit or application and data unit derived by the processing of FIG.
  • a well-known or well-known technique can be employed for the method of arrangement planning.
  • the arrangement plan of the application 14 and the data 140 includes the location of the DC 2 that provides the application 14 and the data 140, the designation of the physical computer 201 and the virtual computer 210 that execute the application 14, and the like. Since the SLO standard is set for the application 14 and the data 140 unit, the arrangement of the application 14 and the data 140 between the DCs 2 or in the DC 2 is planned without considering the dependency relationship between the applications 14 and the data 140. It is possible.
  • the SLO standard for the application 14 and the data 140 unit set by the present embodiment is determined in order from the transaction trace close to the user using the user terminal 1 in the transaction trace. Therefore, the arrangement may be planned in order from the application 14 or data 140 close to the user terminal 1 in the order of the application 14 directly accessed by the user terminal 1, the application 14 accessed by the application 14, or the data 140.
  • the placement planning unit 41 places the application 14 and the data 140 on the resource of DC2, if there are a plurality of DC2s that satisfy the derived SLO criterion, the application is intended to improve the processing accuracy of step F8. 14 and the replica of the data 140 are generated and arranged in a plurality of DCs 2 that satisfy the redundantly derived SLO criterion.
  • the method of deriving the SLO from the KPI proposed in the present embodiment is a method of adjusting to the SLO according to the KPI based on the actual end-user behavior. If the measured SLO values of all end users are the same, the user plot points in FIG. 7 have a distribution like a straight line parallel to the vertical axis, and it becomes difficult to adjust to the desired SLO for KPI. Basically, since performance is not guaranteed particularly in cloud computing, the performance related to applications and data arranged in one place fluctuates. However, by placing applications and data in a plurality of data centers, a user trace having a larger fluctuation can be obtained, and as a result, an SLO along the KPI can be derived more accurately.
  • the number of calculation patterns is about 100 * 1000, which greatly increases the amount of calculation for the arrangement plan compared to the conventional example. Can be reduced.
  • the service provider 4 simply sets the KPI, and the SLO according to the KPI is automatically set, and the automatically set SLO Automatically, the application 14 and the data 140 are placed at a location (DC2) along the KPI.
  • DC2 location
  • a KPI is set for each target transaction.
  • a KPI may be set for each service provided by the application 14.
  • the management computer 100 that manages the arrangement of the application 14 and the data 140 in the computer system that can divide the service of the application 14 that can be divided and can be provided by one or more data centers 2, For each service or transaction, a KPI (first condition) including business conditions is received, and the reference values are automatically updated for items related to the service level target (second condition) of the application 14 and data 140.
  • KPI first condition
  • second condition service level target
  • the agent In the data center 2 to which the computer (virtual computer 210 or physical computer 201) on which the application 14 operates belongs, the agent (user trace acquisition unit 11) is operated to monitor the user terminal 1 that uses the service of the application 14.
  • the agent of each data center 2 acquires user trace information including a log of service (application 14) used by the user terminal 1 and performance information.
  • the management computer 100 that manages the arrangement of the application 14 and the data 140 to the plurality of data centers 2 collects user trace information of the user terminal 1 using the application 14 and the data 140 from the agent of each data center 2.
  • the management computer 100 calculates the importance (or contribution) for the SLO correlated with the KPI from the user trace information, and extracts the important SLO items. Then, the management computer 100 calculates a distribution approximation curve from the user trace information for the important SLO items correlated with the KPI, derives a new SLO criterion that satisfies the KPI criterion from the distribution approximation curve and the KPI criterion, and obtains the current SLO. Change the standard to the new SLO standard.
  • the management computer 100 calculates the contribution rate (or the degree of influence) of the application 14 and the data 140 to the SLO item for the SLO item for the KPI. Then, the management computer 100 calculates the SLO standard of the application 14 and the SLO standard of the data 140 from the contribution rate and the new SLO standard.
  • the SLO standard since the SLO standard is set in units of the application 14 and the data 140, the arrangement of the application 14 and the data 140 can be planned without considering the dependency relationship between the application 14 and the data 140. Is possible. And in this invention, the computational complexity concerning arrangement
  • the configuration of the computer, the processing unit, and the processing unit described in the present invention may be partially or entirely realized by dedicated hardware.
  • the various software exemplified in the present embodiment can be stored in various recording media (for example, non-transitory storage media) such as electromagnetic, electronic, and optical, and through a communication network such as the Internet. It can be downloaded to a computer.
  • recording media for example, non-transitory storage media
  • a communication network such as the Internet. It can be downloaded to a computer.
  • the present invention is not limited to the above-described embodiments, and includes various modifications.
  • the above-described embodiments have been described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described.

Abstract

In this invention, a management computer: accepts first conditions, including business conditions, for each transaction used by a user terminal; collects, from data centers and agents on user terminals that used applications, user-trace information for said user terminals; computes, from said user-trace information, importance degrees for second conditions, namely application service-level targets, that are correlated with the abovementioned first conditions; extracts important-second-condition items consisting of items corresponding to second conditions having importance degrees greater than or equal to a prescribed threshold; computes a distribution-approximating curve for said important-second-condition items from the user-trace information; computes, from said distribution-approximating curve and the first conditions, new second conditions that satisfy the first conditions; computes a contribution ratio for each application with respect to said new second conditions; and computes, from said contribution ratios and the new second conditions, criteria for the new second conditions with respect to the applications.

Description

アプリケーション及びデータの配置管理方法、アプリケーション及びデータの配置管理システム及び記憶媒体Application and data arrangement management method, application and data arrangement management system, and storage medium
 本発明は、アプリケーション及びデータの配置を管理する技術に関する。 The present invention relates to a technique for managing the arrangement of applications and data.
 情報技術をサービスとして提供するクラウドコンピューティングが普及している。様々なクラウドコンピューティングサービスを提供するクラウドコンピューティングサービスプロバイダー(以降、クラウドプロバイダー)が多数出現し、また、クラウドプロバイダーは世界規模で広域にクラウドコンピューティングサービスを提供するためのデータセンターを構築している。 Cloud computing that provides information technology as a service is widespread. Many cloud computing service providers (hereinafter referred to as cloud providers) that provide various cloud computing services have emerged, and cloud providers have built data centers to provide cloud computing services in a wide area on a global scale. Yes.
 クラウドコンピューティングサービスの提供を受けるアプリケーションサービスプロバイダーは、複数のクラウドプロバイダーが提供し、広域に展開されるデータセンターに対して、アプリケーションやアプリケーションデータを配置しなければならない。非特許文献1には、アプリケーションサービスを利用するエンドユーザーの位置情報等を考慮して、自動的にアプリケーションやアプリケーションデータを配置する技術が開示されている。 Application service providers that receive cloud computing services must place applications and application data in data centers provided by multiple cloud providers and deployed in a wide area. Non-Patent Document 1 discloses a technique for automatically arranging applications and application data in consideration of position information of end users who use application services.
 しかしながら、上記非特許文献1記載の技術では、様々な制約情報、具体的には、エンドユーザーもしくはエンドユーザーが利用する端末の位置情報や、データセンターの位置情報、アプリケーションやアプリケーションデータの特性情報、アプリケーションやアプリケーションデータにより構成されるサービスやトランザクション毎に設定されるサービスレベル目標(SLO:Service Level Objectives)の情報、データセンターの特性情報、データセンターのコスト情報、などを制約情報として入力とする。そして、入力された制約情報全てを満足するように、アプリケーションやアプリケーションデータを分散するデータセンターに配置しなければならない。この配置計画はNP(Non-deterministic Polynomial)困難問題であり膨大な計算量を必要とする、という問題があった。 However, in the technology described in Non-Patent Document 1, various constraint information, specifically, end information of the end user or the terminal used by the end user, position information of the data center, characteristic information of the application and application data, Service level target (SLO: Service Level Objects) information, data center characteristic information, data center cost information, and the like that are set for each service and transaction configured by applications and application data are input as constraint information. Then, the application and application data must be arranged in a data center where the application and application data are distributed so as to satisfy all the input constraint information. This arrangement plan is a NP (Non-deterministic Polynomial) difficult problem and requires a huge amount of calculation.
 ここで、アプリケーションはエンドユーザーにサービスの一部を提供するアプリケーションバイナリを示す。アプリケーションデータはアプリケーションバイナリがアクセスするデータを示す。アプリケーションやアプリケーションデータにより構成され、エンドユーザーの登録処理や商品検索処理など、それぞれが異なる処理内容を実行する単位をトランザクションとし、複数のトランザクションから構成されるものをサービスとする。 Here, application indicates an application binary that provides a part of the service to the end user. Application data indicates data accessed by the application binary. A unit composed of an application and application data, each of which executes different processing contents such as an end user registration process and a product search process, is a transaction, and a service composed of a plurality of transactions is a service.
 上記SLOは重要業績評価指標(KPI:Key Performance Indicator)、及び人間工学などの絶対基準に基づき決定される。KPIは「契約率20%以上」などの経営上の条件文であり、SLOは「応答時間が1秒以下」などの計算機システムの性能に関する条件文である。以降、「契約率」や「応答時間」などKPIやSLOが対象とする項目をそれぞれKPI項目、SLO項目と呼び、「20%以上」や「1秒以下」などKPIやSLOが満足すべき値(例えば、閾値)を、それぞれKPI基準、SLO基準と呼ぶ。なお、上記契約率は、例えば、WEBサイトを訪れたユーザーのうち、商品を購入したユーザーの比率を示す。 The above SLO is determined based on an absolute standard such as an important performance evaluation index (KPI: Key Performance Indicator) and ergonomics. KPI is a management conditional statement such as “contract rate of 20% or more”, and SLO is a conditional statement related to the performance of the computer system such as “response time of 1 second or less”. In the following, items targeted by KPI and SLO, such as “contract rate” and “response time”, will be referred to as KPI items and SLO items, respectively, and values that KPI and SLO should satisfy such as “20% or more” and “1 second or less”. (For example, threshold) are referred to as KPI standard and SLO standard, respectively. The contract rate indicates, for example, the ratio of users who have purchased a product among users who have visited the WEB site.
 膨大な計算量を必要とする最大の要因は、SLOが多くてもトランザクション単位にしか設定されず、そのSLOを満足するためにアプリケーションやアプリケーションデータを配置するパターンの組合せ数が膨大であるためである。具体的には、データセンター数を100、アプリケーションやアプリケーションデータ数を1000とすると、計算パターン数は100^1000(100の1000乗)となる。 The biggest factor that requires an enormous amount of calculation is that even if there are many SLOs, they are set only for each transaction, and the number of combinations of patterns for arranging applications and application data to satisfy the SLOs is enormous. is there. Specifically, if the number of data centers is 100 and the number of applications and application data is 1000, the number of calculation patterns is 100 ^ 1000 (100 to the 1000th power).
 そこで本発明は、上記問題点に鑑みて成されたもので、SLOを満足するアプリケーションやアプリケーションデータの配置計画にかかる計算量を削減することを目的とする。 Therefore, the present invention has been made in view of the above problems, and an object of the present invention is to reduce the amount of calculation required for an application or application data arrangement plan that satisfies SLO.
 本発明は、プロセッサとメモリを有してデータセンターとユーザー端末に接続された管理計算機が、1以上のデータセンターに分割可能なアプリケーション及びデータの配置を管理するアプリケーション及びデータの配置管理方法であって、前記データセンター内に配備されたエージェントが、前記データセンターを監視し、また、前記ユーザー端末内に配備されたエージェントが、前記ユーザー端末を監視し、前記データセンターのエージェントと前記ユーザー端末のエージェントが、前記ユーザー端末を操作し、前記アプリケーションのサービスを利用するユーザーのサービス行動ログ及び性能情報を含むユーザートレース情報を取得する第1のステップと、前記管理計算機が、前記ユーザー端末が利用したサービスまたはトランザクション毎に、ビジネスの条件を含む第1の条件を受け付ける第2のステップと、前記管理計算機が、アプリケーション及びデータを利用したユーザー端末のユーザートレース情報を、前記データセンターの前記エージェントと前記ユーザー端末の前記エージェントから収集する第3のステップと、前記管理計算機は、前記アプリケーション及びデータの所定のサービスレベル目標を第2の条件として、前記ユーザートレース情報から前記第1の条件に相関関係がある第2の条件について重要度を算出し、前記重要度が所定の閾値以上の第2の条件に対応する項目を、重要第2条件項目として抽出する第4のステップと、前記管理計算機が、前記重要第2条件項目と前記第1の条件の項目について、前記ユーザートレース情報から分布近似曲線を算出し、前記分布近似曲線と前記第1の条件から当該第1の条件を満足する新たな第2の条件を算出し、現在の第2の条件を新たな第2の条件に変更する第5のステップと、前記管理計算機が、前記第1の条件に対する前記新たな第2の条件項目について、前記アプリケーション及びデータそれぞれの当該第2の条件項目への寄与率を算出する第6のステップと、前記管理計算機が、前記寄与率と新たな第2の条件から前記アプリケーションに対する新たな第2の条件の基準と、前記データに対する新たな第2の条件の基準をそれぞれ算出する第7のステップと、前記算出された前記アプリケーションと前記データに対する新たな第2の条件の基準を満たす1以上のデータセンターに対して、前記アプリケーションと前記データを分散配置する第8のステップと、を含む。 The present invention is an application and data arrangement management method in which a management computer having a processor and a memory and connected to a data center and a user terminal manages an application and data arrangement that can be divided into one or more data centers. An agent deployed in the data center monitors the data center, and an agent deployed in the user terminal monitors the user terminal, and the agent of the data center and the user terminal A first step in which an agent operates the user terminal and obtains user trace information including service behavior log and performance information of a user who uses the application service; and the management computer uses the user terminal Service or transaction A second step of accepting a first condition including a business condition for each application, and a user trace information of a user terminal using the application and data by the management computer, the agent of the data center, and the user The third step of collecting from the agent of the terminal and the management computer have a correlation between the user trace information and the first condition, with a predetermined service level target of the application and data as a second condition A fourth step of calculating an importance level for a second condition, and extracting an item corresponding to a second condition with the importance level equal to or greater than a predetermined threshold as an important second condition item; and For the important second condition item and the first condition item, the distribution approximate curve is obtained from the user trace information. And calculating a new second condition that satisfies the first condition from the distribution approximate curve and the first condition, and changing the current second condition to a new second condition. And a sixth step in which the management computer calculates a contribution rate of the application and data to the second condition item for the new second condition item for the first condition, and A seventh step in which the management computer calculates a new second condition criterion for the application and a new second condition criterion for the data from the contribution rate and the new second condition; The application and the data are distributed to one or more data centers that satisfy the new second condition criterion for the calculated application and the data. And an eighth step.
 本発明によれば、サービスレベルを満足する目標アプリケーション及びデータの配置計画にかかる計算量を大幅に削減できる。例えば、データセンター数を100、アプリケーションやアプリケーションデータ数を1000とすると、計算するパターン数は100*1000程度となる。 According to the present invention, it is possible to greatly reduce the amount of calculation required for the target application satisfying the service level and the data allocation plan. For example, if the number of data centers is 100 and the number of applications and application data is 1000, the number of patterns to be calculated is about 100 * 1000.
本発明の実施例を示し、クラウドコンピューティングで構成されるデータセンターとユーザー端末の機能の一例を示すブロック図である。It is a block diagram which shows the Example of this invention and shows an example of the function of the data center comprised by cloud computing, and a user terminal. 本発明の実施例を示し、クラウドコンピューティングの管理計算機の一例を示すブロック図である。It is a block diagram which shows the Example of this invention and shows an example of the management computer of cloud computing. 本発明の実施例を示し、データセンターの構成の一例を示すブロック図である。It is a block diagram which shows the Example of this invention and shows an example of a structure of a data center. 本発明の実施例を示し、ユーザートレース単位の情報を格納するテーブルの一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the table which stores the information of a user trace unit. 本発明の実施例を示し、トランザクション単位の情報を格納するテーブルの一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the table which stores the information of a transaction unit. 本発明の実施例を示し、KPI項目に関して、一定期間の複数ユーザーのユーザートレースをトランザクション単位にまとめたトランザクショントレースとして格納するテーブルの一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the table which stores as a transaction trace which put together the user trace of several users for a fixed period in transaction unit regarding a KPI item. 本発明の実施例を示し、SLO項目に関して、一定期間の複数ユーザーのユーザートレースをトランザクション単位にまとめたトランザクショントレースとして格納するテーブルの一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the table which stores as a transaction trace which put together the user trace of several users for a fixed period in transaction unit regarding SLO item. 本発明の実施例を示し、SLO項目に関して、一定期間の複数ユーザーのユーザートレース情報を、1トレース単位にまとめたテーブルの一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the table which put together the user trace information of the several period user for 1 period regarding SLO item. 本発明の実施例を示し、管理計算機で行われる処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the process performed with a management computer. 本発明の実施例を示し、管理計算機で行われるSLO基準の導出処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the derivation | leading-out process of the SLO reference | standard performed with a management computer. 本発明の実施例を示し、アプリケーション及びデータ毎にSLO基準を設定する処理の一例を示すフローチャートである。It is a flowchart which shows the Example of this invention and shows an example of the process which sets an SLO reference | standard for every application and data. 本発明の実施例を示し、KPI格納部の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of a KPI storage part. 本発明の実施例を示し、SLO格納部の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of a SLO storage part. 本発明の実施例を示し、管理計算機が提供するKPIの設定画面の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the setting screen of KPI which a management computer provides. 本発明の実施例を示し、管理計算機が提供するKPIの設定画面の一例を示す図である。It is a figure which shows the Example of this invention and shows an example of the setting screen of KPI which a management computer provides. 本発明の実施例を示し、KPIとSLOの相関を例示するグラフである。It is a graph which shows the Example of this invention and illustrates the correlation of KPI and SLO. 本発明の実施例を示し、KPIに沿ったSLO基準を導出する操作の一例を示すグラフである。It is a graph which shows the Example of this invention and shows an example of operation which derives | leads-out the SLO reference | standard along KPI.
 以下、本発明の一実施形態について添付図面を用いて説明する。 Hereinafter, an embodiment of the present invention will be described with reference to the accompanying drawings.
 以下、図面を用いて本発明の実施例を詳細に説明する。なお、各図に示す同一の符号は同一の機能または構成を有するため、重複した符号の説明は省略する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. In addition, since the same code | symbol shown in each figure has the same function or structure, description of the overlapping code | symbol is abbreviate | omitted.
 図1A~図1Cは本発明の一実施形態に係る計算機システムのブロック図である。図1Aは、データセンターとユーザー端末の機能の一例を示すブロック図である。図1Bは、クラウドコンピューティングを管理する管理計算機の一例を示すブロック図である。図1Cは、データセンターの構成の一例を示すブロック図である。 1A to 1C are block diagrams of a computer system according to an embodiment of the present invention. FIG. 1A is a block diagram illustrating an example of functions of a data center and a user terminal. FIG. 1B is a block diagram illustrating an example of a management computer that manages cloud computing. FIG. 1C is a block diagram illustrating an example of the configuration of the data center.
 図1Aにおいて、本発明の計算機システムは、ネットワーク50を介して複数のユーザー端末1-1~1-nと、複数のデータセンター2-1~2-nと、データセンター2-1~2-nを管理する管理計算機100が接続される。ユーザー端末1-1~1-nの総称をユーザー端末1とし、データセンター2-1~2-nの総称をデータセンター2(以下、DC2)とする。 1A, the computer system of the present invention includes a plurality of user terminals 1-1 to 1-n, a plurality of data centers 2-1 to 2-n, and data centers 2-1 to 2- A management computer 100 that manages n is connected. The user terminal 1-1 to 1-n is generically referred to as a user terminal 1, and the data center 2-1 to 2-n is generically referred to as a data center 2 (hereinafter DC2).
 ユーザー端末1は、クラウドコンピューティング上で提供されるサービスを利用する。DC2は複数のコンピューティングノードが設置され、クラウドコンピューティングが稼働する。なお、本実施例及び図面ではデータセンター2とするが、複数のコンピューティングノードをまとめたクラスタを示す場合もある。よって、本実施例が広域に分散した環境に適用される場合にはDC2はデータセンターを示し、一方、本実施例がデータセンター内などの局所に分散した環境に適用される場合には、DC2は複数のコンピューティングノードをまとめたラックや、複数のラックにより構成されるクラスタを指す。 User terminal 1 uses services provided on cloud computing. A plurality of computing nodes are installed in DC2, and cloud computing is operated. In addition, although it is set as the data center 2 in a present Example and drawing, the cluster which put together the some computing node may be shown. Therefore, when this embodiment is applied to an environment distributed over a wide area, DC2 indicates a data center. On the other hand, when this embodiment is applied to a locally distributed environment such as in a data center, DC2 is indicated. Indicates a rack in which a plurality of computing nodes are combined or a cluster constituted by a plurality of racks.
 ユーザー端末1のユーザーアプリケーション10は、サービス事業者(またはDC2)が提供するサービスの一部を処理するソフトウェアであり、ユーザー端末1にダウンロードされる。ユーザーアプリケーション10は、例えば、ユーザー端末1に実装されるブラウザアプリケーション上で実行され、ユーザーの操作を受け付けるサービスGUIである。 The user application 10 of the user terminal 1 is software that processes a part of the service provided by the service provider (or DC2), and is downloaded to the user terminal 1. The user application 10 is a service GUI that is executed on, for example, a browser application installed in the user terminal 1 and receives user operations.
 <データセンター及びユーザー端末の構成>
 図1Cは、DC2の一例を示すブロック図である。図示では、DC2-1について説明するが、DC2-2~2-nについても同様の構成である。
<Data center and user terminal configuration>
FIG. 1C is a block diagram illustrating an example of DC2. In the figure, DC2-1 will be described, but DC2-2 to 2-n have the same configuration.
 DC2-1は、複数のコンピューティングノード(#1~#n)200-1~200-nと、ストレージ装置250と、ゲートウェイ装置270とを相互に接続するネットワーク260と、を有する。ノード200-1~200-nの総称を、ノード200で表す。各ノード200は、内部のネットワーク260とゲートウェイ装置270を介して外部のネットワーク50からユーザー端末1や管理計算機100に接続されている。 The DC 2-1 includes a plurality of computing nodes (# 1 to #n) 200-1 to 200-n, a storage apparatus 250, and a network 260 that interconnects the gateway apparatus 270. A generic name of the nodes 200-1 to 200-n is represented by a node 200. Each node 200 is connected to the user terminal 1 and the management computer 100 from the external network 50 via the internal network 260 and the gateway device 270.
 ノード(#1)200-1は、物理計算機201-1と、物理計算機201-1の計算機資源を1以上の仮想計算機210-1~210-nに割り当てる仮想化部202-1と、各仮想計算機210-1~210-nのOS211-1~211-n上で稼動するアプリケーション14-1~14-nと、を含む。OS211-1~211-nの総称をOS211とし、同様にアプリケーション14-1~14-nの総称をアプリケーション14とする。なお、物理計算機201-1は、プロセッサ2011とメモリ2012とを含んで構成される。 The node (# 1) 200-1 includes a physical computer 201-1, a virtualization unit 202-1 that assigns the computer resources of the physical computer 201-1 to one or more virtual computers 210-1 to 210-n, and each virtual machine 201-1. And applications 14-1 to 14-n running on the OS 211-1 to 211-n of the computers 210-1 to 210-n. The OS 211-1 to 211 -n is generically referred to as OS 211, and the applications 14-1 to 14 -n are generically referred to as application 14. The physical computer 201-1 includes a processor 2011 and a memory 2012.
 ストレージ装置250は、OS211やアプリケーション14またはアプリケーション14が利用するデータ140などが格納される。 The storage apparatus 250 stores the OS 211, the application 14, or data 140 used by the application 14.
 DC2-1では、管理計算機100からの指令に応じて、1以上の仮想計算機210を生成して、OS211、アプリケーション14を実行させ、アプリケーション14のサービスをユーザー端末1に提供する。 The DC 2-1 generates one or more virtual machines 210 according to a command from the management computer 100, executes the OS 211 and the application 14, and provides the service of the application 14 to the user terminal 1.
 上記図1Cのハードウェア及びソフトウェアが、DC2のリソースであり、管理計算機100がDC2毎に管理するリソースである。 The hardware and software shown in FIG. 1C are DC2 resources, and the management computer 100 manages the resources for each DC2.
 次に、図1Aを参照して、DC2とユーザー端末1の機能の一例について説明する。DC2-1では、ユーザー端末1にサービスを提供するアプリケーション14と、ユーザー端末1がアプリケーション14を利用した履歴等のユーザートレース情報を取得するユーザートレース取得部11と、ユーザートレース取得部11が取得したユーザートレース情報を格納するユーザートレース格納部12と、DC2-1のリソース情報を取得するリソース情報取得部15と、リソース情報を格納するリソース情報格納部16と、管理計算機100等からの要求を受け付けて、ユーザートレース情報またはリソース情報を送信する送受信部131と、を提供する。 Next, an example of functions of the DC 2 and the user terminal 1 will be described with reference to FIG. 1A. In the DC 2-1, an application 14 that provides a service to the user terminal 1, a user trace acquisition unit 11 that acquires user trace information such as a history of use of the application 14 by the user terminal 1, and a user trace acquisition unit 11 have acquired. Accepts requests from a user trace storage unit 12 that stores user trace information, a resource information acquisition unit 15 that acquires DC2-1 resource information, a resource information storage unit 16 that stores resource information, and a management computer 100 A transmission / reception unit 131 that transmits user trace information or resource information.
 ユーザートレース取得部11やリソース情報取得部15は、所定の仮想計算機210または物理計算機201で実行することができ、ユーザートレース格納部12及びリソース情報格納部16はストレージ装置250に設定することができる。また、ユーザートレース取得部11やリソース情報取得部15は、各DC2で管理計算機100のエージェントとして機能する。 The user trace acquisition unit 11 and the resource information acquisition unit 15 can be executed by a predetermined virtual computer 210 or physical computer 201, and the user trace storage unit 12 and the resource information storage unit 16 can be set in the storage apparatus 250. . The user trace acquisition unit 11 and the resource information acquisition unit 15 function as an agent of the management computer 100 in each DC2.
 ユーザー端末1は、DC2のアプリケーション14に接続してユーザーの入力を受け付ける処理などの所定の処理を実行するアプリケーション10と、外部との通信を行う送受信部13とを提供する。ユーザー端末1は、ユーザー端末から見た応答時間などの性能情報を取得するユーザートレース取得部11Aと、ユーザートレース取得部11Aが取得したユーザートレース情報を格納するユーザートレース格納部12Aにて構成される。なお、ユーザー端末1は、図示しないプロセッサとメモリを備える。また、ユーザートレース取得部11Aやユーザートレース格納部12Aは、各ユーザー端末1で管理計算機100のエージェントとして機能する。 The user terminal 1 provides an application 10 that executes a predetermined process such as a process of connecting to the application 14 of the DC 2 and receiving a user input, and a transmission / reception unit 13 that communicates with the outside. The user terminal 1 includes a user trace acquisition unit 11A that acquires performance information such as response time viewed from the user terminal, and a user trace storage unit 12A that stores user trace information acquired by the user trace acquisition unit 11A. . The user terminal 1 includes a processor and a memory (not shown). Further, the user trace acquisition unit 11 </ b> A and the user trace storage unit 12 </ b> A function as an agent of the management computer 100 in each user terminal 1.
 ユーザートレース取得部11は、少なくともDC2に実装され、可能であればユーザー端末1にユーザートレース取得部11Aとして実装され、ユーザー端末1を利用するユーザーによる実際のサービス利用の履歴(サービス行動ログ)を取得し、ユーザートレース格納部12Aに格納する。 The user trace acquisition unit 11 is implemented at least in the DC 2 and, if possible, is implemented in the user terminal 1 as the user trace acquisition unit 11 A, and the actual service usage history (service action log) by the user using the user terminal 1 is recorded. Acquired and stored in the user trace storage unit 12A.
 ユーザートレース情報とは、ユーザー端末1を利用するユーザーや全てのアプリケーション14及びデータ140の間でやり取りされる要求や、応答にかかる通信や処理を各DC2や各ユーザー端末1内のエージェントが監視し、ログや性能に関する情報を取得してサービスのトランザクション(またはワークロード)毎にまとめた情報である。それゆえ、一つのトランザクションにかかるユーザートレース情報の集合(以降、トランザクショントレース)は、1つ以上のユーザートレース情報で構成される。各エージェントが収集したユーザートレース情報は、管理計算機100に集められる。 The user trace information means that each DC 2 or an agent in each user terminal 1 monitors requests exchanged between users who use the user terminal 1, all applications 14 and data 140, and communication and processing related to responses. This is information gathered for each service transaction (or workload) by acquiring information related to logs and performance. Therefore, a set of user trace information related to one transaction (hereinafter referred to as transaction trace) is composed of one or more user trace information. User trace information collected by each agent is collected in the management computer 100.
 ユーザートレース情報の取得は、公知または周知の手法を採用することができる。例えば、「Dapper, a large-scale distributed systems tracing infrastructure.」(Sigelman, B. H., Barroso, L. A., Burrows, M., Stephenson, P., Plakal, M., Beaver, D., ... & Shanbhag, C. 2010、 Google research)に開示される手法を適用すればよい。 The user trace information can be acquired using a known or well-known method. For example, “Dapper, a large-scale distributed systems tracing infrastructure, structure.” (Sigelman, B. H., Barroso, L. A., Burs, M., P. M., Peter. ... & Shanbag, C. 2010, Google research) may be applied.
 なお、ユーザートレース情報で取得する情報には、図2A、図2Bに示すテーブルT1及びテーブルT2の情報を含むことが好ましい。 In addition, it is preferable that the information acquired by the user trace information includes information on the tables T1 and T2 shown in FIGS. 2A and 2B.
 図2Aは、ユーザートレース単位の情報を格納するテーブルT1の一例を示す図である。図2Bは、トランザクション単位の情報を格納するテーブルT2の一例を示す図である。 FIG. 2A is a diagram illustrating an example of a table T1 that stores information in units of user traces. FIG. 2B is a diagram illustrating an example of a table T2 that stores information in transaction units.
 図2Aにおいて、テーブルT1は、ユーザートレース情報の識別子を格納するTrace ID301と、ユーザートレース間で親関係のある識別子を格納するParent ID302と、ユーザートレース間で兄弟関係のある識別子を格納するSibling ID303と、当該ユーザートレース情報が含まれるトランザクションの識別子を格納するTransaction ID304と、当該トレース情報の開始時刻を格納するStart Time305と、当該トレース情報の終了時刻を格納するEnd Time306と、当該トレースの発信元の識別子を格納するSource307と、当該トレースの発信元が配置されるDCの識別子を格納するSource DC308、当該トレースの発信元の位置に関する情報を格納するSource Location309と、当該トレースの受信先の識別子を格納するDestination310と、当該トレースの受信先が配置されるDCの識別子を格納するDestination DC311と、当該トレースの受信先の位置に関する情報を格納するDestination Location312と、当該トレースにかかる応答時間を格納するResponse Time313と、当該トランザクションのスループットを格納するThroughput314と、当該トランザクションのターンアラウンドタイムを格納するTAT315と、からひとつのエントリーを構成する。 In FIG. 2A, a table T1 includes a Trace ID 301 for storing an identifier of user trace information, a Parent ID 302 for storing an identifier having a parent relationship between user traces, and a Sibling ID 303 for storing an identifier having a sibling relationship between user traces. A transaction ID 304 for storing an identifier of a transaction including the user trace information, a start time 305 for storing a start time of the trace information, an end time 306 for storing an end time of the trace information, and a source of the trace Source 307 for storing the identifier of the source, Source DC 308 for storing the identifier of the DC where the source of the trace is arranged, and information on the location of the source of the trace Stores the Source Location 309, the Destination 310 that stores the identifier of the destination of the trace, the Destination DC 311 that stores the identifier of the DC where the destination of the trace is placed, and the information about the location of the destination of the trace The Destination Location 312, the Response Time 313 that stores the response time for the trace, the Throughput 314 that stores the throughput of the transaction, and the TAT 315 that stores the turnaround time of the transaction constitute one entry.
 なお、位置に関する情報(309、312)はテーブルT1に示す通り、緯度と経度による表記でも良く、IPアドレスなどのネットワーク位置情報を保持しておき、GeoIPデータベースサービスなどで変換しても良い(例えば、Quova IP Geo-Location Database. <http://www.quova.com>)。 The location information (309, 312) may be expressed in latitude and longitude as shown in the table T1, or may be converted by a GeoIP database service or the like by holding network location information such as an IP address (for example, , Quova IP Geo-Location Database. <Http://www.quova.com>).
 ここで、テーブルT1におけるトレースID="1"とトレースID="2"のエントリーにおいて、それぞれの開始時間と終了時間、接続関係(発信元及び受信先)情報によると、サービスの一部はアプリ(図中APP)AとアプリBで構成され、ユーザーが使用するユーザー端末1からアプリA、アプリAからアプリBという2つの接続により、サービスが成立していることが記録されている。 Here, in the entries of trace ID = “1” and trace ID = “2” in the table T1, according to the respective start time, end time, and connection relationship (source and destination) information, part of the service is an application (APP in the figure) It is composed of A and application B, and it is recorded that the service is established by two connections of the user terminal 1 used by the user from application A and application A to application B.
 つまり、トレースID="1"は、トレースID="2"の親となる関係がある。このとき、トレースID="1"の応答時間313、スループット314、TAT315は、トレースID="2"の応答時間313、スループット314、TAT315を含まず、トレースID="1"の通信及び処理のみにかかった応答時間313、スループット314、TAT315とする。 That is, the trace ID = “1” has a relationship that becomes the parent of the trace ID = “2”. At this time, the response time 313, the throughput 314, and the TAT 315 of the trace ID = “1” do not include the response time 313, the throughput 314, and the TAT 315 of the trace ID = “2”, and only the communication and processing of the trace ID = “1”. , Response time 313, throughput 314, and TAT 315.
 本実施例において、SLO(Service Level Objectives)とは前述のとおり応答時間やスループット、TATといったDC2やネットワーク50等の性能に関する項目を対象とし、セキュリティ、可用性、信頼性などは対象外である。SLOの各項目は、KPI(Key Performance Indicator=重要業績評価指標)を達成するための項目で、管理計算機100を利用するシステム管理者やサービスプロバイダー4などが予め設定することができる。 In the present embodiment, SLO (Service Level Objects) covers items related to performance such as DC2 and network 50 such as response time, throughput, and TAT, as described above, and excludes security, availability, reliability, and the like. Each item of the SLO is an item for achieving KPI (Key Performance Indicator = key performance evaluation index), and can be set in advance by a system administrator or service provider 4 using the management computer 100.
 次に、図2Bにおいて、テーブルT2は、トランザクション単位の識別子を格納するTransaction ID321と、トランザクション間で親関係のあるトランザクションの識別子を格納するParent ID322と、トランザクション間で兄弟関係のあるトランザクションの識別子を格納するSibling ID323と、当該トランザクションが含むTrace IDの集合を格納するTrace Set324と、当該トランザクションが発生した位置情報を格納するLocation325と、コンバージョン1及びコンバージョン2の結果をそれぞれ格納するConversion 1(326)と、Conversion 2(327)と、売上金額を格納するSales328と、からひとつのエントリーが構成される。 Next, in FIG. 2B, the table T2 includes a transaction ID 321 storing an identifier for each transaction, a parent ID 322 storing an identifier of a transaction having a parent relationship between transactions, and an identifier of a transaction having a sibling relationship between transactions. Sibling ID 323 to be stored, Trace Set 324 to store a set of Trace IDs included in the transaction, Location 325 to store location information where the transaction has occurred, and Conversion 1 (326) to store the results of conversion 1 and conversion 2 respectively And one entry from Conversion 2 (327) and Sales 328 for storing the sales amount It is made.
 本実施例ではKPIがトランザクション単位に設定されることを想定している。なお、KPIがサービス単位に設定される場合であっても、サービスに属する全てのトランザクションに対して、同一のKPIを設定すれば良い。 In this embodiment, it is assumed that KPI is set for each transaction. Even if the KPI is set for each service, the same KPI may be set for all transactions belonging to the service.
 上記Location325は、ユーザートレース情報を所定の地域毎に分割して収集し、地域毎に分析を行う場合に利用する。KPIはサービスが展開する地域毎に設定される場合があり、この場合において、所定の地域毎にユーザートレース情報を管理し、後述するように、SLOも地域毎に設定を行う。 The location 325 is used when user trace information is divided and collected for each predetermined area and analyzed for each area. The KPI may be set for each area where the service is deployed. In this case, the user trace information is managed for each predetermined area, and the SLO is set for each area as described later.
 上記Conversion 1/2(326、327)はKPIの達成結果を示しており、具体的には「本登録したか」、「サービス利用頻度が2回/1月以上か」、「ソーシャル活動実施回数が1回/1月以上か」、「目標課金額を達成したか」など、当該ユーザートレース情報においてKPI項目が達成されたか否かを示す。KPIが達成された場合には、Conversion 1/2(326、327)に"True"が格納され、達成されなかった場合には"False"が格納される。ここで、本実施例において、KPIが対象とする項目であるKPI項目数、換言するとConversion項目数については、設定されるKPIに応じて変更することができる。 The above Conversion 1/2 (326, 327) indicates the achievement result of KPI. Specifically, “Is this registered?”, “Is the frequency of service usage more than twice / month”, “Number of social activities performed” Indicates whether or not the KPI item has been achieved in the user trace information such as “Is it once or more than a month?”, “Has the target billing amount been achieved”, or the like. When KPI is achieved, “True” is stored in Conversion 1/2 (326, 327), and when it is not achieved, “False” is stored. Here, in the present embodiment, the number of KPI items that are items targeted by the KPI, in other words, the number of conversion items can be changed according to the set KPI.
 本実施例において、KPIとは売上高や利益、それらに関連するユーザー満足度などビジネスのインカム(income)に関する項目を対象とし、経費やコストなどの費用(costs and expenses)は対象外である。なお、経費に対する売上高や売上原価に対する売上高や利益(原価率や利益率)といった、「費用あたりのインカム」等の指標はKPIの対象項目とする。 In this embodiment, KPI covers items related to business income such as sales and profits, and user satisfaction related to them, and does not cover expenses such as expenses and expenses (expenses and expenses). Note that indicators such as “income per expense” such as sales against expenses and sales and profits against cost of sales (cost rate and profit ratio) are subject to KPI.
 図1Aにおいて、DC2の仮想計算機210で実行されるアプリケーション14は、サービス事業者が提供するサービスの一部を処理する。Webサービスであれば、Webサービスを構成するファイルをユーザー端末1からの要求に応じて送信するアプリケーションや、電子商取引(EC)サービスの場合、商品データの検索処理を行うアプリケーションである。 In FIG. 1A, the application 14 executed by the virtual computer 210 of DC2 processes a part of the service provided by the service provider. In the case of a Web service, it is an application that transmits a file constituting the Web service in response to a request from the user terminal 1, or in the case of an electronic commerce (EC) service, an application that searches for product data.
 リソース情報取得部15は、DC2内のコンピューティングノードの稼働状況に関する情報を収集し、リソース情報格納部16に格納する。リソース情報とは、例えば、プロセッサ使用率やストレージ容量、ネットワーク使用量や、アプリケーションログなどを示す。なお、リソース情報の取得は、公知または周知の手法を適用することができる。例えば、"Nagios - The Industry Standard in IT Infrastructure Monitoring"(http://www.nagios.org/)等を適用してもよい。 The resource information acquisition unit 15 collects information regarding the operating status of the computing nodes in the DC 2 and stores the information in the resource information storage unit 16. The resource information indicates, for example, a processor usage rate, storage capacity, network usage amount, application log, and the like. In addition, acquisition of resource information can apply a well-known or well-known method. For example, “Nagios-The Industry Standard in IT Infrastructure Monitoring” (http://www.nagios.org/) may be applied.
 ユーザートレース格納部12に格納されたユーザートレース情報及びリソース情報格納部16に格納されたリソース情報は、送受信部13により管理計算機100のSLO管理部3の送受信部20に送信される。なお、送信タイミングは、定期的であってもよいし、ユーザートレース情報及びリソース情報の取得を契機とするイベントドリブンであっても良い。 The user trace information stored in the user trace storage unit 12 and the resource information stored in the resource information storage unit 16 are transmitted to the transmission / reception unit 20 of the SLO management unit 3 of the management computer 100 by the transmission / reception unit 13. The transmission timing may be periodic, or may be event-driven triggered by acquisition of user trace information and resource information.
 <管理計算機の構成>
 図1Bにおいて、管理計算機100は、プロセッサ110と、メモリ120と、ストレージ130と、管理コンソール5と、を備えて、SLO管理部3と、配置計画部41と、配置実行部42を実行する。管理コンソール5は、入力装置と出力装置を備えて、サービスプロバイダー4やシステム管理者などからの入力を受け付ける。
<Configuration of management computer>
In FIG. 1B, the management computer 100 includes a processor 110, a memory 120, a storage 130, and a management console 5, and executes an SLO management unit 3, an arrangement plan unit 41, and an arrangement execution unit 42. The management console 5 includes an input device and an output device, and receives input from the service provider 4 or a system administrator.
 管理計算機100のSLO管理部3は、分散したDC2-1~2-nへのアプリケーション14及びデータ140の配置を管理する。なお、SLO管理部3の実装場所は、本実施例の管理計算機100に限定されるものではなく、DC2内の物理計算機201や仮想計算機210である。なお、SLO管理部3は、分散される各DC2に分散して配置されてもよいし、集中的に一つのDC2に配置されても良い。 The SLO management unit 3 of the management computer 100 manages the arrangement of the application 14 and data 140 to the distributed DCs 2-1 to 2-n. The mounting location of the SLO management unit 3 is not limited to the management computer 100 of the present embodiment, but is the physical computer 201 and the virtual computer 210 in the DC2. Note that the SLO management unit 3 may be arranged in a distributed manner in each DC 2 to be distributed, or may be centrally arranged in one DC 2.
 SLO管理部3は、ユーザートレース集積部21は送受信部20によりDC2及びユーザー端末1から送信されるユーザートレース情報を集積する。リソース情報集積部40は、DC2から送信されたリソース情報を送受信部20で受け付けて、集積する。 In the SLO management unit 3, the user trace accumulation unit 21 accumulates user trace information transmitted from the DC 2 and the user terminal 1 by the transmission / reception unit 20. The resource information accumulation unit 40 receives and accumulates the resource information transmitted from the DC 2 by the transmission / reception unit 20.
 ここで、SLO管理部3が分散されて各DC2に分散配置される場合、ユーザートレース集積部21及びリソース情報集積部40は、各SLO管理部3が担当するDC2-1~2-n及びユーザー端末1から送信されるユーザートレース情報、各SLO管理部3が担当するDC2-1~2-nから送信されるリソース情報のみを集積する。一方、SLO管理部3が集中的に一つのDC2に配置される場合、ユーザートレース集積部21及びリソース情報集積部40それぞれは、全てのDC2及び全てのユーザー端末1から送信されるユーザートレース情報、全てのDC2から送信されるリソース情報を集積する。 Here, when the SLO management unit 3 is distributed and distributed to each DC 2, the user trace accumulation unit 21 and the resource information accumulation unit 40 are connected to the DC 2-1 to 2 -n and the user who are in charge of each SLO management unit 3. Only user trace information transmitted from the terminal 1 and resource information transmitted from the DCs 2-1 to 2-n in charge of each SLO management unit 3 are accumulated. On the other hand, when the SLO management unit 3 is centrally arranged in one DC 2, the user trace accumulation unit 21 and the resource information accumulation unit 40 respectively have user trace information transmitted from all DCs 2 and all user terminals 1, Resource information transmitted from all DCs 2 is accumulated.
 重要SLO選定部22は、ユーザートレース集積部21から取得したユーザートレース情報から、トランザクション毎のKPI実測値とSLO実測値を抽出し、KPIと関連性の高いSLO項目を選定する。なお、重要SLO選定部22を含むSLO管理部3の詳細な処理については図3を用いて後述する。 The important SLO selection unit 22 extracts the KPI actual measurement value and the SLO actual measurement value for each transaction from the user trace information acquired from the user trace accumulation unit 21, and selects an SLO item highly related to the KPI. Detailed processing of the SLO management unit 3 including the important SLO selection unit 22 will be described later with reference to FIG.
 SLO基準導出部26は、重要SLO選定部22で選定されたKPIと関連性の高いSLO項目に関して、KPI格納部24と、SLO格納部25にそれぞれ格納されるKPIとSLOを用いて、トランザクション単位でKPIに沿ったSLO基準を導出するトランザクション単位SLO基準導出部(第1SLO基準導出部)30と、アプリケーション14またはデータ140単位にKPIに沿ったSLO基準を導出するアプリケーション及びデータ単位SLO基準導出部(第2SLO基準導出部)31とを含む。なお、「アプリケーション単位」及び「データ単位」とは、分散するDC2-1~2-nに対応して、アプリケーション14及びデータ140を1以上のデータセンター2に分割して配置可能な単位を示す。 The SLO standard derivation unit 26 uses the KPI and SLO stored in the KPI storage unit 24 and the SLO storage unit 25 for each SLO item highly relevant to the KPI selected by the important SLO selection unit 22, respectively. The transaction unit SLO standard deriving unit (first SLO standard deriving unit) 30 for deriving the SLO standard along the KPI and the application and data unit SLO standard deriving unit for deriving the SLO standard along the KPI for the application 14 or the data 140 unit (Second SLO reference deriving unit) 31. The “application unit” and the “data unit” indicate units that can be arranged by dividing the application 14 and the data 140 into one or more data centers 2 corresponding to the distributed DCs 2-1 to 2-n. .
 KPI格納部24は、サービスプロバイダー4が管理コンソール5から指定したKPIを、設定入力部23を経由して、格納する。 The KPI storage unit 24 stores the KPI designated by the service provider 4 from the management console 5 via the setting input unit 23.
 KPI格納部24が格納するKPIの例を図5AのテーブルT6に示す。図5Aは、KPI格納部24の一例を示す図である。テーブルT6は、Item361と、Value362からひとつのエントリーを構成する。 An example of the KPI stored in the KPI storage unit 24 is shown in a table T6 in FIG. 5A. FIG. 5A is a diagram illustrating an example of the KPI storage unit 24. The table T6 includes one entry from the Item 361 and the Value 362.
 テーブルT6は、クリックスルー率(Click Through Rate)が15%以上、売上高(Sales Performance)が1億(100M)円以上、利益率(Profit Rate)が20%以上であるKPIを含む。また、サービスプロバイダー4が管理コンソール5からKPIを指定する際の画面の一例を図6Aに示す。 The table T6 includes KPIs having a click through rate (Click Through Rate) of 15% or more, a sales (Sales Performance) of 100 million yen or more, and a profit rate (Profit Rate) of 20% or more. An example of a screen when the service provider 4 specifies a KPI from the management console 5 is shown in FIG. 6A.
 図6Aは管理計算機100が提供するKPIを設定する画面イメージである。画面G1は、SLO管理部3の設定入力部23が管理コンソール5へ出力する。管理コンソール5は、サービスプロバイダー4からKPIの設定を受け付ける。 FIG. 6A is a screen image for setting a KPI provided by the management computer 100. The setting input unit 23 of the SLO management unit 3 outputs the screen G1 to the management console 5. The management console 5 receives the KPI setting from the service provider 4.
 画面G1は、対象トランザクションG11と、KPI項目G12と、KPI基準G13を設定することができる。KPI項目G12と、KPI基準G13はプルダウンメニューにより予め設定された値から選択することもできる。また、「ADD」ボタンの操作により、新たな対象トランザクションを加えることができる。設定が完了すると「保存」ボタンを操作することで、設定したKPIの情報をKPI格納部24に格納することができる。 In the screen G1, a target transaction G11, a KPI item G12, and a KPI standard G13 can be set. The KPI item G12 and the KPI standard G13 can be selected from values set in advance by a pull-down menu. Further, a new target transaction can be added by operating the “ADD” button. When the setting is completed, the set KPI information can be stored in the KPI storage unit 24 by operating the “Save” button.
 SLO格納部25は、サービスプロバイダー4が管理コンソール5から指定した初期値としてのSLOを、設定入力部23を経由して、格納する。SLO格納部25が格納するSLOの例を図5BのテーブルT7に示す。図5Bは、SLO格納部25の一例を示す図である。テーブルT7は、Item371と、Value372からひとつのエントリーを構成する。 The SLO storage unit 25 stores the SLO as an initial value designated by the service provider 4 from the management console 5 via the setting input unit 23. An example of the SLO stored in the SLO storage unit 25 is shown in a table T7 in FIG. 5B. FIG. 5B is a diagram illustrating an example of the SLO storage unit 25. The table T7 constitutes one entry from Item 371 and Value 372.
 テーブルT7では、応答時間が500ms以下、スループットが3Mbps以上、TATが1秒以下であるSLOを含む。 Table T7 includes SLOs with a response time of 500 ms or less, a throughput of 3 Mbps or more, and a TAT of 1 second or less.
 SLO出力部27は、SLO基準導出部26により導出されたトランザクション単位もしくはアプリケーション及びデータ単位のSLO基準を、管理コンソール5などに出力する。本実施例では出力方法や出力フォーマットについては、周知または公知の技術を適用すればよい。 The SLO output unit 27 outputs the transaction unit or application and data unit SLO standard derived by the SLO standard deriving unit 26 to the management console 5 or the like. In this embodiment, a known or publicly known technique may be applied to the output method and output format.
 配置計画部41は、SLO基準導出部26によりアプリケーション及びデータ単位に導出されたSLO基準と、リソース情報集積部40のリソース情報を用いて、DC2へのアプリケーション14及びデータ140の配置場所を計画する。 The arrangement planning unit 41 plans the arrangement location of the application 14 and the data 140 on the DC 2 by using the SLO standard derived for each application and data by the SLO standard deriving unit 26 and the resource information of the resource information accumulation unit 40. .
 配置実行部42は、配置計画部41の出力する配置計画に基づき、DC2へのアプリケーション14及びデータ140の配布を実行する。 The arrangement execution unit 42 executes distribution of the application 14 and the data 140 to the DC 2 based on the arrangement plan output from the arrangement planning unit 41.
 SLO管理部3と、配置計画部41と、配置実行部42の各機能部はプログラムとしてメモリ120にロードされる。プロセッサ110は、各機能部のプログラムに従って処理することによって、所定の機能を提供する機能部として稼働する。例えば、プロセッサ110は、SLO管理プログラムに従って処理することでSLO管理部3として機能する。他のプログラムについても同様である。さらに、プロセッサ110は、各プログラムが実行する複数の処理のそれぞれの機能を提供する機能部としても稼働する。計算機及び計算機システムは、これらの機能部を含む装置及びシステムである。 Each functional unit of the SLO management unit 3, the arrangement planning unit 41, and the arrangement execution unit 42 is loaded into the memory 120 as a program. The processor 110 operates as a functional unit that provides a predetermined function by performing processing according to a program of each functional unit. For example, the processor 110 functions as the SLO management unit 3 by performing processing according to the SLO management program. The same applies to other programs. Furthermore, the processor 110 also operates as a function unit that provides the functions of a plurality of processes executed by each program. A computer and a computer system are an apparatus and a system including these functional units.
 SLO管理部3の各機能を実現するプログラム、テーブル等の情報は、ストレージ130や不揮発性半導体メモリ、ハードディスクドライブ、SSD(Solid State Drive)等の記憶デバイス、または、ICカード、SDカード、DVD等の計算機読み取り可能な非一時的データ記憶媒体に格納することができる。 Information such as programs and tables for realizing each function of the SLO management unit 3 includes storage 130, nonvolatile semiconductor memory, hard disk drive, storage device such as SSD (Solid State Drive), or IC card, SD card, DVD, etc. Can be stored in any computer-readable non-transitory data storage medium.
 <管理計算機の処理>
 続けて、図3を用いてSLO管理部3の詳細な処理について説明する。図3は、SLO管理部3で行われる処理の一例を示すフローチャートである。
<Management computer processing>
Next, detailed processing of the SLO management unit 3 will be described with reference to FIG. FIG. 3 is a flowchart illustrating an example of processing performed in the SLO management unit 3.
 ユーザー端末1及びDC2に配備されるユーザートレース取得部11は、それぞれユーザートレース情報を収集する(F1)。管理計算機100のSLO管理部3は、送受信部20を経由して、DC2及びユーザー端末1のユーザートレース格納部12、12Aからユーザートレース情報を取得してユーザートレース集積部21に格納する(F2)。ユーザートレース集積部21は、各DC2やユーザー端末1のエージェントとしてのユーザートレース取得部11、11Aが収集したユーザートレース情報を管理計算機100に集約する。 The user trace acquisition unit 11 deployed in the user terminal 1 and the DC 2 collects user trace information, respectively (F1). The SLO management unit 3 of the management computer 100 acquires user trace information from the DC2 and the user trace storage units 12 and 12A of the user terminal 1 via the transmission / reception unit 20, and stores them in the user trace accumulation unit 21 (F2). . The user trace accumulating unit 21 aggregates the user trace information collected by the user trace acquisition units 11 and 11A as agents of each DC 2 and user terminal 1 in the management computer 100.
 ここで、ユーザートレース集積部21は、テーブルT1及びテーブルT2に示すユーザートレース情報の生データをそのまま格納するのでなく、図2C~図2Eに示すテーブルT3~T5のように一定期間もしくは、複数のユーザーのユーザートレース情報をまとめて格納しても良い。 Here, the user trace accumulating unit 21 does not store the raw data of the user trace information shown in the table T1 and the table T2 as they are, but instead of the raw data of the user trace information as shown in the tables T3 to T5 shown in FIGS. User trace information of users may be stored together.
 図2Cは、KPI項目に関して、一定期間の複数ユーザーのユーザートレースをトランザクション単位にまとめて、トランザクショントレースとして格納するテーブルT3の一例を示す図である。図2Dは、SLO項目に関して、一定期間の複数ユーザーのユーザートレースをトランザクション単位にまとめたトランザクショントレースとして格納するテーブルT4の一例を示す図である。図2Eは、SLO項目に関して、一定期間の複数ユーザーのユーザートレース情報を、1トレース単位にまとめたテーブルT5の一例を示す図である。 FIG. 2C is a diagram illustrating an example of a table T3 that stores user traces of a plurality of users for a certain period in a transaction unit and stores them as transaction traces for KPI items. FIG. 2D is a diagram illustrating an example of a table T4 that stores user traces of a plurality of users for a certain period as transaction traces in a transaction unit with respect to the SLO item. FIG. 2E is a diagram illustrating an example of a table T5 in which user trace information of a plurality of users for a certain period is collected in units of one trace regarding the SLO item.
 図2CのテーブルT3は、トランザクションの識別子を格納するTransaction ID331と、集積期間を格納するPeriod332と、当該トランザクションが含むTrace IDの集合を格納するTrace Set333と、当該トランザクションが発生した位置情報を格納するLocation334と、コンバージョン1及びコンバージョン2の比率をそれぞれ格納するConversion 1 Rate(335)と、Conversion 2 Rate(336)と、売上金額を格納するSales3337と、からひとつのエントリーが構成される。 The table T3 in FIG. 2C stores a transaction ID 331 for storing a transaction identifier, a period 332 for storing an accumulation period, a trace set 333 for storing a set of trace IDs included in the transaction, and position information where the transaction has occurred. One entry is composed of Location 334, Conversion 1 Rate (335) for storing the ratios of Conversion 1 and Conversion 2, respectively, Conversion 2 Rate (336), and Sales 3337 for storing the sales amount.
 図2DのテーブルT4は、トランザクションの識別子を格納するTransaction ID341と、当該トランザクションが発生した位置情報を格納するLocation342と、当該トレースにかかる応答時間を格納するResponse Time343と、当該トランザクションのターンアラウンドタイムを格納するTAT345と、からひとつのエントリーを構成する。 The table T4 in FIG. 2D includes a transaction ID 341 for storing a transaction identifier, a location 342 for storing position information where the transaction has occurred, a response time 343 for storing a response time for the trace, and a turnaround time of the transaction. One entry is composed of the TAT 345 to be stored.
 図2EのテーブルT5は、ユーザートレース情報の識別子を格納するTrace ID351と、トランザクションの識別子を格納するTransaction ID352と、集積期間を格納するPeriod353と、当該トレースにかかる応答時間を格納するResponse Time354と、当該トランザクションのスループットを格納するThroughput355と、当該トランザクションのターンアラウンドタイムを格納するTAT356と、からひとつのエントリーを構成する。 The table T5 of FIG. 2E includes a trace ID 351 for storing user trace information identifiers, a transaction ID 352 for storing transaction identifiers, a period 353 for storing an accumulation period, and a response time 354 for storing response times for the traces. One entry consists of Throughput 355 that stores the throughput of the transaction and TAT 356 that stores the turnaround time of the transaction.
 図2Cにおいて、テーブルT3はテーブルT2に対して、集積期間を示すPeriod332が追加される。また、複数のユーザートレース情報をまとめるため、Conversion Rate 1/2(335、336)のように、率に関するKPI項目が項目化される。なお、テーブルT3のKPI項目に関して、まとめられる値はKPI項目及び基準に従う。KPIにて平均が採用される場合、これらの項目も平均を用いれば良い。以降では、テーブルT3のKPI項目情報をKPI実測値として利用する。一方、Sales項337のように、集積されるユーザートレース情報の総和を示す項目も存在する。 In FIG. 2C, a period 332 indicating an accumulation period is added to the table T3 with respect to the table T2. In order to collect a plurality of pieces of user trace information, KPI items relating to rates are itemized, such as Conversion Rate 1/2 (335, 336). In addition, regarding the KPI items in the table T3, the values to be collected are in accordance with the KPI items and criteria. When the average is adopted in KPI, these items may be averaged. Thereafter, the KPI item information in the table T3 is used as the KPI actual measurement value. On the other hand, there is an item indicating the total sum of user trace information to be accumulated, such as a Sales item 337.
 図2Dにおいて、テーブルT4も同様に、SLO項目に関して、まとめられる値は平均、最頻値、中央値など限定しない。以降では、テーブルT4のSLO項目情報をSLO実測値として利用する。 In FIG. 2D, the table T4 similarly does not limit the values collected for the SLO items, such as the average, mode, and median. Thereafter, the SLO item information in the table T4 is used as the actual SLO value.
 続けて、図3では、全てのトランザクションそれぞれについて、いて(F3)、全てのKPI項目それぞれにおいて(F4)、SLO管理部3の重要SLO選定部22は当該トランザクションで、当該KPI項目にとって重要なSLO項目を選定する(F5)。 Next, in FIG. 3, for all transactions (F3) and for all KPI items (F4), the important SLO selection unit 22 of the SLO management unit 3 is the SLO that is important for the KPI item. Select an item (F5).
 ここで、重要SLO項目とは、当該KPI項目と相関の高い、もしくは高い因果関係のあるSLO項目を指す。 Here, the important SLO item refers to an SLO item having a high correlation with the KPI item or having a high causal relationship.
 本実施例では、KPI項目と相関(または因果関係)の高いSLO項目のうち、重要なSLO項目を導出する手法については、公知又は周知の手法を用いることができる。例えば、SLO管理部3は、テーブルT3から取得するKPI実測値を目的変数とし、図2DのテーブルT4から取得するSLO実測値を説明変数とする重回帰分析を行えば良い。なお、説明変数としては、SLO項目の種別が異なる複数のSLO実測値を用いることが望ましい。 In this embodiment, a known or well-known method can be used as a method for deriving an important SLO item among SLO items having a high correlation (or causal relationship) with a KPI item. For example, the SLO management unit 3 may perform multiple regression analysis using the KPI actual value acquired from the table T3 as an objective variable and the SLO actual value acquired from the table T4 in FIG. 2D as an explanatory variable. As explanatory variables, it is desirable to use a plurality of measured SLO values with different types of SLO items.
 具体的には、次の(1)式のモデルを生成し、回帰直線を算出する。
K = Sβ+ε、 where K = (k1、 k2、 ...、 kn)^T、S = (S1、 S2、 ...、 Si、 ...、 Sn)^T、 Si = (si1、 si2、 ...、 sip)、 β = (b1、 b2、 ...、 bp)^T、 ε = (e1、 e2、 ...、 en)^T     ・・・(1)
 図2CのテーブルT3及び図2DのテーブルT4のKPI実測値及びSLO実測値は、正規化されていないため、回帰直線の偏回帰係数をt検定し、算出されたt値をソートしてランキングすれば良い。ここで、t値は、KPI項目とSLO項目の相関関係の高さ(または重要度)を示す指標として扱うことができる。
Specifically, a model of the following equation (1) is generated and a regression line is calculated.
K = Sβ + ε, where K = (k1, k2, ..., kn) ^ T, S = (S1, S2, ..., Si, ..., Sn) ^ T, Si = (si1, si2, , Sip), β = (b1, b2, ..., bp) ^ T, ε = (e1, e2, ..., en) ^ T (1)
The KPI actual values and SLO actual values in the table T3 in FIG. 2C and the table T4 in FIG. 2D are not normalized. It ’s fine. Here, the t value can be treated as an index indicating the height (or importance) of the correlation between the KPI item and the SLO item.
 なお、上記(1)式において、Kはk1、k2とKPI実測値を要素とする行列である。例えば、当該KPIがConversion 1 Rate335である場合、k1、k2それぞれにはテーブルT3のTransaction ID331が図中100と101のConversion 1 Rate335の値が入る。 In the above equation (1), K is a matrix whose elements are k1, k2 and KPI measured values. For example, when the KPI is Conversion 1 Rate 335, Transaction ID 331 of Table T3 is set to values of Conversion 1 Rate 335 of 100 and 101 in the figure for each of k1 and k2.
 SiはSLO項目iのSLO実測値をsi1、si2のような要素とする行列であり、si1、si2にはそれぞれテーブルT4の応答時間343やスループット344などの値が入る。SはSiを要素とする行列、βは偏回帰係数行列、εは切片行列である。また、Tはかかる行列が転置行列であることを示す。 Si is a matrix having SLO actual values of SLO item i as elements such as si1 and si2, and values such as response time 343 and throughput 344 of table T4 are entered in si1 and si2, respectively. S is a matrix having Si as an element, β is a partial regression coefficient matrix, and ε is an intercept matrix. T indicates that the matrix is a transposed matrix.
 続いて、図3のステップF5では、重要なSLO項目の中から、一定の基準を満たすSLO項目、もしくは上位のSLO項目を重要SLO項目として設定する。本実施例では前述の「一定基準」及び「抽出する上位数」については、所定の値を用いることができ、あるいは上述のt検定で算出されたt値に対する閾値などを用いることができる。 Subsequently, in step F5 of FIG. 3, among important SLO items, an SLO item satisfying a certain standard or a higher-level SLO item is set as an important SLO item. In the present embodiment, a predetermined value can be used for the above-mentioned “constant standard” and “higher number to be extracted”, or a threshold value for the t value calculated by the above-described t-test can be used.
 続けて、ステップF5で算出された重要SLO項目の全てに対して、ステップF7、F8の処理を実行する(F6)。全ての重要SLO項目について上記処理が完了すると、ステップF4の処理に復帰する。 Subsequently, the processes in steps F7 and F8 are executed for all the important SLO items calculated in step F5 (F6). When the above processing is completed for all important SLO items, the processing returns to step F4.
 ステップF7では、SLO基準導出部26のトランザクション単位SLO基準導出部30が、当該KPIに沿ったSLO基準を導出する。本実施例では当該KPIに沿ったSLO基準を導出する手法の一例として図4Aに示すフローチャートを示す。 In step F7, the transaction unit SLO standard deriving unit 30 of the SLO standard deriving unit 26 derives an SLO standard along the KPI. In the present embodiment, a flowchart shown in FIG. 4A is shown as an example of a method for deriving an SLO standard along the KPI.
 図4Aは、管理計算機100のトランザクション単位SLO基準導出部30で行われるSLO基準の導出処理の一例を示すフローチャートである。 FIG. 4A is a flowchart showing an example of the SLO standard derivation process performed by the transaction unit SLO standard derivation unit 30 of the management computer 100.
 図4Aにおいて、トランザクション単位SLO基準導出部30は、テーブルT3を参照して、当該KPIの実測値を取得し、テーブルT4を参照して当該重要SLO実測値の相関を取得する(F21)。 4A, the transaction unit SLO standard derivation unit 30 refers to the table T3, acquires the actual measured value of the KPI, and refers to the table T4 to acquire the correlation of the important SLO actual measured value (F21).
 具体的には、多数のユーザーによるユーザートレース情報によって生成されたテーブルT3及びテーブルT4に含まれる多数のKPI実測値と多数のSLO実測値を、SLO項目と、KPI項目を軸とする分布を算出する。 Specifically, the table T3 and the table T4 generated by the user trace information by a large number of users calculate a large number of measured KPI values and a large number of SLO measured values, and the distribution around the SLO items and the KPI items as axes. To do.
 例えば、テーブルT3とテーブルT4に含まれる同一のTransaction ID(331、341)=100を持つトランザクションは、Conversion 1 Rate335として5%を持ち、Response Time343として1sを持つ。ゆえに、Response Time343を横軸、Conversion 1 Rate335を縦軸とするグラフを生成する場合、当該トランザクショントレースは[5%、1s]の座標にプロットされる。ステップF21で算出された分布にて、分布近似曲線を得て、当該分布曲線とKPI直線の交点を獲得する(F22)。 For example, a transaction having the same Transaction ID (331, 341) = 100 included in Table T3 and Table T4 has 5% as Conversion 1 Rate 335 and 1s as Response Time 343. Therefore, when generating a graph with the Response Time 343 as the horizontal axis and the Conversion 1 Rate 335 as the vertical axis, the transaction trace is plotted at the coordinates of [5%, 1s]. A distribution approximate curve is obtained from the distribution calculated in step F21, and the intersection of the distribution curve and the KPI line is obtained (F22).
 本実施例で対象とするKPIである売上高や利益率、ユーザー満足度などインカムに関するKPI項目と、本実施例で対象とする応答時間やスループット、TATなどの性能に関するSLO項目は相関を持ち、その分布はべき乗分布となることが知られている。 KPI items related to income such as sales, profit ratio, user satisfaction, etc., which are KPIs targeted in this example, and SLO items related to performances such as response time, throughput, TAT, etc. in this example have a correlation, It is known that the distribution is a power distribution.
 後述する図7のように、SLO項目(Response Time)とKPI項目(Conversion 1 Rate)を軸とする第一象限を、当該KPIの基準と当該SLOの基準の2つの直線により4つの領域に分割すると、当該SLO基準を満足するものの、当該KPI基準を満たさない領域にプロットされるトランザクショントレースが存在する。 As shown in FIG. 7 to be described later, the first quadrant with the SLO item (Response Time) and the KPI item (Conversion 1 Rate) as the axes is divided into four areas by two straight lines, the KPI standard and the SLO standard. Then, there is a transaction trace that is plotted in a region that satisfies the SLO criterion but does not satisfy the KPI criterion.
 この領域において、分布近似曲線との誤差が所定の範囲内にプロットされるトランザクショントレースは、当該SLO基準の設定不備により当該KPIが未達となった可能性が高い。換言すると、当該KPI基準に沿ったSLO基準を当該SLO基準として設定していないために、分布近似曲線との誤差が小さな(所定の範囲内)プロットのトランザクショントレースは、当該SLO基準を満足するものの、当該KPI基準を満たさない領域にプロットされたと解釈できる。それゆえ、分布近似曲線との誤差が所定の範囲内でプロットされるトランザクショントレースが、当該KPI基準を満たす領域にプロットされることを期待し、当該SLO基準を新たなSLO基準に変更する。 In this area, a transaction trace in which an error from the distribution approximate curve is plotted within a predetermined range is highly likely that the KPI has not been achieved due to a setting failure of the SLO standard. In other words, since the SLO standard along the KPI standard is not set as the SLO standard, the transaction trace of the plot with a small error from the distribution approximate curve (within a predetermined range) satisfies the SLO standard. Can be interpreted as being plotted in a region not satisfying the KPI standard. Therefore, the transaction trace in which the error from the distribution approximate curve is plotted within a predetermined range is expected to be plotted in a region that satisfies the KPI criterion, and the SLO criterion is changed to a new SLO criterion.
 ステップF22で得た交点のSLO項目軸の値を、当該KPIに沿ったSLO基準として設定する(F23)。ステップF23により、SLO管理部3は、KPIに沿ったSLO基準を自動的に導出できる。 The value of the SLO item axis at the intersection obtained in step F22 is set as the SLO reference along the KPI (F23). By step F23, the SLO management unit 3 can automatically derive the SLO standard along the KPI.
 図7を用いて上記ステップF21の処理の具体例を提示する。図7は、ステップF21で述べた当該KPI実測値と、テーブルT4から取得される当該SLO実測値の相関を示したグラフである。 A specific example of the processing in step F21 is presented using FIG. FIG. 7 is a graph showing a correlation between the KPI actual measurement value described in Step F21 and the SLO actual measurement value acquired from the table T4.
 図7は、本発明の一例として、横軸を当該SLO項目としてResponse Timeとし、縦軸を当該KPI項目としてConversion 1 Rateとした図である。図7において、縦線S1は当該SLO基準を示し、横線K1は当該KPI基準を示し、C1は分布近似曲線を示す。 FIG. 7 shows an example of the present invention in which the horizontal axis is Response Time as the SLO item and the vertical axis is Conversion 1 Rate as the KPI item. In FIG. 7, a vertical line S1 indicates the SLO standard, a horizontal line K1 indicates the KPI standard, and C1 indicates a distribution approximate curve.
 また、図中A1は当該SLO基準と当該KPI基準の両方を満たすトランザクショントレースがプロット(図中ユーザープロット点)されるエリアを示し、A2は当該SLO基準を満足するものの、当該KPIを満たさないトランザクショントレースがプロットされるエリアを示し、A3は当該SLO基準と当該KPI基準を共に満たさないトランザクショントレースがプロットされるエリアを示す。図7において、当該KPIを満足するエリアA1へ、A2のエリアにプロットされるトランザクショントレースを移動させることを目的に、上記ステップF22の処理を実施する。 A1 in the figure indicates an area where transaction traces that satisfy both the SLO standard and the KPI standard are plotted (user plot points in the figure), and A2 is a transaction that satisfies the SLO standard but does not satisfy the KPI. A3 indicates an area where a trace is plotted, and A3 indicates an area where a transaction trace that does not satisfy both the SLO criterion and the KPI criterion is plotted. In FIG. 7, the process of step F22 is performed for the purpose of moving the transaction trace plotted in the area A2 to the area A1 that satisfies the KPI.
 図8を用いて上記ステップF22及びF23の処理の具体例を示す。図8は図7と同様に、ステップF21で示した当該KPI実測値と、テーブルT4から取得される当該SLO実測値の相関を図示したグラフである。 A specific example of the processing of steps F22 and F23 will be described with reference to FIG. FIG. 8 is a graph illustrating the correlation between the measured KPI value shown in step F21 and the measured SLO value obtained from the table T4, as in FIG.
 図8において、I1は分布近似曲線C1とKPI基準K1の交点を示し、S2は交点I1を通る新たなSLO基準線を示す。図4AのステップF23では交点I1を通る新たなSLO基準線S2を当該KPIに沿ったSLO基準として設定する。図8の例では、新たなSLO基準S2を応答時間が短くなる方向に変更することで、KPI基準を満たすトランザクショントレースが増大することが期待できる。 8, I1 indicates an intersection between the distribution approximate curve C1 and the KPI standard K1, and S2 indicates a new SLO reference line passing through the intersection I1. In step F23 of FIG. 4A, a new SLO reference line S2 passing through the intersection I1 is set as the SLO reference along the KPI. In the example of FIG. 8, it can be expected that transaction traces that satisfy the KPI standard will increase by changing the new SLO standard S2 in a direction that shortens the response time.
 次に、図3のフローチャートでは、続けて、SLO基準導出部26のアプリ/データ単位SLO基準導出部31が、当該KPIに沿ったトランザクション単位のSLO基準を、アプリケーション14及びデータ140単位へと展開する(F8)。 Next, in the flowchart of FIG. 3, the application / data unit SLO standard deriving unit 31 of the SLO standard deriving unit 26 expands the transaction unit SLO standard along the KPI into the application 14 and the data 140 unit. (F8).
 図4Bを用いてステップF8の詳細を説明する。図4Bは、アプリケーション14及びデータ140毎にSLO基準を設定する処理の一例を示すフローチャートである。 Details of step F8 will be described with reference to FIG. 4B. FIG. 4B is a flowchart illustrating an example of processing for setting an SLO standard for each application 14 and data 140.
 アプリ/データ単位SLO基準導出部31は、テーブルT3を参照してKPI実測値を目的変数として取得し、テーブルT5を参照してアプリケーション14及びデータ140単位のSLO実測値を説明変数として取得する。アプリ/データ単位SLO基準導出部31は、取得した目的変数と説明変数から上記(1)式のモデルを生成し、重回帰分析により回帰直線を算出する。 The application / data unit SLO standard deriving unit 31 refers to the table T3 to acquire the KPI actual measurement value as the objective variable, and refers to the table T5 to acquire the SLO actual measurement value of the application 14 and the data 140 unit as the explanatory variable. The application / data unit SLO criterion derivation unit 31 generates a model of the above equation (1) from the acquired objective variable and explanatory variable, and calculates a regression line by multiple regression analysis.
 ここで、テーブルT3及びテーブルT5のKPI実測値及びSLO実測値は、正規化されていないため、回帰直線の偏回帰係数をt検定し、算出されたt値をランキングすれば良い。なお、上記ステップF5と異なり、上記(1)式において、SiはSLO項目iのSLO実測値がsi1、si2のような要素とする行列であり、異なるユーザートレース情報の同一のSLO項目が入る。 Here, since the KPI actual measurement values and SLO actual measurement values in the tables T3 and T5 are not normalized, the partial regression coefficient of the regression line may be t-tested and the calculated t values may be ranked. Note that, unlike step F5, in the above equation (1), Si is a matrix having SLO actual values of SLO items i as elements such as si1 and si2, and the same SLO items of different user trace information are entered.
 具体的に、応答時間と当該KPIの重回帰分析をする場合、テーブルT5にて、si1、si2には100ms、300msが格納される。SはSiを要素とする行列であり、S1、S2、...は異なるTransaction IDを持つ。βは偏回帰係数行列、εは切片行列である。また、Tはかかる行列が転置行列であることを示す。 Specifically, when performing a multiple regression analysis of the response time and the KPI, 100 ms and 300 ms are stored in si1 and si2 in table T5. S is a matrix having Si as an element, and S1, S2,... Have different transaction IDs. β is a partial regression coefficient matrix, and ε is an intercept matrix. T indicates that the matrix is a transposed matrix.
 このような重回帰分析を行うことで、当該KPIに対する、アプリケーション14及びデータ140それぞれの当該SLO項目への寄与率(もしくは影響度)を抽出できる。 By performing such multiple regression analysis, it is possible to extract the contribution rate (or degree of influence) of the application 14 and data 140 to the SLO item for the KPI.
 前述の重回帰分析により、アプリ/データ単位SLO基準導出部31は、当該KPIに対する、アプリケーション14及びデータ140それぞれの当該SLO項目への寄与率(もしくは影響度)を抽出する(F30)。 The application / data unit SLO criterion derivation unit 31 extracts the contribution rate (or the degree of influence) of the application 14 and the data 140 to the corresponding SLO item by the multiple regression analysis described above (F30).
 ここで、アプリ/データ単位SLO基準導出部31は、多数のSLO実測値を算出するため、換言すれば、上記(1)式におけるnを大きくするために、SLOを満たす範囲内で、アプリケーション14及びデータ140を複数のDC2-1~2-nに分散配置させておく。この分散配置により、異なるSLO実測値を持つユーザートレース情報及びトランザクショントレースを生成することができるため、より精度の高い寄与率を導出できる。 Here, the application / data unit SLO criterion derivation unit 31 calculates a large number of SLO actual measurement values, in other words, in order to increase n in the above equation (1), the application 14 is within a range that satisfies the SLO. The data 140 is distributed in a plurality of DCs 2-1 to 2-n. With this distributed arrangement, user trace information and transaction traces having different actual SLO values can be generated, so that a more accurate contribution rate can be derived.
 続けて、アプリ/データ単位SLO基準導出部31は、上記図4AのステップF23で導出した新たなSLO基準を、上記ステップF30で導出した寄与率に応じて、アプリケーション14及びデータ140毎のSLO基準を設定する(F31)。ステップF31の具体的な手法については、公知または周知の手法を採用することができる。例えば、SLO項目が応答時間やTATの場合、寄与率を正規化し、その逆数をステップF23で導出したSLO基準に乗算することで、アプリケーション14及びデータ140それぞれのSLO基準が算出される。また、SLO項目がスループットの場合、寄与率を正規化した値をステップF23で導出した新たなSLO基準に乗算することで、アプリケーション14及びデータ140それぞれのSLO基準が算出される。 Subsequently, the application / data unit SLO criterion deriving unit 31 determines the new SLO criterion derived in step F23 of FIG. 4A as the SLO criterion for each application 14 and data 140 according to the contribution rate derived in step F30. Is set (F31). As a specific method of Step F31, a known or well-known method can be employed. For example, when the SLO item is response time or TAT, the contribution rate is normalized, and the reciprocal number is multiplied by the SLO criterion derived in step F23, thereby calculating the SLO criterion for each of the application 14 and the data 140. When the SLO item is throughput, the SLO standard for each of the application 14 and the data 140 is calculated by multiplying the normalized value of the contribution rate by the new SLO standard derived in step F23.
 以上の処理を全ての重要SLO項目、全てのKPI項目で実施する(F6及びF4によるループ)。この結果、KPI項目毎に、重要SLO項目毎のSLO基準が導出される。それゆえ、全KPI項目をまたがり、同一のSLO項目のSLO基準を調整する(F9)。 The above processing is executed for all important SLO items and all KPI items (loop by F6 and F4). As a result, an SLO criterion for each important SLO item is derived for each KPI item. Therefore, the SLO standard of the same SLO item is adjusted across all KPI items (F9).
 本実施例ではSLO基準の調整の具体的な方法について、公知または周知の技術を採用することができる。例えば、いずれのKPIを満足するために、最低限必要となるSLO基準を選択すれば良い。具体的には、KPI-1に対して応答時間の基準が100ms、KPI-2に対して応答時間の基準が50msである場合、KPI-1とKPI-2の両者を満足するためには、50msを採用すれば良い。 In this embodiment, a known or well-known technique can be adopted as a specific method for adjusting the SLO standard. For example, in order to satisfy any KPI, a minimum required SLO standard may be selected. Specifically, when the response time standard for KPI-1 is 100 ms and the response time standard for KPI-2 is 50 ms, in order to satisfy both KPI-1 and KPI-2, What is necessary is just to employ 50 ms.
 なお、SLO基準調整にあたり、運用管理コストに対する売上高や利益といった、「運用管理費用あたりのインカム」となる指標をKPIとして設定することで、アプリケーション14及びデータ140の配置にかかるコストを考慮したSLO基準の設定が可能となる。前述の例と同様に、運用管理コストに対する利益率(KPI-3)に対して応答時間の基準が100ms、利益率(KPI-4)に対して応答時間の基準が50msである場合、KPI-2に沿ったSLO基準はサービスプロバイダー4にとって現実的でない値である。 It should be noted that, when adjusting the SLO standard, by setting an index to be an “income per operation management cost” such as sales and profit with respect to the operation management cost as a KPI, an SLO that takes into account the costs for the arrangement of the application 14 and the data 140 The standard can be set. Similar to the above example, if the response time standard is 100 ms for the profit rate (KPI-3) with respect to the operation management cost and the response time standard is 50 ms for the profit rate (KPI-4), KPI- The SLO criterion along 2 is an unrealistic value for the service provider 4.
 それゆえ、KPI-1に沿ったSLO基準を優先し、100msを採用することで、サービスプロバイダー4にとって妥当、かつKPIに沿ったSLO基準の設定が可能となる。 Therefore, by prioritizing the SLO standard according to KPI-1 and adopting 100 ms, it becomes possible for the service provider 4 to set the SLO standard according to the KPI.
 最後にSLO管理部3は、以上の処理を全てのトランザクションで実施し(F3のループ)てから、終了する。なお、図3の処理の実行タイミングは任意とする。但し、本実施例によるSLO基準は、図3の処理を実行する時点までのユーザー端末1による処理の履歴を基にして決定される。このため、図3の処理を長期的な間隔で定期的に実行すると、ユーザー端末1を利用するユーザーの行動がSLO基準と乖離する可能性が高い。それゆえ、図3の処理は、短い間隔での定期的な実行もしくはサービス更新後の一定期間後でのイベントドリブンによって実行するのが好ましい。 Finally, the SLO management unit 3 performs the above processing for all transactions (F3 loop), and then ends. Note that the execution timing of the processing in FIG. 3 is arbitrary. However, the SLO standard according to the present embodiment is determined based on the history of processing by the user terminal 1 up to the time when the processing of FIG. 3 is executed. For this reason, if the process of FIG. 3 is periodically executed at long-term intervals, the user's action using the user terminal 1 is highly likely to deviate from the SLO standard. Therefore, it is preferable to execute the processing of FIG. 3 by periodic execution at short intervals or event driven after a certain period after service update.
 図3の処理により導出された、トランザクション単位もしくはアプリケーション14及びデータ140単位の重要SLO項目、SLO基準は、SLO管理部3のSLO出力部27を経由して、管理コンソール5に出力される。トランザクション単位もしくはアプリケーション14及びデータ140単位の重要SLO項目、SLO基準の管理コンソール5への出力の一例を図6Bに示す。 3, the important SLO items and SLO criteria for each transaction or application 14 and data 140 derived by the processing of FIG. 3 are output to the management console 5 via the SLO output unit 27 of the SLO management unit 3. FIG. 6B shows an example of the output to the management console 5 based on the important SLO items of the transaction unit or the application 14 and the data 140 unit and the SLO standard.
 図6Bは、管理計算機100の出力部27がSLOの情報を、管理コンソール5に出力する画面イメージである。 FIG. 6B is a screen image in which the output unit 27 of the management computer 100 outputs SLO information to the management console 5.
 画面G2は、対象トランザクションG21と、対象アプリケーション/データG22と、SLO項目G23と、SLO基準G24を表示することができる。SLO管理部3は、図3の処理が完了すると画面G2を管理コンソール5に出力する。 The screen G2 can display the target transaction G21, the target application / data G22, the SLO item G23, and the SLO standard G24. When the processing of FIG. 3 is completed, the SLO management unit 3 outputs a screen G2 to the management console 5.
 また、図3の処理により導出された、トランザクション単位もしくはアプリケーション及びデータ単位の重要SLO項目、SLO基準を用いて、配置計画部41がアプリケーション14やデータ140の配置計画を策定する。本実施例では配置計画の方法につていては、公知または周知の技術を採用することができる。アプリケーション14やデータ140の配置計画は、アプリケーション14やデータ140を提供するDC2の位置や、アプリケーション14を実行する物理計算機201や仮想計算機210の指定等で構成される。なお、アプリケーション14及びデータ140単位にSLO基準が設定されるため、アプリケーション14及びデータ140間の配置に関する依存関係を考慮することなく、アプリケーション14及びデータ140のDC2間またはDC2内の配置を計画することが可能である。 In addition, the arrangement planning unit 41 formulates an arrangement plan for the application 14 and the data 140 using the important SLO items and SLO criteria of the transaction unit or application and data unit derived by the processing of FIG. In this embodiment, a well-known or well-known technique can be employed for the method of arrangement planning. The arrangement plan of the application 14 and the data 140 includes the location of the DC 2 that provides the application 14 and the data 140, the designation of the physical computer 201 and the virtual computer 210 that execute the application 14, and the like. Since the SLO standard is set for the application 14 and the data 140 unit, the arrangement of the application 14 and the data 140 between the DCs 2 or in the DC 2 is planned without considering the dependency relationship between the applications 14 and the data 140. It is possible.
 本実施例により設定されるアプリケーション14及びデータ140単位のSLO基準は、トランザクショントレースにおいて、ユーザー端末1を利用するユーザーに近いトランザクショントレースから順に決定されている。ゆえに、ユーザー端末1が直接アクセスするアプリケーション14、当該アプリケーション14がアクセスするアプリケーション14、もしくはデータ140という順に、ユーザー端末1に近いアプリケーション14もしくはデータ140から順に配置を計画すれば良い。なお、配置計画部41が、アプリケーション14及びデータ140をDC2のリソースへ配置する際に、導出したSLO基準を満たすDC2が複数存在する場合、ステップF8の処理精度を向上させることを目的に、アプリケーション14及びデータ140のレプリカを生成し、冗長的に導出したSLO基準を満たす複数のDC2に配置すると良い。これは、エンドユーザーから見たアプリケーション及びデータに対する応答時間などの性能にバラつきを発生させるためである。本実施例が提案するKPIからSLOを導出する方法は、実際のエンドユーザーの行動を基に、KPIに沿ったSLOへと調整する方法である。いずれのエンドユーザーのSLO実測値が同じであれば、図7のユーザープロット点は、縦軸に平行な直線のような分布となり、KPIにとって望ましいSLOへの調整が困難となる。基本的に、特にクラウドコンピューティングでは性能が保証されないため、1箇所に配置されたアプリケーション及びデータに関する性能は揺らぐことになる。しかしながら、複数のデータセンターへアプリケーション及びデータを配置することで、より大きな揺らぎを持つユーザートレースを獲得でき、その結果、より正確にKPIに沿ったSLOを導出できる。 The SLO standard for the application 14 and the data 140 unit set by the present embodiment is determined in order from the transaction trace close to the user using the user terminal 1 in the transaction trace. Therefore, the arrangement may be planned in order from the application 14 or data 140 close to the user terminal 1 in the order of the application 14 directly accessed by the user terminal 1, the application 14 accessed by the application 14, or the data 140. When the placement planning unit 41 places the application 14 and the data 140 on the resource of DC2, if there are a plurality of DC2s that satisfy the derived SLO criterion, the application is intended to improve the processing accuracy of step F8. 14 and the replica of the data 140 are generated and arranged in a plurality of DCs 2 that satisfy the redundantly derived SLO criterion. This is to cause variations in performance such as response time to applications and data viewed from the end user. The method of deriving the SLO from the KPI proposed in the present embodiment is a method of adjusting to the SLO according to the KPI based on the actual end-user behavior. If the measured SLO values of all end users are the same, the user plot points in FIG. 7 have a distribution like a straight line parallel to the vertical axis, and it becomes difficult to adjust to the desired SLO for KPI. Basically, since performance is not guaranteed particularly in cloud computing, the performance related to applications and data arranged in one place fluctuates. However, by placing applications and data in a plurality of data centers, a user trace having a larger fluctuation can be obtained, and as a result, an SLO along the KPI can be derived more accurately.
 本実施例に用いられる技術により、アプリケーション14及びデータ140単位にSLO基準が設定されるため、アプリケーション14及びデータ140間の配置に関する依存関係を考慮することなく、配置を計画することが可能である。 Since the SLO standard is set for each application 14 and data 140 by the technique used in the present embodiment, it is possible to plan the arrangement without considering the dependency relationship between the application 14 and the data 140. .
 具体的には、データセンターの数を100、アプリケーション14やデータ140の数を1000とすると、計算パターン数は100*1000程度となり、前記従来例に比して配置計画にかかる計算量を大幅に削減できる。 Specifically, if the number of data centers is 100, and the number of applications 14 and data 140 is 1000, the number of calculation patterns is about 100 * 1000, which greatly increases the amount of calculation for the arrangement plan compared to the conventional example. Can be reduced.
 また、本実施例によると、サービスプロバイダー4は、図6Aの画面G1で示すように、KPIを設定するだけで、当該KPIに沿ったSLOが自動的に設定され、また、自動設定されたSLOに基づき、自動的にアプリケーション14及びデータ140はKPIに沿った場所(DC2)へと配置される。これにより、広域にデータセンター2が分散した環境において、ビジネスの条件であるKPIを入力として、アプリケーション14及びデータ140の配置を最適化することが可能となる。 Further, according to the present embodiment, as shown in the screen G1 in FIG. 6A, the service provider 4 simply sets the KPI, and the SLO according to the KPI is automatically set, and the automatically set SLO Automatically, the application 14 and the data 140 are placed at a location (DC2) along the KPI. As a result, in an environment where the data centers 2 are distributed over a wide area, it is possible to optimize the arrangement of the application 14 and the data 140 using the KPI that is a business condition as an input.
 また、上記実施例では、対象とするトランザクション毎にKPIを設定する例を示したが、アプリケーション14が提供するサービス毎にKPIを設定してもよい。 In the above embodiment, an example is shown in which a KPI is set for each target transaction. However, a KPI may be set for each service provided by the application 14.
 また、上記実施例では、アプリケーション14及びデータ140のSLO基準から、アプリケーション14及びデータ140毎の寄与率を抽出して、アプリケーション14及びデータ140毎のSLOを算出する例を示したが、アプリケーション14のみについてSLO基準の設定と、SLOの算出を行うようにしてもよい。 In the above embodiment, an example is shown in which the contribution rate for each application 14 and data 140 is extracted from the SLO criteria of the application 14 and data 140, and the SLO for each application 14 and data 140 is calculated. Only SLO may be set and SLO may be calculated.
 <まとめ>
 以上のように、本発明では、分割可能なアプリケーション14のサービスを分割して、1以上のデータセンター2で提供可能な計算機システムにおいて、アプリケーション14及びデータ140の配置を管理する管理計算機100が、サービスまたはトランザクション毎に、ビジネスの条件を含むKPI(第1の条件)を受け付けて、アプリケーション14及びデータ140のサービスレベル目標(第2の条件)に関する項目について自動的に基準値を更新する。
<Summary>
As described above, in the present invention, the management computer 100 that manages the arrangement of the application 14 and the data 140 in the computer system that can divide the service of the application 14 that can be divided and can be provided by one or more data centers 2, For each service or transaction, a KPI (first condition) including business conditions is received, and the reference values are automatically updated for items related to the service level target (second condition) of the application 14 and data 140.
 アプリケーション14が稼働する計算機(仮想計算機210または物理計算機201)が所属するデータセンター2内で、エージェント(ユーザートレース取得部11)を稼働させ、アプリケーション14のサービスを利用するユーザー端末1を監視する。各データセンター2のエージェントは、ユーザー端末1が利用したサービス(アプリケーション14)のログ及び性能情報を含むユーザートレース情報を取得する。複数のデータセンター2へアプリケーション14を及びデータ140の配置を管理する管理計算機100は、アプリケーション14及びデータ140を利用したユーザー端末1のユーザートレース情報を各データセンター2のエージェントから収集する。 In the data center 2 to which the computer (virtual computer 210 or physical computer 201) on which the application 14 operates belongs, the agent (user trace acquisition unit 11) is operated to monitor the user terminal 1 that uses the service of the application 14. The agent of each data center 2 acquires user trace information including a log of service (application 14) used by the user terminal 1 and performance information. The management computer 100 that manages the arrangement of the application 14 and the data 140 to the plurality of data centers 2 collects user trace information of the user terminal 1 using the application 14 and the data 140 from the agent of each data center 2.
 管理計算機100は、ユーザートレース情報からKPIに相関関係があるSLOについて重要度(または貢献度)を算出し、重要SLO項目を抽出する。そして、管理計算機100は、KPIと相関する重要SLO項目についてユーザートレース情報から分布近似曲線を算出し、分布近似曲線とKPI基準から当該KPI基準を満足する新たなSLO基準を導出し、現在のSLO基準を新たなSLO基準に変更する。 The management computer 100 calculates the importance (or contribution) for the SLO correlated with the KPI from the user trace information, and extracts the important SLO items. Then, the management computer 100 calculates a distribution approximation curve from the user trace information for the important SLO items correlated with the KPI, derives a new SLO criterion that satisfies the KPI criterion from the distribution approximation curve and the KPI criterion, and obtains the current SLO. Change the standard to the new SLO standard.
 次に、管理計算機100は、KPIに対するSLO項目について、アプリケーション14及びデータ140それぞれの当該SLO項目への寄与率(もしくは影響度)を算出する。そして、管理計算機100は、寄与率と新たなSLO基準から、アプリケーション14のSLO基準と、データ140のSLO基準をそれぞれ算出する。 Next, the management computer 100 calculates the contribution rate (or the degree of influence) of the application 14 and the data 140 to the SLO item for the SLO item for the KPI. Then, the management computer 100 calculates the SLO standard of the application 14 and the SLO standard of the data 140 from the contribution rate and the new SLO standard.
 本発明によれば、アプリケーション14及びデータ140単位にSLO基準が設定されるため、アプリケーション14及びデータ140間の配置に関する依存関係を考慮することなく、アプリケーション14及びデータ140の配置を計画することが可能である。そして、本発明では、前記従来例に比して配置計画にかかる計算量を大幅に削減できる。また、本発明では、ユーザートレース情報の変化に応じてSLO基準を自動的に補正することができ、アプリケーション14及びデータ140の配置計画を常時適切に行うことができる。 According to the present invention, since the SLO standard is set in units of the application 14 and the data 140, the arrangement of the application 14 and the data 140 can be planned without considering the dependency relationship between the application 14 and the data 140. Is possible. And in this invention, the computational complexity concerning arrangement | positioning plan can be reduced significantly compared with the said prior art example. Further, according to the present invention, the SLO standard can be automatically corrected according to the change of the user trace information, and the arrangement plan of the application 14 and the data 140 can be always performed appropriately.
 なお、本発明において説明した計算機等の構成、処理部及び処理手段等は、それらの一部又は全部を、専用のハードウェアによって実現してもよい。 The configuration of the computer, the processing unit, and the processing unit described in the present invention may be partially or entirely realized by dedicated hardware.
 また、本実施例で例示した種々のソフトウェアは、電磁的、電子的及び光学式等の種々の記録媒体(例えば、非一時的な記憶媒体)に格納可能であり、インターネット等の通信網を通じて、コンピュータにダウンロード可能である。 In addition, the various software exemplified in the present embodiment can be stored in various recording media (for example, non-transitory storage media) such as electromagnetic, electronic, and optical, and through a communication network such as the Internet. It can be downloaded to a computer.
 また、本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明をわかりやすく説明するために詳細に説明したものであり、必ずしも説明した全ての構成を備えるものに限定されるものではない。 Further, the present invention is not limited to the above-described embodiments, and includes various modifications. For example, the above-described embodiments have been described in detail for easy understanding of the present invention, and are not necessarily limited to those having all the configurations described.

Claims (15)

  1.  プロセッサとメモリを有してデータセンターとユーザー端末に接続された管理計算機が、1以上のデータセンターに分割可能なアプリケーション及びデータの配置を管理するアプリケーション及びデータの配置管理方法であって、
     前記データセンター内に配備されたエージェントが、前記データセンターを監視し、前記ユーザー端末内に配備されたエージェントが前記ユーザー端末を監視し、前記データセンターのエージェントと前記ユーザー端末のエージェントが、前記ユーザー端末を操作し、前記アプリケーションのサービスを利用するユーザーのサービス行動ログ及び性能情報を含むユーザートレース情報を取得する第1のステップと、
     前記管理計算機が、前記ユーザー端末が利用したサービスまたはトランザクション毎に、ビジネスの条件を含む第1の条件を受け付ける第2のステップと、
     前記管理計算機が、アプリケーション及びデータを利用したユーザー端末のユーザートレース情報を、前記データセンターの前記エージェントと前記ユーザー端末の前記エージェントから収集する第3のステップと、
     前記管理計算機は、前記アプリケーション及びデータの所定のサービスレベル目標を第2の条件として、前記ユーザートレース情報から前記第1の条件と相関関係がある第2の条件について重要度を算出し、前記重要度が所定の閾値以上の第2の条件に対応する項目を、重要第2条件項目として抽出する第4のステップと、
     前記管理計算機が、前記重要第2条件項目と第1の条件の項目について、前記ユーザートレース情報から分布近似曲線を算出し、前記分布近似曲線と前記第1の条件から当該第1の条件を満足する新たな第2の条件を算出し、現在の第2の条件を新たな第2の条件に変更する第5のステップと、
     前記管理計算機が、前記第1の条件に対する前記新たな第2の条件項目について、前記アプリケーション及びデータそれぞれの当該第2の条件項目への寄与率を算出する第6のステップと、
     前記管理計算機が、前記寄与率と新たな第2の条件から前記アプリケーションに対する新たな第2の条件の基準と、前記データに対する新たな第2の条件の基準をそれぞれ算出する第7のステップと、
     前記算出された前記アプリケーションと前記データに対する新たな第2の条件の基準を満たす1以上のデータセンターに対して、前記アプリケーションと前記データを分散配置する第8のステップと、
    を含むことを特徴とするアプリケーション及びデータの配置管理方法。
    A management computer having a processor and a memory and connected to a data center and a user terminal is an application and data arrangement management method for managing application and data arrangement that can be divided into one or more data centers,
    An agent deployed in the data center monitors the data center, an agent deployed in the user terminal monitors the user terminal, and the data center agent and the user terminal agent A first step of operating the terminal and obtaining user trace information including service behavior log and performance information of a user who uses the service of the application;
    A second step in which the management computer accepts a first condition including a business condition for each service or transaction used by the user terminal;
    A third step in which the management computer collects user trace information of a user terminal using an application and data from the agent of the data center and the agent of the user terminal;
    The management computer calculates importance for the second condition correlated with the first condition from the user trace information, with the predetermined service level target of the application and data as the second condition, and the importance A fourth step of extracting an item corresponding to a second condition whose degree is equal to or greater than a predetermined threshold as an important second condition item;
    The management computer calculates a distribution approximate curve from the user trace information for the important second condition item and the first condition item, and satisfies the first condition from the distribution approximate curve and the first condition. A fifth step of calculating a new second condition to change the current second condition to the new second condition;
    A sixth step in which the management computer calculates a contribution ratio of the application and data to the second condition item for the new second condition item for the first condition;
    A seventh step in which the management computer calculates a new second condition criterion for the application and a new second condition criterion for the data from the contribution rate and the new second condition;
    An eighth step of distributing and arranging the application and the data to one or more data centers that satisfy a new second condition criterion for the calculated application and the data;
    An application and data arrangement management method comprising:
  2.  請求項1に記載のアプリケーション及びデータの配置管理方法であって、
     前記第4のステップは、
     前記ユーザートレース情報から前記第1の条件に対応する実測値を取得して目的変数とし、前記ユーザートレース情報から前記第2の条件に対応する実測値を取得して説明変数とし、重回帰分析を行って前記重要第2条件項目を抽出することを特徴とするアプリケーション及びデータの配置管理方法。
    The application and data arrangement management method according to claim 1,
    The fourth step includes
    An actual value corresponding to the first condition is acquired from the user trace information as an objective variable, an actual value corresponding to the second condition is acquired from the user trace information as an explanatory variable, and multiple regression analysis is performed. An application and data arrangement management method, characterized in that the important second condition item is extracted.
  3.  請求項1に記載のアプリケーション及びデータの配置管理方法であって、
     前記第5のステップは、
     前記分布近似曲線と前記第1の条件の交点を通る第2の条件の値を、新たな第2の条件として設定することを特徴とするアプリケーション及びデータの配置管理方法。
    The application and data arrangement management method according to claim 1,
    The fifth step includes
    An application and data arrangement management method, wherein a value of a second condition passing through an intersection of the distribution approximate curve and the first condition is set as a new second condition.
  4.  請求項1に記載のアプリケーション及びデータの配置管理方法であって、
     前記第6のステップは、
     前記ユーザートレース情報から前記第1の条件に対応する実測値を取得して目的変数とし、前記ユーザートレース情報から前記アプリケーション及びデータ単位の第2の条件に対応する実測値を取得して説明変数とし、重回帰分析を行って前記寄与率を算出することを特徴とするアプリケーション及びデータの配置管理方法。
    The application and data arrangement management method according to claim 1,
    The sixth step includes
    An actual value corresponding to the first condition is acquired from the user trace information as an objective variable, and an actual value corresponding to the second condition of the application and data unit is acquired from the user trace information as an explanatory variable. An application and data arrangement management method, wherein the contribution rate is calculated by performing multiple regression analysis.
  5.  請求項1に記載のアプリケーション及びデータの配置管理方法であって、
     前記第4のステップは、
     全ての前記第1の条件と相関関係がある第2の条件について重要度を算出し、前記重要度が所定の閾値以上の第2の条件に対応する項目を、重要第2条件項目として抽出することを特徴とするアプリケーション及びデータの配置管理方法。
    The application and data arrangement management method according to claim 1,
    The fourth step includes
    Importance is calculated for the second condition correlated with all the first conditions, and an item corresponding to the second condition having the importance equal to or higher than a predetermined threshold is extracted as an important second condition item. And an arrangement management method for data.
  6.  分割可能なアプリケーションのサービスをユーザー端末に提供する1以上のデータセンターと、
     プロセッサとメモリを有し、前記データセンターと前記ユーザー端末に接続されて前記アプリケーションと前記アプリケーションが利用するデータの配置を管理する管理計算機と、を備えたアプリケーション及びデータの配置管理システムであって、
     前記ユーザー端末は、
     当該ユーザー端末を操作し、前記アプリケーションのサービスを利用するユーザーのサービス行動ログ及び性能情報を取得するエージェントを有し、
     前記データセンターは、
     前記アプリケーションを実行し、前記データセンターを監視し、前記ユーザー端末のエージェントが、前記ユーザー端末を監視し、前記データセンターのエージェントと前記ユーザー端末のエージェントが、前記ユーザー端末を操作して前記サービスを利用するユーザーのサービス行動ログ及び性能情報を含むユーザートレース情報を取得するエージェントを実行する計算機を有し、
     前記管理計算機は、
     前記ユーザー端末が利用したサービスまたはトランザクション毎に、ビジネスの条件を含む第1の条件を受け付ける設定入力部と、
     前記アプリケーション及びデータを利用したユーザー端末のユーザートレース情報を、前記データセンター内の前記エージェントと前記ユーザー端末の前記エージェントから収集するユーザートレース収集部と、
     前記第1の条件と前記ユーザートレース情報から前記アプリケーション及びデータに対するサービスレベル目標をそれぞれ設定するサービスレベル目標管理部と、を有し、
     前記サービスレベル目標管理部は、
     前記アプリケーション及びデータの所定のサービスレベル目標を第2の条件として、前記ユーザートレース情報から前記第1の条件と相関関係がある第2の条件について重要度を算出し、前記重要度が所定の閾値以上の第2の条件に対応する項目を、重要第2条件項目として抽出する重要SLO選定部と、
     前記重要第2条件項目と前記第1の条件の項目について、前記ユーザートレース情報から分布近似曲線を算出し、前記分布近似曲線と前記第1の条件から当該第1の条件を満足する新たな第2の条件を算出し、現在の第2の条件を新たな第2の条件に変更する第1SLO基準導出部と、
     前記管理計算機が、前記第1の条件に対する前記新たな第2の条件項目について、前記アプリケーション及びデータそれぞれの当該第2の条件項目への寄与率を算出し、前記寄与率と前記新たな第2の条件から、前記アプリケーションに対する新たな第2の条件の基準と、前記データに対する新たな第2の条件の基準をそれぞれ設定する第2SLO基準導出部と、
     前記アプリケーションと前記データに対する新たな第2の条件の基準を満たす1以上のデータセンターに対して、前記アプリケーションと前記データを分散配置する配置計画部と、
    を含むことを特徴とするアプリケーション及びデータの配置管理システム。
    One or more data centers that provide user terminal services with separable applications;
    An application and data arrangement management system comprising a processor and a memory, and connected to the data center and the user terminal to manage the application and the arrangement of data used by the application,
    The user terminal is
    An agent that operates the user terminal and obtains service behavior log and performance information of a user who uses the application service,
    The data center
    The application is executed, the data center is monitored, the user terminal agent monitors the user terminal, and the data center agent and the user terminal agent operate the user terminal to operate the service. A computer that executes an agent that acquires user trace information including service activity logs and performance information of users to be used;
    The management computer is
    A setting input unit that accepts a first condition including a business condition for each service or transaction used by the user terminal;
    A user trace collection unit for collecting user trace information of a user terminal using the application and data from the agent in the data center and the agent of the user terminal;
    A service level target management unit for setting service level targets for the application and data from the first condition and the user trace information, respectively,
    The service level target management unit includes:
    Using the predetermined service level target of the application and data as a second condition, the importance is calculated for the second condition correlated with the first condition from the user trace information, and the importance is a predetermined threshold value An important SLO selection unit that extracts an item corresponding to the second condition as an important second condition item;
    For the important second condition item and the first condition item, a distribution approximate curve is calculated from the user trace information, and a new first condition that satisfies the first condition from the distribution approximate curve and the first condition is calculated. A first SLO criterion derivation unit that calculates the second condition and changes the current second condition to a new second condition;
    The management computer calculates a contribution ratio of the application and data to the second condition item for the new second condition item for the first condition, and the contribution ratio and the new second condition item are calculated. A second SLO criterion derivation unit for setting a new second condition criterion for the application and a new second condition criterion for the data, respectively,
    An arrangement planning unit that distributes and arranges the application and the data with respect to one or more data centers that satisfy a new second condition criterion for the application and the data;
    An application and data arrangement management system comprising:
  7.  請求項6に記載のアプリケーション及びデータの配置管理システムであって、
     前記重要SLO選定部は、
     前記ユーザートレース情報から前記第1の条件に対応する実測値を取得して目的変数とし、前記ユーザートレース情報から前記第2の条件に対応する実測値を取得して説明変数とし、重回帰分析を行って前記重要第2条件項目を抽出することを特徴とするアプリケーション及びデータの配置管理システム。
    The application and data arrangement management system according to claim 6,
    The important SLO selection part is:
    An actual value corresponding to the first condition is acquired from the user trace information as an objective variable, an actual value corresponding to the second condition is acquired from the user trace information as an explanatory variable, and multiple regression analysis is performed. An application and data arrangement management system, wherein the important second condition item is extracted.
  8.  請求項6に記載のアプリケーション及びデータの配置管理システムであって、
     前記第1SLO基準導出部は、
     前記分布近似曲線と前記第1の条件の交点を通る第2の条件の値を、新たな第2の条件として設定することを特徴とするアプリケーション及びデータの配置管理システム。
    The application and data arrangement management system according to claim 6,
    The first SLO criterion derivation unit includes:
    An application and data arrangement management system characterized in that a value of a second condition passing through an intersection of the distribution approximate curve and the first condition is set as a new second condition.
  9.  請求項6に記載のアプリケーション及びデータの配置管理システムであって、
     前記第2SLO基準導出部は、
     前記ユーザートレース情報から前記第1の条件に対応する実測値を取得して目的変数とし、前記ユーザートレース情報から前記アプリケーション及びデータ単位の第2の条件に対応する実測値を取得して説明変数とし、重回帰分析を行って前記寄与率を算出することを特徴とするアプリケーション及びデータの配置管理システム。
    The application and data arrangement management system according to claim 6,
    The second SLO criterion derivation unit includes:
    An actual value corresponding to the first condition is acquired from the user trace information as an objective variable, and an actual value corresponding to the second condition of the application and data unit is acquired from the user trace information as an explanatory variable. An application and data arrangement management system, wherein the contribution rate is calculated by performing multiple regression analysis.
  10.  請求項6に記載のアプリケーション及びデータの配置管理システムであって、
     前記重要SLO選定部は、
     全ての前記第1の条件と相関関係がある第2の条件について重要度を算出し、前記重要度が所定の閾値以上の第2の条件に対応する項目を、重要第2条件項目として抽出することを特徴とするアプリケーション及びデータの配置管理システム。
    The application and data arrangement management system according to claim 6,
    The important SLO selection part is:
    Importance is calculated for the second condition correlated with all the first conditions, and an item corresponding to the second condition having the importance equal to or higher than a predetermined threshold is extracted as an important second condition item. An application and data arrangement management system characterized by the above.
  11.  プロセッサとメモリを有してデータセンターとユーザー端末に接続された管理計算機が、1以上のデータセンターに分割可能なアプリケーション及びデータの配置を管理するプログラムを格納した記憶媒体であって、
     前記データセンターに接続されたユーザー端末が利用したサービスまたはトランザクション毎に、ビジネスの条件を含む第1の条件を受け付ける第1の手順と、
     前記データセンターと前記ユーザー端末に配備されて、前記データセンターと前記ユーザー端末を監視し、前記ユーザー端末を操作し、前記アプリケーションのサービスを利用するユーザーのサービス行動ログ及び性能情報を含むユーザートレース情報を取得するエージェントから、前記アプリケーション及びデータを利用したユーザー端末のユーザートレース情報を収集する第2の手順と、
     前記アプリケーション及びデータの所定のサービスレベル目標を第2の条件として、前記ユーザートレース情報から前記第1の条件と相関関係がある第2の条件について重要度を算出し、前記重要度が所定の閾値以上の第2の条件に対応する項目を、重要第2条件項目として抽出する第3の手順と、
     前記重要第2条件項目と第1の条件の項目について、前記ユーザートレース情報から分布近似曲線を算出し、前記分布近似曲線と前記第1の条件から当該第1の条件を満足する新たな第2の条件を算出し、現在の第2の条件を新たな第2の条件に変更する第4の手順と、
     前記第1の条件に対する前記新たな第2の条件項目について、前記アプリケーション及びデータそれぞれの当該第2の条件項目への寄与率を算出する第5の手順と、
     前記管理計算機が、前記寄与率と新たな第2の条件から前記アプリケーションに対する新たな第2の条件の基準と、前記データに対する新たな第2の条件の基準をそれぞれ算出する第6の手順と、
     前記算出された前記アプリケーションと前記データに対する新たな第2の条件の基準を満たす1以上のデータセンターに対して、前記アプリケーションと前記データを分散配置する第7の手順と、
     を前記管理計算機に実行させるプログラムを格納した非一時的な計算機読み取り可能な記憶媒体。
    A management computer having a processor and a memory and connected to a data center and a user terminal is a storage medium storing a program for managing an application and data arrangement that can be divided into one or more data centers,
    A first procedure for receiving a first condition including a business condition for each service or transaction used by a user terminal connected to the data center;
    User trace information, which is deployed in the data center and the user terminal, monitors the data center and the user terminal, operates the user terminal, and uses the service of the application and includes service performance log and performance information A second procedure for collecting user trace information of a user terminal using the application and data from an agent that obtains
    Using the predetermined service level target of the application and data as a second condition, the importance is calculated for the second condition correlated with the first condition from the user trace information, and the importance is a predetermined threshold value A third procedure for extracting an item corresponding to the second condition as an important second condition item;
    For the important second condition item and the first condition item, a distribution approximate curve is calculated from the user trace information, and a new second condition that satisfies the first condition from the distribution approximate curve and the first condition is calculated. A fourth procedure for calculating the condition of the current and changing the current second condition to a new second condition;
    A fifth procedure for calculating a contribution ratio of the application and data to the second condition item for the new second condition item for the first condition;
    A sixth procedure in which the management computer calculates a new second condition criterion for the application and a new second condition criterion for the data from the contribution rate and the new second condition;
    A seventh procedure for distributing and distributing the application and the data to one or more data centers that satisfy a new second condition criterion for the calculated application and the data;
    A non-transitory computer-readable storage medium storing a program for causing the management computer to execute the program.
  12.  請求項11に記載の記憶媒体であって、
     前記第3の手順は、
     前記ユーザートレース情報から前記第1の条件に対応する実測値を取得して目的変数とし、前記ユーザートレース情報から前記第2の条件に対応する実測値を取得して説明変数とし、重回帰分析を行って前記重要第2条件項目を抽出することを特徴とする記憶媒体。
    The storage medium according to claim 11,
    The third procedure is:
    An actual value corresponding to the first condition is acquired from the user trace information as an objective variable, an actual value corresponding to the second condition is acquired from the user trace information as an explanatory variable, and multiple regression analysis is performed. A storage medium for performing the extraction of the important second condition item.
  13.  請求項11に記載の記憶媒体であって、
     前記第4の手順は、
     前記分布近似曲線と前記第1の条件の交点を通る第2の条件の値を、新たな第2の条件として設定することを特徴とする記憶媒体。
    The storage medium according to claim 11,
    The fourth procedure includes:
    A storage medium, wherein a value of a second condition passing through an intersection of the distribution approximate curve and the first condition is set as a new second condition.
  14.  請求項11に記載の記憶媒体であって、
     前記第5の手順は、
     前記ユーザートレース情報から前記第1の条件に対応する実測値を取得して目的変数とし、前記ユーザートレース情報から前記アプリケーション及びデータ単位の第2の条件に対応する実測値を取得して説明変数とし、重回帰分析を行って前記寄与率を算出することを特徴とする記憶媒体。
    The storage medium according to claim 11,
    The fifth procedure includes:
    An actual value corresponding to the first condition is acquired from the user trace information as an objective variable, and an actual value corresponding to the second condition of the application and data unit is acquired from the user trace information as an explanatory variable. A storage medium characterized by performing multiple regression analysis to calculate the contribution rate.
  15.  請求項11に記載の記憶媒体であって、
     前記第3の手順は、
     全ての前記第1の条件と相関関係がある第2の条件について重要度を算出し、前記重要度が所定の閾値以上の第2の条件に対応する項目を、重要第2条件項目として抽出することを特徴とする記憶媒体。
    The storage medium according to claim 11,
    The third procedure is:
    Importance is calculated for the second condition correlated with all the first conditions, and an item corresponding to the second condition having the importance equal to or higher than a predetermined threshold is extracted as an important second condition item. A storage medium characterized by that.
PCT/JP2013/080672 2013-11-13 2013-11-13 Application- and data-distribution management method, application- and data-distribution management system, and storage medium WO2015071975A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/JP2013/080672 WO2015071975A1 (en) 2013-11-13 2013-11-13 Application- and data-distribution management method, application- and data-distribution management system, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2013/080672 WO2015071975A1 (en) 2013-11-13 2013-11-13 Application- and data-distribution management method, application- and data-distribution management system, and storage medium

Publications (1)

Publication Number Publication Date
WO2015071975A1 true WO2015071975A1 (en) 2015-05-21

Family

ID=53056945

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2013/080672 WO2015071975A1 (en) 2013-11-13 2013-11-13 Application- and data-distribution management method, application- and data-distribution management system, and storage medium

Country Status (1)

Country Link
WO (1) WO2015071975A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003242228A (en) * 2002-01-31 2003-08-29 Currie & Brown Japan Ltd Facility operation management system
JP2007529048A (en) * 2003-07-11 2007-10-18 インターナショナル・ビジネス・マシーンズ・コーポレーション System and method for monitoring and controlling business level service level agreements
JP2008027442A (en) * 2006-07-21 2008-02-07 Sony Computer Entertainment Inc Sub-task processor distribution scheduling

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003242228A (en) * 2002-01-31 2003-08-29 Currie & Brown Japan Ltd Facility operation management system
JP2007529048A (en) * 2003-07-11 2007-10-18 インターナショナル・ビジネス・マシーンズ・コーポレーション System and method for monitoring and controlling business level service level agreements
JP2008027442A (en) * 2006-07-21 2008-02-07 Sony Computer Entertainment Inc Sub-task processor distribution scheduling

Similar Documents

Publication Publication Date Title
US20210037097A1 (en) Cloud data storage location monitoring
US9544403B2 (en) Estimating latency of an application
US20190268428A1 (en) Systems and methods of determining a geographic location based conversion
US20150067171A1 (en) Cloud service brokering systems and methods
US9774654B2 (en) Service call graphs for website performance
US9766993B2 (en) Quality of information assessment in dynamic sensor networks
Trihinas et al. Monitoring elastically adaptive multi-cloud services
US20100077449A1 (en) Calculating multi-tenancy resource requirements and automated tenant dynamic placement in a multi-tenant shared environment
US20160225042A1 (en) Determining a cost of an application programming interface
US20150006279A1 (en) Systems and methods of generating a valid location cluster based on a location of a commercial entity
US10560353B1 (en) Deployment monitoring for an application
Abderrahim et al. A holistic monitoring service for fog/edge infrastructures: a foresight study
CN112822298A (en) Business service capacity expansion and reduction method, device, medium and electronic equipment
Barve et al. Fecbench: A holistic interference-aware approach for application performance modeling
Zheng et al. Probabilistic QoS aggregations for service composition
JP2016504687A (en) Management of information technology services
Yu et al. TraceRank: Abnormal service localization with dis‐aggregated end‐to‐end tracing data in cloud native systems
US20160225043A1 (en) Determining a cost of an application
Klaver et al. Towards independent run-time cloud monitoring
Efimov et al. Integration data model for continuous service delivery in cloud computing system
Rady Formal definition of service availability in cloud computing using OWL
WO2015071975A1 (en) Application- and data-distribution management method, application- and data-distribution management system, and storage medium
US20210227351A1 (en) Out of box user performance journey monitoring
US9755925B2 (en) Event driven metric data collection optimization
Chhetri et al. CL-SLAM: Cross-layer SLA monitoring framework for cloud service-based applications

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13897377

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13897377

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP