CN106933718A - Method for monitoring performance and device - Google Patents

Method for monitoring performance and device Download PDF

Info

Publication number
CN106933718A
CN106933718A CN201511025843.8A CN201511025843A CN106933718A CN 106933718 A CN106933718 A CN 106933718A CN 201511025843 A CN201511025843 A CN 201511025843A CN 106933718 A CN106933718 A CN 106933718A
Authority
CN
China
Prior art keywords
storage
application
operation object
server
performance information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201511025843.8A
Other languages
Chinese (zh)
Other versions
CN106933718B (en
Inventor
李夫路
黄飞腾
徐飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CN201511025843.8A priority Critical patent/CN106933718B/en
Publication of CN106933718A publication Critical patent/CN106933718A/en
Application granted granted Critical
Publication of CN106933718B publication Critical patent/CN106933718B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3034Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system component is a storage system, e.g. DASD based or network based
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3452Performance evaluation by statistical analysis

Abstract

The invention discloses a kind of method for monitoring performance and device, belong to field of computer technology.Methods described includes:The application side performance information of operation object is received, and receives the storage side performance information of operation object;Receive the first mark of operation object and the corresponding relation of the second mark;The first mark of read operation object from the application side performance information of each operation object;The second mark of read operation object from the storage side performance information of each operation object;The corresponding application side performance information of each operation object and storage side performance information are determined according to corresponding relation;To the corresponding application side performance information of each operation object and storage side performance information analysis;Determine it is that application server or storage server have problem according to analysis result.The present invention is solved cannot determine that application server or storage server have problem, the problem for causing monitoring to malfunction, and reach the effect of the accuracy for improving monitoring.

Description

Method for monitoring performance and device
Technical field
The present invention relates to field of computer technology, more particularly to a kind of method for monitoring performance and device.
Background technology
User can produce data during application is used, and in order to be stored to data, developer can With the application server configuration storage server for having application to operation.Wherein, application can be such as database Etc software.
In correlation technique, application server is connected to external storage server by developer by network, The storage server can be the set of the storage device of plurality of specifications.During user uses application, Developer can be monitored with the performance of application server and storage server.Generally, in application service Device side can monitor the performance of application server, and storage server can be monitored in storage server side Performance.
When application server malfunctions when certain operation object is operated, because now application server cannot be monitored Storage server operates the performance during operation object, so as to cannot determine that application server has problem also It is that storage server has problem, causes monitoring to malfunction.
The content of the invention
During in order to solve to be malfunctioned when application server operates certain operation object, it is impossible to it is determined that being application server Or storage server has problem, the problem for causing monitoring to malfunction the embodiment of the invention provides a kind of property Can monitoring method and device.The technical scheme is as follows:
A kind of first aspect, there is provided method for monitoring performance, methods described is used in management server, management Server is connected with application server and storage server respectively, and storage server provides number for application server According to storage, methods described includes:
Management server receives the application side performance information of the operation object that application server sends, and reception is deposited The storage side performance information of the operation object that storage server sends;Management server receives application server hair again First mark of the operation object sent and the corresponding relation of the second mark;Then, management server is grasped from each Make the first mark of read operation object in the application side performance information of object, and depositing from each operation object Second mark of read operation object in storage side performance information;Management server determines each according to corresponding relation The corresponding application side performance information of operation object and storage side performance information;Finally, management server is to each The corresponding application side performance information of operation object and storage side performance information analysis, be according to analysis result determination There is problem in application server or storage server.
Determine that the corresponding application side performance information of each operation object and storage side performance are believed by management server Breath, then the corresponding application side performance information of each operation object and storage side performance information are analyzed, can Determine it is that application server or storage server have problem with according to analysis result, solving to determine It is that application server or storage server have problem, the problem for causing monitoring to malfunction has reached raising prison The effect of the accuracy of control.
In the first possible implementation of first aspect, the first mark include the title of operation object with At least one in logical address, the second mark includes the physical address of operation object;
First correspondence of title and logical address that corresponding relation obtains each operation object by application server is closed System, and the title of each operation object and the second corresponding relation of physical address are obtained, will be including same names The first corresponding relation and the second corresponding relation merge what is obtained.
The first corresponding relation and the second corresponding relation are collected respectively by application server, by the first corresponding relation Be merged into corresponding relation with the second corresponding relation, in order to according to the corresponding relation to the application side of operation object Performance information and storage side performance information carry out correspondence, and solving cannot be to application side performance information and storage side Performance information carries out corresponding problem, has reached the effect of the accuracy for improving analysis.
In second possible implementation of first aspect, when application side performance information includes data manipulation Application side time delay information, storage side performance information include data manipulation storage side time delay information when, To the corresponding application side performance information of each operation object and storage side performance information analysis, including:
Storage side time delay will be subtracted using side time delay, obtain time delay difference;Detect whether the time delay difference is more than First threshold;When testing result is that the time delay difference is more than first threshold, generation application server has the The analysis result of one allocation problem.
Compare the size of time delay difference and first threshold by management server, can automatically determine whether it is to answer There is problem with server, solve the size for needing developer to compare time delay difference and first threshold, really Determine low and accuracy the problem of efficiency of problem, reach the effect of the efficiency and accuracy that improve determination problem.
With reference to second possible implementation of first aspect, in the third possible realization of first aspect In mode, to the corresponding application side performance information of each operation object and storage side performance information analysis, also wrap Include:
Whether detection storage side time delay is more than Second Threshold;When testing result for storage side time delay is more than the second threshold During value, there is the analysis result of the second allocation problem in generation storage server.
Compare the size of storage side time delay and Second Threshold by management server, can automatically determine whether be There is problem in storage server, solve the size for needing developer to compare storage side time delay and Second Threshold, Determine low and accuracy the problem of efficiency of problem, reach the effect of the efficiency and accuracy that improve determination problem Really.
A kind of second aspect, there is provided performance monitoring apparatus, the device is used in management server, management clothes Business device is connected with application server and storage server respectively, and storage server provides data for application server Storage, described device, including:
Receiving unit, the application side performance information of the operation object for receiving application server transmission, and connect Receive the storage side performance information of the operation object that storage server sends;
Receiving unit, is additionally operable to receive first mark and the second mark of the operation object that application server sends Corresponding relation;
Reading unit, for being read in the application side performance information of each operation object received from receiving unit First mark of operation object;
Reading unit, is additionally operable to read from the storage side performance information of each operation object of receiving unit reception Second mark of extract operation object;
Determining unit, each behaviour that the reading unit that the corresponding relation for being received according to receiving unit determines reads Make the corresponding application side performance information of object and storage side performance information;
Analytic unit, the corresponding application side performance information of each operation object for determining to determining unit and Storage side performance information analysis;
Determining unit, the analysis result determination for being additionally operable to be obtained according to analytic unit is application server or deposits There is problem in storage server.
Determine that the corresponding application side performance information of each operation object and storage side performance are believed by management server Breath, then the corresponding application side performance information of each operation object and storage side performance information are analyzed, can Determine it is that application server or storage server have problem with according to analysis result, solving to determine It is that application server or storage server have problem, the problem for causing monitoring to malfunction has reached raising prison The effect of the accuracy of control.
In the first possible implementation of second aspect, the first mark include the title of operation object with At least one in logical address, the second mark includes the physical address of operation object;
First correspondence of title and logical address that corresponding relation obtains each operation object by application server is closed System, and the title of each operation object and the second corresponding relation of physical address are obtained, will be including same names The first corresponding relation and the second corresponding relation merge what is obtained.
The first corresponding relation and the second corresponding relation are collected respectively by application server, by the first corresponding relation Be merged into corresponding relation with the second corresponding relation, in order to according to the corresponding relation to the application side of operation object Performance information and storage side performance information carry out correspondence, and solving cannot be to application side performance information and storage side Performance information carries out corresponding problem, has reached the effect of the accuracy for improving analysis.
In second possible implementation of second aspect, when application side performance information includes application side The information prolonged, when storage side performance information includes the information of storage side time delay, analytic unit, specifically for:
Storage side time delay will be subtracted using side time delay, obtain time delay difference;Whether detection time delay difference is more than the One threshold value;When testing result is that time delay difference is more than first threshold, generation application server is of problems Analysis result.
Compare the size of time delay difference and first threshold by management server, can automatically determine whether it is to answer There is problem with server, solve the size for needing developer to compare time delay difference and first threshold, really Determine low and accuracy the problem of efficiency of problem, reach the effect of the efficiency and accuracy that improve determination problem.
With reference to second possible implementation of second aspect, in the third possible realization of second aspect In mode, analytic unit is additionally operable to:Whether detection storage side time delay is more than Second Threshold;Work as testing result When being more than Second Threshold for storage side time delay, storage server analysis result of problems is generated.
Compare the size of storage side time delay and Second Threshold by management server, can automatically determine whether be There is problem in storage server, solve the size for needing developer to compare storage side time delay and Second Threshold, Determine low and accuracy the problem of efficiency of problem, reach the effect of the efficiency and accuracy that improve determination problem Really.
Brief description of the drawings
Technical scheme in order to illustrate more clearly the embodiments of the present invention, institute in being described to embodiment below The accompanying drawing for needing to use is briefly described, it should be apparent that, drawings in the following description are only the present invention Some embodiments, for those of ordinary skill in the art, on the premise of not paying creative work, Other accompanying drawings can also be obtained according to these accompanying drawings.
Fig. 1 is the first structural representation for the performance monitoring system that each embodiment of the invention is provided;
Fig. 2 is second structural representation of the performance monitoring system that each embodiment of the invention is provided;
Fig. 3 is the third structural representation for the performance monitoring system that each embodiment of the invention is provided;
Fig. 4 is the 4th kind of structural representation of the performance monitoring system that each embodiment of the invention is provided;
Fig. 5 is the 5th kind of structural representation of the performance monitoring system that each embodiment of the invention is provided;
Fig. 6 is the 6th kind of structural representation of the performance monitoring system that each embodiment of the invention is provided;
Fig. 7 is the 7th kind of structural representation of the performance monitoring system that each embodiment of the invention is provided;
Fig. 8 is the structural representation of the management server that one embodiment of the invention is provided;
Fig. 9 is the method flow diagram of the method for monitoring performance that one embodiment of the invention is provided;
Figure 10 A are the method flow diagrams of the method for monitoring performance that another embodiment of the present invention is provided;
Figure 10 B are the interface schematic diagrams of the display information that another embodiment of the present invention is provided;
Figure 11 is the structured flowchart of the performance monitoring apparatus that one embodiment of the invention is provided.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with accompanying drawing to the present invention Implementation method is described in further detail.
Fig. 1 is refer to, the first of the performance monitoring system provided it illustrates each exemplary embodiment of the invention Plant structural representation.The performance monitoring system includes:Application server 110, storage server 120 and management Server 130, management server 130 is connected with application server 110 and storage server 120 respectively.Its In, storage server 120 is connected with application server 110, and is deposited for application server 110 provides data Storage.
Second structural representation of the performance monitoring system shown in Fig. 2 is refer to, on application server 110 Operation has application data collection procedure 111, and application data collection procedure 111 is used to collect application server 110 Application side performance information, and the application side performance information is sent to management server 130.Storage server Operation has data storage collection procedure 121 on 120, and data storage collection procedure 121 is used to collect storage service The storage side performance information of device 130, and the storage side performance information is sent to management server 130.Management Server 130 is used to be analyzed application side performance information and storage side performance information, and is tied according to analysis It is that application server 110 or storage server 120 have problem that fruit determines.
Wherein, the application for being run on application server 110 can be locally applied, such as local data base, void Intend desktop, virtual machine etc., or working application, such as cloud database, Internet of Things application, Che Lian Net application etc..It should be added that, the present invention is only illustrated with application server 110, When actually realizing, application server 110 can also be replaced with terminal, application now can be terminal The application of middle installation.
The third structural representation of the performance monitoring system shown in Fig. 3 is refer to, performance monitoring system may be used also So that including result follower 140, analysis result is sent to result follower 140 and carried out by management server 130 Output.Wherein, as a result follower 140 can be display, player etc., and the present invention is not construed as limiting.
The 4th kind of structural representation of the performance monitoring system shown in Fig. 4 is refer to, on application server 110 Also operation has application performance statistics program 112, and the application performance statistics program 112 is used to collect application server 110 application side performance information, and the application side performance information is sent to application data collection procedure 111.
Refer to the 5th kind of structural representation of the performance monitoring system shown in Fig. 5, application data collection procedure 111 can include agency (Agent) data collection program 111a and data distributing program 111b, proxy data Collection procedure 111a is connected with application performance statistics program 112, data distributing program 111b and management server 130 are connected.Wherein, proxy data collection procedure 111a sends for collecting application performance statistics program 112 Application side performance information, final application side performance letter is obtained after processing the application side performance information Breath, stores in storage organization.For example, proxy data collection procedure 111a is by the final application side performance Finish message into xml forms file, and by this document storage in database.
Data distributing program 111b is used to enter the application side performance information that proxy data collection procedure 111a is collected Row parsing, the application side performance information that will be obtained after parsing according to predetermined protocol is sent to management server 130. Specifically, when application side performance information is organized into the file of xml forms, data distributing program 111b will This document is parsed into application side performance information, by secure file transportation protocol (English:secure File Transfer Protocol;Referred to as:SFTP) or safety shell protocol (English:Secure Shell;Referred to as: SSH) application side performance information is sent to management server 130 by agreement.
Refer to the 6th kind of structural representation of the performance monitoring system shown in Fig. 6, performance monitoring system can be with Including at least one application server 110 and at least one storage server 120.With performance monitoring system in Fig. 6 System includes being illustrated as a example by two application servers 110 and two storage servers 120.
The 7th kind of structural representation of the performance monitoring system shown in Fig. 7 is refer to, wherein, application server 110 are specially cloud application service 110, and storage server 120 is specially cloud storage service 120.Performance monitoring System can include the service 110 of at least one cloud application and at least one cloud storage service 120, with property in Fig. 7 Energy monitoring system includes being illustrated as a example by a cloud application service 110 and a cloud storage service 120.
For convenience of description, application side performance is hereafter sent to management server 130 with application server 110 The implementation of information is illustrated, and the implementation can directly collect application including application server 110 Side performance information is sent to the implementation of management server 130, it is also possible to including application data collection procedure The 111 application side performance informations for directly collecting application server 110 are sent to the realization side of management server 130 Formula, can also include that application performance statistics program 112 collects application side performance information and is sent to application data receipts Application side performance information is sent to management server 130 by collection program 111, application data collection procedure 111 Implementation.Similarly, hereafter storage side performance is sent to management server 130 with storage server 120 to believe The implementation of breath is illustrated, and the implementation can directly collect storage side including storage server 120 Performance information is sent to the implementation of management server 130, it is also possible to including data storage collection procedure 111 The storage side performance information for directly collecting storage server is sent to the implementation of management server 130.
Fig. 8 is refer to, the structure of the management server provided it illustrates an illustrative embodiment of the invention is shown It is intended to.The management server can be the management server shown in Fig. 1 to Fig. 7, the management server Can include:Receiver 810, processor 820 and transmitter 830.It will be understood by those skilled in the art that The management server structure shown in Fig. 8 does not constitute the restriction to management server, can include than diagram More or less part, or some parts are combined, or different part arrangements.Such as, management clothes Business device also includes memory 840 etc..Wherein:
Processor 820 is the control centre of management server, is entirely managed using various interfaces and connection The various pieces of server, by running or performing software program and/or module of the storage in memory 840, And application side performance information and storage side performance information of the storage in memory 840 are called, perform data Analytic function.Optionally, processor 820 may include one or more processing cores;Optionally, processor 820 can integrated application processor and modem processor, wherein, application processor mainly processes operating system With application program etc., modem processor mainly processes radio communication.It is understood that above-mentioned modulation Demodulation processor can not also be integrated into processor 820, and above-mentioned modem processor can be implemented separately As a baseband chip.
Memory 840 can be used for software program and module.Processor 820 is by running storage in memory 840 software program and module, so as to perform data analysis function.Memory 840 can mainly include storage Program area and storage data field, wherein, storing program area can storage program area 841, receiver module 842, Application journey needed for read module 843, determining module 844, analysis module 845 and at least one other function Sequence 846 etc.;Storage data field can be stored (such as to be applied according to the created data that use of management server Side performance information, storage side performance information etc.) etc..Additionally, memory 840 can be by any kind of easy The property lost or non-volatile memory device or combinations thereof realize that such as static RAM is (English: Static Random Access Memory, referred to as:SRAM), Electrically Erasable Read Only Memory (English Text:Electrically Erasable Programmable Read-Only Memory, referred to as:EEPROM), Erasable Programmable Read Only Memory EPROM (English:Erasable Programmable Read Only Memory, letter Claim:EPROM), programmable read only memory (English:Programmable Read-Only Memory, Referred to as:PROM), read-only storage (English:Read Only Memory, referred to as:ROM), magnetic is deposited Reservoir, flash memory, disk or CD.
Transmitter 830 can include radio-frequency transmissions component, such as antenna.Transmitter 830 is used to tie analysis Fruit is transmitted in being carried on wireless signal.The wireless signal can be the running time-frequency resource in GSM.
Receiver 810 can include radio frequency reception component, such as antenna.Receiver 810 is used to receive carrying Application side performance information and storage side performance information in wireless signal.The wireless signal can be mobile logical Running time-frequency resource in letter system.
Although not shown, management server is also optional including power supply, bluetooth module etc., no longer goes to live in the household of one's in-laws on getting married herein State.
Fig. 9 is refer to, the method flow of the method for monitoring performance provided it illustrates one embodiment of the invention Figure, the method for monitoring performance can apply in the management server shown in Fig. 1 to Fig. 7.The performance monitoring Method, including:
Step 901, receives the application side performance information of the operation object that application server sends, and receives storage The storage side performance information of the operation object that server sends.
Step 902, receives the first mark of the operation object that application server sends and the corresponding pass of the second mark System.
Step 903, the first mark of read operation object from the application side performance information of each operation object.
Step 904, the second mark of read operation object from the storage side performance information of each operation object.
Step 905, the corresponding application side performance information of each operation object and storage side are determined according to corresponding relation Performance information.
Step 906, to the corresponding application side performance information of each operation object and storage side performance information analysis.
Step 907, determines it is that application server or storage server have problem according to analysis result.
In sum, method for monitoring performance provided in an embodiment of the present invention, each is determined by management server The corresponding application side performance information of operation object and storage side performance information, then it is corresponding to each operation object Application side performance information and storage side performance information are analyzed, and can determine it is using clothes according to analysis result There is problem in business device or storage server, solving cannot determine application server or storage server There is problem, the problem for causing monitoring to malfunction has reached the effect of the accuracy for improving monitoring.
Because the method for monitoring performance that the present embodiment is provided goes for any application, and should for different With the application side performance information that application server is collected into is different, and corresponding, storage server is collected To storage side performance information be also different, for the ease of understanding and illustrating, the present embodiment with application be several As a example by according to storehouse, the method for monitoring performance to database is illustrated.Wherein, application side performance information is used In the performance of description application server, storage side performance information is used to describe the performance of storage server.
Figure 10 A are refer to, the method stream of the method for monitoring performance provided it illustrates another embodiment of the present invention Cheng Tu, the method for monitoring performance can apply in the management server shown in Fig. 1 to Fig. 7.The performance is supervised Prosecutor method, including:
Step 1001, receives the application side performance information of the operation object that application server sends, and reception is deposited The storage side performance information of the operation object that storage server sends.
When application is database, application side performance information can be included but is not limited to:Using the letter of side time delay Breath and application side average throughput information;Corresponding, storage side performance information can be included but is not limited to:Storage The information and storage side average throughput information of side time delay.
When using the information of side time delay referring to database in application server to the operation of certain operation object Prolong, the information for storing side time delay refers to that under the control of application server, storage server is right to certain operation The operation time delay of elephant.Operation object can be data file, or one or more storage devices LUN, The present embodiment is not construed as limiting.The operation performed to operation object can be read operation, write operation and recovery (Redo) One kind in operation.
Application side average throughput information is flat in weighing application server to the operating process of certain operation object The parameter of equal data throughout, can include but is not limited to:Number of times (the English for carrying out read/write/recovery operation per second Text:Input/Output Operations Per Second;Referred to as:IOPS), bandwidth etc..Wherein, bandwidth It refer to the data volume for carrying out read/write/recovery operation per second.
Application server collects application side performance information and storage server collects storage side performance separately below The flow of information is introduced.
First, application side performance information is the information for applying side time delay, and storage side performance information is storage side The information of time delay.
1) every first time period, application server is obtained in first time period to the behaviour of each operation object The application side time delay of work, this using side time delay at the beginning of carve be application server receive operation equipment input Operation moment, finish time be to operation equipment output storage server return operating result moment; Obtain the first mark of operation object;Generation is carried using the application side time delay of side time delay and the first mark Information, management server is sent to by this using the information of side time delay.Wherein, the first mark includes that operation is right At least one in the title of elephant and the logical address of operation object.
When application server receive the operation equipment input of outside to the operation of certain operation object when, need The operation is sent to storage server, storage server processes the operation, operating result is returned Back to application server, application server exports the operating result to operation equipment.For example, working as application service Device by application side I/O interfaces to operation equipment be input into the read operation of data file when, read the reading First mark of the data file carried in operation, the title of the data file in first mark determines The physical address of the data file, the physical address is carried by storing side I/O interfaces to storage server transmission Read operation, storage server reads the content stored in the physical address, will by storing side I/O interfaces The content returns to application server as operating result, and application server is by application side I/O interfaces to operation Equipment exports the content.
Understood according to the above-mentioned responding process to operating, from receiving operation that operation equipment is input into to operation The application side time delay of equipment output operating result can represent the performance of application server, therefore, application service Device can be sent to management server using the information of application side time delay as application side performance information.
Specifically, application server counts the application side time delay to the operation of certain operation object, and acquisition should The first mark carried in operation, generation carries the letter of the application side time delay using side time delay and the first mark Breath.Optionally, application server can also obtain application server identifier, and application server identifier is added To in the information of application side time delay.
Because application server can perform operation to multiple operation objects, the application of each operation object is collected The resource that the information of side time delay to be consumed is more, it is preferred, therefore, that application server can also be to The application side time delay of the operation performed to certain operation object in one time period is sampled, only to sampling period Operation generation application side time delay information, with save resources.Wherein, first time period can voluntarily be set And modification.
2) every first time period, storage server is obtained in first time period to the behaviour of each operation object Carved at the beginning of the storage side time delay of work, storage side time delay is that storage server receives application server transmission Operation moment, finish time be to application server return operating result moment;Obtain operation object Second mark;Generation carries the information of the storage side time delay of storage side time delay and the second mark, and this is deposited The information for storing up side time delay is sent to management server.Wherein, the second mark includes the physical address of operation object, And second mark be application server to operation object in first mark change after obtain.
According to it is above-mentioned to operate responding process understand, from receive application server transmission operation to answer The storage side time delay for exporting operating result with server can represent the performance of storage server, therefore, storage The information that server can will store side time delay is sent to management server as storage side performance information.
Specifically, storage server counts the storage side time delay to the operation of certain operation object, and acquisition should The second mark carried in operation, generation carries the letter of the storage side time delay of storage side time delay and the second mark Breath.Optionally, storage server can also obtain storage server mark, storage server is identified and is added To in the information of storage side time delay.
Because storage server can perform operation to multiple operation objects, the storage of each operation object is collected The resource that the information of side time delay to be consumed is more, it is preferred, therefore, that storage server can also be to The storage side time delay of the operation performed to certain operation object in one time period is sampled, only to sampling period Operation generation storage side time delay information, with save resources.
It should be noted that application server is identical with the sampling time of storage server so that same behaviour The information of the information and storage side time delay of making the application side time delay of object can be sampled.
The present embodiment does not limit information and the storage server acquisition storage that application server obtains application side time delay The priority execution sequence of the information of side time delay.
Second, application side performance information is application side average throughput information, and storage side performance information is storage Side average throughput information.
1) every second time period, application server is collected each bar application side counted in second time period and is put down Equal throughput data;Obtain the first mark of operation object;Generation carries application side average throughput data and the The application side average throughput information of one mark, management server is sent to by the application side average throughput information.
It can be seen from the above-mentioned responding process to operating, application side IOPS and application side bandwidth can represent application The performance of server, therefore, application server can be using application side average throughput information as application side performance Information is sent to management server.
Specifically, when application side average throughput information is application side IOPS, application server is counted to certain When operation object performs operation, application side I/O interfaces number of operations per second, be applied side IOPS, and The first mark carried in the operation is obtained, generation carries the application side of the marks of application side IOPS and first Average throughput information.When application side average throughput information is application side bandwidth, application server is counted to certain When individual operation object performs operation, application side data volume per second, be applied side bandwidth, and obtains the behaviour The first mark carried in work, generation carries the application side average throughput of the application side bandwidth and the first mark Information.Optionally, application server can also obtain application server identifier, and application server identifier is added It is added in application side average throughput information.
Wherein, second time period can voluntarily set and change, and the duration of second time period can be with first The duration of time period is identical, it is also possible to different, and the present embodiment is not construed as limiting.
2) every second time period, storage server is collected each bar storage side counted in second time period and is gulped down Tell data;Obtain the second mark of operation object;Generation carries storage side average throughput data and the second mark The storage side average throughput information of knowledge, management server is sent to by the storage side average throughput information.
It can be seen from the above-mentioned responding process to operating, storage side IOPS and storage side bandwidth can represent storage The performance of server, therefore, storage server can be using storage side average throughput information as storage side performance Information is sent to management server.
Specifically, when it is storage side IOPS to store side average throughput information, storage server is counted to certain When operation object performs operation, storage side I/O interfaces number of operations per second obtains storing side IOPS, and The second mark carried in the operation is obtained, generation carries the storage side of the marks of the storage side IOPS and second Average throughput information.When it is storage side bandwidth to store side average throughput information, storage server is counted to certain When individual operation object performs operation, storage side data volume per second obtains storing side bandwidth, and obtain the behaviour The second mark carried in work, generation carries the storage side average throughput of the storage side bandwidth and the second mark Information.Optionally, storage number server can also obtain storage server mark, and storage server is identified It is added in storage side average throughput information.
Application server is not limited in the present embodiment and obtains application side average throughput information and storage server acquisition Store the priority execution sequence of side average throughput information.
It should be noted that application server actively can send application side performance information to management server, Application side performance information can also be sent when the request of management server transmission is received;Similarly, storage clothes Business device actively can send storage side performance information to management server, it is also possible to receive management server Storage side performance information is sent during the request of transmission, the present embodiment is not construed as limiting.
Step 1002, the first mark for receiving the operation object that application server sends is corresponding with the second mark Relation, wherein, the first mark includes at least one in the title and logical address of operation object, the second mark Knowledge includes the physical address of operation object;And corresponding relation is obtained the name of each operation object by application server Claim and logical address the first corresponding relation, and obtain each operation object title and physical address second Corresponding relation, will include that first corresponding relation and the second corresponding relation of same names merge what is obtained.
Wherein, the first corresponding relation can include:The title and logical address of operation object, the second correspondence are closed System can include:The title and physical address of operation object, application server is by the first corresponding relation and second After corresponding relation is merged, the corresponding relation for obtaining can be:The title of operation object, logical address, Physical address.
Include at least one in the title and logical address of operation object, the second mark bag due to the first mark Include the physical address of operation object, therefore, above-mentioned corresponding relation can indicate the first mark and second mark Corresponding relation.
After management server receives corresponding relation, corresponding relation can be stored.
It should be noted that due to reasons such as disk failures, backups, in storage server, object is physically Location it may happen that change, therefore, application server needs the every prescribed time period to carry out more corresponding relation Newly.In a kind of possible implementation, application server can enter every first time period to corresponding relation Row is updated, and the corresponding relation after renewal and application side performance information are sent jointly into management server;Another In a kind of possible implementation, because the frequency that the geographical address of object changes may be relatively low, therefore, should The 3rd time period can be set with server, corresponding relation is updated every the 3rd time period, will update Corresponding relation afterwards is sent to management server, wherein, the duration of the 3rd time period is more than first time period Duration.For example, first time period when a length of 5 minutes, the 3rd time period be when a length of 1 day etc., The present embodiment is not construed as limiting.
Step 1003, the first mark of read operation object from the application side performance information of each operation object.
Management server receives at least one application side performance information that application server sends, and receives storage At least one storage side performance information that server sends, now, management server is it needs to be determined that same behaviour Make the corresponding application side performance information of object and storage side performance information.
Specifically, due to carrying the first mark in application side performance information, therefore, management server can be with Directly first is read from application side performance information to identify.
Step 1004, the second mark of read operation object from the storage side performance information of each operation object.
Specifically, due to carrying the second mark in storage side performance information, therefore, management server can be with Directly second is read from storage side performance information to identify.
The present embodiment does not limit the priority execution sequence to step 1003 and step 1004.
Step 1005, the corresponding application side performance information of each operation object and storage are determined according to corresponding relation Side performance information.
Management server determines every after each first mark and each second mark is obtained according to corresponding relation First mark and the second mark of individual operation object.For first mark and the second mark of each operation object, To include that the application side performance information of first mark and the storage side performance information including second mark determine It is the corresponding application side performance information of the operation object and storage side performance information.
It should be noted that step 1002 only needs to be performed before step 1005, the present embodiment is not limited The priority execution sequence of step 1002 and other steps.
After the corresponding application side performance information of each operation object and storage side performance information is determined, management The corresponding application side performance information of each operation object and storage side performance information can be sent to knot by server Fruit follower.As a result follower can be by the corresponding application side performance information of each operation object and storage side property Energy information carries out correspondence output, and indicates application side performance information according to application server identifier, according to storage Server identification sign storage side performance information.
When result follower is display, can directly in interface correspondence display application side performance information and Storage side performance information so that developer can intuitively recognize application side performance information and storage side performance Relation between information, improves information acquisition efficiency.It is right hereafter so that result follower is as display as an example Display mode is introduced.
In the first display mode, display shows the above sequentially in time.In second display In mode, display can show the above according to each dimension.For example, space utilization dimension, I/O are warm Degree dimension, I/O size distributions dimension, time delay dimension etc..
The interface schematic diagram of the display information shown in Figure 10 B is refer to, wherein, (1) the figure exhibition in Figure 10 B That show is IOPS, and (2) figure shows time delay, and (3) figure shows bandwidth.
Developer can be observed the content of result follower output.When developer is according to the content When determining that application server has problem, the first configuration information, application service can be input into application server Device repairs the problem according to first configuration information.For example, when developer determines application clothes according to the content Be engaged in task priority when there is problem of device, can to the configuration information of application server incoming task priority, Application server reconfigures task priority according to the configuration information.
When developer determines that storage server has problem according to the content, can be defeated to storage server Enter the second configuration information, storage server repairs the problem according to second configuration information.For example, when exploitation When personnel determine the insufficient memory of storage server according to the content, can be input into storage server empty Between configuration information, storage server redistributes memory space according to the space configuration information.
Because developer determines it is application server or storage server effect of problems according to the content Rate and accuracy rate are all relatively low, therefore, in a kind of possible implementation, can be by management server to upper State content and proceed analysis, it is determined that being that application server or storage server have problem, now perform Step 1006.
Step 1006, when application side performance information includes the information of application side time delay, stores side performance information bag When including the information of storage side time delay, storage side time delay will be subtracted using side time delay, obtain time delay difference;Detection Whether time delay difference is more than first threshold;When testing result is that time delay difference is more than first threshold, generation should With server analysis result of problems.
Wherein, time delay difference is application server local terminal response operation the consumed time.When time delay difference compared with When big, illustrate that application server can not timely respond to the operation, generate application server analysis of problems As a result;When time delay difference is smaller, illustrate that application server can timely respond to the operation, generation application clothes The business device analysis result that there is no problem.
For example, being 3s using side time delay, storage side time delay is 2s, and the time delay difference being calculated is 1s, if First threshold is 0.5s, generates application server analysis result of problems;It is raw if first threshold is 2s Into the application server analysis result that there is no problem.
Whether step 1007, detection storage side time delay is more than Second Threshold;When testing result is storage side time delay During more than Second Threshold, storage server analysis result of problems is generated.
Wherein, storage side time delay is storage server local terminal response operation the consumed time.When side is stored When prolonging larger, illustrate that storage server can not timely respond to the operation, generation storage server is of problems Analysis result;When storing side time delay and being smaller, illustrate that storage server can timely respond to the operation, generate The storage server analysis result that there is no problem.
For example, storage side time delay is 2s, if Second Threshold is 1s, of problems point of storage server is generated Analysis result;If Second Threshold is 3s, the storage server analysis result that there is no problem is generated.
Step 1008, determines it is that application server or storage server have problem according to analysis result.
It should be noted that management server can also be stored in storage server analysis result, subsequently, Other equipment can obtain the analysis result to storage server.
Optionally, application server can also obtain collect application server other performance informations, below with Other performance informations for wait event information as a example by be illustrated, the present embodiment provide performance monitoring side Method, also includes:Receive the wait event information of the operation object that application server sends;According to the event of wait Information determines whether that application server has problem.
Because statistics has above-mentioned wait event information on application server, therefore, application server can be direct The wait event information is sent to management server.Wherein, it refers to that application server exists to wait event information Certain waits lower the consumed time.For example, wait event information can be included but is not limited to:Center treatment Unit (English:Central Processing Unit;Referred to as:CPU) stand-by period, CPU wait task Sequence, I/O stand-by period, non-I/O stand-by period, standby time, sampling elapsed time.
In a kind of possible implementation, management server is not analyzed to wait event information, directly The wait event information is sent to result follower, as a result follower shows to the wait event information, So that developer is analyzed, it is determined that being that application server or storage server have problem.Another Plant in possible implementation, management server sets to each wait event and waits threshold value, will wait event Information is compared with corresponding wait threshold value, when waiting event information more than corresponding wait threshold value, really Determine application server and there is problem;When waiting event information less than corresponding wait threshold value, it is determined that using clothes There is no problem for business device.
Optionally, application server can also obtain application server identifier, and application server identifier is added To in wait event information.
In sum, method for monitoring performance provided in an embodiment of the present invention, each is determined by management server The corresponding application side performance information of operation object and storage side performance information, then it is corresponding to each operation object Application side performance information and storage side performance information are analyzed, and can determine it is using clothes according to analysis result There is problem in business device or storage server, solving cannot determine application server or storage server There is problem, the problem for causing monitoring to malfunction has reached the effect of the accuracy for improving monitoring.
The first corresponding relation and the second corresponding relation are collected respectively by application server, by the first corresponding relation Be merged into corresponding relation with the second corresponding relation, in order to according to the corresponding relation to the application side of operation object Performance information and storage side performance information carry out correspondence, and solving cannot be to application side performance information and storage side Performance information carries out corresponding problem, has reached the effect of the accuracy for improving analysis.
Compare the size of time delay difference and first threshold by management server, can automatically determine whether it is to answer There is problem with server, solve the size for needing developer to compare time delay difference and first threshold, really Determine low and accuracy the problem of efficiency of problem, reach the effect of the efficiency and accuracy that improve determination problem.
Compare the size of storage side time delay and Second Threshold by management server, can automatically determine whether be There is problem in storage server, solve the size for needing developer to compare storage side time delay and Second Threshold, Determine low and accuracy the problem of efficiency of problem, reach the effect of the efficiency and accuracy that improve determination problem Really.
Figure 11 is refer to, the structured flowchart of the performance monitoring apparatus provided it illustrates one embodiment of the invention, The performance monitoring apparatus can by software, hardware or both be implemented in combination with turn into management server it is complete Portion or a part.The performance monitoring apparatus, including:Receiving unit 1110, reading unit 1120, determination Unit 1130 and analytic unit 1140.
Receiving unit 1110, the function for realizing above-mentioned steps 901 and step 902;
Reading unit 1120, the function for realizing above-mentioned steps 903 and step 904;
Determining unit 1130, the function for realizing above-mentioned steps 905 and 907;
Analytic unit 1140, the function for realizing above-mentioned steps 906.
In another optional embodiment, above-mentioned receiving unit 1110, for realizing step 1001, step 1002 function;Above-mentioned reading unit 1120, the function for realizing step 1003 and step 1004;On Determining unit 1130 is stated, for realizing step 1005, the function of step 1008;Above-mentioned analytic unit 1140, For realizing step 1006, the function of step 1007.
Correlative detail can be combined and refer to above method embodiment.
It should be noted that above-mentioned receiving unit 1110 can be stored by the computing device of management server Receiver module in device is realized;Above-mentioned reading unit 1120 can be held by the processor of management server Read module in line storage is realized;Above-mentioned determining unit 1130 can be by the place of management server Reason device performs the determining module in memory to realize;Above-mentioned analytic unit 1140 can be by management server Computing device memory in analysis module realize.
One of ordinary skill in the art will appreciate that realize all or part of step of above-described embodiment can pass through Hardware is completed, it is also possible to instruct the hardware of correlation to complete by program, described program can be stored in In a kind of computer-readable recording medium, storage medium mentioned above can be read-only storage, disk or CD etc..
Presently preferred embodiments of the present invention is the foregoing is only, is not intended to limit the invention, it is all of the invention Within spirit and principle, any modification, equivalent substitution and improvements made etc. should be included in of the invention Within protection domain.

Claims (10)

1. a kind of method for monitoring performance, it is characterised in that in for management server, the management server It is connected with application server and storage server respectively, the storage server is provided for the application server Data storage, methods described, including:
The application side performance information of the operation object that the application server sends is received, and receives the storage The storage side performance information of the operation object that server sends;
Receive the first mark of the operation object that the application server sends and the corresponding relation of the second mark;
The first mark of the operation object is read from the application side performance information of each operation object;
The second mark of the operation object is read from the storage side performance information of each operation object;
The corresponding application side performance information of each operation object and storage side performance are determined according to the corresponding relation Information;
To the corresponding application side performance information of each operation object and storage side performance information analysis;
Determine it is that the application server or the storage server have problem according to analysis result.
2. method according to claim 1, it is characterised in that first mark includes the operation At least one in the title and logical address of object, second mark includes the physics of the operation object Address;
The corresponding relation by the application server obtain each operation object title and logical address One corresponding relation, and the title of each operation object and the second corresponding relation of physical address are obtained, will include First corresponding relation and the second corresponding relation of same names merge what is obtained.
3. method according to claim 1, it is characterised in that methods described, also includes:
Receive the wait event information of the operation object that the application server sends;
Determine whether that the application server has problem according to the wait event information.
4. according to any described method of claims 1 to 3, it is characterised in that when the application side performance Information includes the information of application side time delay, when storage side performance information includes the information of storage side time delay, It is described that the corresponding application side performance information of each operation object and storage side performance information are analyzed, including:
The application side time delay is subtracted into the storage side time delay, time delay difference is obtained;
Detect the time delay difference whether more than first threshold;
When testing result is that the time delay difference is more than the first threshold, generates the application server and deposit In the analysis result of problem.
5. method according to claim 4, it is characterised in that described corresponding to each operation object Application side performance information and storage side performance information analysis, also include:
Whether the detection storage side time delay is more than Second Threshold;
When testing result is more than the Second Threshold for the storage side time delay, the storage server is generated Analysis result of problems.
6. a kind of performance monitoring apparatus, it is characterised in that in for management server, the management server It is connected with application server and storage server respectively, the storage server is provided for the application server Data storage, described device, including:
Receiving unit, the application side performance information for receiving the operation object that the application server sends, And receive the storage side performance information of the operation object that the storage server sends;
The receiving unit, be additionally operable to receive the first mark of the operation object that the application server sends with The corresponding relation of the second mark;
Reading unit, in the application side performance information of each operation object received from the receiving unit Read the first mark of the operation object;
The reading unit, is additionally operable to the storage side performance of each operation object from receiving unit reception The second mark of the operation object is read in information;
Determining unit, the corresponding relation for being received according to the receiving unit determines the reading unit The corresponding application side performance information of each operation object and storage side performance information for reading;
Analytic unit, the corresponding application side performance letter of each operation object for determining to the determining unit Breath and storage side performance information analysis;
The determining unit, the analysis result for being additionally operable to be obtained according to the analytic unit determines it is the application There is problem in server or the storage server.
7. device according to claim 6, it is characterised in that first mark includes the operation At least one in the title and logical address of object, second mark includes the physics of the operation object Address;
The corresponding relation by the application server obtain each operation object title and logical address One corresponding relation, and the title of each operation object and the second corresponding relation of physical address are obtained, will include First corresponding relation and the second corresponding relation of same names merge what is obtained.
8. device according to claim 6, it is characterised in that
The receiving unit, is additionally operable to receive the wait event letter of the operation object that the application server sends Breath;
The determining unit, be additionally operable to according to the receiving unit receive the wait event information determine be No is that the application server has problem.
9. according to any described device of claim 6 to 8, it is characterised in that when the application side performance Information includes the information of application side time delay, when storage side performance information includes the information of storage side time delay, The analytic unit, specifically for:
The application side time delay is subtracted into the storage side time delay, time delay difference is obtained;
Detect the time delay difference whether more than first threshold;
When testing result is that the time delay difference is more than the first threshold, generates the application server and deposit In the analysis result of problem.
10. device according to claim 9, it is characterised in that the analytic unit, is additionally operable to:
Whether the detection storage side time delay is more than Second Threshold;
When testing result is more than the Second Threshold for the storage side time delay, the storage server is generated Analysis result of problems.
CN201511025843.8A 2015-12-30 2015-12-30 Method for monitoring performance and device Active CN106933718B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511025843.8A CN106933718B (en) 2015-12-30 2015-12-30 Method for monitoring performance and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511025843.8A CN106933718B (en) 2015-12-30 2015-12-30 Method for monitoring performance and device

Publications (2)

Publication Number Publication Date
CN106933718A true CN106933718A (en) 2017-07-07
CN106933718B CN106933718B (en) 2019-11-26

Family

ID=59441148

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511025843.8A Active CN106933718B (en) 2015-12-30 2015-12-30 Method for monitoring performance and device

Country Status (1)

Country Link
CN (1) CN106933718B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111371646A (en) * 2020-02-28 2020-07-03 苏州浪潮智能科技有限公司 Detection method, detection device and detection equipment for performance of storage system
CN111756575A (en) * 2020-06-19 2020-10-09 星辰天合(北京)数据科技有限公司 Performance analysis method and device of storage server and electronic equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101022363A (en) * 2007-03-23 2007-08-22 杭州华为三康技术有限公司 Network storage equipment fault protecting method and device
JP2010146154A (en) * 2008-12-17 2010-07-01 Mitsubishi Electric Corp Counter-fault means determination device and computer program and counter-fault means determination method
US20140165064A1 (en) * 2012-12-10 2014-06-12 Fujitsu Limited Processing method, processing apparatus, and recording medium
CN104767682A (en) * 2014-01-08 2015-07-08 腾讯科技(深圳)有限公司 Routing method and system as well as routing information distributing method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101022363A (en) * 2007-03-23 2007-08-22 杭州华为三康技术有限公司 Network storage equipment fault protecting method and device
JP2010146154A (en) * 2008-12-17 2010-07-01 Mitsubishi Electric Corp Counter-fault means determination device and computer program and counter-fault means determination method
US20140165064A1 (en) * 2012-12-10 2014-06-12 Fujitsu Limited Processing method, processing apparatus, and recording medium
CN104767682A (en) * 2014-01-08 2015-07-08 腾讯科技(深圳)有限公司 Routing method and system as well as routing information distributing method and device

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111371646A (en) * 2020-02-28 2020-07-03 苏州浪潮智能科技有限公司 Detection method, detection device and detection equipment for performance of storage system
CN111756575A (en) * 2020-06-19 2020-10-09 星辰天合(北京)数据科技有限公司 Performance analysis method and device of storage server and electronic equipment
CN111756575B (en) * 2020-06-19 2023-08-11 北京星辰天合科技股份有限公司 Performance analysis method and device of storage server and electronic equipment

Also Published As

Publication number Publication date
CN106933718B (en) 2019-11-26

Similar Documents

Publication Publication Date Title
CN110908879B (en) Reporting method, reporting device, reporting terminal and recording medium of buried point data
CN103067297B (en) A kind of dynamic load balancing method based on resource consumption prediction and device
CN103200046B (en) The method and system of monitoring network element device performance
CN109586999A (en) A kind of container cloud platform condition monitoring early warning system, method and electronic equipment
CN106487574A (en) Automatic operating safeguards monitoring system
CN102567185B (en) Monitoring method of application server
CN103095819A (en) Data information pushing method and data information pushing system
CN107370806A (en) HTTP conditional codes monitoring method, device, storage medium and electronic equipment
CN102663298B (en) Safety online detecting system facing to terminal computers
US11256590B1 (en) Agent profiler to monitor activities and performance of software agents
CN102916854A (en) Traffic statistical method and device and proxy server
CN109936621A (en) Multi-page information push method, device, equipment and the storage medium of information security
CN107544832A (en) A kind of monitoring method, the device and system of virtual machine process
CN103490978A (en) Terminal, server and message monitoring method
CN110166529A (en) It keeps logging in state method, apparatus, equipment and storage medium
CN110213309A (en) A kind of method, equipment and the storage medium of binding relationship management
CN106685685A (en) Method and system for monitoring performance of exchange boards across safety subareas
CN109800133A (en) A kind of method, one-stop monitoring alarm platform and the system of unified monitoring alarm
CN109873714A (en) Cloud computing node configures update method and terminal device
CN108877188A (en) A kind of environment protection digital concurrently acquires and Multi net voting dissemination method and device
CN106933718A (en) Method for monitoring performance and device
US9043535B1 (en) Minimizing application response time
CN109559121A (en) Transaction path calls exception analysis method, device, equipment and readable storage medium storing program for executing
CN109189652A (en) A kind of acquisition method and system of close network terminal behavior data
CN110515938B (en) Data aggregation storage method, equipment and storage medium based on KAFKA message bus

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant