WO2015077385A2 - Performance monitoring to provide real or near real time remediation feedback - Google Patents

Performance monitoring to provide real or near real time remediation feedback Download PDF

Info

Publication number
WO2015077385A2
WO2015077385A2 PCT/US2014/066480 US2014066480W WO2015077385A2 WO 2015077385 A2 WO2015077385 A2 WO 2015077385A2 US 2014066480 W US2014066480 W US 2014066480W WO 2015077385 A2 WO2015077385 A2 WO 2015077385A2
Authority
WO
WIPO (PCT)
Prior art keywords
data
client
performance
tenant
isp
Prior art date
Application number
PCT/US2014/066480
Other languages
French (fr)
Other versions
WO2015077385A3 (en
Inventor
Cheng Zou
Dhanasekaran RAJU
Pravjit Tiwana
Olexiy KARPUS
Original Assignee
Microsoft Technology Licensing, Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing, Llc filed Critical Microsoft Technology Licensing, Llc
Priority to JP2016533584A priority Critical patent/JP2017500791A/en
Priority to RU2016119573A priority patent/RU2016119573A/en
Priority to EP14810075.3A priority patent/EP3072050A2/en
Priority to CN201480063665.5A priority patent/CN105765907A/en
Publication of WO2015077385A2 publication Critical patent/WO2015077385A2/en
Publication of WO2015077385A3 publication Critical patent/WO2015077385A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3495Performance evaluation by tracing or monitoring for systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/566Grouping or aggregating service requests, e.g. for unified processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3065Monitoring arrangements determined by the means or processing involved in reporting the monitored data
    • G06F11/3072Monitoring arrangements determined by the means or processing involved in reporting the monitored data where the reporting involves data filtering, e.g. pattern matching, time or event triggered, adaptive or policy-based reporting
    • G06F11/3082Monitoring arrangements determined by the means or processing involved in reporting the monitored data where the reporting involves data filtering, e.g. pattern matching, time or event triggered, adaptive or policy-based reporting the data filtering being achieved by aggregating or compressing the monitored data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • H04L41/065Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis involving logical or physical relationship, e.g. grouping and hierarchies
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0654Management of faults, events, alarms or notifications using network fault recovery
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/34Signalling channels for network management communication
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • Bandwidth is one factor that affects speed of a network.
  • Latency is another factor that affects network speed and responsiveness. Latency may be described as delay that affects processing of network data. Network conditions, hardware and software limitations, and/or other factors may adversely affect a user's experience of some online application or service. With the emergence of cloud computing and datacenter services, it is imperative to provide timely service with minimal bottlenecks across hundreds of server computers and associated networking infrastructure serving millions of users worldwide.
  • Embodiments provide for monitoring of an online user experience and/or remediating performance issues, but are not so limited.
  • a computer-implemented method of an embodiment operates to receive, pre-aggregate, and aggregate client performance data as part of providing an end-to-end diagnostics monitoring and resolution service.
  • a system of an embodiment is configured to aggregate performance data of a plurality of client devices or systems as part of identifying latency issues at one or more of a tenant level, geographic location level, and/or service provider level. Other embodiments are included.
  • FIGURE 1 depicts an exemplary system that operates in part to provide real or near real time end user performance monitoring services.
  • FIGURE 2 is a flow diagram depicting an exemplary process of pre- aggregating and aggregating performance and/or other data.
  • FIGURE 3 is a block diagram depicting components of an exemplary end-to- end data processing pipeline.
  • FIGURE 4 is flow diagram depicting operations of an exemplary end-to-end process used as part of providing performance diagnostic analysis and/or issue remediation services.
  • FIGURE 5 is a block diagram illustrating an exemplary computing
  • FIGURES 6A-6B illustrate a mobile computing device with which
  • FIGURE 7 illustrates one embodiment of a system architecture for implementation of various embodiments.
  • FIGURE 1 depicts an exemplary system 100 that operates in part to provide real or near real time end user performance monitoring services, but is not so limited.
  • Components of the system 100 operate in part to use aggregated latency and/or other network data to mitigate and/or resolve network ecosystem issues.
  • components of the system 100 can operate to provide failure zone analysis and resolution information to tenants based on aggregations of performance data.
  • Components of the system 100 can be used to provide a real or near real time assessment of the usability of an online service as well as being able to identify or target failure zones to troubleshoot and/or correct any associated performance or user-experience problems.
  • the system 100 includes features that provide end user performance optics to consumers of an online service including quantifying real time tenant level optics, such as by enabling one or more designated persons of a customer with an ability to view performance or other metrics of a user base across any geographical location or locations.
  • components of the system 100 operate in part by collecting tenant level data to identify top latency data or other outliers for reporting or alerting within a defined location of interest. Equipped with an ability to focus at a geographic level can uncover issues specific to location, such as poor CDN performance, DNS resolution time, longer round trip times, etc.
  • geographic granularity based on a service provider allows for identifying issues at an Internet Service Provider (ISP) level.
  • ISP Internet Service Provider
  • components of the system 100 can operate to ascertain one or more failure zones for tenants as well as identify specific users having degraded experience.
  • the aggregation service 110 can use rules to generate an aggregated output 112 to generate a geographic- based latency map color coded by scale of communication latency.
  • the aggregation service 110 can use configured rules to generate an aggregated output 1 12 as part of debugging and isolating issues based on geographic, ISP, and/or other parameters as described below.
  • components of the system 100 operate to identify failure zones, such as by isolating an issue tied to a DNS resolver, ISP peering, network routing, non-optimal hosting locations, etc.
  • components of the system 100 can be used to assess or quantify a state of a user experience for one or more locations (e.g., region, country, county, etc.), one or more tenants, a selected tenant by geographic location or ISP, and/or for selected geographic location by ISP.
  • the system 100 operates in part to provide for debugging of latency or other data with additional breakdowns by: a client time, a network time, a server time, a CDN time, a connect time, etc.; identifying outlier data, such as a first number of tenants and ISPs by latency; generating historic trends on latency and other performance metrics; providing guidance data for effective edge and other server deployments; enabling pre-aggregating by configuring mailbox servers with geo-mapping capability; generating report data to gain insight into real user CDN interaction; supporting web access based and locally installed clients to reduce load times; etc.
  • different types of metrics or other data can be collected and provided to the system 100 for use in quantifying user experiences.
  • Components of the system 100 can operate as part of supporting use of an online service or application by proactively operating to identify specific users or user groups having a degraded experience. As described below, as part of assessing a performance state of an online service or application, quantitative comparisons can be made relative to one or more baseline experiences for a particular location or ISP. Establishing robust and up-to- date baselines allows for a more focused and confident response to performance related calls/emails and proactive aspect of identification of outliers can be used to have 360 degree loop with service consumers.
  • One embodiment of the system 100 comprises a service support communication infrastructure that enables troubleshooting and remedying performance or other issues related to a server component, a client component, and/or a network condition, such as network latency issues, DNS look up issues, Content Delivery Network (CDN) issues, etc.
  • data collection services comprise a decentralized architecture which partitions client data based in part on a datacenter location by processing raw client data for each server node including pre-aggregating raw data before uploading pre-aggregated data to one or more stores, such as a plurality of database servers for example, before final aggregations.
  • the aggregation service 1 10 can be configured as a separate or an integrated service running on one or multiple physical machines to globally aggregate the pre-aggregated data across multiple data stores based on a set of common and/or customized metrics.
  • pre-aggregating as part of collecting data at each node, processing time and use can be reduced due in part to the limited number of data points used with a final aggregation. As such, aggregated data can be generated in real or near real time.
  • the aggregation service 110 of one embodiment is configured to automatically aggregate latency and/or other performance data, including navigation and/or load timing data, to identify issues at different levels or granularities, such as a tenant level, a geographical or location level, and/or an ISP level as part of efficiently remediating any realized or potential issues.
  • the system 100 may include multiple server computers, including pre-aggregation servers, database servers, and/or aggregation servers, as well as client devices/systems that operate as part of an end-to-end computing architecture. It will be appreciated that servers may comprise one or more physical and/or virtual machines dependent upon the particular implementation.
  • components of the system 100 are configured to collect, pre-aggregate, aggregate, and/or analyze client information as part of providing real or near real time reporting to customers regarding the state of an application or network. Additional components and/or features can be added to the system 100 as needed. For example, based on an identified latency, a customer may use the feedback to deploy an additional edge server in their network. As described below, components of the system 100 may be used to ascertain different user experiences and/or network conditions across multiple networks and network types serving a client or consumer base.
  • server 102 receives information from one or more clients shown as input 104.
  • input 104 includes performance data associated with a client while using an online service or application.
  • raw performance data can be uploaded to server 102 for processing.
  • input 104 includes information pertaining to a client experience such as loading and navigating web resources, and/or server 102 comprises a server computer that supports the use of log files to store collected data.
  • a browser or other application running on a user device/system can use script code to collect information related to one or more of navigation timing parameters, resource and/or load timing parameters, and/or custom marker parameters which may be written to a server log file.
  • server 102 can be configured as a MICROSOFT EXCHANGE server to use one or more fault-tolerant, transaction-based databases to store information.
  • server 102 in addition to processing and memory resources, server 102 includes extensible diagnostic features that utilize a pre-aggregator 106 that operates in part on raw performance data included with input 104, but is not so limited.
  • the pre-aggregator 106 of an embodiment operates to parse client data stored in log files as part of extracting and mapping the client data to one or more mapping tables.
  • the pre-aggregator 106 operates to parse performance data stored in one or more log files to generate mappings, wherein the mappings are defined in part by transforming client IP address and logged client information to one or more of a geographical location (e.g., country/state), an ISP, and/or tenant global user identifier (GUID).
  • the pre-aggregator 106 is configured to group performance data by one or more of IP, location, ISP, and/or tenant GUID before storing the grouped information to store 108.
  • the pre-aggregator 106 can be configured to group performance data associated with client latency metrics by country/state, ISP, and/or tenant. If the logged data cannot be resolved to an ISP level, the pre-aggregator 106 can identify groups limited to country and/or tenant. It will be appreciated that country and ISP parameters can be determined according to client IP address.
  • the aggregation service 110 operates on the pre-aggregated output provided by pre-aggregator 106 to generate an aggregated output 112.
  • the functionality provided by the pre-aggregator 106 operates in part to increase an efficient use of processing and memory resources at the aggregation service 1 10 while also reducing power consumption since a smaller data set can be input to the aggregation service 110 to generate the aggregated output 112.
  • the aggregation service 1 10 of an embodiment comprises one or more server computers and complex aggregation code that operates to provide aggregated output 1 12.
  • an aggregated output 1 12 can be further processed to identify any potential failure zones and/or other issues that may be contributing to a user experience.
  • the aggregation service 110 of one embodiment aggregates pre- aggregated data across all databases to quantify one or more of tenant level, country level, and/or ISP level latencies associated with a particular application, service, or other component.
  • rules can be included with the aggregation service 110 to control processing of the pre-aggregated output to generate the aggregated output 112.
  • the aggregated output 1 12 provides focus including correlations, trends, baseline comparisons, and/or other quantified information tied to a use experience during execution of an application or an online service.
  • rules can be implemented that operate on pre-aggregated data to analyze performance based on an overall value for a region, such as by deriving the 75% percentile x and the standard deviation y for a given metric for North America. If the measurement for Mexico is greater than (x+y), it may cause escalation of a potential issue to engineering staff. Additional features are described further below.
  • complex communication architectures typically employ multiple hardware and/or software components including, but not limited to, server computers, networking components, and other components that enable communication and interaction by way of wired and/or wireless networks. While some embodiments have been described, various embodiments may be used with a number of computer configurations, including hand-held devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, etc. Various embodiments may be implemented in distributed computing environments using remote processing devices/systems that communicate over a one or more communications networks. In a distributed computing environment, program modules or code may be located in both local and remote memory. Various embodiments may be implemented as a process or method, a system, a device, article of manufacture, etc.
  • FIGURE 2 is a flow diagram depicting an exemplary process 200 of pre- aggregating and aggregating performance and/or other data as part of providing performance diagnostics and/or remediation services according to an embodiment.
  • the process 200 begins at 202 by receiving raw performance data.
  • the process 200 at 202 can operate using a server computer to receive client-centric performance data collected by a client as part of requesting an assessment of a state of an online service or application.
  • the process 200 at 202 operates to receive client performance data that includes navigation timing, page load timing, and/or other parameters to use when assessing health or user experience associated with an online service or application.
  • the process 200 operates to pre-aggregate the raw performance data.
  • the process 200 at 204 operates to pre-aggregate the raw performance data by parsing log files and mapping client IP addresses to one or more of tenant identifier, location identifier, and/or ISP identifier before uploading the pre-aggregated data to one or more databases for final aggregation operations.
  • the process 200 operates to aggregate the pre-aggregated data.
  • process 200 at 206 operates to aggregate the pre-aggregated data in part by generating an output of latency or other user experience quantifiers to identify issues at one or more of a tenant level, a location level, and/or ISP level.
  • the process 200 proceeds to 210 and uses the aggregated data for latency and/or other analysis. Otherwise, the process 200 returns to 206 and continues aggregation operations.
  • aggregated output can be used as part of remediating any identified issue by implementing contingency or other measures. While a certain number and order of operations are described for the exemplary flow of FIGURE 2, it will be appreciated that other numbers, combinations, and/or orders can be used according to desired implementations.
  • FIGURE 3 is a block diagram depicting components of an exemplary end-to- end data processing pipeline 300 that operate in part to provide user insights into aggregated data as part of identifying infrastructure, performance, network, or other issues that may be adversely affecting use of an online application or service.
  • an online service supporting cloud-based application services can include functionality to collect and quantify performance data or metrics in near real time including providing user scenario latencies and detailed breakdowns by collected metrics associated with one or more of client operational parameters, tenant parameters, IP parameters, location parameters, and/or ISP parameters.
  • Components of the pipeline 300 operate in part to aggregate, pivot, and/or store data at the tenant level, IP level, geographic location level, and/or an ISP level.
  • Components of the pipeline 300 operate in part to proactively monitor user experiences to reduce performance degradations while providing alerts and/or solutions to remediate end user performance issues.
  • a client 302 associated with a first tenant user and client 304 associated with a second tenant user are communicating with server 306.
  • log file 308 receives and stores collected data from clients 302 and 304.
  • the client 302 can be implemented as part of a browser application running on a user device system, wherein script code can be used to collect information associated with use of an online application or service, such as a page load time, a time to connect, or some other parameter for example.
  • the server 306 of one embodiment comprises a server computer dedicated to serving clients 302 and 304.
  • server 306 includes a diagnostics service that uses an IP mapper 310 and upload component 312 for an associated node.
  • the IP mapper 310 and upload component 312 operate in part to provide pre- aggregation services on the data of log file 308. As described above, a single component can be configured to perform the pre-aggregation services provided by these components.
  • the IP mapper 310 of an embodiment operates in part to parse log file 308 to extract and map logged performance data or metrics based on one or more of an IP address, a location, and/or ISP for each client or tenant. According to one embodiment, the IP mapper 310 operates in part to pre-aggregate and consolidate the client data by mapping a client IP address and performance or latency data to one or more of a geographic location (e.g., country/state), an ISP, and/or a tenant global user identifier (GUID).
  • a geographic location e.g., country/state
  • ISP Internet Protocol Security
  • GUID tenant global user identifier
  • the upload component 312 operates to upload the mapped data provided by the IP mapper 310 grouped by one or more of location, ISP, and/or tenant GUID to a dedicated database 314. If the logged data cannot be resolved to an ISP level, the pre-aggregation can include groupings limited to country and/or tenant. It will be appreciated that country and ISP parameters can be determined according to a client IP address.
  • components of server 306 are configured with complex programming code that operates to pre-aggregate collected client data in part by parsing the collected client data, such as by parsing performance data logs for example, and extracting user scenario, time of event, client IP, latency, tenant data and other detailed metrics based on the client information. Consequently, the server 306 is able to pre-aggregate data received from client as part of reducing the final aggregation load when quantifying latency and/or other performance issues.
  • the IP mapper 310 of an embodiment operates to map client IP addresses to a geographic location depending on the mapping granularity and/or a client IP to an associated ISP based on known or to be implemented IP ranges.
  • the server 306 includes analysis code that operates to parse based in part on a type of client and/or associated client data. For example, performance data of a web access client can be collected and routed to a log file of mailbox server serving the sessions, wherein the analysis code would be configured to parse the particular client information to understand a scenario, latency, and associated issues (e.g., slow navigation time, slow DNS time, etc.).
  • Parsing of an embodiment operates to transform client IP address and tenant information in the log files to country/state, ISP and/or tenant GUID.
  • parsing operations are performed in part using a derived mapping table generated from a generic public geo-mapping database.
  • An example data entry in a geo-mapping database for parsing may include:
  • the parsing operations applied by the IP mapper 310 of an embodiment result in the generation of a derived mapping table for IP to countries by scanning each data entry, sorting, and merging based on IP ranges and corresponding countries to yield:
  • a mapping table can include exemplary mapping ⁇ key,value ⁇ data.
  • the mapped data includes a key that is an integer value that represents a starting IP address and a value that is the country ISO code.
  • IP addresses between 16777216 and 16777472 belong to AU. By sorting the keys, the table can be compressed for loading into memory for quick look-up.
  • parsing operations applied by the IP mapper 310 of an embodiment result in the generation of a derived mapping table for an IP to ISP mapping as shown below (key is the same as above but the value is an ASN number of an ISP):
  • server 316 processes or pre-aggregates client data of clients 318 and 320 stored in log file 321 in part by using the IP mapper 322 and upload component 324 to process and upload pre- aggregated data to another dedicated database 326.
  • Dedicated databases 314 and 326 may or may not include more than one host computer.
  • the pipeline can include additional components, features, and functionality.
  • Server 328 processes client data of clients 330, 332, 334, and 336 stored in log file 337 in part by using the IP mapper 338 and upload component 340 to process and upload pre-aggregated data to dedicated database 326.
  • databases 314 and 326 are designed to handle the performance counters and metrics collected from various machines that may be networked to provide an online application or service. Since the end user performance data brings in additional pivots, a database schema can be used to support IP, geographic location, tenant, and/or ISP metrics and parameters.
  • server 306, server 316, and server 328 collect client data from a plurality of clients. For example, at the node level, server 306 can operate to pre-aggregate client data every 5 minutes using IP mapper 310 to transform the client data into predetermined pivots and the upload component 316 propagates the transformed data to database 314.
  • Aggregation service 342 aggregates the pre-aggregated data across databases 314 and 326 to determine one or more of tenant level latencies, country level latencies, and/or ISP level latencies associated with an online application or service, but is not so limited.
  • the aggregation service 342 operates on the pre-aggregated or transformed data to perform scope (Global and/or Site for example) level conversion on the node level data for end user metrics.
  • scope Global and/or Site for example
  • the aggregation service 342 has provided an aggregated output that includes quantified client performance data 346 associated with the first tenant and quantified client performance data 348 associated with the second tenant.
  • a number of sample counts can be used as a weighting factor to improve statistical accuracy of the quantified client performance data.
  • the aggregation service 342 can be configured to aggregate pre-aggregated data uploaded from one of more upload components at defined time intervals (e.g., run every 15 min., use for a sliding window of last 1 hour of data; run every 24 hours, use sliding window of last 24 hours of data, etc.).
  • the aggregation service 342 can also be configured to pivot or group, across one or more domain controllers, by geographic location, tenant, ISP per geographic location, tenant per geographic location, and/or scope per site level.
  • the aggregation service 342 operates in part to generate client scenario latency and other performance related statistics for quantifying navigation time, CDN time, authorization time, redirect time, etc.
  • the aggregation service 342 can provide statistical measures/values such as average, 75% percentile, 85% percentile, 95% percentile, etc.
  • the aggregation service 342 can also use dynamic bins that encompass a range of latencies with percentile values for latencies at 10th, 20th, 30th, 40th, 50th, 60th, 70th, 80th, 90th percentiles, and maximum.
  • Failure zone analyzer 350 operates in part using rules that are designed to identify certain segments or characteristics of the data aggregate using statistical measures or other latency quantifications.
  • the rules may be designed to identify different levels of performance (e.g., fair, poor, excellent, etc.) based on one or more quantitative measures, such as navigation time, load time, connect time, etc.
  • the rules are applied to the aggregated data according to the output from the aggregation service 342.
  • Exemplary rules are configurable according to each implementation. For example, rules may be based on an overall value for a region or ISP such as rules configured to prioritize consideration of certain metrics or measures over others.
  • Report generator 352 operates to generate report information for reporting and/or feedback communications as to the state of an application or service along with any specific recommendations for tenants having some identified issue that may need to be addressed.
  • report generator 352 can operate to dynamically generate a user insight report that lists the top number (e.g., 10) tenants for each geographic location having highest latencies or the top number of tenants having the highest latencies. While shown as integral components, it will be appreciated that failure zone analyzer 350 and report generator 352 can be configured as separate components.
  • pivots can be applied solely at the aggregation service 342, or in combination with pivots applied the server 306, server 316, and/or server 328.
  • the pipeline 300 of an embodiment uses performance markers as part of: reliably collecting client data; allowing segregation of successful and failed execution of scenario; allowing for filtering/segregation of monitoring data (e.g., probes); accurately marking the start and end of scenarios tied with user experience (e.g., navigation time, page load, page displayed, page interactive, etc.); and/or identifying and filling missing data to assist with detailed drill downs, such as time to complete authentication, time to download CDN resources, time to redirect to correct web-access server, etc.
  • monitoring data e.g., probes
  • accurately marking the start and end of scenarios tied with user experience e.g., navigation time, page load, page displayed, page interactive, etc.
  • identifying and filling missing data to assist with detailed drill downs, such as time to complete authentication, time to download CDN resources, time to redirect to correct web-access server, etc.
  • Navigation timing of one embodiment comprise calculated values based on each time stamp defined in the W3C Navigation Timing API.
  • the W3C Navigation Timing API introduces the performance timing interface allowing JAVASCRIPT mechanisms to provide complete client-side latency measurements within applications.
  • the interface can be used to measure a user's perceived page load time.
  • Resource timing markers of one embodiment are the calculated values based on each time stamp defined in the W3C Resource Timing API that defines an interface allowing JAVASCRIPT mechanisms to provide complete client-side latency measurements within applications.
  • the interface can be used to measure a user's perceived load time of a resource.
  • exemplary markers may include:
  • PLT Page load time
  • ALT The PLT time without authentication time, this key only appear when "type" is ALT (boot from application cache).
  • RDT The render time from web access finish retrieve session data until PLT end marker.
  • client raw data includes parameters including but not limited to:
  • log file 308 can include the following web-access navigation timing raw data associated with client 302 as:
  • Exemplary load timing raw data associated with client 302 as:
  • FIGURE 4 is flow diagram depicting operations of an exemplary end-to-end process 400 used as part of providing performance diagnostic analysis and/or issue remediation services according to an embodiment.
  • the process 400 at 402 operates to collect performance data using a client executing on an end-user device/system.
  • a client such as a browser or other application and scripting code (e.g., JAVASCRIPT code) collects client-centric performance data and/or requests performance diagnostic analysis services from one or more server computers associated with use an online of service or application.
  • the process 400 at 402 of one embodiment operates to collect raw performance data that includes navigation timing, page load timing, and/or other parameters indicative of latencies or other performance issues as part of assessing an end- user experience associated with an online service or application.
  • the process 400 at 404 operates to provide the raw performance data to a log file of a dedicated server computer.
  • the process 400 at 404 includes the use of a browser executing on a user device/system to upload a client IP address and collected performance data or some portion to one or more log files.
  • the process 400 operates to transform or map the logged performance data using the client IP address and mapping targets that include geographical location (e.g., country/state), ISP, and/or tenant GUID.
  • the process 400 at 406 can be configured to map logged client data to a plurality of mapping tables including a first mapping table that defines IP address to geographic location mappings for the logged client data and a second mapping table that defines IP address to ISP mappings for the logged client data.
  • the process 400 operates to upload the transformed data grouped by one or more of tenant GUID, geographic location, and/or ISP to one or more diagnostic service databases.
  • the process 400 at 410 operates to perform aggregation operations across the one or more databases to generate latency and/or other performance related aggregations for the online service or application.
  • the process 400 at 410 performs aggregation operations to determine one or more of tenant level, geographic location level, and/or ISP level latencies.
  • the process 400 at 412 uses one or more rules on the aggregated data to perform a failure zone analysis to identify one or more failure or potential failure zones. For example, the process 400 at 412 can use configured rules to vet whether a user experience is poor, satisfactory, or excellent based in part on trend or baseline comparisons across all countries and/or ISPs.
  • the process 400 operates to use the failure zone information as part of taking any corrective or mitigating action.
  • the process 400 at 414 can use failure zone analysis information to generate online reports that identify potential network and/or communication architecture modifications as part of reducing latency or other performance related issues. While a certain number and order of operations are described for the exemplary flow of FIGURE 4, it will be appreciated that other numbers, combinations, and/or orders can be used according to desired implementations.
  • the process 400 can be used in part to generate an electronic report that allows for viewing of different network metrics for an online email service to identify that users in a first location are spending longer time in CDN compared to rest of the countries in the associated region. A reviewer can then follow-up with a CDN provider in the first location to resolve the issue. Additionally, review of a geographic-ISP report for the first location reveals difference in latencies by ISP enabling ready identification of an increase in latency for one of the larger ISPs that may be contacted to inform and resolve the issue.
  • the process 400 can be used to generate an electronic report that includes download times by region to identify users of a particular region having maximum download time resulting in deploying of a new edge server to reduce the impact of user networks.
  • An updated report reveals a reduction in latencies for the particular region.
  • the process 400 can generate an electronic report that allows a particular tenant to display a trend view and determine that a latency increase occurred in the last few days as well as TCP connecting times increased by 500ms. Based on the report, an affected tenant can be contacted to identify issues with ISP peering with another location.
  • Suitable programming means include any means for directing a computer system or device to execute steps of a process or method, including for example, systems comprised of processing units and arithmetic-logic circuits coupled to computer memory, which systems have the capability of storing in computer memory, which computer memory includes electronic circuits configured to store data and program instructions or code.
  • An exemplary article of manufacture includes a computer program product useable with any suitable processing system. While a certain number and types of components are described above, it will be appreciated that other numbers and/or types and/or configurations can be included according to various embodiments. Accordingly, component functionality can be further divided and/or combined with other component functionalities according to desired implementations.
  • the term computer readable media as used herein can include computer storage media or computer storage. The computer storage of an embodiment stores program code or instructions that operate to perform some function. Computer storage media can include volatile and nonvolatile, removable and nonremovable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, etc.
  • Computer storage media may include, but is not limited to, RAM, ROM, electrically erasable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store information and which can be accessed by a computing device. Any such computer storage media may be part of a device or system.
  • communication media may include wired media such as a wired network or direct- wired connection, and wireless media such as acoustic, RF, infrared, and other wireless media.
  • the embodiments and examples described herein are not intended to be limiting and other embodiments are available.
  • the components described above can be implemented as part of networked, distributed, and/or other computer-implemented environment.
  • the components can communicate via a wired, wireless, and/or a combination of communication networks.
  • Network components and/or couplings between components of can include any of a type, number, and/or combination of networks and the corresponding network components which include, but are not limited to, wide area networks (WANs), local area networks (LANs), metropolitan area networks (MANs), proprietary networks, backend networks, cellular networks, etc.
  • Client computing devices/systems and servers can be any type and/or combination of processor-based devices or systems. Additionally, server functionality can include many components and include other servers. Components of the computing environments described in the singular tense may include multiple instances of such components. While certain embodiments include software implementations, they are not so limited and encompass hardware, or mixed hardware/software solutions.
  • Terms used in the description generally describe a computer-related operational environment that includes hardware, software, firmware and/or other items.
  • a component can use processes using a processor, executable, and/or other code.
  • Exemplary components include an application, a server running on the application, and/or an electronic communication client coupled to a server for receiving communication items.
  • Computer resources can include processor and memory resources such as: digital signal processors, microprocessors, multi-core processors, etc. and memory components such as magnetic, optical, and/or other storage devices, smart memory, flash memory, etc.
  • Communication components can be used to communicate computer-readable information as part of transmitting, receiving, and/or rendering electronic communication items using a communication network or networks, such as the Internet for example. Other embodiments and configurations are included.
  • FIGURE 5 provides a brief, general description of a suitable computing environment in which embodiments be implemented. While described in the general context of program modules that execute in conjunction with program modules that run on an operating system on various types of computing devices/systems, those skilled in the art will recognize that the invention may also be implemented in combination with other types of computer devices/systems and program modules.
  • program modules include routines, programs, components, data structures, and other types of structures that perform particular tasks or implement particular abstract data types.
  • program modules may be located in both local and remote memory storage devices.
  • computer 2 comprises a general purpose server, desktop, laptop, handheld, or other type of computer capable of executing one or more application programs including an email application or other application that includes email functionality.
  • the computer 2 includes at least one central processing unit 8 ("CPU"), a system memory 12, including a random access memory 18 ("RAM”) and a read-only memory (“ROM”) 20, and a system bus 10 that couples the memory to the CPU 8.
  • CPU central processing unit
  • RAM random access memory
  • ROM read-only memory
  • the computer 2 further includes a mass storage device 14 for storing an operating system 24, application programs, and other program modules/resources 26.
  • the mass storage device 14 is connected to the CPU 8 through a mass storage controller (not shown) connected to the bus 10.
  • the mass storage device 14 and its associated computer-readable media provide non- volatile storage for the computer 2.
  • computer-readable media can be any available media that can be accessed or utilized by the computer 2.
  • the computer 2 may operate in a networked environment using logical connections to remote computers through a network 4, such as a local network, the Internet, etc. for example.
  • the computer 2 may connect to the network 4 through a network interface unit 16 connected to the bus 10. It should be appreciated that the network interface unit 16 may also be utilized to connect to other types of networks and remote computing systems.
  • the computer 2 may also include an input/output controller 22 for receiving and processing input from a number of other devices, including a keyboard, mouse, etc. (not shown). Similarly, an input/output controller 22 may provide output to a display screen, a printer, or other type of output device.
  • a number of program modules and data files may be stored in the mass storage device 14 and RAM 18 of the computer 2, including an operating system 24 suitable for controlling the operation of a networked personal computer, such as the WINDOWS operating systems from MICROSOFT CORPORATION of Redmond, Washington.
  • the mass storage device 14 and RAM 18 may also store one or more program modules.
  • the mass storage device 14 and the RAM 18 may store application programs, such as word processing, spreadsheet, drawing, e-mail, and other applications and/or program modules, etc.
  • FIGURES 6A-6B illustrate a mobile computing device 600, for example, a mobile telephone, a smart phone, a tablet personal computer, a laptop computer, and the like, with which embodiments may be practiced.
  • a mobile computing device 600 for example, a mobile telephone, a smart phone, a tablet personal computer, a laptop computer, and the like, with which embodiments may be practiced.
  • FIGURE 6A one embodiment of a mobile computing device 600 for implementing the embodiments is illustrated.
  • the mobile computing device 600 is a handheld computer having both input elements and output elements.
  • the mobile computing device 600 typically includes a display 605 and one or more input buttons 610 that allow the user to enter information into the mobile computing device 600.
  • the display 605 of the mobile computing device 600 may also function as an input device (e.g., a touch screen display). If included, an optional side input element 615 allows further user input.
  • the side input element 615 may be a rotary switch, a button, or any other type of manual input element.
  • mobile computing device 600 may incorporate more or less input elements.
  • the display 605 may not be a touch screen in some embodiments.
  • the mobile computing device 600 is a portable phone system, such as a cellular phone.
  • the mobile computing device 600 may also include an optional keypad 635.
  • Optional keypad 635 may be a physical keypad or a "soft" keypad generated on the touch screen display.
  • the output elements include the display 605 for showing a graphical user interface (GUI), a visual indicator 620 (e.g., a light emitting diode), and/or an audio transducer 625 (e.g., a speaker).
  • GUI graphical user interface
  • a visual indicator 620 e.g., a light emitting diode
  • an audio transducer 625 e.g., a speaker
  • the mobile computing device 600 incorporates a vibration transducer for providing the user with tactile feedback.
  • the mobile computing device 600 incorporates input and/or output ports, such as an audio input (e.g., a microphone jack), an audio output (e.g., a headphone jack), and a video output (e.g., a HDMI port) for sending signals to or receiving signals from an external device.
  • an audio input e.g., a microphone jack
  • an audio output e.g., a headphone jack
  • a video output e.g., a HDMI port
  • FIGURE 6B is a block diagram illustrating the architecture of one embodiment of a mobile computing device. That is, the mobile computing device 600 can incorporate a system (i.e., an architecture) 602 to implement some embodiments.
  • the system 602 is implemented as a "smart phone" capable of running one or more applications (e.g., browser, e-mail, calendaring, contact managers, messaging clients, games, and media clients/players).
  • the system 602 is integrated as a computing device, such as an integrated personal digital assistant (PDA) and wireless phone.
  • PDA personal digital assistant
  • One or more application programs 666 may be loaded into the memory 662 and run on or in association with the operating system 664.
  • Examples of the application programs include phone dialer programs, e-mail programs, personal information management (PIM) programs, word processing programs, spreadsheet programs, Internet browser programs, messaging programs, and so forth.
  • the system 602 also includes a non-volatile storage area 668 within the memory 662. The non-volatile storage area 668 may be used to store persistent information that should not be lost if the system 602 is powered down.
  • the application programs 666 may use and store information in the non-volatile storage area 668, such as e-mail or other messages used by an e-mail application, and the like.
  • a synchronization application (not shown) also resides on the system 602 and is programmed to interact with a corresponding synchronization application resident on a host computer to keep the information stored in the non-volatile storage area 668 synchronized with corresponding information stored at the host computer.
  • other applications may be loaded into the memory 662 and run on the mobile computing device 600.
  • the system 602 has a power supply 670, which may be implemented as one or more batteries.
  • the power supply 670 might further include an external power source, such as an AC adapter or a powered docking cradle that supplements or recharges the batteries.
  • the system 602 may also include a radio 672 that performs the function of transmitting and receiving radio frequency communications.
  • the radio 672 facilitates wireless connectivity between the system 602 and the "outside world," via a communications carrier or service provider. Transmissions to and from the radio 672 are conducted under control of the operating system 664. In other words, communications received by the radio 672 may be disseminated to the application programs 666 via the operating system 664, and vice versa.
  • the visual indicator 620 may be used to provide visual notifications and/or an audio interface 674 may be used for producing audible notifications via the audio transducer 625.
  • the visual indicator 620 is a light emitting diode (LED) and the audio transducer 625 is a speaker.
  • LED light emitting diode
  • the LED may be programmed to remain on indefinitely until the user takes action to indicate the powered-on status of the device.
  • the audio interface 674 is used to provide audible signals to and receive audible signals from the user.
  • the audio interface 674 may also be coupled to a microphone to receive audible input, such as to facilitate a telephone conversation.
  • the microphone may also serve as an audio sensor to facilitate control of notifications, as will be described below.
  • the system 602 may further include a video interface 676 that enables an operation of an on-board camera 630 to record still images, video stream, and the like.
  • a mobile computing device 600 implementing the system 602 may have additional features or functionality.
  • the mobile computing device 600 may also include additional data storage devices (removable and/or non-removable) such as, magnetic disks, optical disks, or tape. Such additional storage is illustrated in FIGURE 6B by the non-volatile storage area 668.
  • additional data storage devices removable and/or non-removable
  • FIGURE 6B Such additional storage is illustrated in FIGURE 6B by the non-volatile storage area 668.
  • Data/information generated or captured by the mobile computing device 600 and stored via the system 602 may be stored locally on the mobile computing device 600, as described above, or the data may be stored on any number of storage media that may be accessed by the device via the radio 672 or via a wired connection between the mobile computing device 600 and a separate computing device associated with the mobile computing device 600, for example, a server computer in a distributed computing network, such as the Internet.
  • a server computer in a distributed computing network such as the Internet.
  • data/information may be accessed via the mobile computing device 600 via the radio 672 or via a distributed computing network.
  • data/information may be readily transferred between computing devices for storage and use according to well-known data/information transfer and storage means, including electronic mail and collaborative data/information sharing systems.
  • FIGURE 7 illustrates one embodiment of a system architecture for implementing latency identification and remediation features.
  • Data processing information may be stored in different communication channels or storage types. For example, various information may be stored/accessed using a directory service 722, a web portal 724, a mailbox service 726, an instant messaging store 728, and/or a social networking site 730.
  • a server 720 may provide additional latency analysis and other features. As one example, the server 720 may provide rules that are used to distribute outbound email using a number of datacenter partitions over network 715, such as the Internet or other network(s) for example.
  • the client computing device may be implemented as a general computing device 702 and embodied in a personal computer, a tablet computing device 704, and/or a mobile computing device 706 (e.g., a smart phone). Any of these clients may use content from the store 716.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Computer Hardware Design (AREA)
  • Environmental & Geological Engineering (AREA)
  • Debugging And Monitoring (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Computer And Data Communications (AREA)

Abstract

Embodiments provide for monitoring of an online user experience and/or remediating performance issues, but are not so limited. A computer-implemented method of an embodiment operates to receive, pre-aggregate, and aggregate client performance data as part of providing an end-to-end diagnostics monitoring and resolution service. A system of an embodiment is configured to aggregate performance data of a plurality of client devices or systems as part of identifying latency issues at one or more of a tenant level, geographic location level, and/or service provider level. Other embodiments are included.

Description

PERFORMANCE MONITORING TO PROVIDE REAL OR NEAR REAL TIME
REMEDIATION FEEDBACK
BACKGROUND
[0001] Many large and small scale businesses depend on some type of on online service as part of running a successful venture. Bandwidth is one factor that affects speed of a network. Latency is another factor that affects network speed and responsiveness. Latency may be described as delay that affects processing of network data. Network conditions, hardware and software limitations, and/or other factors may adversely affect a user's experience of some online application or service. With the emergence of cloud computing and datacenter services, it is imperative to provide timely service with minimal bottlenecks across hundreds of server computers and associated networking infrastructure serving millions of users worldwide.
[0002] One difficulty lies in the complexity associated with monitoring the health of one or more services over multiple geographic locations and multiple diverse components in real or near real time. System downtime and even small amounts of performance degradation can lead to additional man hours, cost, and machine overload, which may potentially affect a business' bottom line. Unfortunately, the current state of the art is deficient in providing performance monitoring and resolution systems that efficiently identify issues and provide robust solutions or feedback as quickly as possible.
SUMMARY
[0003] This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended as an aid in determining the scope of the claimed subject matter.
[0004] Embodiments provide for monitoring of an online user experience and/or remediating performance issues, but are not so limited. A computer-implemented method of an embodiment operates to receive, pre-aggregate, and aggregate client performance data as part of providing an end-to-end diagnostics monitoring and resolution service. A system of an embodiment is configured to aggregate performance data of a plurality of client devices or systems as part of identifying latency issues at one or more of a tenant level, geographic location level, and/or service provider level. Other embodiments are included.
[0005] These and other features and advantages will be apparent from a reading of the following detailed description and a review of the associated drawings. It is to be understood that both the foregoing general description and the following detailed description are explanatory only and are not restrictive of the invention as claimed.
BRIEF DESCRIPTION OF THE DRAWINGS
[0006] FIGURE 1 depicts an exemplary system that operates in part to provide real or near real time end user performance monitoring services.
[0007] FIGURE 2 is a flow diagram depicting an exemplary process of pre- aggregating and aggregating performance and/or other data.
[0008] FIGURE 3 is a block diagram depicting components of an exemplary end-to- end data processing pipeline.
[0009] FIGURE 4 is flow diagram depicting operations of an exemplary end-to-end process used as part of providing performance diagnostic analysis and/or issue remediation services.
[0010] FIGURE 5 is a block diagram illustrating an exemplary computing
environment for implementation of various embodiments.
[0011] FIGURES 6A-6B illustrate a mobile computing device with which
embodiments may be practiced.
[0012] FIGURE 7 illustrates one embodiment of a system architecture for implementation of various embodiments.
DETAILED DESCRIPTION
[0013] FIGURE 1 depicts an exemplary system 100 that operates in part to provide real or near real time end user performance monitoring services, but is not so limited. Components of the system 100 operate in part to use aggregated latency and/or other network data to mitigate and/or resolve network ecosystem issues. As an example, as part of providing an online service, such as providing one or more office productivity applications and/or features of an application suite, components of the system 100 can operate to provide failure zone analysis and resolution information to tenants based on aggregations of performance data. Components of the system 100 can be used to provide a real or near real time assessment of the usability of an online service as well as being able to identify or target failure zones to troubleshoot and/or correct any associated performance or user-experience problems.
[0014] As described below, the system 100 includes features that provide end user performance optics to consumers of an online service including quantifying real time tenant level optics, such as by enabling one or more designated persons of a customer with an ability to view performance or other metrics of a user base across any geographical location or locations. For example, components of the system 100 operate in part by collecting tenant level data to identify top latency data or other outliers for reporting or alerting within a defined location of interest. Equipped with an ability to focus at a geographic level can uncover issues specific to location, such as poor CDN performance, DNS resolution time, longer round trip times, etc. Additionally, geographic granularity based on a service provider allows for identifying issues at an Internet Service Provider (ISP) level.
[0015] Correspondingly, consumers can use real or near real time feedback to identify users, tenants, and/or locations having degraded or otherwise deficient service experiences. As described briefly above, components of the system 100 can operate to ascertain one or more failure zones for tenants as well as identify specific users having degraded experience. For example, as part of monitoring an end user using an online email service, the aggregation service 110 can use rules to generate an aggregated output 112 to generate a geographic- based latency map color coded by scale of communication latency. The aggregation service 110 can use configured rules to generate an aggregated output 1 12 as part of debugging and isolating issues based on geographic, ISP, and/or other parameters as described below.
[0016] Correspondingly, components of the system 100 operate to identify failure zones, such as by isolating an issue tied to a DNS resolver, ISP peering, network routing, non-optimal hosting locations, etc. For example, components of the system 100 can be used to assess or quantify a state of a user experience for one or more locations (e.g., region, country, county, etc.), one or more tenants, a selected tenant by geographic location or ISP, and/or for selected geographic location by ISP. The system 100 operates in part to provide for debugging of latency or other data with additional breakdowns by: a client time, a network time, a server time, a CDN time, a connect time, etc.; identifying outlier data, such as a first number of tenants and ISPs by latency; generating historic trends on latency and other performance metrics; providing guidance data for effective edge and other server deployments; enabling pre-aggregating by configuring mailbox servers with geo-mapping capability; generating report data to gain insight into real user CDN interaction; supporting web access based and locally installed clients to reduce load times; etc. Depending on the client, different types of metrics or other data can be collected and provided to the system 100 for use in quantifying user experiences.
[0017] Components of the system 100 can operate as part of supporting use of an online service or application by proactively operating to identify specific users or user groups having a degraded experience. As described below, as part of assessing a performance state of an online service or application, quantitative comparisons can be made relative to one or more baseline experiences for a particular location or ISP. Establishing robust and up-to- date baselines allows for a more focused and confident response to performance related calls/emails and proactive aspect of identification of outliers can be used to have 360 degree loop with service consumers.
[0018] One embodiment of the system 100 comprises a service support communication infrastructure that enables troubleshooting and remedying performance or other issues related to a server component, a client component, and/or a network condition, such as network latency issues, DNS look up issues, Content Delivery Network (CDN) issues, etc. According to one embodiment, data collection services comprise a decentralized architecture which partitions client data based in part on a datacenter location by processing raw client data for each server node including pre-aggregating raw data before uploading pre-aggregated data to one or more stores, such as a plurality of database servers for example, before final aggregations.
[0019] Depending on the implementation, the aggregation service 1 10 can be configured as a separate or an integrated service running on one or multiple physical machines to globally aggregate the pre-aggregated data across multiple data stores based on a set of common and/or customized metrics. By pre-aggregating as part of collecting data at each node, processing time and use can be reduced due in part to the limited number of data points used with a final aggregation. As such, aggregated data can be generated in real or near real time. The aggregation service 110 of one embodiment is configured to automatically aggregate latency and/or other performance data, including navigation and/or load timing data, to identify issues at different levels or granularities, such as a tenant level, a geographical or location level, and/or an ISP level as part of efficiently remediating any realized or potential issues.
[0020] With continuing reference to FIGURE 1 , while a limited number of components are shown to describe aspects of the various embodiments, it will be appreciated that the embodiments are not so limited and other configurations are available. For example, while a single server 102 is shown, the system 100 may include multiple server computers, including pre-aggregation servers, database servers, and/or aggregation servers, as well as client devices/systems that operate as part of an end-to-end computing architecture. It will be appreciated that servers may comprise one or more physical and/or virtual machines dependent upon the particular implementation.
[0021] As described further below, components of the system 100 are configured to collect, pre-aggregate, aggregate, and/or analyze client information as part of providing real or near real time reporting to customers regarding the state of an application or network. Additional components and/or features can be added to the system 100 as needed. For example, based on an identified latency, a customer may use the feedback to deploy an additional edge server in their network. As described below, components of the system 100 may be used to ascertain different user experiences and/or network conditions across multiple networks and network types serving a client or consumer base.
[0022] As shown in FIGURE 1 , server 102 receives information from one or more clients shown as input 104. According to an embodiment, input 104 includes performance data associated with a client while using an online service or application. For example, raw performance data can be uploaded to server 102 for processing. In one embodiment, input 104 includes information pertaining to a client experience such as loading and navigating web resources, and/or server 102 comprises a server computer that supports the use of log files to store collected data. In one embodiment, a browser or other application running on a user device/system can use script code to collect information related to one or more of navigation timing parameters, resource and/or load timing parameters, and/or custom marker parameters which may be written to a server log file. For example, server 102 can be configured as a MICROSOFT EXCHANGE server to use one or more fault-tolerant, transaction-based databases to store information.
[0023] According to an embodiment, in addition to processing and memory resources, server 102 includes extensible diagnostic features that utilize a pre-aggregator 106 that operates in part on raw performance data included with input 104, but is not so limited. The pre-aggregator 106 of an embodiment operates to parse client data stored in log files as part of extracting and mapping the client data to one or more mapping tables. In one embodiment, the pre-aggregator 106 operates to parse performance data stored in one or more log files to generate mappings, wherein the mappings are defined in part by transforming client IP address and logged client information to one or more of a geographical location (e.g., country/state), an ISP, and/or tenant global user identifier (GUID).
[0024] The pre-aggregator 106 is configured to group performance data by one or more of IP, location, ISP, and/or tenant GUID before storing the grouped information to store 108. For example, the pre-aggregator 106 can be configured to group performance data associated with client latency metrics by country/state, ISP, and/or tenant. If the logged data cannot be resolved to an ISP level, the pre-aggregator 106 can identify groups limited to country and/or tenant. It will be appreciated that country and ISP parameters can be determined according to client IP address.
[0025] As shown, the aggregation service 110 operates on the pre-aggregated output provided by pre-aggregator 106 to generate an aggregated output 112. The functionality provided by the pre-aggregator 106 operates in part to increase an efficient use of processing and memory resources at the aggregation service 1 10 while also reducing power consumption since a smaller data set can be input to the aggregation service 110 to generate the aggregated output 112. The aggregation service 1 10 of an embodiment comprises one or more server computers and complex aggregation code that operates to provide aggregated output 1 12. As described in more detail below, an aggregated output 1 12 can be further processed to identify any potential failure zones and/or other issues that may be contributing to a user experience. The aggregation service 110 of one embodiment aggregates pre- aggregated data across all databases to quantify one or more of tenant level, country level, and/or ISP level latencies associated with a particular application, service, or other component.
[0026] As described below, rules can be included with the aggregation service 110 to control processing of the pre-aggregated output to generate the aggregated output 112. Based on different rule types, the aggregated output 1 12 provides focus including correlations, trends, baseline comparisons, and/or other quantified information tied to a use experience during execution of an application or an online service. For example, rules can be implemented that operate on pre-aggregated data to analyze performance based on an overall value for a region, such as by deriving the 75% percentile x and the standard deviation y for a given metric for North America. If the measurement for Mexico is greater than (x+y), it may cause escalation of a potential issue to engineering staff. Additional features are described further below.
[0027] It will be appreciated that complex communication architectures typically employ multiple hardware and/or software components including, but not limited to, server computers, networking components, and other components that enable communication and interaction by way of wired and/or wireless networks. While some embodiments have been described, various embodiments may be used with a number of computer configurations, including hand-held devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, etc. Various embodiments may be implemented in distributed computing environments using remote processing devices/systems that communicate over a one or more communications networks. In a distributed computing environment, program modules or code may be located in both local and remote memory. Various embodiments may be implemented as a process or method, a system, a device, article of manufacture, etc.
[0028] FIGURE 2 is a flow diagram depicting an exemplary process 200 of pre- aggregating and aggregating performance and/or other data as part of providing performance diagnostics and/or remediation services according to an embodiment. The process 200 begins at 202 by receiving raw performance data. For example, the process 200 at 202 can operate using a server computer to receive client-centric performance data collected by a client as part of requesting an assessment of a state of an online service or application. In one embodiment, the process 200 at 202 operates to receive client performance data that includes navigation timing, page load timing, and/or other parameters to use when assessing health or user experience associated with an online service or application.
[0029] At 204, the process 200 operates to pre-aggregate the raw performance data. In one embodiment, the process 200 at 204 operates to pre-aggregate the raw performance data by parsing log files and mapping client IP addresses to one or more of tenant identifier, location identifier, and/or ISP identifier before uploading the pre-aggregated data to one or more databases for final aggregation operations. At 206, the process 200 operates to aggregate the pre-aggregated data. In one embodiment, process 200 at 206 operates to aggregate the pre-aggregated data in part by generating an output of latency or other user experience quantifiers to identify issues at one or more of a tenant level, a location level, and/or ISP level.
[0030] If there are no further aggregation operations at 208 the process 200 proceeds to 210 and uses the aggregated data for latency and/or other analysis. Otherwise, the process 200 returns to 206 and continues aggregation operations. As described above and further below, aggregated output can be used as part of remediating any identified issue by implementing contingency or other measures. While a certain number and order of operations are described for the exemplary flow of FIGURE 2, it will be appreciated that other numbers, combinations, and/or orders can be used according to desired implementations.
[0031] FIGURE 3 is a block diagram depicting components of an exemplary end-to- end data processing pipeline 300 that operate in part to provide user insights into aggregated data as part of identifying infrastructure, performance, network, or other issues that may be adversely affecting use of an online application or service. For example, an online service supporting cloud-based application services can include functionality to collect and quantify performance data or metrics in near real time including providing user scenario latencies and detailed breakdowns by collected metrics associated with one or more of client operational parameters, tenant parameters, IP parameters, location parameters, and/or ISP parameters. Components of the pipeline 300 operate in part to aggregate, pivot, and/or store data at the tenant level, IP level, geographic location level, and/or an ISP level. Components of the pipeline 300 operate in part to proactively monitor user experiences to reduce performance degradations while providing alerts and/or solutions to remediate end user performance issues.
[0032] As shown in FIGURE 3, a client 302 associated with a first tenant user and client 304 associated with a second tenant user are communicating with server 306. As shown, log file 308 receives and stores collected data from clients 302 and 304. In one embodiment, the client 302 can be implemented as part of a browser application running on a user device system, wherein script code can be used to collect information associated with use of an online application or service, such as a page load time, a time to connect, or some other parameter for example. The server 306 of one embodiment comprises a server computer dedicated to serving clients 302 and 304. According to an embodiment, server 306 includes a diagnostics service that uses an IP mapper 310 and upload component 312 for an associated node.
[0033] The IP mapper 310 and upload component 312 operate in part to provide pre- aggregation services on the data of log file 308. As described above, a single component can be configured to perform the pre-aggregation services provided by these components. The IP mapper 310 of an embodiment operates in part to parse log file 308 to extract and map logged performance data or metrics based on one or more of an IP address, a location, and/or ISP for each client or tenant. According to one embodiment, the IP mapper 310 operates in part to pre-aggregate and consolidate the client data by mapping a client IP address and performance or latency data to one or more of a geographic location (e.g., country/state), an ISP, and/or a tenant global user identifier (GUID). The upload component 312 operates to upload the mapped data provided by the IP mapper 310 grouped by one or more of location, ISP, and/or tenant GUID to a dedicated database 314. If the logged data cannot be resolved to an ISP level, the pre-aggregation can include groupings limited to country and/or tenant. It will be appreciated that country and ISP parameters can be determined according to a client IP address. [0034] With continuing reference to FIGURE 3, components of server 306 are configured with complex programming code that operates to pre-aggregate collected client data in part by parsing the collected client data, such as by parsing performance data logs for example, and extracting user scenario, time of event, client IP, latency, tenant data and other detailed metrics based on the client information. Consequently, the server 306 is able to pre-aggregate data received from client as part of reducing the final aggregation load when quantifying latency and/or other performance issues.
[0035] The IP mapper 310 of an embodiment operates to map client IP addresses to a geographic location depending on the mapping granularity and/or a client IP to an associated ISP based on known or to be implemented IP ranges. The server 306 includes analysis code that operates to parse based in part on a type of client and/or associated client data. For example, performance data of a web access client can be collected and routed to a log file of mailbox server serving the sessions, wherein the analysis code would be configured to parse the particular client information to understand a scenario, latency, and associated issues (e.g., slow navigation time, slow DNS time, etc.).
[0036] Parsing of an embodiment operates to transform client IP address and tenant information in the log files to country/state, ISP and/or tenant GUID. In one embodiment, parsing operations are performed in part using a derived mapping table generated from a generic public geo-mapping database.
[0037] An example data entry in a geo-mapping database for parsing may include:
[0038] StartIP|EndIP|CIDR|Continent|Country|Country_IS02|CountryConfidence|Re gion|State|State_CF|City|CityConfidence|Postal_Code|
[0039] 16777472| 16778239|24|asia|china|cn|8||beijingshi|73|beijing|5| 100000|0|8|39.9 1176055| 1 16.3792325|0|0|0|unknown||none|False|0|0|0| 1307256208|0|RT_Unknown
[0040] 16778240116779263|24|oceania|australia|au|8||victoria|74|melbourne|5|3000|0| l o|-
37.8132| 144.963|0|0|0|unknown||none|False|56203|7482486|440| 1312156419| 1312378472| RT Unknown
[0041] The parsing operations applied by the IP mapper 310 of an embodiment result in the generation of a derived mapping table for IP to Countries by scanning each data entry, sorting, and merging based on IP ranges and corresponding countries to yield:
[0042] 16777216,au
[0043] 16777472,cn
[0044] 16778240,au [0045] 16779264,cn
[0046] 16781312,jp
[0047] 16785408,cn
[0048] 16793600,jp
[0049] 16809984,th
[0050] 16842752,cn
[0051] A mapping table can include exemplary mapping {key,value} data. As shown above, the mapped data includes a key that is an integer value that represents a starting IP address and a value that is the country ISO code. In the above mapping data, IP addresses between 16777216 and 16777472 belong to AU. By sorting the keys, the table can be compressed for loading into memory for quick look-up.
[0052] Similarly, parsing operations applied by the IP mapper 310 of an embodiment result in the generation of a derived mapping table for an IP to ISP mapping as shown below (key is the same as above but the value is an ASN number of an ISP):
[0053] 174981 12, 18313
[0054] 17514496,38091
[0055] 17522688,38669
[0056] 17530880, 17839
[0057] 17563648, 18245
[0058] With continuing reference to FIGURE 3, and continuing the example, server 316 processes or pre-aggregates client data of clients 318 and 320 stored in log file 321 in part by using the IP mapper 322 and upload component 324 to process and upload pre- aggregated data to another dedicated database 326. Dedicated databases 314 and 326 may or may not include more than one host computer. Moreover, while certain numbers and types of components are shown, it will be appreciated that the pipeline can include additional components, features, and functionality. Server 328 processes client data of clients 330, 332, 334, and 336 stored in log file 337 in part by using the IP mapper 338 and upload component 340 to process and upload pre-aggregated data to dedicated database 326.
[0059] In an embodiment, databases 314 and 326 are designed to handle the performance counters and metrics collected from various machines that may be networked to provide an online application or service. Since the end user performance data brings in additional pivots, a database schema can be used to support IP, geographic location, tenant, and/or ISP metrics and parameters. In one embodiment, server 306, server 316, and server 328 collect client data from a plurality of clients. For example, at the node level, server 306 can operate to pre-aggregate client data every 5 minutes using IP mapper 310 to transform the client data into predetermined pivots and the upload component 316 propagates the transformed data to database 314.
[0060] Aggregation service 342 aggregates the pre-aggregated data across databases 314 and 326 to determine one or more of tenant level latencies, country level latencies, and/or ISP level latencies associated with an online application or service, but is not so limited. For example, the aggregation service 342 operates on the pre-aggregated or transformed data to perform scope (Global and/or Site for example) level conversion on the node level data for end user metrics. As shown by example in FIGURE 3, the aggregation service 342 has provided an aggregated output that includes quantified client performance data 346 associated with the first tenant and quantified client performance data 348 associated with the second tenant. A number of sample counts can be used as a weighting factor to improve statistical accuracy of the quantified client performance data.
[0061] The aggregation service 342 can be configured to aggregate pre-aggregated data uploaded from one of more upload components at defined time intervals (e.g., run every 15 min., use for a sliding window of last 1 hour of data; run every 24 hours, use sliding window of last 24 hours of data, etc.). The aggregation service 342 can also be configured to pivot or group, across one or more domain controllers, by geographic location, tenant, ISP per geographic location, tenant per geographic location, and/or scope per site level. The aggregation service 342 operates in part to generate client scenario latency and other performance related statistics for quantifying navigation time, CDN time, authorization time, redirect time, etc. For example, the aggregation service 342 can provide statistical measures/values such as average, 75% percentile, 85% percentile, 95% percentile, etc. The aggregation service 342 can also use dynamic bins that encompass a range of latencies with percentile values for latencies at 10th, 20th, 30th, 40th, 50th, 60th, 70th, 80th, 90th percentiles, and maximum.
[0062] Failure zone analyzer 350 operates in part using rules that are designed to identify certain segments or characteristics of the data aggregate using statistical measures or other latency quantifications. For example, the rules may be designed to identify different levels of performance (e.g., fair, poor, excellent, etc.) based on one or more quantitative measures, such as navigation time, load time, connect time, etc. The rules are applied to the aggregated data according to the output from the aggregation service 342. Exemplary rules are configurable according to each implementation. For example, rules may be based on an overall value for a region or ISP such as rules configured to prioritize consideration of certain metrics or measures over others.
[0063] Report generator 352 operates to generate report information for reporting and/or feedback communications as to the state of an application or service along with any specific recommendations for tenants having some identified issue that may need to be addressed. For example, report generator 352 can operate to dynamically generate a user insight report that lists the top number (e.g., 10) tenants for each geographic location having highest latencies or the top number of tenants having the highest latencies. While shown as integral components, it will be appreciated that failure zone analyzer 350 and report generator 352 can be configured as separate components. In an alternative embodiment, pivots can be applied solely at the aggregation service 342, or in combination with pivots applied the server 306, server 316, and/or server 328.
[0064] The pipeline 300 of an embodiment uses performance markers as part of: reliably collecting client data; allowing segregation of successful and failed execution of scenario; allowing for filtering/segregation of monitoring data (e.g., probes); accurately marking the start and end of scenarios tied with user experience (e.g., navigation time, page load, page displayed, page interactive, etc.); and/or identifying and filling missing data to assist with detailed drill downs, such as time to complete authentication, time to download CDN resources, time to redirect to correct web-access server, etc.
[0065] Navigation timing of one embodiment comprise calculated values based on each time stamp defined in the W3C Navigation Timing API. To address the need for complete information on user experience, the W3C Navigation Timing API introduces the performance timing interface allowing JAVASCRIPT mechanisms to provide complete client-side latency measurements within applications. The interface can be used to measure a user's perceived page load time. Resource timing markers of one embodiment are the calculated values based on each time stamp defined in the W3C Resource Timing API that defines an interface allowing JAVASCRIPT mechanisms to provide complete client-side latency measurements within applications. The interface can be used to measure a user's perceived load time of a resource.
[0066] The Table below provides exemplary markers, marker calculations, and the associated descriptions in accordance with one embodiment.
Figure imgf000015_0001
[0067] Other exemplary markers may include:
[0068] Page load time (PLT) - The PLT time without authentication time, this key only appear when "type" is PLT (boot from no-cache or browser cache).
[0069] ALT - The PLT time without authentication time, this key only appear when "type" is ALT (boot from application cache). [0070] RDT - The render time from web access finish retrieve session data until PLT end marker.
[0071] For the examples below, client raw data includes parameters including but not limited to:
[0072] Redirect Count (RC);
[0073] Redirect Time (RT);
[0074] Fetch Time (FT);
[0075] Domain Lookup Time (DN);
[0076] Connect Time (CT);
[0077] Secure Connect Time (ST);
[0078] Request Time (RQ);
[0079] Response Time (RS);
[0080] Total Response Time (TR);
[0081] Dom Load Time (DL); and
[0082] Total Navigation Time (NV).
[0083] As an example log file 308 can include the following web-access navigation timing raw data associated with client 302 as:
[0084] 20XX -01- 09T00:08: 12.304Z,W3CNavTimeTestBox,PerfNavTime,S:mg=«Tenant
ID»;S:ts=20XX -01-09T00:08:03.860;
[0085] S:UC=5f8a321a877591c42b7;I32:ds=132;I32:DC=l;S:Mowa=0;S:ip=<PII> IP Address</PII>;
[0086] S:tg=D73DD084-BF81-4F05-A0D0-B8599C0444D0;S:user=<PII>Username like userl @contoso.com<PII>;
[0087] S:cbld=15.0.609.0;S:BuildType=DEBUG;
[0088] S:URI=«Server
URI»;S:FT=12;S:DN=0;S:CT=0;S:RQ=0;S:RS=10;S:UL=5;S:NV=5000;S:DL=2000;
[0089] S:D1=1078;S:D2=1760;
[0090] S:DE=5;S:PL=2;S:RC=0;S:NT=1.
[0091] And navigation timing raw data associated with client 304 as:
[0092] 20XX -01- 09T00:08: 12.304Z,W3CNavTimeTestBox,Perf avTime,S:mg=«Tenant
ID»;S:ts=20XX -01- 09T00:08:04.860;S:UC=f8a321a877591c42b7;I32:ds=132;I32:DC=l ;S:Mowa=0;S:ip=<P IMP Address</PII>;
[0093] S:tg=D73DD084-BF81-4F05-A0D0-B8599C0444D0;S:user=<PII> Useraame like userl @contoso.com</PII>;
[0094] S:cbld=15.0.609.0;S:BuildType=DEBUG;
[0095] S:URI=«Server
URI»;S:FT=20;S:DN=1 ;S:CT=10;S:RQ=10;S:RS=10;S:UL=15;S:NV=6000;S:DL=4000
[0096] S:D1=2156;S:D2=3000;
[0097] S:DE=10;S:PL=3;S:RC=2;S:NT=1.
[0098] Exemplary load timing raw data associated with client 302 as:
[0099] 20XX -05-
30T08:02: 12.304Z,ClientLoadTimeTestBox,CalculatedClientLoadTime,
[00100] S:ts=20XX -05-30T08:02: 16.20XX 727Z;S:UC=41 Ie478fdfef403c9a28clc3ffaa0317;
[00101] S:ip=<PII>IP Address</PII>;S:tg=la3ba9c6-00d3-4c2e-9862-f08a05al lflf;
[00102] S:PLT=7000;S:RDT=4000;S:RT=18;S:DN=0;S:CT=0;S:RQ=1 188;S:RS=2;S: SDN=0;S:SCT=10;S:SRQ=1800;
[00103] S:SRS=300;S:R1DN=0;S:R1CT=200;S:R1 ST=100;S:R1RQ=50;S:R1RS=10; S:R2DN=0;S:R2CT=8;S:R2ST=0;
[00104] S:R2RQ=50;S:R2RS=200;S:brn=MSIE;S:brv=10;
[00105] And, load timing raw data associated with client 304 as:
[00106] 20XX-05-
30T08:02: 12.304Z,ClientLoadTimeTestBox,CalculatedClientLoadTime,
[00107] S:ts=20XX -05-30T08:03: 16.20XX 727Z;S :UC=412e478fdfef403c9a28c 1 c3ffaa0317;
[00108] S:ip=<PII>IP Address</PII>;S:tg=la3ba9c6-00d3-4c2e-9862- fD8a05al lflf;S:PLT=8000;S:RT=18;S:DN=0;S:CT=0;S:RQ=1 188;S:RS=2;S:SDN=100;S :SCT=50;S:SRQ=1600;
[00109] S:SRS=400;S:R1DN=0;S:R1CT=600;S:R1 ST=300;S:R1RQ=90;S:R1RS=50; S :R2DN=0;S :R2CT= 16;S :R2ST=0;
[00110] S:R2RQ=0;S:R2RS=400;S:brn=Chrome;S:brv=27. [00111] Using the exemplary client data, the Table below shows exemplary output from aggregation service 342 aggregating user performance data by tenant and by country as follows.
Sample Tenant Aggregates
Figure imgf000018_0001
Sample Country Aggregates
Figure imgf000018_0002
23:0 00:0 00:0
5 5 5
09/1 09/1 09/1 US 18.3 1035 55 70 153 64
11 8/ 8/ OWA W3C 3 3 3 1 7
20X 20X 20X Navigation
X X X Timing\Conn
23: 1 00: 1 00: 1 ect Time
0 0 0
[00112] FIGURE 4 is flow diagram depicting operations of an exemplary end-to-end process 400 used as part of providing performance diagnostic analysis and/or issue remediation services according to an embodiment. The process 400 at 402 operates to collect performance data using a client executing on an end-user device/system. For example, at 402, a client such as a browser or other application and scripting code (e.g., JAVASCRIPT code) collects client-centric performance data and/or requests performance diagnostic analysis services from one or more server computers associated with use an online of service or application. The process 400 at 402 of one embodiment operates to collect raw performance data that includes navigation timing, page load timing, and/or other parameters indicative of latencies or other performance issues as part of assessing an end- user experience associated with an online service or application.
[00113] The process 400 at 404 operates to provide the raw performance data to a log file of a dedicated server computer. For example, the process 400 at 404 includes the use of a browser executing on a user device/system to upload a client IP address and collected performance data or some portion to one or more log files. At 406, the process 400 operates to transform or map the logged performance data using the client IP address and mapping targets that include geographical location (e.g., country/state), ISP, and/or tenant GUID. For example, the process 400 at 406 can be configured to map logged client data to a plurality of mapping tables including a first mapping table that defines IP address to geographic location mappings for the logged client data and a second mapping table that defines IP address to ISP mappings for the logged client data.
[00114] At 408, the process 400 operates to upload the transformed data grouped by one or more of tenant GUID, geographic location, and/or ISP to one or more diagnostic service databases. The process 400 at 410 operates to perform aggregation operations across the one or more databases to generate latency and/or other performance related aggregations for the online service or application. In one embodiment, the process 400 at 410 performs aggregation operations to determine one or more of tenant level, geographic location level, and/or ISP level latencies.
[00115] The process 400 at 412 uses one or more rules on the aggregated data to perform a failure zone analysis to identify one or more failure or potential failure zones. For example, the process 400 at 412 can use configured rules to vet whether a user experience is poor, satisfactory, or excellent based in part on trend or baseline comparisons across all countries and/or ISPs. At 414, the process 400 operates to use the failure zone information as part of taking any corrective or mitigating action. For example, the process 400 at 414 can use failure zone analysis information to generate online reports that identify potential network and/or communication architecture modifications as part of reducing latency or other performance related issues. While a certain number and order of operations are described for the exemplary flow of FIGURE 4, it will be appreciated that other numbers, combinations, and/or orders can be used according to desired implementations.
[00116] For example, the process 400 can be used in part to generate an electronic report that allows for viewing of different network metrics for an online email service to identify that users in a first location are spending longer time in CDN compared to rest of the countries in the associated region. A reviewer can then follow-up with a CDN provider in the first location to resolve the issue. Additionally, review of a geographic-ISP report for the first location reveals difference in latencies by ISP enabling ready identification of an increase in latency for one of the larger ISPs that may be contacted to inform and resolve the issue.
[00117] As yet another example, as part of an edge server deployment, the process 400 can be used to generate an electronic report that includes download times by region to identify users of a particular region having maximum download time resulting in deploying of a new edge server to reduce the impact of user networks. An updated report reveals a reduction in latencies for the particular region. As another example, of reducing identifying latencies, the process 400 can generate an electronic report that allows a particular tenant to display a trend view and determine that a latency increase occurred in the last few days as well as TCP connecting times increased by 500ms. Based on the report, an affected tenant can be contacted to identify issues with ISP peering with another location.
[00118] It will be appreciated that various features described herein can be implemented as part of a processor-driven environment including hardware and software components. Also, while certain embodiments and examples are described above for illustrative purposes, other embodiments are included and available, and the described embodiments should not be used to limit the claims. Suitable programming means include any means for directing a computer system or device to execute steps of a process or method, including for example, systems comprised of processing units and arithmetic-logic circuits coupled to computer memory, which systems have the capability of storing in computer memory, which computer memory includes electronic circuits configured to store data and program instructions or code.
[00119] An exemplary article of manufacture includes a computer program product useable with any suitable processing system. While a certain number and types of components are described above, it will be appreciated that other numbers and/or types and/or configurations can be included according to various embodiments. Accordingly, component functionality can be further divided and/or combined with other component functionalities according to desired implementations. The term computer readable media as used herein can include computer storage media or computer storage. The computer storage of an embodiment stores program code or instructions that operate to perform some function. Computer storage media can include volatile and nonvolatile, removable and nonremovable media implemented in any method or technology for storage of information, such as computer readable instructions, data structures, program modules, etc.
[00120] System memory, removable storage, and non-removable storage are all computer storage media examples (i.e., memory storage.). Computer storage media may include, but is not limited to, RAM, ROM, electrically erasable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disks (DVD) or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or any other medium which can be used to store information and which can be accessed by a computing device. Any such computer storage media may be part of a device or system. By way of example, and not limitation, communication media may include wired media such as a wired network or direct- wired connection, and wireless media such as acoustic, RF, infrared, and other wireless media.
[00121] The embodiments and examples described herein are not intended to be limiting and other embodiments are available. Moreover, the components described above can be implemented as part of networked, distributed, and/or other computer-implemented environment. The components can communicate via a wired, wireless, and/or a combination of communication networks. Network components and/or couplings between components of can include any of a type, number, and/or combination of networks and the corresponding network components which include, but are not limited to, wide area networks (WANs), local area networks (LANs), metropolitan area networks (MANs), proprietary networks, backend networks, cellular networks, etc.
[00122] Client computing devices/systems and servers can be any type and/or combination of processor-based devices or systems. Additionally, server functionality can include many components and include other servers. Components of the computing environments described in the singular tense may include multiple instances of such components. While certain embodiments include software implementations, they are not so limited and encompass hardware, or mixed hardware/software solutions.
[00123] Terms used in the description, such as component, module, system, device, cloud, network, and other terminology, generally describe a computer-related operational environment that includes hardware, software, firmware and/or other items. A component can use processes using a processor, executable, and/or other code. Exemplary components include an application, a server running on the application, and/or an electronic communication client coupled to a server for receiving communication items. Computer resources can include processor and memory resources such as: digital signal processors, microprocessors, multi-core processors, etc. and memory components such as magnetic, optical, and/or other storage devices, smart memory, flash memory, etc. Communication components can be used to communicate computer-readable information as part of transmitting, receiving, and/or rendering electronic communication items using a communication network or networks, such as the Internet for example. Other embodiments and configurations are included.
[00124] Referring now to FIGURE 5, the following provides a brief, general description of a suitable computing environment in which embodiments be implemented. While described in the general context of program modules that execute in conjunction with program modules that run on an operating system on various types of computing devices/systems, those skilled in the art will recognize that the invention may also be implemented in combination with other types of computer devices/systems and program modules.
[00125] Generally, program modules include routines, programs, components, data structures, and other types of structures that perform particular tasks or implement particular abstract data types. Moreover, those skilled in the art will appreciate that the invention may be practiced with other computer system configurations, including hand-held devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, and the like. The invention may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.
[00126] As shown in FIGURE 5, computer 2 comprises a general purpose server, desktop, laptop, handheld, or other type of computer capable of executing one or more application programs including an email application or other application that includes email functionality. The computer 2 includes at least one central processing unit 8 ("CPU"), a system memory 12, including a random access memory 18 ("RAM") and a read-only memory ("ROM") 20, and a system bus 10 that couples the memory to the CPU 8. A basic input/output system containing the basic routines that help to transfer information between elements within the computer, such as during startup, is stored in the ROM 20. The computer 2 further includes a mass storage device 14 for storing an operating system 24, application programs, and other program modules/resources 26.
[00127] The mass storage device 14 is connected to the CPU 8 through a mass storage controller (not shown) connected to the bus 10. The mass storage device 14 and its associated computer-readable media provide non- volatile storage for the computer 2. Although the description of computer-readable media contained herein refers to a mass storage device, such as a hard disk or CD-ROM drive, it should be appreciated by those skilled in the art that computer-readable media can be any available media that can be accessed or utilized by the computer 2.
[00128] According to various embodiments, the computer 2 may operate in a networked environment using logical connections to remote computers through a network 4, such as a local network, the Internet, etc. for example. The computer 2 may connect to the network 4 through a network interface unit 16 connected to the bus 10. It should be appreciated that the network interface unit 16 may also be utilized to connect to other types of networks and remote computing systems. The computer 2 may also include an input/output controller 22 for receiving and processing input from a number of other devices, including a keyboard, mouse, etc. (not shown). Similarly, an input/output controller 22 may provide output to a display screen, a printer, or other type of output device.
[00129] As mentioned briefly above, a number of program modules and data files may be stored in the mass storage device 14 and RAM 18 of the computer 2, including an operating system 24 suitable for controlling the operation of a networked personal computer, such as the WINDOWS operating systems from MICROSOFT CORPORATION of Redmond, Washington. The mass storage device 14 and RAM 18 may also store one or more program modules. In particular, the mass storage device 14 and the RAM 18 may store application programs, such as word processing, spreadsheet, drawing, e-mail, and other applications and/or program modules, etc.
[00130] FIGURES 6A-6B illustrate a mobile computing device 600, for example, a mobile telephone, a smart phone, a tablet personal computer, a laptop computer, and the like, with which embodiments may be practiced. With reference to FIGURE 6A, one embodiment of a mobile computing device 600 for implementing the embodiments is illustrated. In a basic configuration, the mobile computing device 600 is a handheld computer having both input elements and output elements.
[00131] The mobile computing device 600 typically includes a display 605 and one or more input buttons 610 that allow the user to enter information into the mobile computing device 600. The display 605 of the mobile computing device 600 may also function as an input device (e.g., a touch screen display). If included, an optional side input element 615 allows further user input. The side input element 615 may be a rotary switch, a button, or any other type of manual input element. In alternative embodiments, mobile computing device 600 may incorporate more or less input elements. For example, the display 605 may not be a touch screen in some embodiments. In yet another alternative embodiment, the mobile computing device 600 is a portable phone system, such as a cellular phone.
[00132] The mobile computing device 600 may also include an optional keypad 635. Optional keypad 635 may be a physical keypad or a "soft" keypad generated on the touch screen display. In various embodiments, the output elements include the display 605 for showing a graphical user interface (GUI), a visual indicator 620 (e.g., a light emitting diode), and/or an audio transducer 625 (e.g., a speaker). In some embodiments, the mobile computing device 600 incorporates a vibration transducer for providing the user with tactile feedback. In yet another embodiment, the mobile computing device 600 incorporates input and/or output ports, such as an audio input (e.g., a microphone jack), an audio output (e.g., a headphone jack), and a video output (e.g., a HDMI port) for sending signals to or receiving signals from an external device.
[00133] FIGURE 6B is a block diagram illustrating the architecture of one embodiment of a mobile computing device. That is, the mobile computing device 600 can incorporate a system (i.e., an architecture) 602 to implement some embodiments. In one embodiment, the system 602 is implemented as a "smart phone" capable of running one or more applications (e.g., browser, e-mail, calendaring, contact managers, messaging clients, games, and media clients/players). In some embodiments, the system 602 is integrated as a computing device, such as an integrated personal digital assistant (PDA) and wireless phone.
[00134] One or more application programs 666, including a notes application, may be loaded into the memory 662 and run on or in association with the operating system 664. Examples of the application programs include phone dialer programs, e-mail programs, personal information management (PIM) programs, word processing programs, spreadsheet programs, Internet browser programs, messaging programs, and so forth. The system 602 also includes a non-volatile storage area 668 within the memory 662. The non-volatile storage area 668 may be used to store persistent information that should not be lost if the system 602 is powered down.
[00135] The application programs 666 may use and store information in the non-volatile storage area 668, such as e-mail or other messages used by an e-mail application, and the like. A synchronization application (not shown) also resides on the system 602 and is programmed to interact with a corresponding synchronization application resident on a host computer to keep the information stored in the non-volatile storage area 668 synchronized with corresponding information stored at the host computer. As should be appreciated, other applications may be loaded into the memory 662 and run on the mobile computing device 600.
[00136] The system 602 has a power supply 670, which may be implemented as one or more batteries. The power supply 670 might further include an external power source, such as an AC adapter or a powered docking cradle that supplements or recharges the batteries. The system 602 may also include a radio 672 that performs the function of transmitting and receiving radio frequency communications. The radio 672 facilitates wireless connectivity between the system 602 and the "outside world," via a communications carrier or service provider. Transmissions to and from the radio 672 are conducted under control of the operating system 664. In other words, communications received by the radio 672 may be disseminated to the application programs 666 via the operating system 664, and vice versa.
[00137] The visual indicator 620 may be used to provide visual notifications and/or an audio interface 674 may be used for producing audible notifications via the audio transducer 625. In the illustrated embodiment, the visual indicator 620 is a light emitting diode (LED) and the audio transducer 625 is a speaker. These devices may be directly coupled to the power supply 670 so that when activated, they remain on for a duration dictated by the notification mechanism even though the processor 660 and other components might shut down for conserving battery power. The LED may be programmed to remain on indefinitely until the user takes action to indicate the powered-on status of the device.
[00138] The audio interface 674 is used to provide audible signals to and receive audible signals from the user. For example, in addition to being coupled to the audio transducer 625, the audio interface 674 may also be coupled to a microphone to receive audible input, such as to facilitate a telephone conversation. In accordance with embodiments, the microphone may also serve as an audio sensor to facilitate control of notifications, as will be described below. The system 602 may further include a video interface 676 that enables an operation of an on-board camera 630 to record still images, video stream, and the like. A mobile computing device 600 implementing the system 602 may have additional features or functionality. For example, the mobile computing device 600 may also include additional data storage devices (removable and/or non-removable) such as, magnetic disks, optical disks, or tape. Such additional storage is illustrated in FIGURE 6B by the non-volatile storage area 668.
[00139] Data/information generated or captured by the mobile computing device 600 and stored via the system 602 may be stored locally on the mobile computing device 600, as described above, or the data may be stored on any number of storage media that may be accessed by the device via the radio 672 or via a wired connection between the mobile computing device 600 and a separate computing device associated with the mobile computing device 600, for example, a server computer in a distributed computing network, such as the Internet. As should be appreciated such data/information may be accessed via the mobile computing device 600 via the radio 672 or via a distributed computing network. Similarly, such data/information may be readily transferred between computing devices for storage and use according to well-known data/information transfer and storage means, including electronic mail and collaborative data/information sharing systems.
[00140] FIGURE 7 illustrates one embodiment of a system architecture for implementing latency identification and remediation features. Data processing information may be stored in different communication channels or storage types. For example, various information may be stored/accessed using a directory service 722, a web portal 724, a mailbox service 726, an instant messaging store 728, and/or a social networking site 730. A server 720 may provide additional latency analysis and other features. As one example, the server 720 may provide rules that are used to distribute outbound email using a number of datacenter partitions over network 715, such as the Internet or other network(s) for example. By way of example, the client computing device may be implemented as a general computing device 702 and embodied in a personal computer, a tablet computing device 704, and/or a mobile computing device 706 (e.g., a smart phone). Any of these clients may use content from the store 716.
[00141] Embodiments, for example, are described above with reference to block diagrams and/or operational illustrations of methods, systems, computer program products, etc. The functions/acts noted in the blocks may occur out of the order as shown in any flowchart. For example, two blocks shown in succession may in fact be executed substantially concurrently or the blocks may sometimes be executed in the reverse order, depending upon the functionality/acts involved.
[00142] The description and illustration of one or more embodiments provided in this application are not intended to limit or restrict the scope of the invention as claimed in any way. The embodiments, examples, and details provided in this application are considered sufficient to convey possession and enable others to make and use the best mode of claimed invention. The claimed invention should not be construed as being limited to any embodiment, example, or detail provided in this application. Regardless of whether shown and described in combination or separately, the various features (both structural and methodological) are intended to be selectively included or omitted to produce an embodiment with a particular set of features. Having been provided with the description and illustration of the present application, one skilled in the art may envision variations, modifications, and alternate embodiments falling within the spirit of the broader aspects of the general inventive concept embodied in this application that do not depart from the broader scope of the claimed invention.
[00143] It should be appreciated that various embodiments can be implemented (1) as a sequence of computer implemented acts or program modules running on a computing system and/or (2) as interconnected machine logic circuits or circuit modules within the computing system. The implementation is a matter of choice dependent on the performance requirements of the computing system implementing the invention. Accordingly, logical operations including related algorithms can be referred to variously as operations, structural devices, acts or modules. It will be recognized by one skilled in the art that these operations, structural devices, acts and modules may be implemented in software, firmware, special purpose digital logic, and any combination thereof without deviating from the spirit and scope of the present invention as recited within the claims set forth herein. [00144] Although the invention has been described in connection with various exemplary embodiments, those of ordinary skill in the art will understand that many modifications can be made thereto within the scope of the claims that follow. Accordingly, it is not intended that the scope of the invention in any way be limited by the above description, but instead be determined entirely by reference to the claims that follow.

Claims

1. A system configured to :
receive user performance data from a plurality of clients as part of analyzing a state of an online service or application;
pre-aggregate the user performance data of the plurality of clients in part using a client Internet Protocol (IP) address and tenant information associated with the performance data to provide mapped data that includes mappings between client IP addresses and one or more of a location parameter, a service provider parameter, and tenant globally unique identifier (GUID) parameter; and
aggregate the mapped data in part to generate aggregated data to identify one or more of a tenant level issue, a location level issue, and an ISP level issue.
2. The system of claim 1, further configured to collect client data at each node to reduce processing time by limiting of a number of data points used with final aggregation operations.
3. The system of claim 1 , further configured to apply a number of rules to the aggregated data as part of performing failure zone analysis.
4. The system of claim 3, further configured to provide a report associated with mitigating or resolving a performance issue for one or more tenants.
5. The system of claim 1 , further configured to generate one or more mapping tables using one or more key-value pairs, wherein a first key-value pair comprises a key comprising an integer that represents a starting IP address and a value for the key is a country code parameter.
6. The system of claim 5, further configured to generate the one or more mapping tables using the one or more key-value pairs, wherein a second key-value pair comprises a key comprising an integer that represents a starting IP address and a value for the key is an autonomous system number (ASN) number associated with an ISP.
7. The system of claim 1, further configured to provide aggregation services by pulling client performance data globally and aggregating based on a set of common or customized metrics.
8. An article of manufacture configured with instructions that operate to provide aggregation features by:
receiving client data including navigation timing and load timing metrics; transforming the client data to mapped data using one or more mapping tables;
uploading the mapping tables and mapped data to one or more databases; and
aggregating the mapped data across the one or more databases to quantify one or more tenant level latencies, location level latencies, and ISP level latencies.
9. A method comprising:
collecting performance metrics for a plurality of clients, wherein the performance metrics are associated with a state of an online service or application;
pre-aggregating the performance metrics of the plurality of clients to provide transformed data in part by generating mappings associated with an IP address to country mapping and an IP address to ISP mapping; and
aggregating the transformed data to provide aggregated data and identify one or more of tenant level latencies, location level latencies, and ISP level latencies.
10. The method of claim 9, further comprising resolving a latency-related issue based on the failure zone analysis.
PCT/US2014/066480 2013-11-22 2014-11-20 Performance monitoring to provide real or near real time remediation feedback WO2015077385A2 (en)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2016533584A JP2017500791A (en) 2013-11-22 2014-11-20 Performance monitoring that provides real-time or near real-time improvement feedback
RU2016119573A RU2016119573A (en) 2013-11-22 2014-11-20 PERFORMANCE MONITORING TO REALIZE REAL OR ALMOST REAL TIME CORRECTION
EP14810075.3A EP3072050A2 (en) 2013-11-22 2014-11-20 Performance monitoring to provide real or near real time remediation feedback
CN201480063665.5A CN105765907A (en) 2013-11-22 2014-11-20 Performance monitoring to provide real or near real time remediation feedback

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US14/087,413 US20150149609A1 (en) 2013-11-22 2013-11-22 Performance monitoring to provide real or near real time remediation feedback
US14/087,413 2013-11-22

Publications (2)

Publication Number Publication Date
WO2015077385A2 true WO2015077385A2 (en) 2015-05-28
WO2015077385A3 WO2015077385A3 (en) 2015-08-20

Family

ID=52021441

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2014/066480 WO2015077385A2 (en) 2013-11-22 2014-11-20 Performance monitoring to provide real or near real time remediation feedback

Country Status (6)

Country Link
US (2) US20150149609A1 (en)
EP (1) EP3072050A2 (en)
JP (1) JP2017500791A (en)
CN (1) CN105765907A (en)
RU (1) RU2016119573A (en)
WO (1) WO2015077385A2 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109906592A (en) * 2016-11-07 2019-06-18 华为技术有限公司 Monitor the system and method for chip property
EP3436951A4 (en) * 2016-03-29 2019-11-20 Anritsu Company Systems and methods for measuring effective customer impact of network problems in real-time using streaming analytics
EP4002800A3 (en) * 2020-11-17 2022-08-03 Citrix Systems Inc. Systems and methods for detection of degradation of a virtual desktop environment
US11467911B2 (en) 2020-11-17 2022-10-11 Citrix Systems, Inc. Systems and methods for detection of degradation of a virtual desktop environment
EP3864516B1 (en) * 2018-11-19 2022-12-21 Microsoft Technology Licensing, LLC Veto-based model for measuring product health

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9800567B2 (en) * 2014-03-31 2017-10-24 Sap Se Authentication of network nodes
US10003492B2 (en) * 2015-02-24 2018-06-19 CENX, Inc. Systems and methods for managing data related to network elements from multiple sources
US10567246B2 (en) 2015-12-15 2020-02-18 At&T Intellectual Property I, L.P. Processing performance data of a content delivery network
US10355872B2 (en) * 2016-02-05 2019-07-16 Prysm, Inc Techniques for a collaboration server network connection indicator
US20170346909A1 (en) * 2016-05-31 2017-11-30 Linkedin Corporation Client-side bottleneck analysis using real user monitoring data
CN106656666B (en) * 2016-12-13 2020-05-22 中国联合网络通信集团有限公司 Method and device for acquiring first screen time of webpage
US10666515B2 (en) * 2017-01-17 2020-05-26 International Business Machines Corporation Control of activities executed by endpoints based on conditions involving aggregated parameters
US10680933B2 (en) 2017-02-02 2020-06-09 Microsoft Technology Licensing, Llc Electronic mail system routing control
US10581954B2 (en) * 2017-03-29 2020-03-03 Palantir Technologies Inc. Metric collection and aggregation for distributed software services
US10482000B2 (en) 2017-04-24 2019-11-19 Microsoft Technology Licensing, Llc Machine learned decision guidance for alerts originating from monitoring systems
CN107122448A (en) * 2017-04-25 2017-09-01 广州市诚毅科技软件开发有限公司 A kind of intelligent display method and device of the estimated response time of front end page request
US10951462B1 (en) * 2017-04-27 2021-03-16 8X8, Inc. Fault isolation in data communications centers
US11645131B2 (en) * 2017-06-16 2023-05-09 Cisco Technology, Inc. Distributed fault code aggregation across application centric dimensions
US10476946B2 (en) * 2017-07-27 2019-11-12 Citrix Systems, Inc. Heuristics for selecting nearest zone based on ICA RTT and network latency
US10698756B1 (en) 2017-12-15 2020-06-30 Palantir Technologies Inc. Linking related events for various devices and services in computer log files on a centralized server
US10824497B2 (en) * 2018-08-29 2020-11-03 Oracle International Corporation Enhanced identification of computer performance anomalies based on computer performance logs
CN111475429B (en) * 2019-01-24 2023-08-29 爱思开海力士有限公司 memory access method
US11068333B2 (en) 2019-06-24 2021-07-20 Bank Of America Corporation Defect analysis and remediation tool
CN110493075B (en) * 2019-08-01 2021-06-25 京信通信系统(中国)有限公司 Method, device and system for monitoring online duration of equipment
US11558271B2 (en) * 2019-09-04 2023-01-17 Cisco Technology, Inc. System and method of comparing time periods before and after a network temporal event
US10924334B1 (en) * 2019-09-12 2021-02-16 Salesforce.Com, Inc. Monitoring distributed systems with auto-remediation
US11799741B2 (en) * 2019-10-29 2023-10-24 Fannie Mae Systems and methods for enterprise information technology (IT) monitoring
US11012326B1 (en) * 2019-12-17 2021-05-18 CloudFit Software, LLC Monitoring user experience using data blocks for secure data access
US10877867B1 (en) 2019-12-17 2020-12-29 CloudFit Software, LLC Monitoring user experience for cloud-based services
US11379442B2 (en) 2020-01-07 2022-07-05 Bank Of America Corporation Self-learning database issue remediation tool
ZA202100191B (en) 2020-01-20 2023-12-20 EXFO Solutions SAS Method and device for estimating a number of distinct subscribers of a telecommunication network impacted by network issues
JP7285798B2 (en) * 2020-03-09 2023-06-02 株式会社日立製作所 Performance analysis device, performance analysis method, and performance analysis program
US11546408B2 (en) 2020-11-02 2023-01-03 Microsoft Technology Licensing, Llc Client-side measurement of computer network conditions
US20220357968A1 (en) * 2021-05-07 2022-11-10 Citrix Systems, Inc. Heuristic Policy Recommendations in a Virtual Environment
US12086160B2 (en) 2021-09-23 2024-09-10 Oracle International Corporation Analyzing performance of resource systems that process requests for particular datasets
US12038816B2 (en) * 2021-09-24 2024-07-16 Salesforce, Inc. Determining insights related to performance bottlenecks in a multi-tenant database system preliminary class
US12088347B2 (en) * 2022-04-22 2024-09-10 Bank Of America Corporation Intelligent monitoring and repair of network services using log feeds provided over Li-Fi networks
US20230385173A1 (en) * 2022-05-27 2023-11-30 Microsoft Technology Licensing, Llc Real-time report generation

Family Cites Families (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6738933B2 (en) * 2001-05-09 2004-05-18 Mercury Interactive Corporation Root cause analysis of server system performance degradations
JP2007158623A (en) * 2005-12-02 2007-06-21 Matsushita Electric Ind Co Ltd Method of monitoring quality of video distribution service and terminal
CN101548268B (en) * 2006-10-05 2014-05-21 瓦拉泰克有限公司 Advanced contention detection
CN101192227B (en) * 2006-11-30 2011-05-25 阿里巴巴集团控股有限公司 Log file analytical method and system based on distributed type computing network
DE602006015827D1 (en) * 2006-12-08 2010-09-09 Ubs Ag Method and apparatus for detecting the IP address of a computer and related location information
JP5158189B2 (en) * 2008-03-07 2013-03-06 日本電気株式会社 Mail receiving device
US20090245114A1 (en) * 2008-04-01 2009-10-01 Jayanth Vijayaraghavan Methods for collecting and analyzing network performance data
KR101940815B1 (en) * 2009-01-28 2019-01-21 헤드워터 리서치 엘엘씨 Quality of service for device assisted services
US9203913B1 (en) * 2009-07-20 2015-12-01 Conviva Inc. Monitoring the performance of a content player
US9021362B2 (en) * 2010-07-19 2015-04-28 Soasta, Inc. Real-time analytics of web performance using actual user measurements
CN102291594B (en) * 2011-08-25 2015-05-20 中国电信股份有限公司上海信息网络部 IP network video quality detecting and evaluating system and method
US8452871B2 (en) * 2011-08-27 2013-05-28 At&T Intellectual Property I, L.P. Passive and comprehensive hierarchical anomaly detection system and method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
None

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3436951A4 (en) * 2016-03-29 2019-11-20 Anritsu Company Systems and methods for measuring effective customer impact of network problems in real-time using streaming analytics
US10686681B2 (en) 2016-03-29 2020-06-16 Anritsu Company Systems and methods for measuring effective customer impact of network problems in real-time using streaming analytics
CN109906592A (en) * 2016-11-07 2019-06-18 华为技术有限公司 Monitor the system and method for chip property
EP3526933A4 (en) * 2016-11-07 2019-08-21 Huawei Technologies Co., Ltd. System and methods for monitoring performance of slices
CN109906592B (en) * 2016-11-07 2020-10-09 华为技术有限公司 System and method for monitoring slicing performance
US10827366B2 (en) 2016-11-07 2020-11-03 Huawei Technologies Co., Ltd. System and methods for monitoring performance of slices
EP3864516B1 (en) * 2018-11-19 2022-12-21 Microsoft Technology Licensing, LLC Veto-based model for measuring product health
EP4002800A3 (en) * 2020-11-17 2022-08-03 Citrix Systems Inc. Systems and methods for detection of degradation of a virtual desktop environment
US11467911B2 (en) 2020-11-17 2022-10-11 Citrix Systems, Inc. Systems and methods for detection of degradation of a virtual desktop environment
US12001287B2 (en) 2020-11-17 2024-06-04 Citrix Systems, Inc. Systems and methods for detection of degradation of a virtual desktop environment

Also Published As

Publication number Publication date
US20150149609A1 (en) 2015-05-28
EP3072050A2 (en) 2016-09-28
JP2017500791A (en) 2017-01-05
US20180027088A1 (en) 2018-01-25
RU2016119573A (en) 2017-11-23
RU2016119573A3 (en) 2018-08-10
WO2015077385A3 (en) 2015-08-20
CN105765907A (en) 2016-07-13

Similar Documents

Publication Publication Date Title
US20180027088A1 (en) Performance monitoring to provide real or near real time remediation feedback
US11042474B2 (en) Scheduled tests for endpoint agents
US10644962B2 (en) Continuous monitoring for performance evaluation of service interfaces
US10951489B2 (en) SLA compliance determination with real user monitoring
US20160070691A1 (en) Method and system for auto-populating electronic forms
EP3864516B1 (en) Veto-based model for measuring product health
US20150067147A1 (en) Group server performance correction via actions to server subset
JP7105356B2 (en) Mapping Entity to Account
US20100198649A1 (en) Role tailored dashboards and scorecards in a portal solution that integrates retrieved metrics across an enterprise
US10275338B2 (en) Automated system for fixing and debugging software deployed to customers
US11522765B2 (en) Auto discovery of network proxies
US20160321906A1 (en) Alert management within a network based virtual collaborative space
CN114208125A (en) Network problem node identification using traceroute aggregation
US8819704B1 (en) Personalized availability characterization of online application services
US12047839B2 (en) Out of box user performance journey monitoring
WO2015148238A1 (en) End user performance analysis
US20140344418A1 (en) Dynamic configuration analysis
WO2021021267A1 (en) Scheduled tests for endpoint agents
US20160226719A1 (en) Network based virtual collaborative problem solving space
US10425452B2 (en) Identifying changes in multiple resources related to a problem
US20170223136A1 (en) Any Web Page Reporting and Capture
US8689058B2 (en) Centralized service outage communication
US11328369B2 (en) Network liquidity to engagement mapping
US10505894B2 (en) Active and passive method to perform IP to name resolution in organizational environments
US10838950B2 (en) Dynamic review cadence for intellectual capital

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14810075

Country of ref document: EP

Kind code of ref document: A2

DPE1 Request for preliminary examination filed after expiration of 19th month from priority date (pct application filed from 20040101)
REEP Request for entry into the european phase

Ref document number: 2014810075

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014810075

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2016119573

Country of ref document: RU

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2016533584

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112016011022

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 112016011022

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20160516