WO2009127639A1 - Query processing visualization system and method of visualizing query processing - Google Patents

Query processing visualization system and method of visualizing query processing Download PDF

Info

Publication number
WO2009127639A1
WO2009127639A1 PCT/EP2009/054425 EP2009054425W WO2009127639A1 WO 2009127639 A1 WO2009127639 A1 WO 2009127639A1 EP 2009054425 W EP2009054425 W EP 2009054425W WO 2009127639 A1 WO2009127639 A1 WO 2009127639A1
Authority
WO
WIPO (PCT)
Prior art keywords
query
query processing
handler
logging
presenting
Prior art date
Application number
PCT/EP2009/054425
Other languages
French (fr)
Inventor
Soufiane Azizi
Vladimir Mordvinov
Original Assignee
International Business Machines Corporation
Ibm United Kingdom Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CA002629922A external-priority patent/CA2629922A1/en
Priority claimed from US12/104,108 external-priority patent/US8122050B2/en
Application filed by International Business Machines Corporation, Ibm United Kingdom Limited filed Critical International Business Machines Corporation
Priority to CN2009801133489A priority Critical patent/CN102007476A/en
Priority to JP2011504438A priority patent/JP5349581B2/en
Publication of WO2009127639A1 publication Critical patent/WO2009127639A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/36Preventing errors by testing or debugging software
    • G06F11/3604Software analysis for verifying properties of programs
    • G06F11/3612Software analysis for verifying properties of programs by runtime analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3466Performance evaluation by tracing or monitoring
    • G06F11/3476Data logging

Definitions

  • the present invention relates to a query processing visualization system and method of visualizing query processing.
  • data stores for storing business data, such as financial data and operational data.
  • various data analyzing applications are proposed. Those data analyzing applications provide various views or reports of data to users.
  • the data analyzing applications have query engines that access the data stores to obtain desired data.
  • Some data analyzing applications have Online
  • OLAP Analytical Processing
  • United States patent application publication No. US 2006/0294076 Al published December 28, 2006 which is hereby incorporated by reference, proposes use of a query framework system that processes a query having a coordination planner and multiple query service providers.
  • the query framework system processes a query and generates a query execution plan, along which the query is executed using relevant query service providers.
  • issues arise during the query execution in order to debug the issues, a user needs to understand the query processing details. Therefore, it is desirable to provide a tool that provides visual representation of the query processing which aids users to attend to query processing issues.
  • the present invention uses a query processing visualization interface that can provide specifically formatted views of query processing.
  • a query processing visualization system comprising a visualization module for visualizing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources.
  • the visualization module comprises a logging information analyzer, an event sequence formatter and a method call presentation handler.
  • the logging information analyzer is provided for analyzing the logging information to identify dependency of the query processing events.
  • the event sequence formatter is provided for formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events.
  • the method call presentation handler is provided for presenting visual presentation components in a context of the hierarchy of the visual components as formatted by the event sequence formatter.
  • a method of visualizing query processing comprises the steps of analyzing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources so as to identify dependency of the query processing events, formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events, and presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the formatting step.
  • a computer readable medium containing computer executable instructions that can be read and executed by a computer for caring out a method of visualizing query processing.
  • the method comprises the steps of analyzing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources so as to identify dependency of the query processing events, formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events, and presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the formatting step.
  • the present invention provides a query processing visualization system comprising a visualization module for visualizing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources, the visualization module comprising: a logging information analyzer for analyzing the logging information to identify dependency of the query processing events; an event sequence formatter for formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events; and a method call presentation handler for presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the event sequence formatter.
  • the present invention provides a query processing visualization system wherein the logging information analyzer further comprises analysing logging information of method calls corresponding to main query processing events and identifying dependency of the method calls; the event sequence formatter further comprises formatting the visual presentation components in a hierarchy to reflect the dependency of the method calls; and the method call presentation handler further comprises presenting the visual presentation components that represents method calls in a context of the hierarchy of the visual presentation components.
  • the present invention provides a query processing visualization system wherein the logging information analyzer further comprises analysing the logging information to identify recursive dependency of the method calls; the event sequence formatter further comprises a recursive dependency handler for formatting the visual presentation components representing recursively called method calls as collapsible components; and the method call presentation handler further comprises presenting the collapsible components so that the methods at a given calling depth are visually presented.
  • the logging information analyzer further comprises analysing the logging information to identify recursive dependency of the method calls
  • the event sequence formatter further comprises a recursive dependency handler for formatting the visual presentation components representing recursively called method calls as collapsible components
  • the method call presentation handler further comprises presenting the collapsible components so that the methods at a given calling depth are visually presented.
  • the present invention provides a query processing visualization system wherein the event sequence formatter further comprises a transforming method call handler for opening a tree of macro operations representing a method that has transformed a structure of the tree.
  • the present invention provides a query processing visualization system wherein: the logging information analyzer further comprises extracting logging information including event process information of a method call; and the method call presentation handler further comprises a process information adder for adding the process information to a visual component representing the method call.
  • the present invention provides a query processing visualization system wherein the process information comprises an elapse time and/or memory consumption of the method call.
  • the present invention provides a query processing visualization system wherein the process information adder highlights the process information in the presentation.
  • the present invention provides a query processing visualization system wherein the method call presentation handler further comprises a component label handler for labelling a presentation component representing a method call by a name of the method, name of a container object and name of a service provider that provides a processing service of the method.
  • the method call presentation handler further comprises a component label handler for labelling a presentation component representing a method call by a name of the method, name of a container object and name of a service provider that provides a processing service of the method.
  • the present invention provides a query processing visualization system wherein the logging information analyzer further comprises extracting logging information including parameter information of a method call; and the method call presentation handler further comprises a parameter information adder for adding to a visual component representing the method call an access to the parameter information of the method call.
  • the present invention provides a query processing visualization system wherein the parameter information adder further comprises adding an input parameter access, an output parameter access, and/or a return parameter access.
  • the present invention provides a query processing visualization system wherein the parameter information handler further comprises a macro operation tree handler for presenting a tree of macro operations that represents a parameter containing a query framework query, the query framework query being executable by one or more query processing service providers of a query framework that processes the client query.
  • a macro operation tree handler for presenting a tree of macro operations that represents a parameter containing a query framework query, the query framework query being executable by one or more query processing service providers of a query framework that processes the client query.
  • the present invention provides a query processing visualization system wherein the macro operations include a query result definition, query and/or provider query.
  • the macro operations include a query result definition, query and/or provider query.
  • the present invention provides a query processing visualization system wherein the method call presentation handler comprises an execute option handler for allowing execution of the query framework query and for presenting a view containing query results of the execution.
  • the method call presentation handler comprises an execute option handler for allowing execution of the query framework query and for presenting a view containing query results of the execution.
  • the present invention provides a query processing visualization system wherein the macro operation tree handler comprises a macro operation details handler for exposing a structure of the macro operations that completes definition of the macro operations in a predetermined format.
  • the present invention provides a query processing visualization system wherein the logging information analyzer comprises extracting logging information of a query execution plan which is generated as a result of a query planning process, the query execution plan comprising macro operations that are executable by query processing service providers; the event sequence formatter further comprises formatting visual presentation components representing the executable macro operations to represent the query execution plan; and the method call presentation handler further comprises presenting the visual presentation components for validation of the execution plan.
  • the present invention provides a query processing visualization system further comprising a logging module comprising: an event capturer for capturing query processing events occurring during the processing of the client query; a method call identifier for identifying, as method calls, main query processing events among the captured query processing events; and a logging element handler for logging each method call as being represented by a logging element.
  • a logging module comprising: an event capturer for capturing query processing events occurring during the processing of the client query; a method call identifier for identifying, as method calls, main query processing events among the captured query processing events; and a logging element handler for logging each method call as being represented by a logging element.
  • the present invention provides a method of visualizing query processing, the method comprising the steps of: analyzing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources so as to identify dependency of the query processing events; formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events; and presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the formatting step.
  • the present invention provides a method wherein the analyzing step further comprises analysing logging information of method calls corresponding to main query processing events and identifying dependency of the method calls; the formatting step further comprising formatting the visual presentation components in a hierarchy to reflect the dependency of the method calls; and the presenting step further comprising presenting the visual presentation components that represents method calls in a context of the hierarchy of the visual components.
  • the present invention provides a method wherein the analyzing step further comprises identifying recursive dependency of the method calls; the formatting step further comprising formatting the visual presentation components representing recursively called method calls as collapsible components; and the presenting step further comprising presenting the collapsible components so that the methods at a given calling depth are visually presented.
  • the present invention provides a method wherein the presenting step further comprises opening a tree of macro operations representing a method that has transformed a structure of the tree.
  • the present invention provides a method wherein the analyzing step further comprises extracting logging information including event process information of a method call; and the presenting step further comprises adding the process information to a visual component representing the method call.
  • the present invention provides a method wherein the presenting step further comprises adding an elapse time and/or memory consumption of the method call.
  • the present invention provides a method wherein the formatting step further comprises labelling a presentation component representing a method call by a name of method, name of container object and name of a service provider that provides a processing service of the method.
  • the present invention provides a method wherein: the analyzing step further comprises extracting logging information comprising parameter information of a method call; and the presenting step further comprises adding to a visual component representing the method call an access to the parameter information of the method call.
  • the present invention provides a method wherein the parameter information adding step further comprises presenting a tree of macro operations that represents a parameter containing a query framework query, the query framework query being executable by one or more query processing service providers of a query framework that processes the client query.
  • the present invention provides a method wherein the macro operation tree presenting step further comprises provides an execute option for allowing execution of the query framework query and for presenting a view containing query results of the execution.
  • the present invention provides a method wherein the analyzing step further comprises obtaining logging information of a query execution plan which is generated as a result of a query planning process, the query execution plan comprising of macro operations that are executable by query processing service providers; the formatting step further comprises formatting visual presentation components representing the executable macro operations to represent the query execution plan; and the presenting step further comprising presenting the visual presentation components for validation of the execution plan.
  • the present invention provides a method further comprising the step of obtaining the logging information by: capturing query processing events occurring during the processing of the client query; identifying, as method calls, main query processing events among the captured query processing events; and logging each method call as being represented by a logging element.
  • the present invention provides a computer program product loadable into the internal memory of a digital computer, comprising software code portions for performing, when said product is run on a computer, to carry out the invention as described above.
  • Figure 1 is a block diagram showing a query processing visualization system in accordance with a preferred embodiment of the present invention
  • Figure 2 is a block diagram showing an example of a query framework system
  • Figure 3 is a diagram showing an example of a query tree
  • Figure 4 is a diagram showing an example of a query framework system
  • Figure 5 is a diagram showing an embodiment of the query process visualization system
  • Figure 6 is a diagram showing an embodiment of a logging module of the query processing visualization system
  • Figure 7 is an example of a query processing log
  • Figure 8 is a diagram showing an embodiment of a visualization module of the query processing visualization system
  • Figure 9 is an example of a high level representation of a formatted query processing log
  • Figure 10 is a diagram showing an embodiment of a method call presentation handler
  • Figure 11 is an example of a visualized method call
  • Figure 12 is an example of a XML representation of a parameter
  • Figure 13 is an example of a tree of macro operations
  • Figure 14 is an example of a view of query results of a query framework query
  • Figure 15 is an example showing opening of a macro operation tree
  • Figure 16 is an example of a tree pattern of macro operations
  • Figure 17 is another example of a tree pattern of macro operations
  • Figure 18 is another example of a tree pattern of macro operations
  • Figure 19 is an example of macro operation details
  • Figure 20 is another example of macro operation details
  • Figure 21 is another example of macro operation details
  • Figure 22 is an example of a view of a query execution plan
  • Figure 23 is a diagram showing an example of a flow of a query execution
  • Figure 24 is an example of a validation request
  • Figure 25 is an example of a translated validation request
  • Figure 26 is an example of a validation request command report
  • Figure 27 is an example of a validation request response
  • Figure 28 is an example of a query information string.
  • a query processing visualization system 100 in accordance with an embodiment of the application is described.
  • the query processing visualization system 100 is suitably used in an open system of loosely coupled query processing components.
  • the query processing visualization system 100 works with a query framework system 10 that is used in a report server 20.
  • the query framework system 10 is provided to receive user queries from a data analyzing system 30 and process the received user requests to retrieve requested data from one or more data sources 40, using a metadata model 22.
  • These systems may be suitably used in server-client environments.
  • the query analyzing system 30 is an application system that provides various views of data in the data sources 40 to allow users to analyze the data.
  • the query analyzing system 20 When a user requests a view of data, the query analyzing system 20 generates a user request.
  • the query analyzing system 20 may use a metadata model 22 that contains metadata of the data sources
  • the user request is in a query language that the query analyzing system 20 uses to issue the user request.
  • the query framework system 10 intercepts user requests generated by the data analyzing system 30. It processes and executes the user requests to retrieve desired data from the data sources 40.
  • the query framework system 10 has multiple query processing components 12.
  • Query processing components 12 include a set of query operation providers 15 and a coordination planner 16.
  • Query processing components 12 share a common interface
  • Query processing components 12 are pluggable.
  • Each query operation provider 15 is capable of performing a specific operation on queries, as further exemplified below.
  • three query operation providers 15 are shown for the purpose of illustration. There may be more or fewer query operation providers in the query framework system 10.
  • the query framework system 10 uses a query framework (QF) query 50 as exemplified in Figure 3.
  • QF query 50 plays the role of a query specification that the query operation providers 15 use to communicate to each other and to the (RQP) 72, OLAP planner query provider (OQP) 74, and vendor query (VQ) planner query provider 76.
  • Service providers 80 provide local query operations, and generate provider queries on top of query blocks associated with other components.
  • the query framework system 10 has a local tabular operation provider 82, local execution provider 84 and a multicube join provider 86.
  • Query transformation providers 90 are responsible for preprocessing of a QF query for the consumption of the transformed query by other query operation providers.
  • the query framework system 10 has a canonical query result definition provider 92, query refinement provider 94, query decomposition provider 96 and query optimization provider 98.
  • the query processing visualization system 100 provides visualization of the query processing.
  • the query processing includes the query planning and query execution.
  • the query processing visualization system 100 comprises a visualization module 140.
  • the query processing visualization system 100 works with a logging module 120 that is provided outside the query processing visualization system 100.
  • the logging module 120 may be part of the query processing visualization system 100.
  • the logging module 120 obtains logging information of query processing events occurring during processing of a client query requesting data.
  • the query framework system 10 has a logging functionality and a logging information storage storing logging information
  • the logging module 120 may obtain the logging information from the logging information storage of the query framework system 10.
  • Figure 6 shows an embodiment where the logging module 120 provides logging functionality.
  • the logging module 120 has an event capturer 122, a method call identifier 124 and a logging element handler 126.
  • the event capture 122 captures query processing events occurring in the query framework system 10.
  • the event capturer 122 has a parameter capturer 132 for capturing logging information of parameters of method calls.
  • the method call identifier 124 identifies main events captured as calls to interface methods.
  • the method call identifier 124 has a recursive method call identifier 134 for identifying recursive method calls.
  • the logging element handler 126 logs in Query Framework log files to reflect the events captured.
  • the logging element handler 126 has a child element handler 136 for handling logging in children elements.
  • the structure of the log is driven by the sequence events occurring in the query framework system 10.
  • the logging element handler 126 writes some information that should be logged into the log right away, so that logging does not require any information to be kept in memory that waits to be logged at the right time.
  • the logging element handler 126 writes some information, e.g., the content of method output parameters of a method, after the method is finished, even though the execution of the method call may lead to other events to occur in the query framework system 10 that should be logged.
  • the logging elements handler 126 logs information in the XML format.
  • the logging elements handler 126 logs information such that each Query Framework interface method call is represented by an XML element.
  • the parameter capturer 132 captures input and output parameters of a method, and the child element handler 136 logs the parameters in its child elements of the method.
  • the structure of the XML format represents the event dependency. Most calls to other Query Framework interface methods are recursive, i.e. they call other methods as part of their implementation.
  • the recursive method call identifier 134 identifies recursive method calls, and the child element handler 136 logs the recursive method calls as children of the method element.
  • Figure 7 shows an example of part of a query processing event log for a method call as logged by the logging module 120 or the query framework system 10. It is typical that even for a single call, the amount of captured log data is significant and overwhelming for a user to analyze as it is.
  • the visualization module 140 visualizes the information in the query processing event log. It functions as a tool that enables the top down analysis of the logging information, helps identifying the portion of the log file responsible for a given transformation, aggregates corresponding pieces of logging data (e.g., input and output parameters) together, reflects the sequence of the provider invocation during query planning and execution phases, and/or maximizes the value of the logging information.
  • Figure 8 shows an embodiment of the visualization module 140 which comprises a logging information analyzer 150, an event sequence formatter 160 and a method call presentation handler 180.
  • the logging information analyzer 150 analyzes the logging information of the query processing events. It identifies dependency of the query processing events, or method calls. The logging information analyzer 150 can extracts information of recursive dependency of the method calls, event process information, and/or parameter information of a method call.
  • the event sequence formatter 160 formats visual presentation components representing the logging information in a hierarchy to reflect the dependency of the method calls. It presents visual components that represent recursively called methods as collapsible components so that the methods at a given calling depth or methods belonging to a given method call sub- tree are visually presented.
  • the event sequence formatter 160 has a recursive dependency handler 164 for formatting the visual presentation components representing the method calls in a hierarchy that reflects the recursive dependency.
  • the event sequence formatter may also have a handler 166 of transforming method calls that provides ability to open macro operation trees for methods that have transformed the structure of these trees. In general case, the transforming method call handler 166 may compare the content of the input parameter of the prepare call with the output parameter.
  • the transforming method call handler 166 may use that as an indication that a transformation took place. In addition to that for optimization purposes, providers are expected to return an empty response if no transformation was applied. The transforming method call handler 166 may use those empty responses to detect indication for the visualization logic that no transformation took place. Once applied, opening of macro operation trees clearly demonstrates the logic of the query planning sequence with the sequence of QueryPrepare method calls where the visualization picks only those method calls that have actually changed the query structure and returned in the response parameter.
  • the method call presentation handler 180 presents components of a method call logging information in the context of the hierarchy of visual components as formatted by the event sequence formatter 160.
  • Figure 9 shows an example of a high level representation of the logging information in a hierarchy reflecting the dependency of method calls. This format is suitably used to address the needs of logging data analysis.
  • the high level representation shows only method calls.
  • the recursive dependency of the calls of a method is reflected in the hierarchy of the visual components corresponding to the method.
  • the visual presentation components representing recursively called methods can be collapsed or opened so that only methods at a desired calling depth are shown.
  • Each visual presentation component representing each method is visually identified by its name, the name of container object and the name of the provider.
  • Figure 10 shows an embodiment of the method call presentation handler 180 in which the method call presentation handler 180 has a process information adder 182 for adding to a visual component representing a method call process information of the method call.
  • the process information may be statistic information of processing of a method call, e.g., an elapse time and/or memory consumption of the method call.
  • the process information adder 182 may highlight the process information in the presentation, so that it is easy to focus on this information whenever performance of the query framework system 10 is being investigated.
  • the method call presentation handler 180 may also have a component label handler 184 for labeling a method call presentation component representing a method call by a name of method, name of container object, and/or name of a provider that provides a processing service of the method
  • the method call presentation handler 180 has a parameter information adder 190.
  • the parameter information adder 190 adds to a visual component representing a method call an access to parameter information of the method call.
  • the parameter information may include information of input parameters, output parameters and/or return parameters.
  • the visualization of a call method also acts as a staring point for further analysis of its input and output parameters.
  • the parameter information adder 182 may add an input parameter access, an output parameter access, and/or a return parameter access.
  • the parameter information adder 182 may show access to only non-empty parameter values. When one of these accesses is selected, the associated parameter value is presented.
  • Figure 11 shows an example of a presentation of a method call with an access to parameter information.
  • This visualization of a method call acts as a starting point for further analysis of its input and output parameters.
  • clicking on a parameter type opens the values passed to a given method call.
  • the parameters can have an XML representation.
  • the method call presentation handler 180 may also have a macro operation tree handler 192.
  • the macro operation tree handler 192 presents a tree of macro operations that represents a parameter containing a QF query.
  • a parameter value may be a QF query.
  • the query framework system 10 considers a query as a combination of macro operations.
  • macro operations There are only few types of the macro operations, including operations expressed with SQL query which is the responsibility of the relational query provider (RQP), operations expressed with multidimensional expression (MDX) query which is the responsibility of OLAP query provider (OQP), operations of a dynamic cube construction which can be characterized by the structure of a cube and is the responsibility of Cube Build Provider, and number of local processing operations such as reporter mode stitching operation, zero suppression operation, and/or master-detail operation.
  • a parameter of a method call may contain a query framework query, which is a combination of macro operations.
  • the method call presentation handler 180 can also show a parameter containing a QF query as a tree of macro operations.
  • a tree of macro operations may include nodes representing a query result definition (QRD), query and/or provider query.
  • Figure 13 shows an example of a presentation of such a tree of macro operations.
  • Nodes of the tree of macro operations include QRD and query in this example.
  • Nodes in a tree of macro operations may be presented differently, e.g., colored differently, according to the operation type.
  • the representation of macro operations exemplified in Figure 13 may be chosen to be default for parameters values containing QF query as it provides an efficient high level description of the request content.
  • This view can demonstrate the transformation of a tree of logical macro operations including QRD and queries into a tree of physical operations, i.e., provider queries.
  • the method call presentation handler 180 may have an execute option handler 194 that allows execution of the query framework query and for presenting a view containing query results of the execution.
  • Figure 13 also shows an example of an "execute" option that initiates execution of given QF query and switches to a view containing the query results once they are ready, as exemplified in Figure 14. Viewing query results allows to understand further the semantics of queries passed to a given component. Another important application of this functionality is ability to verify intermediate results of physical operations in the execution plan.
  • visual inspection of a log file with opened blocks of macro operation tree transformations also lets a user identify the main query transformation patterns that a given request contains.
  • the macro operation tree pattern shown in Figure 16 indicates that a dynamic cube is populated from multiple data streams.
  • the pattern shown in Figure 17 is an example of an indicator of optimization of the tabular query populating a dynamic cube.
  • Another example is the pattern shown in Figure 18, which indicates that the query has a reporter mode operation.
  • a query execution plan is the final result of a query planning process.
  • a QF query representing the query execution plan consists of only physical macro operations that can be executed by query operation providers.
  • the sequence of these operations in the macro operation tree determines the sequence of their invocation at the execution stage and the data flow from one operation to another.
  • Physical macro operations are expressed with provider queries. Every provider query is associated with a certain provider responsible for the operation execution.
  • the macro operation tree contains only providerQuery nodes.
  • the main operations in this tree are relational query provider (RQP) operation based on SQL and OLAP query provider (OQP) operation based primarily on multidimensional expression (MDX).
  • RQP relational query provider
  • OQP OLAP query provider
  • MDX multidimensional expression
  • the visualization system 100 assists the user to understand and analyze the macro operations without the need of understanding of the query framework system 10.
  • the macro operation tree handler 192 may also have a macro operation details handler 198 that exposes this structure in a format that can be understood by wide variety of potential users.
  • Figures 19-21 show examples of formats that the macro operation details handler 198 uses to represent details of RQP, OQP and CubeBuildProvider nodes.
  • the visualization system 100 presents RQP operation details by SQL views, e.g., Cognos SQL, native SQL and the query framework query views, as exemplified in Figure 19.
  • the visualization system 100 presents a CubeBuildProvider operation by the structure of a dynamic cube built for a given query, as exemplified in Figure 20.
  • the visualization system 100 presents OQP operation details by generated native MDX, as exemplified in Figure 21.
  • the macro operation details handler 198 may provide other views, e.g., the view representing the result of a given operations to be able to check the data flows from one operation to another.
  • the visualization system 100 provides a query execution plan view that can be also suitably used by report authors. Exposing an execution plan to report authors may provide a significant insight of the nature of operations involved in a query/report processing, their performance and amount of data consumed at any given point of a query execution. Report authors are typically unfamiliar with the notion of query framework providers which requires understanding of internals of the query framework system 10 or query engine architecture.
  • the visualization system 100 provides a view of the query execution plan that is represented through more widely known notions, e.g., SQL query specification, MDX query specification, and Cube creation operation.
  • the visualization system 100 marks macro operations using their main responsibility, or in other words gives a high level title or label for each macro operation that is intuitive for a report author.
  • Figure 22 shows an example of a view with such high level titles.
  • Figure 23 is a diagram showing an example of a flow for requesting a view of an execution plan by a report author.
  • the report author requests to view an execution plan in a data analyzing system 30.
  • the data analyzing system 30 sends a validation request 200 to a report server 20.
  • the report server 20 sends the translated XML API validation request 202 to the query framework system 10.
  • the query framework system 10 processes the request and generates an execution plan, and returns to the report server 20 an XML API response 204 including the execution plan.
  • the report server 20 translates it to a validation response 206 and returns it to the data analyzing system 30, which sends the execution plan 208 to the visualization module 140 of the query processing visualization system 100 for presenting it to the report author.
  • the data analyzing system 30 uses a validation request, expressed in the format of a SOAP request.
  • the report server 20 translates the validation request 200 into a QF XML API request or command 202, as shown in Figure 25, requesting messages at the "information" level of severity and also number of the request properties.
  • the set of request properties does not include the execution property of a request supported by QF XML API shown in Figure 26.
  • the query framework system 10 processes the validation request and generates a QF XML
  • the report server 20 received it and translates it to a SOAP validation response 206 to the validation request asking for query information contains the details under ⁇ querylnfo> element, as shown in Figure 27, where QUERY INFO is the string of encoded XML.
  • the execution plan is part of this query info XML, as shown in Figure 28.
  • the data analyzing system 30 receives the SOAP validation response 206 including the execution plan in XML in the original format of a tree of provider queries.
  • This execution plan XML 208 is sent to the visualization module 140 of the query processing visualization system 100 that formats the execution plan XML and presents the formatted execution plan as described above.
  • the visualization module 140 also provides query plan analysis functionality by supporting the dynamic properties of the execution plan visualization: folding and unfolding operation trees, opening the node property views and initiating the execution of the plan sub-trees.
  • the query processing visualization system of the present invention may be implemented by any hardware, software or a combination of hardware and software having the above described functions.
  • the software code either in its entirety or a part thereof, may be stored in a computer readable memory.
  • a computer data signal representing the software code which may be embedded in a carrier wave may be transmitted via a communication network.
  • Such a computer readable memory and a computer data signal are also within the scope of the present invention, as well as the hardware, software and the combination thereof.

Abstract

A query processing visualization system comprises a visualization module. The visualization module visualizes logging information of query processing events occurring during processing of a client query requesting data from one or more data sources by analyzing the logging information to identify dependency of the query processing events, formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events, and presenting the visual presentation components in the context of the hierarchy of the visual components as formatted.

Description

QUERY PROCESSING VISUALIZATION SYSTEM AND METHOD OF VISUALIZING QUERY PROCESSING
FIELD OF INVENTION
The present invention relates to a query processing visualization system and method of visualizing query processing.
BACKGROUND OF THE INVENTION
Many organizations use data stores for storing business data, such as financial data and operational data. In order to assist business users to examine their data, various data analyzing applications are proposed. Those data analyzing applications provide various views or reports of data to users. The data analyzing applications have query engines that access the data stores to obtain desired data. Some data analyzing applications have Online
Analytical Processing (OLAP) engines to provide multidimensional views of data.
Those existing query engines and OLAP engines use components of the engines to obtain desired data, and do not allow for external components to be involved into the internal logic of query processing. Thus, these engines cannot reuse or share functionality with other components.
In order to facilitate reuse of the planning logic compiling all query operation provider actions in a single execution plan, United States patent application publication No. US 2006/0294076 Al published December 28, 2006, which is hereby incorporated by reference, proposes use of a query framework system that processes a query having a coordination planner and multiple query service providers. The query framework system processes a query and generates a query execution plan, along which the query is executed using relevant query service providers. When issues arise during the query execution, in order to debug the issues, a user needs to understand the query processing details. Therefore, it is desirable to provide a tool that provides visual representation of the query processing which aids users to attend to query processing issues.
SUMMARY OF THE INVENTION
The present invention uses a query processing visualization interface that can provide specifically formatted views of query processing.
It is an object of the invention to provide an improved query processing visualization system and method of visualizing query processing.
According to an aspect of the present invention there is provided a query processing visualization system comprising a visualization module for visualizing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources. The visualization module comprises a logging information analyzer, an event sequence formatter and a method call presentation handler. The logging information analyzer is provided for analyzing the logging information to identify dependency of the query processing events. The event sequence formatter is provided for formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events. The method call presentation handler is provided for presenting visual presentation components in a context of the hierarchy of the visual components as formatted by the event sequence formatter.
According to another aspect of the present invention there is provided a method of visualizing query processing. The method comprises the steps of analyzing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources so as to identify dependency of the query processing events, formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events, and presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the formatting step. According to another aspect of the present invention there is provided a computer readable medium containing computer executable instructions that can be read and executed by a computer for caring out a method of visualizing query processing. The method comprises the steps of analyzing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources so as to identify dependency of the query processing events, formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events, and presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the formatting step.
Viewed from a first aspect, the present invention provides a query processing visualization system comprising a visualization module for visualizing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources, the visualization module comprising: a logging information analyzer for analyzing the logging information to identify dependency of the query processing events; an event sequence formatter for formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events; and a method call presentation handler for presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the event sequence formatter.
Preferably, the present invention provides a query processing visualization system wherein the logging information analyzer further comprises analysing logging information of method calls corresponding to main query processing events and identifying dependency of the method calls; the event sequence formatter further comprises formatting the visual presentation components in a hierarchy to reflect the dependency of the method calls; and the method call presentation handler further comprises presenting the visual presentation components that represents method calls in a context of the hierarchy of the visual presentation components.
Preferably, the present invention provides a query processing visualization system wherein the logging information analyzer further comprises analysing the logging information to identify recursive dependency of the method calls; the event sequence formatter further comprises a recursive dependency handler for formatting the visual presentation components representing recursively called method calls as collapsible components; and the method call presentation handler further comprises presenting the collapsible components so that the methods at a given calling depth are visually presented.
Preferably, the present invention provides a query processing visualization system wherein the event sequence formatter further comprises a transforming method call handler for opening a tree of macro operations representing a method that has transformed a structure of the tree.
Preferably, the present invention provides a query processing visualization system wherein: the logging information analyzer further comprises extracting logging information including event process information of a method call; and the method call presentation handler further comprises a process information adder for adding the process information to a visual component representing the method call.
Preferably, the present invention provides a query processing visualization system wherein the process information comprises an elapse time and/or memory consumption of the method call.
Preferably, the present invention provides a query processing visualization system wherein the process information adder highlights the process information in the presentation.
Preferably, the present invention provides a query processing visualization system wherein the method call presentation handler further comprises a component label handler for labelling a presentation component representing a method call by a name of the method, name of a container object and name of a service provider that provides a processing service of the method.
Preferably, the present invention provides a query processing visualization system wherein the logging information analyzer further comprises extracting logging information including parameter information of a method call; and the method call presentation handler further comprises a parameter information adder for adding to a visual component representing the method call an access to the parameter information of the method call.
Preferably, the present invention provides a query processing visualization system wherein the parameter information adder further comprises adding an input parameter access, an output parameter access, and/or a return parameter access.
Preferably, the present invention provides a query processing visualization system wherein the parameter information handler further comprises a macro operation tree handler for presenting a tree of macro operations that represents a parameter containing a query framework query, the query framework query being executable by one or more query processing service providers of a query framework that processes the client query.
Preferably, the present invention provides a query processing visualization system wherein the macro operations include a query result definition, query and/or provider query.
Preferably, the present invention provides a query processing visualization system wherein the method call presentation handler comprises an execute option handler for allowing execution of the query framework query and for presenting a view containing query results of the execution.
Preferably, the present invention provides a query processing visualization system wherein the macro operation tree handler comprises a macro operation details handler for exposing a structure of the macro operations that completes definition of the macro operations in a predetermined format.
Preferably, the present invention provides a query processing visualization system wherein the logging information analyzer comprises extracting logging information of a query execution plan which is generated as a result of a query planning process, the query execution plan comprising macro operations that are executable by query processing service providers; the event sequence formatter further comprises formatting visual presentation components representing the executable macro operations to represent the query execution plan; and the method call presentation handler further comprises presenting the visual presentation components for validation of the execution plan.
Preferably, the present invention provides a query processing visualization system further comprising a logging module comprising: an event capturer for capturing query processing events occurring during the processing of the client query; a method call identifier for identifying, as method calls, main query processing events among the captured query processing events; and a logging element handler for logging each method call as being represented by a logging element.
Viewed from another aspect the present invention provides a method of visualizing query processing, the method comprising the steps of: analyzing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources so as to identify dependency of the query processing events; formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events; and presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the formatting step.
Preferably, the present invention provides a method wherein the analyzing step further comprises analysing logging information of method calls corresponding to main query processing events and identifying dependency of the method calls; the formatting step further comprising formatting the visual presentation components in a hierarchy to reflect the dependency of the method calls; and the presenting step further comprising presenting the visual presentation components that represents method calls in a context of the hierarchy of the visual components.
Preferably, the present invention provides a method wherein the analyzing step further comprises identifying recursive dependency of the method calls; the formatting step further comprising formatting the visual presentation components representing recursively called method calls as collapsible components; and the presenting step further comprising presenting the collapsible components so that the methods at a given calling depth are visually presented.
Preferably, the present invention provides a method wherein the presenting step further comprises opening a tree of macro operations representing a method that has transformed a structure of the tree.
Preferably, the present invention provides a method wherein the analyzing step further comprises extracting logging information including event process information of a method call; and the presenting step further comprises adding the process information to a visual component representing the method call.
Preferably, the present invention provides a method wherein the presenting step further comprises adding an elapse time and/or memory consumption of the method call.
Preferably, the present invention provides a method wherein the formatting step further comprises labelling a presentation component representing a method call by a name of method, name of container object and name of a service provider that provides a processing service of the method.
Preferably, the present invention provides a method wherein: the analyzing step further comprises extracting logging information comprising parameter information of a method call; and the presenting step further comprises adding to a visual component representing the method call an access to the parameter information of the method call.
Preferably, the present invention provides a method wherein the parameter information adding step further comprises presenting a tree of macro operations that represents a parameter containing a query framework query, the query framework query being executable by one or more query processing service providers of a query framework that processes the client query. Preferably, the present invention provides a method wherein the macro operation tree presenting step further comprises provides an execute option for allowing execution of the query framework query and for presenting a view containing query results of the execution.
Preferably, the present invention provides a method wherein the analyzing step further comprises obtaining logging information of a query execution plan which is generated as a result of a query planning process, the query execution plan comprising of macro operations that are executable by query processing service providers; the formatting step further comprises formatting visual presentation components representing the executable macro operations to represent the query execution plan; and the presenting step further comprising presenting the visual presentation components for validation of the execution plan.
Preferably, the present invention provides a method further comprising the step of obtaining the logging information by: capturing query processing events occurring during the processing of the client query; identifying, as method calls, main query processing events among the captured query processing events; and logging each method call as being represented by a logging element.
Viewed from another aspect, the present invention provides a computer program product loadable into the internal memory of a digital computer, comprising software code portions for performing, when said product is run on a computer, to carry out the invention as described above.
BRIEF DESCRIPTION OF THE DRAWINGS
Embodiments of the invention are described below in detail, by way of example only, with reference to the accompanying drawings in which:
Figure 1 is a block diagram showing a query processing visualization system in accordance with a preferred embodiment of the present invention; Figure 2 is a block diagram showing an example of a query framework system;
Figure 3 is a diagram showing an example of a query tree;
Figure 4 is a diagram showing an example of a query framework system; Figure 5 is a diagram showing an embodiment of the query process visualization system;
Figure 6 is a diagram showing an embodiment of a logging module of the query processing visualization system;
Figure 7 is an example of a query processing log; Figure 8 is a diagram showing an embodiment of a visualization module of the query processing visualization system;
Figure 9 is an example of a high level representation of a formatted query processing log;
Figure 10 is a diagram showing an embodiment of a method call presentation handler;
Figure 11 is an example of a visualized method call; Figure 12 is an example of a XML representation of a parameter;
Figure 13 is an example of a tree of macro operations;
Figure 14 is an example of a view of query results of a query framework query;
Figure 15 is an example showing opening of a macro operation tree;
Figure 16 is an example of a tree pattern of macro operations; Figure 17 is another example of a tree pattern of macro operations;
Figure 18 is another example of a tree pattern of macro operations;
Figure 19 is an example of macro operation details;
Figure 20 is another example of macro operation details;
Figure 21 is another example of macro operation details; Figure 22 is an example of a view of a query execution plan;
Figure 23 is a diagram showing an example of a flow of a query execution;
Figure 24 is an example of a validation request;
Figure 25 is an example of a translated validation request;
Figure 26 is an example of a validation request command report; Figure 27 is an example of a validation request response; and
Figure 28 is an example of a query information string.
DETAILED DESCRIPTION
Referring to Figure 1, a query processing visualization system 100 in accordance with an embodiment of the application is described. The query processing visualization system 100 is suitably used in an open system of loosely coupled query processing components. In the embodiment shown in Figure 1, the query processing visualization system 100 works with a query framework system 10 that is used in a report server 20. The query framework system 10 is provided to receive user queries from a data analyzing system 30 and process the received user requests to retrieve requested data from one or more data sources 40, using a metadata model 22. These systems may be suitably used in server-client environments.
The query analyzing system 30 is an application system that provides various views of data in the data sources 40 to allow users to analyze the data. When a user requests a view of data, the query analyzing system 20 generates a user request. To generate user requests, the query analyzing system 20 may use a metadata model 22 that contains metadata of the data sources
40. The user request is in a query language that the query analyzing system 20 uses to issue the user request.
The query framework system 10 intercepts user requests generated by the data analyzing system 30. It processes and executes the user requests to retrieve desired data from the data sources 40.
As shown in Figure 2, the query framework system 10 has multiple query processing components 12. Query processing components 12 include a set of query operation providers 15 and a coordination planner 16. Query processing components 12 share a common interface
14 and a common query language of the query framework system 10. Query processing components 12 are pluggable.
Each query operation provider 15 is capable of performing a specific operation on queries, as further exemplified below. In Figure 2, three query operation providers 15 are shown for the purpose of illustration. There may be more or fewer query operation providers in the query framework system 10.
The query framework system 10 uses a query framework (QF) query 50 as exemplified in Figure 3. A QF query 50 plays the role of a query specification that the query operation providers 15 use to communicate to each other and to the (RQP) 72, OLAP planner query provider (OQP) 74, and vendor query (VQ) planner query provider 76. Service providers 80 provide local query operations, and generate provider queries on top of query blocks associated with other components. In this example, the query framework system 10 has a local tabular operation provider 82, local execution provider 84 and a multicube join provider 86. Query transformation providers 90 are responsible for preprocessing of a QF query for the consumption of the transformed query by other query operation providers. In this example, the query framework system 10 has a canonical query result definition provider 92, query refinement provider 94, query decomposition provider 96 and query optimization provider 98.
The query processing visualization system 100 provides visualization of the query processing.
The query processing includes the query planning and query execution.
As shown in Figure 5, the query processing visualization system 100 comprises a visualization module 140. In this embodiment, the query processing visualization system 100 works with a logging module 120 that is provided outside the query processing visualization system 100. In a different embodiment, the logging module 120 may be part of the query processing visualization system 100.
The logging module 120 obtains logging information of query processing events occurring during processing of a client query requesting data. In an embodiment where the query framework system 10 has a logging functionality and a logging information storage storing logging information, the logging module 120 may obtain the logging information from the logging information storage of the query framework system 10.
Figure 6 shows an embodiment where the logging module 120 provides logging functionality.
In this embodiment, the logging module 120 has an event capturer 122, a method call identifier 124 and a logging element handler 126. The event capture 122 captures query processing events occurring in the query framework system 10. The event capturer 122 has a parameter capturer 132 for capturing logging information of parameters of method calls. The method call identifier 124 identifies main events captured as calls to interface methods. The method call identifier 124 has a recursive method call identifier 134 for identifying recursive method calls. The logging element handler 126 logs in Query Framework log files to reflect the events captured. The logging element handler 126 has a child element handler 136 for handling logging in children elements.
The structure of the log is driven by the sequence events occurring in the query framework system 10. The logging element handler 126 writes some information that should be logged into the log right away, so that logging does not require any information to be kept in memory that waits to be logged at the right time. On the other hand, the logging element handler 126 writes some information, e.g., the content of method output parameters of a method, after the method is finished, even though the execution of the method call may lead to other events to occur in the query framework system 10 that should be logged.
In an embodiment, the logging elements handler 126 logs information in the XML format. The logging elements handler 126 logs information such that each Query Framework interface method call is represented by an XML element. The parameter capturer 132 captures input and output parameters of a method, and the child element handler 136 logs the parameters in its child elements of the method. The structure of the XML format represents the event dependency. Most calls to other Query Framework interface methods are recursive, i.e. they call other methods as part of their implementation. The recursive method call identifier 134 identifies recursive method calls, and the child element handler 136 logs the recursive method calls as children of the method element.
Figure 7 shows an example of part of a query processing event log for a method call as logged by the logging module 120 or the query framework system 10. It is typical that even for a single call, the amount of captured log data is significant and overwhelming for a user to analyze as it is.
The visualization module 140 visualizes the information in the query processing event log. It functions as a tool that enables the top down analysis of the logging information, helps identifying the portion of the log file responsible for a given transformation, aggregates corresponding pieces of logging data (e.g., input and output parameters) together, reflects the sequence of the provider invocation during query planning and execution phases, and/or maximizes the value of the logging information. Figure 8 shows an embodiment of the visualization module 140 which comprises a logging information analyzer 150, an event sequence formatter 160 and a method call presentation handler 180.
The logging information analyzer 150 analyzes the logging information of the query processing events. It identifies dependency of the query processing events, or method calls. The logging information analyzer 150 can extracts information of recursive dependency of the method calls, event process information, and/or parameter information of a method call.
The event sequence formatter 160 formats visual presentation components representing the logging information in a hierarchy to reflect the dependency of the method calls. It presents visual components that represent recursively called methods as collapsible components so that the methods at a given calling depth or methods belonging to a given method call sub- tree are visually presented. The event sequence formatter 160 has a recursive dependency handler 164 for formatting the visual presentation components representing the method calls in a hierarchy that reflects the recursive dependency. The event sequence formatter may also have a handler 166 of transforming method calls that provides ability to open macro operation trees for methods that have transformed the structure of these trees. In general case, the transforming method call handler 166 may compare the content of the input parameter of the prepare call with the output parameter. If these parameters are different, the transforming method call handler 166 may use that as an indication that a transformation took place. In addition to that for optimization purposes, providers are expected to return an empty response if no transformation was applied. The transforming method call handler 166 may use those empty responses to detect indication for the visualization logic that no transformation took place. Once applied, opening of macro operation trees clearly demonstrates the logic of the query planning sequence with the sequence of QueryPrepare method calls where the visualization picks only those method calls that have actually changed the query structure and returned in the response parameter. The method call presentation handler 180 presents components of a method call logging information in the context of the hierarchy of visual components as formatted by the event sequence formatter 160.
Figure 9 shows an example of a high level representation of the logging information in a hierarchy reflecting the dependency of method calls. This format is suitably used to address the needs of logging data analysis. The high level representation shows only method calls. The recursive dependency of the calls of a method is reflected in the hierarchy of the visual components corresponding to the method. The visual presentation components representing recursively called methods can be collapsed or opened so that only methods at a desired calling depth are shown. Each visual presentation component representing each method is visually identified by its name, the name of container object and the name of the provider.
Figure 10 shows an embodiment of the method call presentation handler 180 in which the method call presentation handler 180 has a process information adder 182 for adding to a visual component representing a method call process information of the method call. The process information may be statistic information of processing of a method call, e.g., an elapse time and/or memory consumption of the method call. The process information adder 182 may highlight the process information in the presentation, so that it is easy to focus on this information whenever performance of the query framework system 10 is being investigated.
As shown in Figure 10, the method call presentation handler 180 may also have a component label handler 184 for labeling a method call presentation component representing a method call by a name of method, name of container object, and/or name of a provider that provides a processing service of the method
As shown in Figure 10, the method call presentation handler 180 has a parameter information adder 190. The parameter information adder 190 adds to a visual component representing a method call an access to parameter information of the method call. The parameter information may include information of input parameters, output parameters and/or return parameters. Thus, the visualization of a call method also acts as a staring point for further analysis of its input and output parameters. The parameter information adder 182 may add an input parameter access, an output parameter access, and/or a return parameter access. The parameter information adder 182 may show access to only non-empty parameter values. When one of these accesses is selected, the associated parameter value is presented.
Figure 11 shows an example of a presentation of a method call with an access to parameter information. This visualization of a method call acts as a starting point for further analysis of its input and output parameters. As shown in Figure 12, clicking on a parameter type opens the values passed to a given method call. The parameters can have an XML representation.
As shown in Figure 10, the method call presentation handler 180 may also have a macro operation tree handler 192. The macro operation tree handler 192 presents a tree of macro operations that represents a parameter containing a QF query.
A parameter value may be a QF query. As described above referring to Figure 3, the query framework system 10 considers a query as a combination of macro operations. There are only few types of the macro operations, including operations expressed with SQL query which is the responsibility of the relational query provider (RQP), operations expressed with multidimensional expression (MDX) query which is the responsibility of OLAP query provider (OQP), operations of a dynamic cube construction which can be characterized by the structure of a cube and is the responsibility of Cube Build Provider, and number of local processing operations such as reporter mode stitching operation, zero suppression operation, and/or master-detail operation. A parameter of a method call may contain a query framework query, which is a combination of macro operations.
Using the macro operation tree handler 192, the method call presentation handler 180 can also show a parameter containing a QF query as a tree of macro operations. A tree of macro operations may include nodes representing a query result definition (QRD), query and/or provider query. Figure 13 shows an example of a presentation of such a tree of macro operations. Nodes of the tree of macro operations include QRD and query in this example.
Nodes in a tree of macro operations may be presented differently, e.g., colored differently, according to the operation type. The representation of macro operations exemplified in Figure 13 may be chosen to be default for parameters values containing QF query as it provides an efficient high level description of the request content. This view can demonstrate the transformation of a tree of logical macro operations including QRD and queries into a tree of physical operations, i.e., provider queries.
As further shown in Figure 10, the method call presentation handler 180 may have an execute option handler 194 that allows execution of the query framework query and for presenting a view containing query results of the execution. [0042] Figure 13 also shows an example of an "execute" option that initiates execution of given QF query and switches to a view containing the query results once they are ready, as exemplified in Figure 14. Viewing query results allows to understand further the semantics of queries passed to a given component. Another important application of this functionality is ability to verify intermediate results of physical operations in the execution plan.
As exemplified in Figure 15, visual inspection of a log file with opened blocks of macro operation tree transformations also lets a user identify the main query transformation patterns that a given request contains. For example the macro operation tree pattern shown in Figure 16 indicates that a dynamic cube is populated from multiple data streams. The pattern shown in Figure 17 is an example of an indicator of optimization of the tabular query populating a dynamic cube. Another example is the pattern shown in Figure 18, which indicates that the query has a reporter mode operation.
Once a query is planned, a query execution plan is the final result of a query planning process.
At this point a QF query representing the query execution plan consists of only physical macro operations that can be executed by query operation providers. The sequence of these operations in the macro operation tree determines the sequence of their invocation at the execution stage and the data flow from one operation to another. Physical macro operations are expressed with provider queries. Every provider query is associated with a certain provider responsible for the operation execution.
Thus, once a query is planned, the macro operation tree contains only providerQuery nodes.
In an embodiment where the strategy of query planning is to push as much operations to underlying databases as possible, the main operations in this tree are relational query provider (RQP) operation based on SQL and OLAP query provider (OQP) operation based primarily on multidimensional expression (MDX). On its own these operations can be complex and contain a combination of operations that require better understanding. By presenting these operations as a macro operation tree, the visualization system 100 assists the user to understand and analyze the macro operations without the need of understanding of the query framework system 10.
In general, most of macro operations have some internal structure that actually completes their definition. The macro operation tree handler 192 may also have a macro operation details handler 198 that exposes this structure in a format that can be understood by wide variety of potential users. For example, Figures 19-21 show examples of formats that the macro operation details handler 198 uses to represent details of RQP, OQP and CubeBuildProvider nodes. The visualization system 100 presents RQP operation details by SQL views, e.g., Cognos SQL, native SQL and the query framework query views, as exemplified in Figure 19.
The visualization system 100 presents a CubeBuildProvider operation by the structure of a dynamic cube built for a given query, as exemplified in Figure 20. The visualization system 100 presents OQP operation details by generated native MDX, as exemplified in Figure 21. The macro operation details handler 198 may provide other views, e.g., the view representing the result of a given operations to be able to check the data flows from one operation to another.
The visualization system 100 provides a query execution plan view that can be also suitably used by report authors. Exposing an execution plan to report authors may provide a significant insight of the nature of operations involved in a query/report processing, their performance and amount of data consumed at any given point of a query execution. Report authors are typically unfamiliar with the notion of query framework providers which requires understanding of internals of the query framework system 10 or query engine architecture. The visualization system 100 provides a view of the query execution plan that is represented through more widely known notions, e.g., SQL query specification, MDX query specification, and Cube creation operation. Hence, rather than using the notion of provider query in the execution plan visualization, the visualization system 100 marks macro operations using their main responsibility, or in other words gives a high level title or label for each macro operation that is intuitive for a report author. Figure 22 shows an example of a view with such high level titles.
Figure 23 is a diagram showing an example of a flow for requesting a view of an execution plan by a report author. The report author requests to view an execution plan in a data analyzing system 30. The data analyzing system 30 sends a validation request 200 to a report server 20. The report server 20 sends the translated XML API validation request 202 to the query framework system 10. The query framework system 10 processes the request and generates an execution plan, and returns to the report server 20 an XML API response 204 including the execution plan. The report server 20 translates it to a validation response 206 and returns it to the data analyzing system 30, which sends the execution plan 208 to the visualization module 140 of the query processing visualization system 100 for presenting it to the report author.
For example, the data analyzing system 30 uses a validation request, expressed in the format of a SOAP request. To indicate the request for the execution plan property, the data analyzing system 30 issues a SOAP validation request or command 200 with requestProperty= ExecutionPlan. The report server 20 translates the validation request 200 into a QF XML API request or command 202, as shown in Figure 25, requesting messages at the "information" level of severity and also number of the request properties. The QF XML API request 202 includes <property name="QFExecutionPlan>. The set of request properties does not include the execution property of a request supported by QF XML API shown in Figure 26.
The query framework system 10 processes the validation request and generates a QF XML
API response 204 including <property name="QFExecutionPlan>. The report server 20 received it and translates it to a SOAP validation response 206 to the validation request asking for query information contains the details under <querylnfo> element, as shown in Figure 27, where QUERY INFO is the string of encoded XML. The execution plan is part of this query info XML, as shown in Figure 28. The data analyzing system 30 receives the SOAP validation response 206 including the execution plan in XML in the original format of a tree of provider queries. This execution plan XML 208 is sent to the visualization module 140 of the query processing visualization system 100 that formats the execution plan XML and presents the formatted execution plan as described above. The visualization module 140 also provides query plan analysis functionality by supporting the dynamic properties of the execution plan visualization: folding and unfolding operation trees, opening the node property views and initiating the execution of the plan sub-trees.
The query processing visualization system of the present invention may be implemented by any hardware, software or a combination of hardware and software having the above described functions. The software code, either in its entirety or a part thereof, may be stored in a computer readable memory. Further, a computer data signal representing the software code which may be embedded in a carrier wave may be transmitted via a communication network. Such a computer readable memory and a computer data signal are also within the scope of the present invention, as well as the hardware, software and the combination thereof.
The present invention has been described with regard to one or more embodiments. However, it will be apparent to persons skilled in the art that a number of variations and modifications can be made without departing from the scope of the invention as defined in the claims.

Claims

1. A query processing visualization system comprising: a visualization module for visualizing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources, the visualization module comprising: a logging information analyzer for analyzing the logging information to identify dependency of the query processing events; an event sequence formatter for formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events; and a method call presentation handler for presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the event sequence formatter.
2. The query processing visualization system as claimed in claim 1, wherein: the logging information analyzer further comprises analysing logging information of method calls corresponding to main query processing events and identifying dependency of the method calls; the event sequence formatter further comprises formatting the visual presentation components in a hierarchy to reflect the dependency of the method calls; and the method call presentation handler further comprises presenting the visual presentation components that represents method calls in a context of the hierarchy of the visual presentation components.
3. The query processing visualization system as claimed in claim 2, wherein: the logging information analyzer further comprises analysing the logging information to identify recursive dependency of the method calls; the event sequence formatter further comprises a recursive dependency handler for formatting the visual presentation components representing recursively called method calls as collapsible components; and the method call presentation handler further comprises presenting the collapsible components so that the methods at a given calling depth are visually presented.
4. The query processing visualization system as claimed in claim 2, wherein the event sequence formatter further comprises a transforming method call handler for opening a tree of macro operations representing a method that has transformed a structure of the tree.
5. The query processing visualization system as claimed in claim 2, wherein: the logging information analyzer further comprises extracting logging information including event process information of a method call; and the method call presentation handler further comprises a process information adder for adding the process information to a visual component representing the method call.
6. The query processing visualization system as claimed in claim 5, wherein the process information comprises an elapse time and/or memory consumption of the method call.
7. The query processing visualization system as claimed in claim 5, wherein the process information adder highlights the process information in the presentation.
8. The query processing visualization system as claimed in claim 2, wherein the method call presentation handler further comprises a component label handler for labelling a presentation component representing a method call by a name of the method, name of a container object and name of a service provider that provides a processing service of the method.
9. The query processing visualization system as claimed in claim 2, wherein: the logging information analyzer further comprises extracting logging information including parameter information of a method call; and the method call presentation handler further comprises a parameter information adder for adding to a visual component representing the method call an access to the parameter information of the method call.
10. The query processing visualization system as claimed in claim 9, wherein the parameter information adder further comprises adding an input parameter access, an output parameter access, and/or a return parameter access.
11. The query processing visualization system as claimed in claim 9, wherein the parameter information handler further comprises a macro operation tree handler for presenting a tree of macro operations that represents a parameter containing a query framework query, the query framework query being executable by one or more query processing service providers of a query framework that processes the client query.
12. The query processing visualization system as claimed in claim 11 , wherein the macro operations include a query result definition, query and/or provider query.
13. The query processing visualization system as claimed in claim 11, wherein the method call presentation handler comprises an execute option handler for allowing execution of the query framework query and for presenting a view containing query results of the execution.
14. The query processing visualization system as claimed in claim 11, wherein the macro operation tree handler comprises a macro operation details handler for exposing a structure of the macro operations that completes definition of the macro operations in a predetermined format.
15. The query processing visualization system as claimed in claim 1, wherein the logging information analyzer comprises extracting logging information of a query execution plan which is generated as a result of a query planning process, the query execution plan comprising macro operations that are executable by query processing service providers; the event sequence formatter further comprises formatting visual presentation components representing the executable macro operations to represent the query execution plan; and the method call presentation handler further comprises presenting the visual presentation components for validation of the execution plan.
16. The query processing visualization system as claimed in claim 1 further comprising a logging module that comprises: an event capturer for capturing query processing events occurring during the processing of the client query; a method call identifier for identifying, as method calls, main query processing events among the captured query processing events; and a logging element handler for logging each method call as being represented by a logging element.
17. A method of visualizing query processing, the method comprising the steps of: analyzing logging information of query processing events occurring during processing of a client query requesting data from one or more data sources so as to identify dependency of the query processing events; formatting visual presentation components representing the logging information in a hierarchy to reflect the dependency of the query processing events; and presenting the visual presentation components in a context of the hierarchy of the visual components as formatted by the formatting step.
18. The method as claimed in claim 17, wherein: the analyzing step further comprises analysing logging information of method calls corresponding to main query processing events and identifying dependency of the method calls; the formatting step further comprising formating the visual presentation components in a hierarchy to reflect the dependency of the method calls; and the presenting step further comprising presenting the visual presentation components that represents method calls in a context of the hierarchy of the visual components.
19. The method as claimed in claim 18, wherein: the analyzing step further comprises identifying recursive dependency of the method calls; the formatting step further comprising formatting the visual presentation components representing recursively called method calls as collapsible components; and the presenting step further comprising presenting the collapsible components so that the methods at a given calling depth are visually presented.
20. The method as claimed in claim 18, wherein the presenting step further comprises opening a tree of macro operations representing a method that has transformed a structure of the tree.
21. The method as claimed in claim 18, wherein: the analyzing step further comprises extracting logging information including event process information of a method call; and the presenting step further comprises adding the process information to a visual component representing the method call.
22. The method as claimed in claim 21, wherein the presenting step further comprises adding an elapse time and/or memory consumption of the method call.
23. The method as claimed in claim 18, wherein the formatting step further comprises labelling a presentation component representing a method call by a name of method, name of container object and name of a service provider that provides a processing service of the method.
24. The method as claimed in claim 18, wherein: the analyzing step further comprises extracting logging information comprising parameter information of a method call; and the presenting step further comprises adding to a visual component representing the method call an access to the parameter information of the method call.
25. The method as claimed in claim 24, wherein the parameter information adding step further comprises presenting a tree of macro operations that represents a parameter containing a query framework query, the query framework query being executable by one or more query processing service providers of a query framework that processes the client query.
26. The method as claimed in claim 25, wherein the macro operation tree presenting step further comprises provides an execute option for allowing execution of the query framework query and for presenting a view containing query results of the execution.
27. The method as claimed in claim 17, wherein: the analyzing step further comprises obtaining logging information of a query execution plan which is generated as a result of a query planning process, the query execution plan comprising macro operations that are executable by query processing service providers; the formatting step further comprises formatting visual presentation components representing the executable macro operations to represent the query execution plan; and the presenting step further comprising presenting the visual presentation components for validation of the execution plan.
28. The method as claimed in claim 17 further comprising the step of obtaining the logging information by: capturing query processing events occurring during the processing of the client query; identifying, as method calls, main query processing events among the captured query processing events; and logging each method call as being represented by a logging element.
29. A computer program product loadable into the internal memory of a digital computer, comprising software code portions for performing, when said product is run on a computer, to carry out the invention as claimed in claims 17 to 28.
PCT/EP2009/054425 2008-04-16 2009-04-15 Query processing visualization system and method of visualizing query processing WO2009127639A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN2009801133489A CN102007476A (en) 2008-04-16 2009-04-15 Query processing visualization system and method of visualizing query processing
JP2011504438A JP5349581B2 (en) 2008-04-16 2009-04-15 Query processing visualizing system, method for visualizing query processing, and computer program

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CA002629922A CA2629922A1 (en) 2008-04-16 2008-04-16 Query processing visualization system and method of visualizing query processing
US12/104,108 US8122050B2 (en) 2008-04-16 2008-04-16 Query processing visualization system and method of visualizing query processing
US12/104,108 2008-04-16
CA2629922 2008-04-16

Publications (1)

Publication Number Publication Date
WO2009127639A1 true WO2009127639A1 (en) 2009-10-22

Family

ID=40786528

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2009/054425 WO2009127639A1 (en) 2008-04-16 2009-04-15 Query processing visualization system and method of visualizing query processing

Country Status (4)

Country Link
JP (1) JP5349581B2 (en)
KR (1) KR20100135789A (en)
CN (1) CN102007476A (en)
WO (1) WO2009127639A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013513143A (en) * 2009-12-07 2013-04-18 インターナショナル・ビジネス・マシーンズ・コーポレーション Method, system, and computer program for automatic generation of query lineage
WO2020165642A1 (en) * 2019-02-12 2020-08-20 Shivashankar Vidya Devanahalli System and method for visualization of system generated logs
CN114201602A (en) * 2022-02-15 2022-03-18 支付宝(杭州)信息技术有限公司 Method and apparatus for generating natural language statements for visual data analysis

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9665620B2 (en) 2010-01-15 2017-05-30 Ab Initio Technology Llc Managing data queries
US9116955B2 (en) * 2011-05-02 2015-08-25 Ab Initio Technology Llc Managing data queries
CN102789488B (en) * 2012-06-29 2016-05-04 用友网络科技股份有限公司 Data query treatment system and data query processing method
US10417281B2 (en) 2015-02-18 2019-09-17 Ab Initio Technology Llc Querying a data source on a network
CN107729365B (en) * 2017-09-07 2020-02-07 中国科学院自动化研究所 Materialized view selection and optimization method and device
US11093223B2 (en) 2019-07-18 2021-08-17 Ab Initio Technology Llc Automatically converting a program written in a procedural programming language into a dataflow graph and related systems and methods
CN111177249B (en) * 2019-12-10 2022-05-17 浙江大学 Multi-data-source data visualization method and device based on federal learning thought
KR102640115B1 (en) * 2023-05-19 2024-02-23 쿠팡 주식회사 Operating method for electronic apparatus for providing information and electronic apparatus supporting thereof

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH03260742A (en) * 1990-03-12 1991-11-20 Hitachi Ltd Program trace display method
CA2019327C (en) * 1990-06-19 2000-02-29 Peter K.L. Shum User inquiry facility for data processing systems
US6096095A (en) * 1998-06-04 2000-08-01 Microsoft Corporation Producing persistent representations of complex data structures
US7136868B2 (en) * 2003-06-06 2006-11-14 Microsoft Corporation Database object script generation method and system
US7231375B2 (en) * 2003-10-10 2007-06-12 Microsoft Corporation Computer aided query to task mapping
US7219093B2 (en) * 2003-12-02 2007-05-15 Proclarity Corporation System and method for generating custom hierarchies in an analytical data structure
US20060085401A1 (en) * 2004-10-20 2006-04-20 Microsoft Corporation Analyzing operational and other data from search system or the like
CA2518902C (en) * 2005-06-23 2010-11-30 Cognos Incorporated System and method for query planning and execution

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
DE PAUW W ET AL: "Execution patterns in object-oriented visualization", PROCEEDINGS OF THE FOURTH USENIX CONFERENCE ON OBJECT-ORIENTED TECHNOLOGIES AND SYSTEMS. (COOTS) USENIX ASSOC BERKELEY, CA, USA, 1998, pages 219 - 234, XP002534202, ISBN: 1-880446-93-6, Retrieved from the Internet <URL:http://sel.ist.osaka-u.ac.jp/~mnktsts/study/papers/mnktsts/PauwCOOTS1998.pdf> [retrieved on 20090625] *
G. KAPFHAMMER: "A COMPREHENSIVE FRAMEWORK FOR TESTING DATABASE-CENTRIC SOFTWARE APPLICATIONS", PHD THESIS, UNIVERSITY OF PITTSBURG, April 2007 (2007-04-01), pages 1 - 253, XP002534200 *
MIKE BRAYSHAW: "An Architecture for Visualizing the Execution of Parallel Logic Programs", PROC. OF THE TWELFTH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI 91), vol. 2, 24 August 1991 (1991-08-24) - 30 August 1991 (1991-08-30), pages 870 - 876, XP002534201, Retrieved from the Internet <URL:http://ijcai.org/Past%20Proceedings/IJCAI-91-VOL2/PDF/040.pdf> [retrieved on 20090625] *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2013513143A (en) * 2009-12-07 2013-04-18 インターナショナル・ビジネス・マシーンズ・コーポレーション Method, system, and computer program for automatic generation of query lineage
WO2020165642A1 (en) * 2019-02-12 2020-08-20 Shivashankar Vidya Devanahalli System and method for visualization of system generated logs
CN114201602A (en) * 2022-02-15 2022-03-18 支付宝(杭州)信息技术有限公司 Method and apparatus for generating natural language statements for visual data analysis
CN114201602B (en) * 2022-02-15 2022-05-06 支付宝(杭州)信息技术有限公司 Method and apparatus for generating natural language statements for visual data analysis

Also Published As

Publication number Publication date
JP2011517823A (en) 2011-06-16
KR20100135789A (en) 2010-12-27
CN102007476A (en) 2011-04-06
JP5349581B2 (en) 2013-11-20

Similar Documents

Publication Publication Date Title
US8122050B2 (en) Query processing visualization system and method of visualizing query processing
WO2009127639A1 (en) Query processing visualization system and method of visualizing query processing
US8712965B2 (en) Dynamic report mapping apparatus to physical data source when creating report definitions for information technology service management reporting for peruse of report definition transparency and reuse
KR101665152B1 (en) Extending collaboration capabilities to external data
CA2249386C (en) Computer system and computer implemented process for representing software system descriptions and for generating executable computer programs and computer system configurations from software system descriptions
US9495429B2 (en) Automatic synthesis and presentation of OLAP cubes from semantically enriched data sources
US20050234894A1 (en) Techniques for maintaining collections of generated web forms that are hyperlinked by subject
US7315853B2 (en) Service-oriented architecture for accessing reports in legacy systems
US8140545B2 (en) Data organization and evaluation using a two-topology configuration
US7676589B2 (en) Automatic generation of portlets for visualizing data by exploiting object relationships
US9251222B2 (en) Abstracted dynamic report definition generation for use within information technology infrastructure
Monroe et al. Style-based reuse for software architectures
US7596550B2 (en) System and method for query planning and execution
US20030105745A1 (en) Text-file based relational database
Frischmuth et al. Ontowiki–an authoring, publication and visualization interface for the data web
US20140181154A1 (en) Generating information models in an in-memory database system
US8171451B2 (en) Providing reports as web services
US10915585B2 (en) Data store interface that facilitates distribution of application functionality across a multi-tier client-server architecture
US20070268822A1 (en) Conformance control module
US7818328B2 (en) API for obtaining unambiguous representation of objects in a relational database
US20120158416A1 (en) Web-service based generation of business objects
Klerkx et al. How to share and reuse learning resources: the ARIADNE experience
JP3914081B2 (en) Access authority setting method and structured document management system
WO2006126990A1 (en) Pattern query language
CA2629922A1 (en) Query processing visualization system and method of visualizing query processing

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200980113348.9

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09733228

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2011504438

Country of ref document: JP

ENP Entry into the national phase

Ref document number: 20107022581

Country of ref document: KR

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09733228

Country of ref document: EP

Kind code of ref document: A1