US20220188084A1 - Call Graph Enhancement Using Stitching Algorithm - Google Patents

Call Graph Enhancement Using Stitching Algorithm Download PDF

Info

Publication number
US20220188084A1
US20220188084A1 US17/121,496 US202017121496A US2022188084A1 US 20220188084 A1 US20220188084 A1 US 20220188084A1 US 202017121496 A US202017121496 A US 202017121496A US 2022188084 A1 US2022188084 A1 US 2022188084A1
Authority
US
United States
Prior art keywords
nodes
call graph
graph
called
callee
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US17/121,496
Other versions
US11379198B1 (en
Inventor
Amitabh Goswami
Amrit Shankar Dutta Dutta
Abhishek Hondad
Alok Kumar
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SAP SE
Original Assignee
SAP SE
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SAP SE filed Critical SAP SE
Priority to US17/121,496 priority Critical patent/US11379198B1/en
Assigned to SAP SE reassignment SAP SE ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: DUTTA, AMRIT SHANKAR, GOSWAMI, AMITABH, KUMAR, ALOK, HONDAD, ABHISHEK
Publication of US20220188084A1 publication Critical patent/US20220188084A1/en
Application granted granted Critical
Publication of US11379198B1 publication Critical patent/US11379198B1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/43Checking; Contextual analysis
    • G06F8/433Dependency analysis; Data or control flow analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/30Creation or generation of source code
    • G06F8/36Software reuse
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/42Syntactic analysis
    • G06F8/427Parsing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F8/00Arrangements for software engineering
    • G06F8/40Transformation of program code
    • G06F8/41Compilation
    • G06F8/44Encoding
    • G06F8/443Optimisation

Definitions

  • the subject matter described herein is directed to the enhancement of call graphs using stitching algorithms which can identify software application structure which can, in turn, be used as part of a software development process that efficiently address changes to a code base after it has been modified.
  • test cases which are applied to the code base in the code repository. These test cases can, in some cases, require significant computing resources and, additionally take, a large amount of time to execute. Execution of unnecessary test cases further delays deployment of the code base and diminishes developer productivity.
  • a code base is parsed to identify methods encapsulated therein. Thereafter, a call graph is generated based on the parsing using a graph generation technique.
  • the call graph is a directed call graph comprising a plurality of nodes characterizing the identified methods. It can then be determined, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes. At least two of the disconnected nodes are then connected using a stitching algorithm to result in a modified call graph.
  • Data characterizing the modified call graph can then be provided (e.g., displayed in a graphical user interface, stored in a database, loaded into memory, transmitted to a remote computing device, etc.).
  • Edges between nodes of the directed call graph can be unidirectional edges connecting method nodes that correspond to invocations by a calling method to a callee method in the code base.
  • the connecting can use a second graph generation technique that differs from the graph generation technique.
  • At least a portion of the design patterns are non-standard and are not recognized by the graph generation technique.
  • a set of called method nodes in the call graph to be called in runtime that depend on such design pattern can be found.
  • a set of callee method nodes that are hierarchically associated with the set of called methods that depend on such design pattern can be found.
  • Directed edges can be added to the set of callee method nodes from the set of called method nodes if both sets contain at least one node.
  • the operations can advance to a next design pattern if edge/edges are added between callee-called nodes. If the set of called method nodes and the set of callee method nodes are both empty, the operations can advance to a next design pattern.
  • the utilized design patterns can dictate when nodes are deemed to be disconnected. For example, certain nodes can be identified as being disconnected based on a first design pattern while such certain nodes are not identified as being disconnected based on a second design pattern.
  • Non-transitory computer program products i.e., physically embodied computer program products
  • store instructions which when executed by one or more data processors of one or more computing systems, cause at least one data processor to perform operations herein.
  • computer systems are also described that may include one or more data processors and memory coupled to the one or more data processors.
  • the memory may temporarily or permanently store instructions that cause at least one processor to perform one or more of the operations described herein.
  • methods can be implemented by one or more data processors either within a single computing system or distributed among two or more computing systems.
  • Such computing systems can be connected and can exchange data and/or commands or other instructions or the like via one or more connections, including but not limited to a connection over a network (e.g., the Internet, a wireless wide area network, a local area network, a wide area network, a wired network, or the like), via a direct connection between one or more of the multiple computing systems, etc.
  • a network e.g., the Internet, a wireless wide area network, a local area network, a wide area network, a wired network, or the like
  • a direct connection between one or more of the multiple computing systems etc.
  • the current subject matter is advantageous in that it provides enhanced techniques for enhancing call graphs which would otherwise have disconnected nodes.
  • FIG. 1 is a diagram illustrating an architecture for identifying test cases implicated by changes in a code base since a last commit;
  • FIG. 2 is a diagram illustrating an example methods call graph
  • FIG. 3 is a diagram illustrating a methods call graph to which a stitching algorithm generates an edge between two disconnected nodes
  • FIG. 4 is a diagram illustrating an expanded method call graph with method nodes and test case nodes
  • FIG. 5 is a process flow diagram illustrating the generation of a call graph using a stitching algorithm.
  • FIG. 6 is a diagram illustrating a computing device for implementing aspects of the current subject matter.
  • the current subject matter is directed to enhanced techniques for enhancing a call graph which would otherwise have disconnected nodes that do not adequately represent the structure and relationships of a code base being modeled.
  • the current subject matter can be used, for example, in connection with a platform providing predictive test case coverage such as provided in U.S. Pat. Ser. No. ______ (attorney docket number 530198-999446/200517US01) filed concurrently herewith and entitled “Predictive Test Coverage”, the contents of which are hereby fully incorporated by reference.
  • the call graphs can be used for a variety of aspects of a software development process including testing a code base by identifying a minimal amount of test cases that need to be executed in response to an update or other change to the code base (especially as applied to versioning updates to a continually evolving code base).
  • Such techniques leverage a directed graph that defines the interrelationships between methods, and additionally, specifies coverage of each test case.
  • a stitching algorithm can be used to selectively add edges to the directed call graph.
  • test case is a set of actions to be executed to verify a particular feature or functionality of a software application.
  • a test case can encapsulate a specification of the inputs, execution conditions, testing procedure, and expected results that define a single test to be executed to achieve a particular software testing objective in relation to a code base being developed.
  • a test case typically comprises an identification, a summary/description (which can be searched), steps forming part of the test, and other related metadata/contextual data associated with the test case.
  • Each test case can, in some variations, comprise one or more test scripts comprising code required to execution the actions specified by the test case.
  • FIG. 1 is a diagram 100 illustrating a sample architecture for implementing aspects of the current subject matter.
  • the architecture can include components such as a code change analyzer 110 , an anomaly handler 120 , a profiler 130 . These components 110 , 120 , 130 can be software modules/code executed by one or more computing devices (either local or distributed).
  • the components 110 , 120 , 130 can also be in communication with a code repository 140 which can take various forms including, a dedicated database or a software-as-a-service cloud-based code development management platform (e.g., GITHUB).
  • the profiler 130 can generate and save relevant data from source code, byte code and test coverage reports in the filesystem.
  • the code change analyzer 110 provides the test case recommendation depending on the code base changes and data collected by the profiler 130 .
  • the architecture can include a graph database 150 which can store the directed call graphs as provided herein.
  • the anomaly handler 120 can be used to handle less frequent situations such as a code change implicating more than 30% of the test cases or a code change having a number of files above a pre-defined threshold (e.g., five or more files). In such cases, a user can manually specify which test cases to execute or other exceptional handling can be performed with regard to the corresponding code changes. In other words, the anomaly handler 120 can handle deviations or takes action depending on a certain rule being triggered to fine tune the overall system.
  • the anomaly handler 120 can execute a stitching algorithm as described below.
  • the anomaly handler 120 can additionally run a complete test suite (i) in cases in which the number of implicated test cases is above a pre-defined threshold (e.g., 20%, etc.); (ii) against code changes of specific classes or call graphs; (iii) to a change-set crossing a predefined threshold in terms of number of methods or lines changed; or (iv) to non JAVA-based changes such as ‘xml’/build files.
  • the anomaly handler 120 can also log certain actions/results and generate alerts/messages that indicate, for example, that no test cases were found in response to a particular change set (i.e., a change to the code base).
  • the anomaly handler 120 can also identify impacted tests based on class rather than based on method when only the import statement is added/modified and/or when a class is moved. Further, the anomaly handler 120 can provide that a predefined set of critical tests are executed when there are white space changes and the like.
  • the profiler 130 can maintain or access information about the code base in the code repository 140 including changes (sometimes referred to as deltas) which occur in between versions or other user-defined commits.
  • the profiler 130 can also generate a call graph (as described in further detail below) which can be a bi-directional nth level call graph. Using the call graph, the profiler 130 can also generate the coverage per test case. Coverage, in this regard, refers to which methods forming part of the code base are affected by a particular test case.
  • the code change analyzer 110 can parse a commit provided by a revision control tool using a code parser 112 in connection with a code analyzer 114 to identify deltas of the code base in the code repository 140 .
  • the code change analyzer 110 provides the file(s) that are modified along with the method(s). This set of files and its method are passed to the code analyzer 114 module for further analysis.
  • the code analyzer 114 can return the changes in terms of classes, methods or lines.
  • the delta changes can be characterized in terms of changes to methods.
  • different granularities of changes can be characterized such as on a line-by-line basis. In such cases, the nodes of a directed graph would represent lines of code.
  • the code forming the code base can take various forms and the code change analyzer 110 can use varying parsers as may be required by the particular code type.
  • an abstract syntax tree representation (“AST”) of the code base at the time of the commit and an AST of the code base after the changes can be compared to identify changes to methods, lines, and the like.
  • AST abstract syntax tree representation
  • a JAVA parser can be used to identify or otherwise characterize changes to the code base.
  • a source code parser parses the source code to create some form of internal representation.
  • the source code parser takes source code as input data and builds a data structure in the form of a parse tree, AST(Abstract Syntax Tree) or other hierarchical structure, giving a structural representation of the input source code.
  • AST provides detailed information about all attributes/elements of JAVA source code.
  • This representation can provide the package name of source code, import statements, type of source code (class/interface), internal attributes of class/interface (e.g., methods, constructors or field declaration), as well as start line and end line for each attribute. All of this information can be used to precisely identify source code modification for any change set.
  • JGIT library can alternatively or additionally be used to analyze source code as well as code repository commits (e.g., GIT Commits when the code repository 140 is GITHUB or a compatible repository) as opposed to the AST representation of the changes to the code.
  • the code change analyzer 110 can be used to provide byte code analysis metrics by analyzing compiled artifacts (e.g., JAR files, etc.) in order to generate a call graph so that, for given delta changes, the minimal number of required test cases can be determined (and then later executed).
  • the code change analyzer 110 which can be a static code analyzer, can identify changed methods in the code based which can be transformed as nodes in a disconnected graph. After the code analyzer 110 establishes relations between methods in the code base, the whole codebase can be represented as a graph with many disconnected components.
  • the nodes in the graph can each represent a method in the target codebase whereas each method is defined through node properties. Fully qualified class name, method name, and a parameters list are properties which can uniquely identify each method.
  • the whole codebase is merely a disconnected graph of unique methods.
  • the code analyzer 110 can also output caller and callee relationships among the methods. These relations among the methods can be represented as directed edges between the method nodes.
  • the relations which are not explicitly discovered by the code analyzer 110 can then be transformed into edges by providing specific algorithms depending on the yet to be discovered design patterns. Such specific algorithms can then be implemented as one or more stitching algorithms to enhance the call graph with missing connections.
  • Call graphs are directed graphs representing control flow relationships amongst the methods of an application (associated with the code base).
  • the nodes of the graph are the methods, while an edge from node ‘node1’ to node ‘node2’ indicates that method ‘node1’ invokes method ‘node2’.
  • the call graph can be generated by converting the source code forming part of the code base into an AST representation.
  • FIG. 2 is a diagram 200 illustrating a sample call graph in which a plurality of nodes corresponds to different methods of the code base (Method1 . . . Method17).
  • the edges connecting the nodes are defined by the caller and callee relationships (method calling another method) amongst such nodes as specified in the byte code.
  • one variation is to analyze the commit (e.g., git commit) since a last (i.e., most recent commit) to identify a set of methods that changes.
  • a call graph can then be accessed to determine all related methods that should be probed as potentially being covered by a test case.
  • the full result set of recommended test cases can be provided in various manners including loading into memory, storing in physical persistence, displaying a graphical user interface, transmitted to a remote computing system, and the like.
  • the call graph can be stored in the graph database 150 .
  • a graph database 150 refers to a database that uses graph structures for semantic queries for nodes, edges, and properties to represent and store data.
  • the graph database 150 can provide more rapid response time and can take various forms.
  • the call graph may initially have disconnected nodes.
  • Such idiosyncrasies can appear in the code base due to implementation of, for example, non-standard design patterns, JAVA reflections, runtime weaving, etc. These disconnects can be problematic in that interrelations amongst the method nodes may not be adequately captured (and thus not all implicated test cases may be identified).
  • a stitching algorithm can be utilized to add edges to disconnected method nodes in the call graph of FIG. 3 such as between Method8 and Method6 (indicating that Method8 is called by Method6 and as such, changes to Method8 should include test cases covering Method6).
  • Pseudocode for the stitching algorithm can be as follows.
  • ABC is a popular design pattern.
  • ABC is invoked by instantiating an ABC object and passing it to an ABC handler.
  • ABC handler can try to find the implementation by appending ‘Impl’ at the end of the ABC object name and discover it in specific packages.
  • the code analyzer 110 in this situation, cannot create a complete call graph out of it.
  • the stitching algorithm can find an immediate caller of ABC constructor and can stitches it to the method node corresponding to the ABC implementer in the call graph. Now, the ABC implementation becomes the participant in the end to end call graph.
  • each method can be represented as a method node in the call graph, connected with other nodes by edges named as ‘called by’. These edges are unidirectional as a call only occurs in a single direction.
  • the call graph (containing only method nodes) can be supplemented to represent test cases by way of different type of nodes, namely test case nodes, that have a strong relationship with the method nodes mentioned earlier.
  • test case nodes can be connected to the method nodes by edges named as ‘covered by’. ‘Covered by’ in this regard, refers to the test case associated with such node involving the method connected via the edge.
  • the coverage analyzer 136 forming part of the profiler 130 can be used to characterize the relationship between test cases and methods.
  • the coverage analyzer 136 can individually run all of the test cases specified for the code base in question and find which methods are covered by such tests.
  • one or more of the test cases can be executed using tools along with the code coverage report for every test-case individually.
  • the generated report can be parsed to get the methods corresponding to the test-case and this relationship can then be inserted into the graph database 150 .
  • FIG. 5 is a diagram 500 in which, at 510 , a code base is parsed to identify methods encapsulated therein. Thereafter, at 520 , a call graph is generated that is based on the parsing using a graph generation technique.
  • the call graph is a directed call graph comprising a plurality of nodes characterizing the identified methods. It is then determined, at 530 and based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes. At least two of the disconnected nodes are then connected, at 540 , using a stitching algorithm to result in a modified call graph (i.e., an enhanced call graph). Data is later provided, at 550 , that characterizes the modified call graph.
  • a modified call graph i.e., an enhanced call graph
  • FIG. 6 is a diagram 600 illustrating a sample computing device architecture for implementing various aspects described herein.
  • a bus 604 can serve as the information highway interconnecting the other illustrated components of the hardware.
  • a processing system 608 labeled CPU (central processing unit) e.g., one or more computer processors/data processors at a given computer or at multiple computers
  • CPU central processing unit
  • a non-transitory processor-readable storage medium such as read only memory (ROM) 612 and random access memory (RAM) 616 , can be in communication with the processing system 608 and can include one or more programming instructions for the operations specified here.
  • program instructions can be stored on a non-transitory computer-readable storage medium such as a magnetic disk, optical disk, recordable memory device, flash memory, or other physical storage medium.
  • a disk controller 648 can interface with one or more optional disk drives to the system bus 604 .
  • These disk drives can be external or internal floppy disk drives such as 660 , external or internal CD-ROM, CD-R, CD-RW or DVD, or solid state drives such as 652 , or external or internal hard drives 656 .
  • these various disk drives 652 , 656 , 660 and disk controllers are optional devices.
  • the system bus 604 can also include at least one communication port 620 to allow for communication with external devices either physically connected to the computing system or available externally through a wired or wireless network.
  • the at least one communication port 620 includes or otherwise comprises a network interface.
  • a computing device having a display device 640 (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information obtained from the bus 604 via a display interface 614 to the user and an input device 632 such as keyboard and/or a pointing device (e.g., a mouse or a trackball) and/or a touchscreen by which the user can provide input to the computer.
  • a display device 640 e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor
  • an input device 632 such as keyboard and/or a pointing device (e.g., a mouse or a trackball) and/or a touchscreen by which the user can provide input to the computer.
  • input devices 632 can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback by way of a microphone 636 , or tactile feedback); and input from the user can be received in any form, including acoustic, speech, or tactile input.
  • the input device 632 and the microphone 636 can be coupled to and convey information via the bus 604 by way of an input device interface 628 .
  • Other computing devices such as dedicated servers, can omit one or more of the display 640 and display interface 614 , the input device 632 , the microphone 636 , and input device interface 628 .
  • One or more aspects or features of the subject matter described herein can be realized in digital electronic circuitry, integrated circuitry, specially designed application specific integrated circuits (ASICs), field programmable gate arrays (FPGAs) computer hardware, firmware, software, and/or combinations thereof.
  • ASICs application specific integrated circuits
  • FPGAs field programmable gate arrays
  • These various aspects or features can include implementation in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which can be special or general purpose, coupled to receive data and instructions from, and to transmit data and instructions to, a storage system, at least one input device, and at least one output device.
  • the programmable system or computing system may include clients and servers.
  • a client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
  • machine-readable signal refers to any signal used to provide machine instructions and/or data to a programmable processor.
  • the machine-readable medium can store such machine instructions non-transitorily, such as for example as would a non-transient solid-state memory or a magnetic hard drive or any equivalent storage medium.
  • the machine-readable medium can alternatively or additionally store such machine instructions in a transient manner, such as for example as would a processor cache or other random access memory associated with one or more physical processor cores.
  • phrases such as “at least one of” or “one or more of” may occur followed by a conjunctive list of elements or features.
  • the term “and/or” may also occur in a list of two or more elements or features. Unless otherwise implicitly or explicitly contradicted by the context in which it is used, such a phrase is intended to mean any of the listed elements or features individually or any of the recited elements or features in combination with any of the other recited elements or features.
  • the phrases “at least one of A and B;” “one or more of A and B;” and “A and/or B” are each intended to mean “A alone, B alone, or A and B together.”
  • a similar interpretation is also intended for lists including three or more items.
  • the phrases “at least one of A, B, and C;” “one or more of A, B, and C;” and “A, B, and/or C” are each intended to mean “A alone, B alone, C alone, A and B together, A and C together, B and C together, or A and B and C together.”
  • use of the term “based on,” above and in the claims is intended to mean, “based at least in part on,” such that an unrecited feature or element is also permissible.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Debugging And Monitoring (AREA)
  • Stored Programmes (AREA)

Abstract

A code base is parsed to identify methods encapsulated therein. Thereafter, a call graph is generated based on the parsing using a graph generation technique. The call graph is a directed call graph comprising a plurality of nodes characterizing the identified methods. It can then be determined, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes. At least two of the disconnected nodes are then connected using a stitching algorithm to result in a modified call graph. Data characterizing the modified call graph can then be provided (e.g., displayed in a graphical user interface, stored in a database, loaded into memory, transmitted to a remote computing device, etc.).

Description

    TECHNICAL FIELD
  • The subject matter described herein is directed to the enhancement of call graphs using stitching algorithms which can identify software application structure which can, in turn, be used as part of a software development process that efficiently address changes to a code base after it has been modified.
  • BACKGROUND
  • Software development is often an iterative process during which code is generated on a local computing device or file share system, the generated code is committed to a code repository, and the code based in the code repository is then tested to identify errors. In some cases, the testing is performed using test cases which are applied to the code base in the code repository. These test cases can, in some cases, require significant computing resources and, additionally take, a large amount of time to execute. Execution of unnecessary test cases further delays deployment of the code base and diminishes developer productivity.
  • SUMMARY
  • In a first aspect, a code base is parsed to identify methods encapsulated therein. Thereafter, a call graph is generated based on the parsing using a graph generation technique. The call graph is a directed call graph comprising a plurality of nodes characterizing the identified methods. It can then be determined, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes. At least two of the disconnected nodes are then connected using a stitching algorithm to result in a modified call graph. Data characterizing the modified call graph can then be provided (e.g., displayed in a graphical user interface, stored in a database, loaded into memory, transmitted to a remote computing device, etc.).
  • Edges between nodes of the directed call graph can be unidirectional edges connecting method nodes that correspond to invocations by a calling method to a callee method in the code base.
  • The connecting can use a second graph generation technique that differs from the graph generation technique.
  • At least a portion of the design patterns are non-standard and are not recognized by the graph generation technique.
  • For each design pattern used in the code base, a set of called method nodes in the call graph to be called in runtime that depend on such design pattern can be found. In addition, a set of callee method nodes that are hierarchically associated with the set of called methods that depend on such design pattern can be found. Directed edges can be added to the set of callee method nodes from the set of called method nodes if both sets contain at least one node. The operations can advance to a next design pattern if edge/edges are added between callee-called nodes. If the set of called method nodes and the set of callee method nodes are both empty, the operations can advance to a next design pattern.
  • The utilized design patterns can dictate when nodes are deemed to be disconnected. For example, certain nodes can be identified as being disconnected based on a first design pattern while such certain nodes are not identified as being disconnected based on a second design pattern.
  • Non-transitory computer program products (i.e., physically embodied computer program products) are also described that store instructions, which when executed by one or more data processors of one or more computing systems, cause at least one data processor to perform operations herein. Similarly, computer systems are also described that may include one or more data processors and memory coupled to the one or more data processors. The memory may temporarily or permanently store instructions that cause at least one processor to perform one or more of the operations described herein. In addition, methods can be implemented by one or more data processors either within a single computing system or distributed among two or more computing systems. Such computing systems can be connected and can exchange data and/or commands or other instructions or the like via one or more connections, including but not limited to a connection over a network (e.g., the Internet, a wireless wide area network, a local area network, a wide area network, a wired network, or the like), via a direct connection between one or more of the multiple computing systems, etc.
  • The subject matter described herein provides many technical advantages. For example, the current subject matter is advantageous in that it provides enhanced techniques for enhancing call graphs which would otherwise have disconnected nodes.
  • The details of one or more variations of the subject matter described herein are set forth in the accompanying drawings and the description below. Other features and advantages of the subject matter described herein will be apparent from the description and drawings, and from the claims.
  • DESCRIPTION OF DRAWINGS
  • FIG. 1 is a diagram illustrating an architecture for identifying test cases implicated by changes in a code base since a last commit;
  • FIG. 2 is a diagram illustrating an example methods call graph;
  • FIG. 3 is a diagram illustrating a methods call graph to which a stitching algorithm generates an edge between two disconnected nodes;
  • FIG. 4 is a diagram illustrating an expanded method call graph with method nodes and test case nodes;
  • FIG. 5 is a process flow diagram illustrating the generation of a call graph using a stitching algorithm; and
  • FIG. 6 is a diagram illustrating a computing device for implementing aspects of the current subject matter.
  • Like reference symbols in the various drawings indicate like elements.
  • DETAILED DESCRIPTION
  • The current subject matter is directed to enhanced techniques for enhancing a call graph which would otherwise have disconnected nodes that do not adequately represent the structure and relationships of a code base being modeled. The current subject matter can be used, for example, in connection with a platform providing predictive test case coverage such as provided in U.S. Pat. Ser. No. ______ (attorney docket number 530198-999446/200517US01) filed concurrently herewith and entitled “Predictive Test Coverage”, the contents of which are hereby fully incorporated by reference.
  • The call graphs can be used for a variety of aspects of a software development process including testing a code base by identifying a minimal amount of test cases that need to be executed in response to an update or other change to the code base (especially as applied to versioning updates to a continually evolving code base). Such techniques leverage a directed graph that defines the interrelationships between methods, and additionally, specifies coverage of each test case. In some cases, a stitching algorithm can be used to selectively add edges to the directed call graph.
  • A test case, in this regard, is a set of actions to be executed to verify a particular feature or functionality of a software application. In particular, a test case can encapsulate a specification of the inputs, execution conditions, testing procedure, and expected results that define a single test to be executed to achieve a particular software testing objective in relation to a code base being developed. A test case typically comprises an identification, a summary/description (which can be searched), steps forming part of the test, and other related metadata/contextual data associated with the test case. Each test case can, in some variations, comprise one or more test scripts comprising code required to execution the actions specified by the test case.
  • FIG. 1 is a diagram 100 illustrating a sample architecture for implementing aspects of the current subject matter. The architecture can include components such as a code change analyzer 110, an anomaly handler 120, a profiler 130. These components 110, 120, 130 can be software modules/code executed by one or more computing devices (either local or distributed).
  • The components 110, 120, 130 can also be in communication with a code repository 140 which can take various forms including, a dedicated database or a software-as-a-service cloud-based code development management platform (e.g., GITHUB). The profiler 130 can generate and save relevant data from source code, byte code and test coverage reports in the filesystem. The code change analyzer 110 provides the test case recommendation depending on the code base changes and data collected by the profiler 130. Further, the architecture can include a graph database 150 which can store the directed call graphs as provided herein.
  • The anomaly handler 120 can be used to handle less frequent situations such as a code change implicating more than 30% of the test cases or a code change having a number of files above a pre-defined threshold (e.g., five or more files). In such cases, a user can manually specify which test cases to execute or other exceptional handling can be performed with regard to the corresponding code changes. In other words, the anomaly handler 120 can handle deviations or takes action depending on a certain rule being triggered to fine tune the overall system. The anomaly handler 120 can execute a stitching algorithm as described below. The anomaly handler 120 can additionally run a complete test suite (i) in cases in which the number of implicated test cases is above a pre-defined threshold (e.g., 20%, etc.); (ii) against code changes of specific classes or call graphs; (iii) to a change-set crossing a predefined threshold in terms of number of methods or lines changed; or (iv) to non JAVA-based changes such as ‘xml’/build files. The anomaly handler 120 can also log certain actions/results and generate alerts/messages that indicate, for example, that no test cases were found in response to a particular change set (i.e., a change to the code base). The anomaly handler 120 can also identify impacted tests based on class rather than based on method when only the import statement is added/modified and/or when a class is moved. Further, the anomaly handler 120 can provide that a predefined set of critical tests are executed when there are white space changes and the like.
  • The profiler 130 can maintain or access information about the code base in the code repository 140 including changes (sometimes referred to as deltas) which occur in between versions or other user-defined commits. The profiler 130 can also generate a call graph (as described in further detail below) which can be a bi-directional nth level call graph. Using the call graph, the profiler 130 can also generate the coverage per test case. Coverage, in this regard, refers to which methods forming part of the code base are affected by a particular test case.
  • The code change analyzer 110 can parse a commit provided by a revision control tool using a code parser 112 in connection with a code analyzer 114 to identify deltas of the code base in the code repository 140. The code change analyzer 110 provides the file(s) that are modified along with the method(s). This set of files and its method are passed to the code analyzer 114 module for further analysis. The code analyzer 114 can return the changes in terms of classes, methods or lines. In some variations, the delta changes can be characterized in terms of changes to methods. In other variations, different granularities of changes can be characterized such as on a line-by-line basis. In such cases, the nodes of a directed graph would represent lines of code.
  • The code forming the code base can take various forms and the code change analyzer 110 can use varying parsers as may be required by the particular code type. In some variations, an abstract syntax tree representation (“AST”) of the code base at the time of the commit and an AST of the code base after the changes can be compared to identify changes to methods, lines, and the like. For example, when the code base in the code repository 140 comprises JAVA source code, a JAVA parser can be used to identify or otherwise characterize changes to the code base.
  • A source code parser parses the source code to create some form of internal representation. The source code parser takes source code as input data and builds a data structure in the form of a parse tree, AST(Abstract Syntax Tree) or other hierarchical structure, giving a structural representation of the input source code. AST provides detailed information about all attributes/elements of JAVA source code. This representation can provide the package name of source code, import statements, type of source code (class/interface), internal attributes of class/interface (e.g., methods, constructors or field declaration), as well as start line and end line for each attribute. All of this information can be used to precisely identify source code modification for any change set. In some variations, JGIT library can alternatively or additionally be used to analyze source code as well as code repository commits (e.g., GIT Commits when the code repository 140 is GITHUB or a compatible repository) as opposed to the AST representation of the changes to the code.
  • The code change analyzer 110 can be used to provide byte code analysis metrics by analyzing compiled artifacts (e.g., JAR files, etc.) in order to generate a call graph so that, for given delta changes, the minimal number of required test cases can be determined (and then later executed). The code change analyzer 110 which can be a static code analyzer, can identify changed methods in the code based which can be transformed as nodes in a disconnected graph. After the code analyzer 110 establishes relations between methods in the code base, the whole codebase can be represented as a graph with many disconnected components. The nodes in the graph can each represent a method in the target codebase whereas each method is defined through node properties. Fully qualified class name, method name, and a parameters list are properties which can uniquely identify each method. Before the code analyzer 110 runs, the whole codebase is merely a disconnected graph of unique methods. The code analyzer 110 can also output caller and callee relationships among the methods. These relations among the methods can be represented as directed edges between the method nodes. The relations which are not explicitly discovered by the code analyzer 110 can then be transformed into edges by providing specific algorithms depending on the yet to be discovered design patterns. Such specific algorithms can then be implemented as one or more stitching algorithms to enhance the call graph with missing connections.
  • Call graphs, in this context, are directed graphs representing control flow relationships amongst the methods of an application (associated with the code base). The nodes of the graph are the methods, while an edge from node ‘node1’ to node ‘node2’ indicates that method ‘node1’ invokes method ‘node2’. The call graph can be generated by converting the source code forming part of the code base into an AST representation.
  • FIG. 2 is a diagram 200 illustrating a sample call graph in which a plurality of nodes corresponds to different methods of the code base (Method1 . . . Method17). The edges connecting the nodes are defined by the caller and callee relationships (method calling another method) amongst such nodes as specified in the byte code. By tracing the complete call sequence of the methods in the code base, these methods exhibit the structure of a graph, even the self-loop is completely legal.
  • As mentioned above, one variation is to analyze the commit (e.g., git commit) since a last (i.e., most recent commit) to identify a set of methods that changes. A call graph can then be accessed to determine all related methods that should be probed as potentially being covered by a test case. The full result set of recommended test cases can be provided in various manners including loading into memory, storing in physical persistence, displaying a graphical user interface, transmitted to a remote computing system, and the like.
  • In some cases, the call graph can be stored in the graph database 150. A graph database 150, in this regard, refers to a database that uses graph structures for semantic queries for nodes, edges, and properties to represent and store data. The graph database 150 can provide more rapid response time and can take various forms.
  • In some cases, as illustrated in diagram 300 of FIG. 3, the call graph may initially have disconnected nodes. Such idiosyncrasies can appear in the code base due to implementation of, for example, non-standard design patterns, JAVA reflections, runtime weaving, etc. These disconnects can be problematic in that interrelations amongst the method nodes may not be adequately captured (and thus not all implicated test cases may be identified). A stitching algorithm can be utilized to add edges to disconnected method nodes in the call graph of FIG. 3 such as between Method8 and Method6 (indicating that Method8 is called by Method6 and as such, changes to Method8 should include test cases covering Method6).
  • The stitching algorithm used to connect disconnected method nodes. Pseudocode for the stitching algorithm can be as follows.
  • Algorithm:
  • Initialize:
  • Set D:={All standard and non-standard design patterns which are not discoverable by static code analyzer}
  • Repeat for each Design Pattern D1 belonging to Set D
      • 1) Set CALLED:=Find out the set of methods nodes which should be called in runtime depending on D1.
      • 2) Set CALLEE:=Find out the set of methods nodes which should be calling above methods in runtime depending on D1.
      • 3) IF both the set CALLED and CALLEE is non-empty go to step 4, otherwise CONTINUE.
      • 4) Add directed edges to the set of nodes identified by CALLEE from the set of nodes identified by CALLED.
  • For example, with a particular exemplary application XYZ, ABC is a popular design pattern. ABC is invoked by instantiating an ABC object and passing it to an ABC handler. ABC handler can try to find the implementation by appending ‘Impl’ at the end of the ABC object name and discover it in specific packages. The code analyzer 110, in this situation, cannot create a complete call graph out of it. Thus, the stitching algorithm can find an immediate caller of ABC constructor and can stitches it to the method node corresponding to the ABC implementer in the call graph. Now, the ABC implementation becomes the participant in the end to end call graph.
  • With reference to diagram 400 of FIG. 4, each method can be represented as a method node in the call graph, connected with other nodes by edges named as ‘called by’. These edges are unidirectional as a call only occurs in a single direction. The call graph (containing only method nodes) can be supplemented to represent test cases by way of different type of nodes, namely test case nodes, that have a strong relationship with the method nodes mentioned earlier. As illustrated, these test case nodes can be connected to the method nodes by edges named as ‘covered by’. ‘Covered by’ in this regard, refers to the test case associated with such node involving the method connected via the edge.
  • The coverage analyzer 136 forming part of the profiler 130 can be used to characterize the relationship between test cases and methods. The coverage analyzer 136 can individually run all of the test cases specified for the code base in question and find which methods are covered by such tests. As an example, one or more of the test cases can be executed using tools along with the code coverage report for every test-case individually. The generated report can be parsed to get the methods corresponding to the test-case and this relationship can then be inserted into the graph database 150.
  • FIG. 5 is a diagram 500 in which, at 510, a code base is parsed to identify methods encapsulated therein. Thereafter, at 520, a call graph is generated that is based on the parsing using a graph generation technique. The call graph is a directed call graph comprising a plurality of nodes characterizing the identified methods. It is then determined, at 530 and based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes. At least two of the disconnected nodes are then connected, at 540, using a stitching algorithm to result in a modified call graph (i.e., an enhanced call graph). Data is later provided, at 550, that characterizes the modified call graph.
  • FIG. 6 is a diagram 600 illustrating a sample computing device architecture for implementing various aspects described herein. A bus 604 can serve as the information highway interconnecting the other illustrated components of the hardware. A processing system 608 labeled CPU (central processing unit) (e.g., one or more computer processors/data processors at a given computer or at multiple computers), can perform calculations and logic operations required to execute a program. A non-transitory processor-readable storage medium, such as read only memory (ROM) 612 and random access memory (RAM) 616, can be in communication with the processing system 608 and can include one or more programming instructions for the operations specified here. Optionally, program instructions can be stored on a non-transitory computer-readable storage medium such as a magnetic disk, optical disk, recordable memory device, flash memory, or other physical storage medium.
  • In one example, a disk controller 648 can interface with one or more optional disk drives to the system bus 604. These disk drives can be external or internal floppy disk drives such as 660, external or internal CD-ROM, CD-R, CD-RW or DVD, or solid state drives such as 652, or external or internal hard drives 656. As indicated previously, these various disk drives 652, 656, 660 and disk controllers are optional devices. The system bus 604 can also include at least one communication port 620 to allow for communication with external devices either physically connected to the computing system or available externally through a wired or wireless network. In some cases, the at least one communication port 620 includes or otherwise comprises a network interface.
  • To provide for interaction with a user, the subject matter described herein can be implemented on a computing device having a display device 640 (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information obtained from the bus 604 via a display interface 614 to the user and an input device 632 such as keyboard and/or a pointing device (e.g., a mouse or a trackball) and/or a touchscreen by which the user can provide input to the computer. Other kinds of input devices 632 can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback by way of a microphone 636, or tactile feedback); and input from the user can be received in any form, including acoustic, speech, or tactile input. The input device 632 and the microphone 636 can be coupled to and convey information via the bus 604 by way of an input device interface 628. Other computing devices, such as dedicated servers, can omit one or more of the display 640 and display interface 614, the input device 632, the microphone 636, and input device interface 628.
  • One or more aspects or features of the subject matter described herein can be realized in digital electronic circuitry, integrated circuitry, specially designed application specific integrated circuits (ASICs), field programmable gate arrays (FPGAs) computer hardware, firmware, software, and/or combinations thereof. These various aspects or features can include implementation in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which can be special or general purpose, coupled to receive data and instructions from, and to transmit data and instructions to, a storage system, at least one input device, and at least one output device. The programmable system or computing system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
  • These computer programs, which can also be referred to as programs, software, software applications, applications, components, or code, include machine instructions for a programmable processor, and can be implemented in a high-level procedural language, an object-oriented programming language, a functional programming language, a logical programming language, and/or in assembly/machine language. As used herein, the term “machine-readable medium” refers to any computer program product, apparatus and/or device, such as for example magnetic discs, optical disks, memory, and Programmable Logic Devices (PLDs), used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term “machine-readable signal” refers to any signal used to provide machine instructions and/or data to a programmable processor. The machine-readable medium can store such machine instructions non-transitorily, such as for example as would a non-transient solid-state memory or a magnetic hard drive or any equivalent storage medium. The machine-readable medium can alternatively or additionally store such machine instructions in a transient manner, such as for example as would a processor cache or other random access memory associated with one or more physical processor cores.
  • In the descriptions above and in the claims, phrases such as “at least one of” or “one or more of” may occur followed by a conjunctive list of elements or features. The term “and/or” may also occur in a list of two or more elements or features. Unless otherwise implicitly or explicitly contradicted by the context in which it is used, such a phrase is intended to mean any of the listed elements or features individually or any of the recited elements or features in combination with any of the other recited elements or features. For example, the phrases “at least one of A and B;” “one or more of A and B;” and “A and/or B” are each intended to mean “A alone, B alone, or A and B together.” A similar interpretation is also intended for lists including three or more items. For example, the phrases “at least one of A, B, and C;” “one or more of A, B, and C;” and “A, B, and/or C” are each intended to mean “A alone, B alone, C alone, A and B together, A and C together, B and C together, or A and B and C together.” In addition, use of the term “based on,” above and in the claims is intended to mean, “based at least in part on,” such that an unrecited feature or element is also permissible.
  • The subject matter described herein can be embodied in systems, apparatus, methods, and/or articles depending on the desired configuration. The implementations set forth in the foregoing description do not represent all implementations consistent with the subject matter described herein. Instead, they are merely some examples consistent with aspects related to the described subject matter. Although a few variations have been described in detail above, other modifications or additions are possible. In particular, further features and/or variations can be provided in addition to those set forth herein. For example, the implementations described above can be directed to various combinations and subcombinations of the disclosed features and/or combinations and subcombinations of several further features disclosed above. In addition, the logic flows depicted in the accompanying figures and/or described herein do not necessarily require the particular order shown, or sequential order, to achieve desirable results. Other implementations may be within the scope of the following claims.

Claims (17)

1. A computer-implemented method comprising:
parsing a code base to identify methods encapsulated therein;
generating a call graph based on the parsing using a graph generation technique, the call graph being a directed call graph comprising a plurality of nodes characterizing the identified methods;
determining, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes, wherein at least a portion of the design patterns are non-standard and are not recognized by the graph generation technique;
connecting at least two of the disconnected nodes using a stitching algorithm to result in a modified call graph, wherein the stitching algorithm performs operations comprising:
for each of the one or more design patterns:
find a set of called method nodes in the call graph to be called in runtime that depend on such design pattern;
find a set of callee method nodes that are hierarchically associated with the set of called methods that depend on such design pattern; and
add directed edges to the set of callee method nodes from the set of called method nodes if both sets contain at least one node;
advance to a next design pattern if at least one edge is added between a callee and a called node; and
advance to the next design pattern if the set of called method nodes and the set of callee method nodes are both empty; and
providing data characterizing the modified call graph.
2. The method of claim 1, wherein edges between nodes of the directed call graph are unidirectional edges connecting method nodes corresponding to invocations by a calling method to a callee method in the code base.
3. The method of claim 1, wherein the connecting uses a second graph generation technique that differs from the graph generation technique.
4. The method of claim 1, wherein the providing data characterizing the modified call graph comprises: causing the data characterizing the modified call graph in a graphical user interface, loading the modified call graph into memory, storing data characterizing the modified call graph in physical persistence, or transmitting the modified call graph to a remote computing device.
5-6. (canceled)
7. The method of claim 1, wherein certain nodes are identified as being disconnected based on a first design pattern and such certain nodes are not identified as being disconnected based on a second design pattern.
8. A system comprising:
at least one data processor; and
memory storing instructions which, when executed by the at least one data processor, result in operations comprising:
parsing a code base to identify methods encapsulated therein;
generating a call graph based on the parsing using a graph generation technique, the call graph being a directed call graph comprising a plurality of nodes characterizing the identified methods;
determining, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes, wherein at least a portion of the design patterns are non-standard and are not recognized by the graph generation technique;
connecting at least two of the disconnected nodes using a stitching algorithm to result in a modified call graph, wherein the stitching algorithm performs operations comprising:
for each of the one or more design patterns:
find a set of called method nodes in the call graph to be called in runtime that depend on such design pattern;
find a set of callee method nodes that are hierarchically associated with the set of called methods that depend on such design pattern; and
add directed edges to the set of callee method nodes from the set of called method nodes if both sets contain at least one node;
advance to a next design pattern if at least one edge is added between a callee and a called node; and
advance to the next design pattern if the set of called method nodes and the set of callee method nodes are both empty; and
providing data characterizing the modified call graph.
9. The system of claim 8, wherein edges between nodes of the directed call graph are unidirectional edges connecting method nodes corresponding to invocations by a calling method to a callee method in the code base.
10. The system of claim 8, wherein the connecting uses a second graph generation technique that differs from the graph generation technique.
11. The system of claim 8, wherein the providing data characterizing the modified call graph comprises: causing the data characterizing the modified call graph in a graphical user interface, loading the modified call graph into memory, storing data characterizing the modified call graph in physical persistence, or transmitting the modified call graph to a remote computing device.
12-13. (canceled)
14. The system of claim 8, wherein certain nodes are identified as being disconnected based on a first design pattern and such certain nodes are not identified as being disconnected based on a second design pattern.
15. A non-transitory computer program product storing instructions which, when executed by at least one computing device, result in operations comprising:
parsing a code base to identify methods encapsulated therein;
generating a call graph based on the parsing using a graph generation technique, the call graph being a directed call graph comprising a plurality of nodes characterizing the identified methods;
determining, based on one or more design patterns used to generate the code base, that at least a portion of the nodes generated call graph are disconnected nodes, wherein at least a portion of the design patterns are non-standard and are not recognized by the graph generation technique;
connecting at least two of the disconnected nodes using a stitching algorithm to result in a modified call graph, wherein the stitching algorithm performs operations comprising:
for each of the one or more design patterns:
find a set of called method nodes in the call graph to be called in runtime that depend on such design pattern;
find a set of callee method nodes that are hierarchically associated with the set of called methods that depend on such design pattern; and
add directed edges to the set of callee method nodes from the set of called method nodes if both sets contain at least one node;
advance to a next design pattern if at least one edge is added between a callee and a called node; and
advance to the next design pattern if the set of called method nodes and the set of callee method nodes are both empty; and
providing data characterizing the modified call graph.
16. The computer program product of claim 15, wherein edges between nodes of the directed call graph are unidirectional edges connecting method nodes corresponding to invocations by a calling method to a callee method in the code base.
17. The computer program product of claim 15, wherein the connecting uses a second graph generation technique that differs from the graph generation technique.
18. The computer program product of claim 15, wherein the providing data characterizing the modified call graph comprises: causing the data characterizing the modified call graph in a graphical user interface, loading the modified call graph into memory, storing data characterizing the modified call graph in physical persistence, or transmitting the modified call graph to a remote computing device.
19-20. (canceled)
US17/121,496 2020-12-14 2020-12-14 Call graph enhancement using stitching algorithm Active US11379198B1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US17/121,496 US11379198B1 (en) 2020-12-14 2020-12-14 Call graph enhancement using stitching algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US17/121,496 US11379198B1 (en) 2020-12-14 2020-12-14 Call graph enhancement using stitching algorithm

Publications (2)

Publication Number Publication Date
US20220188084A1 true US20220188084A1 (en) 2022-06-16
US11379198B1 US11379198B1 (en) 2022-07-05

Family

ID=81941437

Family Applications (1)

Application Number Title Priority Date Filing Date
US17/121,496 Active US11379198B1 (en) 2020-12-14 2020-12-14 Call graph enhancement using stitching algorithm

Country Status (1)

Country Link
US (1) US11379198B1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220012163A1 (en) * 2021-09-23 2022-01-13 Intel Corporation Methods, systems, articles of manufacture and apparatus to detect code defects
US20220250560A1 (en) * 2018-12-20 2022-08-11 Commissariat A L'energie Atomique Et Aux Energies Alternatives System for formally supervising communications
US11537372B2 (en) * 2020-03-25 2022-12-27 ManyCore Corporation Generating compilable machine code programs from dynamic language code

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3966518B2 (en) * 2004-03-12 2007-08-29 インターナショナル・ビジネス・マシーンズ・コーポレーション Program analysis apparatus, analysis method thereof, and program
US20060168565A1 (en) 2005-01-24 2006-07-27 International Business Machines Corporation Method and system for change classification
US8250556B1 (en) * 2007-02-07 2012-08-21 Tilera Corporation Distributing parallelism for parallel processing architectures
US8984485B2 (en) 2013-05-01 2015-03-17 International Business Machines Corporation Analysis of source code changes
US9501382B2 (en) * 2013-06-25 2016-11-22 Purdue Research Foundation Systems and methods of detecting power bugs
US9703552B2 (en) 2014-12-18 2017-07-11 International Business Machines Corporation Assertions based on recently changed code
US9720813B2 (en) 2015-08-13 2017-08-01 Ca, Inc. Method and apparatus for recommending regression tests
US10423408B2 (en) 2016-06-02 2019-09-24 International Business Machines Corporation Identifying and isolating library code in software applications
US10353810B2 (en) 2016-10-04 2019-07-16 Sap Se Software testing with minimized test suite

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220250560A1 (en) * 2018-12-20 2022-08-11 Commissariat A L'energie Atomique Et Aux Energies Alternatives System for formally supervising communications
US11537372B2 (en) * 2020-03-25 2022-12-27 ManyCore Corporation Generating compilable machine code programs from dynamic language code
US20220012163A1 (en) * 2021-09-23 2022-01-13 Intel Corporation Methods, systems, articles of manufacture and apparatus to detect code defects
US11704226B2 (en) * 2021-09-23 2023-07-18 Intel Corporation Methods, systems, articles of manufacture and apparatus to detect code defects

Also Published As

Publication number Publication date
US11379198B1 (en) 2022-07-05

Similar Documents

Publication Publication Date Title
US11379198B1 (en) Call graph enhancement using stitching algorithm
US8694966B2 (en) Identifying test cases to be run after changes to modules of a software application
US9208057B2 (en) Efficient model checking technique for finding software defects
US8166464B2 (en) Analysis and detection of soft hang responsiveness program errors
US8468391B2 (en) Utilizing log event ontology to deliver user role specific solutions for problem determination
US10489274B2 (en) Using emulation to disassociate verification from stimulus in functional test
US9880832B2 (en) Software patch evaluator
US20120304153A1 (en) Context-sensitive analysis framework using value flows
US8832125B2 (en) Extensible event-driven log analysis framework
US9336128B2 (en) Method and system for code analysis using symbolic types
US10528457B2 (en) Automated unit testing in a mainframe environment
US8924932B2 (en) Using stack data and source code to rank program changes
US9311077B2 (en) Identification of code changes using language syntax and changeset data
US8732664B2 (en) Document serialization and comparison via object model
US11487645B2 (en) Predictive test case coverage
US20050055605A1 (en) Activating assertions and breakpoints
US11734159B2 (en) Ranking test cases specific to changes in software code
US11526429B1 (en) Identifying critical methods and critical paths in software code
CN112650526B (en) Method, device, electronic equipment and medium for detecting version consistency
US20220334955A1 (en) Automated unit testing in a mainframe cics environment
US20110246967A1 (en) Methods and systems for automation framework extensibility
Lavoie et al. A case study of TTCN-3 test scripts clone analysis in an industrial telecommunication setting
CN114968817A (en) Method, device, equipment and storage medium for evaluating code change influence range
US9465723B2 (en) Systems and/or methods for monitoring live software
Kua Unit Testing

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

AS Assignment

Owner name: SAP SE, GERMANY

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GOSWAMI, AMITABH;DUTTA, AMRIT SHANKAR;KUMAR, ALOK;AND OTHERS;SIGNING DATES FROM 20201208 TO 20201209;REEL/FRAME:054659/0001

STCF Information on status: patent grant

Free format text: PATENTED CASE