CN111324480A - Large host transaction fault positioning system and method - Google Patents

Large host transaction fault positioning system and method Download PDF

Info

Publication number
CN111324480A
CN111324480A CN202010111536.6A CN202010111536A CN111324480A CN 111324480 A CN111324480 A CN 111324480A CN 202010111536 A CN202010111536 A CN 202010111536A CN 111324480 A CN111324480 A CN 111324480A
Authority
CN
China
Prior art keywords
transaction
transaction data
link information
host
fault
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010111536.6A
Other languages
Chinese (zh)
Other versions
CN111324480B (en
Inventor
谢小舜
李志军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN202010111536.6A priority Critical patent/CN111324480B/en
Publication of CN111324480A publication Critical patent/CN111324480A/en
Application granted granted Critical
Publication of CN111324480B publication Critical patent/CN111324480B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0715Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a system implementing multitasking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis

Abstract

The invention provides a large host transaction fault positioning system and a method, wherein the system comprises an IBM host end and a platform end; the IBM host side comprises a data processing device; the data processing device is used for acquiring transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program calling state of the transaction data at each node; generating a fault identifier in the transaction state of each node according to the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end is used for receiving and storing the transaction data, the fault identification and the link information which are sent by the IBM host end; and calling corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting processing, and displaying and outputting the transaction data, the fault identification and the link information.

Description

Large host transaction fault positioning system and method
Technical Field
The invention relates to the technical field of software fault positioning, in particular to a method for quickly positioning business faults and tracing scenes of an IBM mainframe.
Background
The main frame (Mainframe) is a Mainframe computer developed by international business machines corporation (hereinafter, IBM corporation), which uses special hardware devices and system software, and has the characteristics of high stability and safety. IBM corporation provides a CEDF/CEDX tracking tool in CICS for tracking troubleshooting of mainframe online transaction failures. The CEDF/CEDX tracking tool can basically realize the analysis and positioning of faults, but in the actual use process, the related operation involves a plurality of steps and complex operation, and the corresponding operation can be completed only by a certain host technical level; meanwhile, the calling links of the transaction and the program are complicated and complicated, the CEDF/CEDX is very difficult to analyze, the positioning analysis of the fault is very time-consuming, the problem solving timeliness of the production system of the financial institution is very important, the low efficiency of the positioning analysis of the problem can cause that the financial institution can not provide the service for a long time, and the loss is caused to the financial institution and the client thereof; in addition, problems need to be reproduced when the CEDF/CEDX or existing online transaction tracking tools are used, and in many scenes of online transactions of large hosts, the problems often cannot be reproduced regardless of production, test and development environments, so that the fault analysis of the host transactions depends on the technical level and experience of technicians, and the dynamic link of the transactions is long, relates to multiple applications, relates to numerous development, test and operation and maintenance personnel, and has higher communication cost. Although the open platform has a relatively perfect error reporting and tracking tool, the open platform is completely different from the system mechanism of the IBM host and cannot be moved to a large host for use.
Disclosure of Invention
The invention aims to provide a system and a method for realizing rapid positioning of large-scale host transaction faults and scene traceability on an open platform, so as to simplify the steps of tracking, positioning and analyzing large-scale host online transaction faults through a friendly and easy-to-use visual operation mode.
In order to achieve the above object, the present invention provides a large mainframe transaction fault location system, which comprises an IBM mainframe and a platform; the IBM host side comprises a data processing device; the data processing device is used for acquiring transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program calling state of the transaction data at each node; generating a fault identifier in the transaction state of each node according to the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end is used for receiving and storing the transaction data, the fault identification and the link information which are sent by the IBM host end; and calling corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting processing, and displaying and outputting the transaction data, the fault identification and the link information.
In the system for positioning transaction faults of the large-scale host, preferably, the data processing device comprises an acquisition module, a filtering module, an analysis processing module and a host queue read-write module; the acquisition module is arranged at an exit program between the host transaction and the CICS middleware and acquires transaction data packets running in the CICS middleware through an assembly program; the filtering module is used for extracting and obtaining transaction data containing transaction states from the transaction data packet; the analysis processing module is used for obtaining program calling states of the transaction data at each node according to the transaction data packet and generating link information; generating a fault identifier according to the transaction state of each node of the transaction data; and the host queue read-write module is used for inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end through an REXX program and a socket.
In the system for positioning transaction faults of the large-scale host, preferably, the platform end comprises a storage module, a task scheduling device and a page end; the storage module is used for receiving the transaction data, the fault identifier and the link information sent by the IBM host end and writing the transaction data, the fault identifier and the link information into a non-relational database; the task scheduling device is used for receiving a task request input by a user and used for host transaction troubleshooting processing, and calling corresponding transaction data, fault identification and link information from the non-relational database according to the type of the task request; analyzing according to the transaction data, the fault identification and the link information to obtain a processing result, and transmitting the processing result to a page end through a JSON standard format; and the page end displays and outputs the received processing result.
In the system for positioning transaction failure of a large-scale host, preferably, the storage module includes a storage monitoring unit, a storage control unit and a non-relational database; the storage monitoring unit is used for monitoring the running state of the non-relational database and the space state of the disk, and generating alarm information when the running state and the space state do not accord with preset rules; the storage control unit is used for backing up the data files in the preset access frequency or access period in the non-relational database to a preset storage according to the access frequency or access date of the data files in the non-relational database; calling a corresponding data file in a preset memory according to the calling instruction of the task scheduling device and loading the data file to the non-relational database; the non-relational database is used for storing the transaction data, the fault identification and the link information.
In the system for positioning transaction faults of the large-scale host, preferably, the task scheduling device comprises an input analysis module, a task request module, a task distribution module, a logic processing module and a task tracking module; the input analysis module is used for receiving a task request of host transaction troubleshooting processing, analyzing and obtaining a corresponding environment name, a transaction code, a start time and an end time according to the task request and providing the environment name, the transaction code, the start time and the end time to the logic processing module; the task request module is used for responding to the received request instruction and providing a request type corresponding to the request instruction to the task distribution module; the task distribution module is used for providing a corresponding interface program for processing according to the request type; the logic processing module is used for storing a plurality of preset interface programs, and the interface programs are used for processing the corresponding request instructions according to the request instructions and the corresponding environment names, transaction codes, starting time and ending time and according to preset rules to generate processing results; and the task tracking module is used for transmitting the processing result to a page end through a JSON standard format.
In the system for positioning transaction failure of a large-scale host, preferably, the page end includes an input module and a display module; the input module is used for generating a task request according to user input information and providing the task request to the task scheduling device; the display module is used for displaying the processing result provided by the task scheduling device.
The invention also provides a large host transaction fault positioning method, which comprises the following steps: the IBM host terminal collects transaction data packets of host transaction and CICS middleware, and extracts transaction data containing transaction states from the transaction data packets; generating link information according to the program calling state of the transaction data at each node, and generating a fault identifier according to the transaction state of the transaction data at each node; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end receives and saves the transaction data, the fault identification and the link information which are sent by the IBM host end; and calling corresponding transaction data, fault identification and the link information according to the received task request for host transaction troubleshooting processing, and displaying and outputting the transaction data, the fault identification and the link information.
In the method for positioning transaction faults of the large-scale host, preferably, an IBM host collects transaction data packets of host transactions and CICS middleware, and extracts transaction data containing transaction states from the transaction data packets; inputting the transaction data, the fault identifier and the link information into a host queue and then sending the transaction data, the fault identifier and the link information to a platform end comprises: collecting a transaction data packet running in a CICS middleware at an exit program between a host transaction and the CICS middleware through an assembler, and extracting transaction data containing a transaction state from the transaction data packet; acquiring program calling states of the transaction data at each node according to the transaction data packet and generating link information; generating a fault identifier in the transaction state of each node according to the transaction data; and after the transaction data, the fault identification and the link information are input into a host queue, the transaction data, the fault identification and the link information are sent to a platform end through an REXX program and a socket.
In the method for positioning transaction failure of the mainframe, preferably, the transaction data, the failure identifier and the link information sent by the IBM mainframe are received and stored; the steps of calling corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting processing, displaying and outputting comprise: receiving the transaction data, the fault identifier and the link information sent by the IBM host end, and writing the transaction data, the fault identifier and the link information into a non-relational database; receiving a task request of host transaction troubleshooting processing input by a user, and calling corresponding transaction data, a fault identifier and link information from the non-relational database according to the type of the task request; analyzing according to the transaction data, the fault identification and the link information to obtain a processing result, and transmitting the processing result to a page end through a JSON standard format; and the page end displays and outputs the received processing result.
In the above method for positioning transaction failure of a mainframe, preferably, the receiving the transaction data, the failure identifier, and the link information sent by the IBM mainframe, and writing the transaction data, the failure identifier, and the link information into a non-relational database further includes: monitoring the running state of the non-relational database and the space state of a disk, and generating alarm information when the running state and the space state do not accord with a preset rule; backing up the data files in the preset access frequency or access period in the non-relational database to a preset memory according to the access frequency or access date of the data files in the non-relational database; and calling a corresponding data file from a preset memory according to the calling instruction of the task scheduling device and loading the data file to the non-relational database.
In the method for positioning transaction faults of the large-scale host, preferably, a task request for host transaction fault troubleshooting processing input by a user is received, and corresponding transaction data, fault identification and link information are called from the non-relational database according to the type of the task request; analyzing and obtaining a processing result according to the transaction data, the fault identification and the link information, and transmitting the processing result to a page end through a JSON standard format, wherein the processing result comprises the following steps: storing a plurality of preset interface programs in a logic processing module; receiving a task request of host transaction troubleshooting processing, analyzing and obtaining a corresponding environment name, a transaction code, a start time and an end time according to the task request, and providing the environment name, the transaction code, the start time and the end time to the logic processing module; the logic processing module matches a corresponding preset interface program according to the request type; the interface program processes the corresponding request instruction according to a preset rule and generates a processing result according to the request instruction and the corresponding environment name, transaction code, start time and end time; and transmitting the processing result to a page end through a JSON standard format.
The invention also provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method when executing the computer program.
The present invention also provides a computer-readable storage medium storing a computer program for executing the above method.
The invention has the beneficial technical effects that: through a friendly and easy-to-use visual operation mode, the steps of tracking, positioning and analyzing online transaction faults of the large host are simplified, the efficiency of communication and problem solving of technical personnel is greatly improved, and the limitation that a user needs to have higher host technical capability in the traditional CEDF/CEDX tracking mode is broken through. Meanwhile, all transaction faults are kept on site, a service scene does not need to be reproduced, and faults occurring in any time period can be quickly positioned and visually displayed.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the principles of the invention. In the drawings:
fig. 1A is a schematic structural diagram of a mainframe transaction fault location system according to an embodiment of the present invention;
fig. 1B is a schematic application structure diagram of a mainframe transaction fault location system according to an embodiment of the present invention;
FIG. 2 is a block diagram of data processing functions provided in accordance with an embodiment of the present invention;
FIG. 3 is a block diagram of a memory function provided in an embodiment of the present invention;
FIG. 4 is a functional block diagram of a task scheduling system according to an embodiment of the present invention;
FIG. 5 is a block diagram of a front-end function of a WEB page according to an embodiment of the present invention;
fig. 6 is a schematic flow chart illustrating a method for positioning a transaction fault of a mainframe according to an embodiment of the present invention;
fig. 7 is a flowchart illustrating an acquisition and storage process of a mainframe transaction fault location method according to an embodiment of the present invention;
fig. 8 is an analysis flow diagram illustrating a method for locating a transaction fault of a mainframe according to an embodiment of the present invention;
fig. 9 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The following detailed description of the embodiments of the present invention will be provided with reference to the drawings and examples, so that how to apply the technical means to solve the technical problems and achieve the technical effects can be fully understood and implemented. It should be noted that, unless otherwise specified, the embodiments and features of the embodiments of the present invention may be combined with each other, and the technical solutions formed are within the scope of the present invention.
Additionally, the steps illustrated in the flow charts of the figures may be performed in a computer system such as a set of computer-executable instructions and, although a logical order is illustrated in the flow charts, in some cases, the steps illustrated or described may be performed in an order different than here.
Referring to fig. 1A, a mainframe transaction fault location system provided by the present invention includes an IBM mainframe 100 and a platform 200; the IBM host side 100 comprises a data processing apparatus 004; the data processing device 004 is used for collecting transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program calling state of the transaction data at each node; generating a fault identifier in the transaction state of each node according to the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end 200 is configured to receive and store the transaction data, the fault identifier and the link information sent by the IBM host end; and calling corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting processing, and displaying and outputting the transaction data, the fault identification and the link information. In actual work, the IBM host side: by deploying a data processing module which is developed by using a mainframe assembly and has extremely high execution efficiency at an exit program between a host transaction data packet and a CICS middleware, particularly, when an application program is adopted to execute CICS API instructions, a system provides corresponding exits, users customize the corresponding exit programs, and the application program can call the corresponding exit programs when executing the CICS APIs. When the transaction is started, the exit program applies for a memory in the CICS through GETMAIN, and stores the memory address in the TWA, and other exit programs of the TASK subsequently acquire the memory address through the TWA and store the transaction link information therein; when the cross-partition calling occurs, the exit program transmits the transaction link data to the target partition through the transaction-level shared memory Channel, and when the cross-partition calling returns, the transaction link data is transmitted back through the transaction-level shared memory Channel. The data processing module acquires and stores transaction data in real time through the memories, and writes acquired transaction link information into the TDQ queue when transaction is finished, so that efficient data acquisition, filtering, preliminary analysis and queue reading and writing are realized. The platform collapse transmission of the transaction link data message is realized through the REXX program and the socket; opening a platform end: the software system at the open platform end is developed through java programming, and a series of functions such as transaction data receiving, data warehousing, storage monitoring, storage control, data archiving, historical data file query, input analysis, task distribution, logic processing, task storage, task tracking, interactive communication, result visualization, log registration and query are realized.
Referring to fig. 2, in an embodiment of the present invention, the data processing apparatus, i.e. the data processing system 004 in the figure, includes an acquisition module 401, a filtering module 402, an analysis processing module 403, and a host queue read/write module 404; the acquisition module 401 is arranged at an exit program between the host transaction and the CICS middleware, and acquires transaction data packets running in the CICS middleware through an assembler; the filtering module 402 is configured to extract and obtain transaction data including a transaction status from the transaction data packet; the analysis processing module 403 is configured to obtain a program calling state of the transaction data at each node according to the transaction data packet and generate link information; generating a fault identifier according to the transaction state of each node of the transaction data; the host queue read-write module 404 is configured to send the transaction data, the fault identifier, and the link information to a platform end through an REXX program and a socket after inputting the transaction data, the fault identifier, and the link information into a host queue. In actual work, the acquisition module 401 is a core program, because the concurrency of host transactions is great, a large number of transactions from a gateway to a CICS at the same time, and the acquisition of data cannot affect the performance of the transactions, the acquisition module 401 adopts a large-machine assembler for development, and deploys an exit program between the host transactions and the CICS middleware, which is a necessary path, so as to finish the efficient acquisition of transaction data; a filtering module 402, which performs preliminary filtering on the collected transaction data and only retains useful data; the analysis processing module 403 determines the logic and fault conditions of the transaction link, performs necessary processing on the data (e.g., generating a hash value to facilitate the matching of subsequent links, transcoding and formatting the communication area to facilitate subsequent processing and reading, etc.), and in addition, in combination with the filtering module 402, provides a more detailed filtering rule for the filtering module (e.g., filtering and intercepting repeated links due to program loop or other reasons, etc.); the host queue read-write module 404 uses the REXX program and the socket to realize the transmission of the host transaction data to the platform-side broken platform.
Referring to fig. 1B, in an embodiment of the invention, the platform end 200 includes a storage module 007, a task scheduler 008, and a page end 009; the storage module 007 is configured to receive the transaction data, the fault identifier and the link information sent by the IBM host 100, and write the transaction data, the fault identifier and the link information into a non-relational database; the task scheduling device 008 is configured to receive a task request for host transaction troubleshooting processing input by a user, and to invoke corresponding transaction data, a fault identifier, and link information from the non-relational database according to a type of the task request; analyzing according to the transaction data, the fault identification and the link information to obtain a processing result, and transmitting the processing result to a page end 009 through a JSON standard format; the page end 009 displays and outputs the received processing result; further, the platform end 200 further includes a log module 010 and a platform communication module 006, the platform communication module 006 communicates with the platform end through a host communication module 005 of the IBM host end, and the IBM host end may include a host transaction 001, a CICS middleware portal program 002, a CICS003 and a data processing system 004.
Referring to fig. 3, in the above embodiment, the memory module 007 includes a memory monitoring unit 701, a memory control unit 702, and a non-relational database 703; the storage monitoring unit 701 is configured to monitor an operation state of the non-relational database and a space state of a disk, and generate alarm information when the operation state and the space state do not meet a preset rule; the storage control unit 702 is configured to backup data files in a predetermined access frequency or access period in the non-relational database 703 to a predetermined storage according to the access frequency or access date of the data files in the non-relational database 703; calling a corresponding data file in a preset memory according to the calling instruction of the task scheduling device and loading the data file to the non-relational database; the non-relational database 703 is used to store the transaction data, the fault identifier, and the link information. In practice, the storage module 007 may include a storage monitoring unit 701, a storage control unit 702, a non-relational database 703, and a data file 704. The storage monitoring unit 701 continuously detects the availability of the database, the disk space and the like, and if the unavailability of the database or the disk space reaching a preset threshold value is detected, an alarm is triggered, and an email is automatically sent to notify operation and maintenance personnel; the storage control unit 702 controls the writing of the non-relational database 703 and the data file 704, and the control strategy is to store the fresh date data with high use frequency by using the database, while the use frequency of the data which is more long-term is very low, the data is periodically stripped from the database to the backup file for storage every day, and the backup file corresponding to the date is loaded to the database by using the storage control unit when the data needs to be used.
The task scheduling system 008 is a set of programs that normally run on the platform server side, and is used for centralized management of task processing and transaction link logic analysis processing. The task scheduling function is encapsulated through development of an MVC (Model, View and Controller) architecture mode, an external service interface of JSON (Java Server object) standard format input and output is provided, a third party can directly call the external service interface, and rapid positioning and automatic analysis of the transaction fault of the host are realized. Referring to fig. 4, in an embodiment of the present invention, the task scheduling device, i.e., the task scheduling system 008 in the figure, includes an input parsing module 801, a task request module 802, a task distributing module 803, a logic processing module 804, and a task tracking module 806; the input analysis module 801 is configured to receive a task request for host transaction troubleshooting processing, analyze the task request to obtain a corresponding environment name, a transaction code, a start time, and an end time, and provide the environment name, the transaction code, the start time, and the end time to the logic processing module; the task request module 802 is configured to respond to the received request instruction and provide a request type corresponding to the request instruction to the task distribution module; the task distribution module 803 is configured to provide a corresponding interface program according to the request type for processing; the logic processing module 804 is configured to store multiple preset interface programs, where the interface programs are configured to process the corresponding request instruction according to a preset rule and generate a processing result according to the request instruction and the corresponding environment name, transaction code, start time, and end time; the task tracking module 806 is configured to transmit the processing result to a page end in a JSON standard format. In actual work, the page end 009 includes a front end of a WEB page and a back end of the WEB page, and the task scheduling system 008 includes an input parsing module 801, a task request module 802, a task distribution module 803, a logic processing module 804, a task storage module 805, a task tracking module 806, and a communication module 807. The input analysis module 801 analyzes the environment name, the transaction code, the start time and the end time of the task request by reading the data packet input by the front end, and provides the information to the subsequent processing module; the task request module 802 is configured to respond to a request of a user and provide a request type to the task distribution module 803; the task distribution module 803 assigns a corresponding interface program to process according to the request type provided by the task request module 802; a logic processing module 804, which includes interface programs with various functions, and each interface program completes corresponding logic function processing; the task storage module 805 is mainly used for storing the transaction states and task processing results of input information, fault location, problem analysis and the like received by the WEB front end so as to be used for task processing and tracking; the task tracking module 806 realizes the full-flow tracking management after task input, including task distribution, task processing, task termination, and returns the result to the front end of the WEB page; the communication module 807 is used for network connection and data transmission with the front end of the WEB page.
Referring to fig. 5, the front end of the WEB page is connected to the task scheduling system 008 through the communication module 902, and provides an input and operation interface for the user, and is used for displaying the processing result. In an embodiment of the present invention, the page end 009 includes an input module 901 and a display module 903; the input module 901 is configured to generate a task request according to user input information, and provide the task request to the task scheduling device; the display module 903 is configured to display a processing result provided by the task scheduling apparatus. Specifically, in actual work, the page end 009 may further include a display module 903, the input module 901 is configured to receive user input, where the input items include an environment name, a transaction code, a start time, an end time, and the like, and the input module also includes operation buttons for querying, canceling, and the like; the communication module 902 realizes the network connection and data transmission between the page end 009 and the task scheduling system 008; the display module 903 presents the results returned by the task scheduling system 008 to the user at the front end of the page.
Referring to fig. 6, the present invention further provides a method for positioning a transaction failure of a mainframe, including:
s601, an IBM host terminal collects transaction data packets of host transaction and CICS middleware, and extracts transaction data containing transaction states from the transaction data packets;
s602, generating link information according to the program calling state of the transaction data at each node, and generating a fault identifier according to the transaction state of the transaction data at each node;
s603, the transaction data, the fault identification and the link information are input into a host queue and then are sent to a platform end;
s604, the platform end receives and saves the transaction data, the fault identification and the link information sent by the IBM host end;
s605 calls corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting process, and displays and outputs the transaction data, the fault identification and the link information.
Specifically, in the above embodiment, steps S601 to S603 may include: collecting a transaction data packet running in a CICS middleware at an exit program between a host transaction and the CICS middleware through an assembler, and extracting transaction data containing a transaction state from the transaction data packet; acquiring program calling states of the transaction data at each node according to the transaction data packet and generating link information; generating a fault identifier in the transaction state of each node according to the transaction data; and after the transaction data, the fault identification and the link information are input into a host queue, the transaction data, the fault identification and the link information are sent to a platform end through an REXX program and a socket. The transaction data, the fault identification and the link information sent by the IBM host are received and stored; the steps of calling corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting processing, displaying and outputting comprise: receiving the transaction data, the fault identifier and the link information sent by the IBM host end, and writing the transaction data, the fault identifier and the link information into a non-relational database; receiving a task request of host transaction troubleshooting processing input by a user, and calling corresponding transaction data, a fault identifier and link information from the non-relational database according to the type of the task request; analyzing according to the transaction data, the fault identification and the link information to obtain a processing result, and transmitting the processing result to a page end through a JSON standard format; and the page end displays and outputs the received processing result.
In the above embodiment, receiving the transaction data, the fault identifier, and the link information sent by the IBM host, and writing the transaction data, the fault identifier, and the link information into a non-relational database further includes: monitoring the running state of the non-relational database and the space state of a disk, and generating alarm information when the running state and the space state do not accord with a preset rule; backing up the data files in the preset access frequency or access period in the non-relational database to a preset memory according to the access frequency or access date of the data files in the non-relational database; and calling a corresponding data file from a preset memory according to the calling instruction of the task scheduling device and loading the data file to the non-relational database.
In one embodiment of the invention, a task request of host transaction troubleshooting processing input by a user is received, and corresponding transaction data, a fault identifier and link information are called in the non-relational database according to the type of the task request; analyzing and obtaining a processing result according to the transaction data, the fault identifier and the link information, and transmitting the processing result to a page end through a JSON standard format may include: storing a plurality of preset interface programs in a logic processing module; receiving a task request of host transaction troubleshooting processing, analyzing and obtaining a corresponding environment name, a transaction code, a start time and an end time according to the task request, and providing the environment name, the transaction code, the start time and the end time to the logic processing module; the logic processing module matches a corresponding preset interface program according to the request type; the interface program processes the corresponding request instruction according to a preset rule and generates a processing result according to the request instruction and the corresponding environment name, transaction code, start time and end time; and transmitting the processing result to a page end through a JSON standard format. The preset interface program can be written and stored in advance by a worker according to actual needs, can be directly called when the preset interface program is used in subsequent actual needs, and analyzes specific fault conditions by adopting the environment name, the transaction code, the start time and the end time obtained by analysis without adopting the existing environment reconstruction, so that the technical threshold of fault positioning is greatly reduced.
The invention solves the problems of high technical capability level requirement, high experience requirement, multiple and complicated operation steps and low fault positioning and analysis efficiency of technical personnel in the online transaction troubleshooting process of the large host, and provides a simple and easy-to-use system and method for quickly positioning the transaction fault of the large host and automatically assisting in analysis; meanwhile, the business scene of the transaction fault does not need to reappear, and the fault occurring in any time period can be quickly positioned and visually displayed. The invention has the following specific advantages: by the invention, the user can complete the quick positioning of the host transaction fault only by filling in simple input items such as environment name, transaction name, time period and the like on the front-end webpage without depending on a large-scale host middleware technology and a host transaction tracking technology. By the invention, the business scene of the transaction fault does not need to reappear, and the fault occurring in any time period can be quickly positioned and visually displayed. The fault auxiliary analysis function designed by the invention greatly improves the timeliness of fault analysis and solution. The invention can develop a universal interface, provides direct calling for third-party application, and meets the requirement of third-party application for positioning and analyzing faults.
Referring to fig. 7 again, in combination with the above embodiments, the host transaction data collection and storage process of the present invention may include the following steps:
s101, developing an efficient data acquisition module by large-scale machine assembly, deploying the efficient data acquisition module at an exit program between host transaction and CICS middleware, and acquiring host transaction data;
and S102, filtering the data by the data acquisition module, and extracting and supplementing relevant useful information.
Step S103, transaction data, link information, communication area information and the like are output to the host queue.
Step S104, using REXX language development program to obtain host queue data, processing format, and sending to platform side through socket;
step S105: the timing task at the platform side scans and receives the transaction data sent by the host side, and then writes the transaction data into the NoSQL platform database;
step S106: the platform database keeps the transaction data of the latest period of time, and the overdue transaction data are automatically transferred to the data file for storage.
Referring to fig. 7 and 8 again, the specific steps of the host transaction fault location and analysis processing flow may include the following:
step S201: the task scheduling system receives and responds to a host transaction troubleshooting processing task request, and provides the request type for the task distribution module to process;
step S202: after the task distribution module processes the received request, the request type is judged and distributed to a corresponding task processing interface program;
step S203: the interface program receives the request parameters, acquires transaction data from a non-relational platform database NoSQL and carries out logic processing;
step S204: after the interface program completes the logic processing, the processing result data is transmitted to a WEB page background program through a JSON standard format;
step S205: the page background program reconstructs data according to the open source Echarts chart plug-in format, and then stores the data to each variable and transmits the data to the front end of the page;
step S206: and after receiving the processing result data of the background program, the front end of the WEB page calls an open source Echarts chart plug-in to finish the visual display of information such as a transaction link, error information, an automatic analysis result, a communication area and the like.
The invention also provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method when executing the computer program.
The present invention also provides a computer-readable storage medium storing a computer program for executing the above method.
As shown in fig. 9, the electronic device may further include: communication module 110, input unit 120, audio processing unit 130, display 160, power supply 170. It is noted that the electronic device 600 does not necessarily include all of the components shown in FIG. 9; furthermore, the electronic device 600 may also comprise components not shown in fig. 9, which may be referred to in the prior art.
As shown in fig. 9, a central processing unit 900, sometimes referred to as a controller or operational control, may include a microprocessor or other processor device and/or logic device, the central processing unit 900 receiving input and controlling operation of the various components of the electronic device 600.
The memory 140 may be, for example, one or more of a buffer, a flash memory, a hard drive, a removable media, a volatile memory, a non-volatile memory, or other suitable device. The information relating to the failure may be stored, and a program for executing the information may be stored. And the central processor 900 may execute the program stored in the memory 140 to realize information storage or processing, etc.
The input unit 120 provides input to the central processor 900. The input unit 120 is, for example, a key or a touch input device. The power supply 170 is used to provide power to the electronic device 600. The display 160 is used to display an object to be displayed, such as an image or a character. The display may be, for example, an LCD display, but is not limited thereto.
The memory 140 may be a solid state memory such as Read Only Memory (ROM), Random Access Memory (RAM), a SIM card, or the like. There may also be a memory that holds information even when power is off, can be selectively erased, and is provided with more data, an example of which is sometimes called an EPROM or the like. The memory 140 may also be some other type of device. Memory 140 includes buffer memory 141 (sometimes referred to as a buffer). The memory 140 may include an application/function storage section 142, and the application/function storage section 142 is used to store application programs and function programs or a flow for executing the operation of the electronic device 600 by the central processor 900.
The memory 140 may also include a data store 143, the data store 143 for storing data, such as contacts, digital data, pictures, sounds, and/or any other data used by the electronic device. The driver storage portion 144 of the memory 140 may include various drivers of the electronic device for communication functions and/or for performing other functions of the electronic device (e.g., messaging application, address book application, etc.).
The communication module 110 is a transmitter/receiver 110 that transmits and receives signals via an antenna 111. The communication module (transmitter/receiver) 110 is coupled to the central processor 900 to provide an input signal and receive an output signal, which may be the same as in the case of a conventional mobile communication terminal.
Based on different communication technologies, a plurality of communication modules 110, such as a cellular network module, a bluetooth module, and/or a wireless local area network module, may be provided in the same electronic device. The communication module (transmitter/receiver) 110 is also coupled to a speaker 131 and a microphone 132 via an audio processor 130 to provide audio output via the speaker 131 and receive audio input from the microphone 132 to implement general telecommunications functions. Audio processor 130 may include any suitable buffers, decoders, amplifiers and so forth. In addition, the audio processor 130 is also coupled to the central processor 900, thereby enabling recording locally through the microphone 132 and playing locally stored sounds through the speaker 131.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The above-mentioned embodiments are intended to illustrate the objects, technical solutions and advantages of the present invention in further detail, and it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made within the spirit and principle of the present invention should be included in the scope of the present invention.

Claims (13)

1. A large host transaction fault positioning system is characterized by comprising an IBM host end and a platform end;
the IBM host side comprises a data processing device;
the data processing device is used for acquiring transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program calling state of the transaction data at each node; generating a fault identifier in the transaction state of each node according to the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end;
the platform end is used for receiving and storing the transaction data, the fault identification and the link information which are sent by the IBM host end; and calling corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting processing, and displaying and outputting the transaction data, the fault identification and the link information.
2. The large host transaction fault location system of claim 1, wherein the data processing device comprises an acquisition module, a filtering module, an analysis processing module, and a host queue read-write module;
the acquisition module is arranged at an exit program between the host transaction and the CICS middleware and acquires transaction data packets running in the CICS middleware through an assembly program;
the filtering module is used for extracting and obtaining transaction data containing transaction states from the transaction data packet;
the analysis processing module is used for obtaining program calling states of the transaction data at each node according to the transaction data packet and generating link information; generating a fault identifier according to the transaction state of each node of the transaction data;
and the host queue read-write module is used for inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end through an REXX program and a socket.
3. The mainframe transaction fault location system of claim 1, wherein the platform side comprises a storage module, a task scheduling device and a page side;
the storage module is used for receiving the transaction data, the fault identifier and the link information sent by the IBM host end and writing the transaction data, the fault identifier and the link information into a non-relational database;
the task scheduling device is used for receiving a task request input by a user and used for host transaction troubleshooting processing, and calling corresponding transaction data, fault identification and link information from the non-relational database according to the type of the task request; analyzing according to the transaction data, the fault identification and the link information to obtain a processing result, and transmitting the processing result to a page end through a JSON standard format;
and the page end displays and outputs the received processing result.
4. The mainframe transaction fault location system of claim 3 wherein the storage module comprises a storage monitoring unit, a storage control unit and a non-relational database;
the storage monitoring unit is used for monitoring the running state of the non-relational database and the space state of the disk, and generating alarm information when the running state and the space state do not accord with preset rules;
the storage control unit is used for backing up the data files in the preset access frequency or access period in the non-relational database to a preset storage according to the access frequency or access date of the data files in the non-relational database; calling a corresponding data file in a preset memory according to the calling instruction of the task scheduling device and loading the data file to the non-relational database;
the non-relational database is used for storing the transaction data, the fault identification and the link information.
5. The mainframe transaction fault location system of claim 3, wherein the task scheduling device comprises an input parsing module, a task request module, a task distribution module, a logic processing module, and a task tracking module;
the input analysis module is used for receiving a task request of host transaction troubleshooting processing, analyzing and obtaining a corresponding environment name, a transaction code, a start time and an end time according to the task request and providing the environment name, the transaction code, the start time and the end time to the logic processing module;
the task request module is used for responding to the received request instruction and providing a request type corresponding to the request instruction to the task distribution module;
the task distribution module is used for providing a corresponding interface program for processing according to the request type;
the logic processing module is used for storing a plurality of preset interface programs, and the interface programs are used for processing the corresponding request instructions according to the request instructions and the corresponding environment names, transaction codes, starting time and ending time and according to preset rules to generate processing results;
and the task tracking module is used for transmitting the processing result to a page end through a JSON standard format.
6. The mainframe transaction fault location system of claim 3 wherein the page side includes an input module and a display module;
the input module is used for generating a task request according to user input information and providing the task request to the task scheduling device;
the display module is used for displaying the processing result provided by the task scheduling device.
7. A method for locating a transaction fault of a mainframe, the method comprising:
the IBM host terminal collects transaction data packets of host transaction and CICS middleware, and extracts transaction data containing transaction states from the transaction data packets;
generating link information according to the program calling state of the transaction data at each node, and generating a fault identifier according to the transaction state of the transaction data at each node;
inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end;
the platform end receives and saves the transaction data, the fault identification and the link information which are sent by the IBM host end;
and calling corresponding transaction data, fault identification and the link information according to the received task request for host transaction troubleshooting processing, and displaying and outputting the transaction data, the fault identification and the link information.
8. The large mainframe transaction fault location method according to claim 7, wherein the IBM mainframe collects transaction data packets of mainframe transaction and CICS middleware, and extracts transaction data including transaction status from the transaction data packets; inputting the transaction data, the fault identifier and the link information into a host queue and then sending the transaction data, the fault identifier and the link information to a platform end comprises:
collecting a transaction data packet running in a CICS middleware at an exit program between a host transaction and the CICS middleware through an assembler, and extracting transaction data containing a transaction state from the transaction data packet;
acquiring program calling states of the transaction data at each node according to the transaction data packet and generating link information;
generating a fault identifier in the transaction state of each node according to the transaction data;
and after the transaction data, the fault identification and the link information are input into a host queue, the transaction data, the fault identification and the link information are sent to a platform end through an REXX program and a socket.
9. The mainframe transaction fault location method according to claim 7, wherein the transaction data, the fault identifier and the link information sent by the IBM mainframe are received and saved; the steps of calling corresponding transaction data, fault identification and the link information according to the received task request of the host transaction troubleshooting processing, displaying and outputting comprise:
receiving the transaction data, the fault identifier and the link information sent by the IBM host end, and writing the transaction data, the fault identifier and the link information into a non-relational database;
receiving a task request of host transaction troubleshooting processing input by a user, and calling corresponding transaction data, a fault identifier and link information from the non-relational database according to the type of the task request;
analyzing according to the transaction data, the fault identification and the link information to obtain a processing result, and transmitting the processing result to a page end through a JSON standard format;
and the page end displays and outputs the received processing result.
10. The mainframe transaction fault location method of claim 9, wherein receiving the transaction data, the fault identifier, and the link information sent by the IBM mainframe, and writing the transaction data, the fault identifier, and the link information into a non-relational database further comprises:
monitoring the running state of the non-relational database and the space state of a disk, and generating alarm information when the running state and the space state do not accord with a preset rule;
backing up the data files in the preset access frequency or access period in the non-relational database to a preset memory according to the access frequency or access date of the data files in the non-relational database;
and calling a corresponding data file from a preset memory according to the calling instruction of the task scheduling device and loading the data file to the non-relational database.
11. The large host transaction fault location method according to claim 9, wherein a task request for host transaction fault troubleshooting processing input by a user is received, and corresponding transaction data, fault identification and link information are retrieved from the non-relational database according to the type of the task request; analyzing and obtaining a processing result according to the transaction data, the fault identification and the link information, and transmitting the processing result to a page end through a JSON standard format, wherein the processing result comprises the following steps:
storing a plurality of preset interface programs in a logic processing module;
receiving a task request of host transaction troubleshooting processing, analyzing and obtaining a corresponding environment name, a transaction code, a start time and an end time according to the task request, and providing the environment name, the transaction code, the start time and the end time to the logic processing module;
the logic processing module matches a corresponding preset interface program according to the request type;
the interface program processes the corresponding request instruction according to a preset rule and generates a processing result according to the request instruction and the corresponding environment name, transaction code, start time and end time;
and transmitting the processing result to a page end through a JSON standard format.
12. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the method of any one of claims 7 to 11 when executing the computer program.
13. A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program for executing the method of any one of claims 7 to 11.
CN202010111536.6A 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method Active CN111324480B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010111536.6A CN111324480B (en) 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010111536.6A CN111324480B (en) 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method

Publications (2)

Publication Number Publication Date
CN111324480A true CN111324480A (en) 2020-06-23
CN111324480B CN111324480B (en) 2023-07-25

Family

ID=71165208

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010111536.6A Active CN111324480B (en) 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method

Country Status (1)

Country Link
CN (1) CN111324480B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112035550A (en) * 2020-09-01 2020-12-04 中国银行股份有限公司 Fault positioning method and device
CN112181697A (en) * 2020-09-21 2021-01-05 厦门南讯股份有限公司 Method and system for rapidly analyzing problems

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6140691A (en) * 1984-07-31 1986-02-26 オムロン株式会社 Abnormal processing for transaction processor
WO2005101214A1 (en) * 2004-04-19 2005-10-27 Oneempower Pte Ltd Transaction system
CN104572351A (en) * 2014-12-23 2015-04-29 中国工商银行股份有限公司 Data recovery system and method based on host system
CN106681909A (en) * 2016-12-02 2017-05-17 中国工商银行股份有限公司 Online transaction fault locating method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6140691A (en) * 1984-07-31 1986-02-26 オムロン株式会社 Abnormal processing for transaction processor
WO2005101214A1 (en) * 2004-04-19 2005-10-27 Oneempower Pte Ltd Transaction system
CN104572351A (en) * 2014-12-23 2015-04-29 中国工商银行股份有限公司 Data recovery system and method based on host system
CN106681909A (en) * 2016-12-02 2017-05-17 中国工商银行股份有限公司 Online transaction fault locating method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陆坤: "大型主机系统性能可视化监控与管理平台" *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112035550A (en) * 2020-09-01 2020-12-04 中国银行股份有限公司 Fault positioning method and device
CN112181697A (en) * 2020-09-21 2021-01-05 厦门南讯股份有限公司 Method and system for rapidly analyzing problems

Also Published As

Publication number Publication date
CN111324480B (en) 2023-07-25

Similar Documents

Publication Publication Date Title
CN110908875B (en) Inspection method and device based on operation terminal
CN110941563B (en) Automatic script testing method and device
CN110990228A (en) Data interface monitoring method and device
CN112463634B (en) Software testing method and device under micro-service architecture
CN111324480B (en) Large-scale host transaction fault positioning system and method
CN114003451B (en) Interface testing method, device, system and medium
CN112905460B (en) Device and method for simulating three-party receipt by automatic interface test
CN113760611A (en) System site switching method and device, electronic equipment and storage medium
CN111897738B (en) Automatic testing method and device based on atomic service
CN113360413A (en) Code testing method and device
CN111078506A (en) Business data batch running task monitoring method and device
CN112948195A (en) Interface testing method and device, electronic equipment and storage medium
CN112101810A (en) Risk event control method, device and system
CN112102058A (en) Management system parameter configuration method and device
CN113037839B (en) Distributed batch framework communication system and method
CN112988603B (en) Big data test case generation method and device
CN110022244B (en) Method and apparatus for transmitting information
CN113556269B (en) SAP client performance test method, system, electronic equipment and storage medium
CN111538660B (en) Method and device for counting coverage rate of program codes in cloud container
CN114070902B (en) Message processing method and device based on DELPHI
CN112564953B (en) Method, device and equipment for managing remote equipment of office
CN111158744B (en) Cross-platform heterogeneous data integration method and device
CN115203015A (en) Interface full-flow integrated management system
CN110191141B (en) Service calling information processing method and device and computer system
CN116820973A (en) Automatic test method and system based on pseudo code

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant