CN111324480B - Large-scale host transaction fault positioning system and method - Google Patents

Large-scale host transaction fault positioning system and method Download PDF

Info

Publication number
CN111324480B
CN111324480B CN202010111536.6A CN202010111536A CN111324480B CN 111324480 B CN111324480 B CN 111324480B CN 202010111536 A CN202010111536 A CN 202010111536A CN 111324480 B CN111324480 B CN 111324480B
Authority
CN
China
Prior art keywords
transaction
transaction data
host
link information
fault
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010111536.6A
Other languages
Chinese (zh)
Other versions
CN111324480A (en
Inventor
谢小舜
李志军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Industrial and Commercial Bank of China Ltd ICBC
Original Assignee
Industrial and Commercial Bank of China Ltd ICBC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Industrial and Commercial Bank of China Ltd ICBC filed Critical Industrial and Commercial Bank of China Ltd ICBC
Priority to CN202010111536.6A priority Critical patent/CN111324480B/en
Publication of CN111324480A publication Critical patent/CN111324480A/en
Application granted granted Critical
Publication of CN111324480B publication Critical patent/CN111324480B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/0706Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment
    • G06F11/0715Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation the processing taking place on a specific hardware platform or in a specific software environment in a system implementing multitasking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/0703Error or fault processing not based on redundancy, i.e. by taking additional measures to deal with the error or fault not making use of redundancy in operation, in hardware, or in data representation
    • G06F11/079Root cause analysis, i.e. error or fault diagnosis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Debugging And Monitoring (AREA)

Abstract

The invention provides a large-scale host transaction fault positioning system and a method, wherein the system comprises an IBM host end and a platform end; the IBM host side comprises a data processing device; the data processing device is used for collecting transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program call state of each node of the transaction data; generating a fault identifier according to the transaction state of each node of the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end is used for receiving and storing the transaction data, the fault identification and the link information sent by the IBM host end; and according to the received task request of the host transaction fault checking process, the corresponding transaction data, fault identification and the link information are called, and displayed and output.

Description

Large-scale host transaction fault positioning system and method
Technical Field
The invention relates to the technical field of software fault positioning, in particular to a method for rapidly positioning transaction faults and tracing scenes of an IBM large-scale host.
Background
The host (Mainframe) is a large computer developed by International Business machines corporation (IBM corporation for short), which uses special hardware devices and system software, and has the characteristics of high stability and safety, the host is a main stream product purchased and used by large financial institutions such as major commercial banks, insurance companies, securities companies and the like, and numerous businesses of the financial institutions such as account opening, deposit and withdrawal, accounting and the like of the banks are completed through online transaction programs running on the host CICS middleware. IBM corporation provides a CEDF/CEDX tracking tool in CICS for the tracking and troubleshooting of host online transaction failures. The CEDF/CEDX tracking tool can basically realize the analysis and positioning of faults, but in the actual use process, related operations involve multiple steps and complex operations, and a certain level of host technology is needed to complete the corresponding operations; meanwhile, the transaction and the procedure are complicated in calling links, the CEDF/CEDX is difficult to analyze, the fault positioning analysis is time-consuming, the problem solving timeliness of a financial institution production system is important, the problem positioning analysis efficiency is low, the financial institution can not provide service for the outside for a long time, and the financial institution and the customer thereof are lost; in addition, the use of CEDF/CEDX or some existing online transaction tracking tools requires the reproduction of problems, but in many situations of online transactions using large hosts, whether in production, test or development environments, there are often cases where problems cannot be reproduced, which causes failure analysis of host transactions to depend on technical skill level and experience of technicians, and dynamic links of transactions are long, involve multiple applications, involve numerous development, test and operation staff, and have large communication costs. Although the open platform has a perfect error reporting tracking tool, the open platform is completely different from the system mechanism of the IBM host and cannot be moved to a large-scale host for use.
Disclosure of Invention
The invention aims to provide a system and a method for realizing quick positioning and scene traceability of transaction faults of a large host by an open platform, so that steps of tracking, positioning and analyzing the transaction faults of the large host on line are simplified through a friendly and easy-to-use visual operation mode.
In order to achieve the above purpose, the present invention provides a large-scale host transaction fault location system, which comprises an IBM host end and a platform end; the IBM host side comprises a data processing device; the data processing device is used for collecting transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program call state of each node of the transaction data; generating a fault identifier according to the transaction state of each node of the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end is used for receiving and storing the transaction data, the fault identification and the link information sent by the IBM host end; and according to the received task request of the host transaction fault checking process, the corresponding transaction data, fault identification and the link information are called, and displayed and output.
In the above large-scale host transaction fault positioning system, preferably, the data processing device includes an acquisition module, a filtering module, an analysis processing module and a host queue read-write module; the acquisition module is arranged at an exit program between host transaction and CICS middleware, and acquires transaction data packets running in the CICS middleware through an assembler; the filtering module is used for extracting and obtaining transaction data containing transaction states from the transaction data packet; the analysis processing module is used for obtaining the program call state of the transaction data at each node according to the transaction data packet and generating link information; generating fault identification according to the transaction state of each node of the transaction data; the host queue read-write module is used for inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end through REXX programs and socket sockets.
In the above large-scale host transaction fault positioning system, preferably, the platform end includes a storage module, a task scheduling device and a page end; the storage module is used for receiving the transaction data, the fault identification and the link information sent by the IBM host end and writing the transaction data, the fault identification and the link information into a non-relational database; the task scheduling device is used for receiving a task request of the host transaction fault troubleshooting processing input by a user, and according to the type of the task request, the task scheduling device is used for scheduling corresponding transaction data, fault identification and link information in the non-relational database; according to the transaction data, the fault identification and the link information analysis, a processing result is obtained, and the processing result is transmitted to a page end through a JSON standard format; and the page end displays and outputs the received processing result.
In the above large-scale host transaction fault location system, preferably, the storage module includes a storage monitoring unit, a storage control unit and a non-relational database; the storage monitoring unit is used for monitoring the running state of the non-relational database and the space state of the disk, and generating alarm information when the running state and the space state do not accord with a preset rule; the storage control unit is used for backing up the data files in the predetermined access frequency or access period in the non-relational database to a predetermined memory according to the access frequency or access date of the data files in the non-relational database; according to the calling instruction of the task scheduling device, the corresponding data file is called in a preset memory and is loaded to the non-relational database; the non-relational database is configured to store the transaction data, the fault identification, and the link information.
In the above large-scale host transaction fault positioning system, preferably, the task scheduling device includes an input analysis module, a task request module, a task distribution module, a logic processing module and a task tracking module; the input analysis module is used for receiving a task request of the host transaction fault troubleshooting process, analyzing and obtaining a corresponding environment name, transaction codes, starting time and ending time according to the task request and providing the environment name, the transaction codes, the starting time and the ending time to the logic processing module; the task request module is used for responding to the received request instruction and providing a request type corresponding to the request instruction to the task distribution module; the task distribution module is used for providing a corresponding interface program for processing according to the request type; the logic processing module is used for storing a plurality of preset interface programs, and the interface programs are used for processing the corresponding request instructions according to preset rules and generating processing results according to the request instructions, the corresponding environment names, the corresponding transaction codes, the corresponding starting time and the corresponding ending time; the task tracking module is used for transmitting the processing result to a page end through a JSON standard format.
In the above large-scale host transaction fault positioning system, preferably, the page end includes an input module and a display module; the input module is used for generating a task request according to user input information and providing the task request to the task scheduling device; the display module is used for displaying the processing result provided by the task scheduling device.
The invention also provides a large-scale host transaction fault positioning method, which comprises the following steps: the method comprises the steps that an IBM host side collects transaction data packets of host transaction and CICS middleware, and transaction data containing transaction states are extracted from the transaction data packets; generating link information according to the program call state of the transaction data in each node, and generating fault identification according to the transaction state of the transaction data in each node; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end receives and stores the transaction data, the fault identification and the link information sent by the IBM host end; and according to the received task request of the host transaction fault checking process, the corresponding transaction data, fault identification and the link information are called, and displayed and output.
In the above large-scale host transaction fault positioning method, preferably, the IBM host side collects transaction data packets of host transaction and CICS middleware, and extracts transaction data including a transaction state from the transaction data packets; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end comprises the following steps: collecting a transaction data packet of the CICS middleware running at an exit program between host transaction and the CICS middleware through an assembler, and extracting transaction data containing a transaction state from the transaction data packet; acquiring program call states of the transaction data at all nodes according to the transaction data packet and generating link information; generating a fault identifier according to the transaction state of each node of the transaction data; and after the transaction data, the fault identification and the link information are input into a host queue, the transaction data, the fault identification and the link information are sent to a platform end through a REXX program and a socket.
In the above large-scale host transaction fault positioning method, preferably, the transaction data, the fault identifier and the link information sent by the IBM host side are received and stored; and according to the received task request of the host transaction fault check processing, the corresponding transaction data, fault identification and the link information are called, and the display output comprises: receiving the transaction data, the fault identification and the link information sent by the IBM host end, and writing the transaction data, the fault identification and the link information into a non-relational database; receiving a task request of host transaction fault troubleshooting processing input by a user, and calling corresponding transaction data, fault identification and link information from the non-relational database according to the type of the task request; according to the transaction data, the fault identification and the link information analysis, a processing result is obtained, and the processing result is transmitted to a page end through a JSON standard format; and the page end displays and outputs the received processing result.
In the foregoing large-scale host transaction fault location method, preferably, receiving the transaction data, the fault identifier and the link information sent by the IBM host, and writing the transaction data, the fault identifier and the link information into the non-relational database further includes: monitoring the running state of the non-relational database and the space state of a disk, and generating alarm information when the running state and the space state do not accord with a preset rule; according to the access frequency or the access date of the data files in the non-relational database, backing up the data files in the predetermined access frequency or the access period in the non-relational database to a predetermined memory; and according to the calling instruction of the task scheduling device, calling the corresponding data file in a preset memory and loading the data file into the non-relational database.
In the above large-scale host transaction fault positioning method, preferably, a task request of host transaction fault troubleshooting processing input by a user is received, and corresponding transaction data, fault identification and link information are called from the non-relational database according to the type of the task request; analyzing and obtaining a processing result according to the transaction data, the fault identification and the link information, and transmitting the processing result to a page end through a JSON standard format comprises the following steps: storing a plurality of preset interface programs in a logic processing module; receiving a task request of host transaction fault troubleshooting processing, analyzing and obtaining a corresponding environment name, transaction codes, start time and end time according to the task request, and providing the environment name, the transaction codes, the start time and the end time to the logic processing module; the logic processing module matches a corresponding preset interface program according to the request type; the interface program processes the corresponding request instruction according to a preset rule and generates a processing result according to the request instruction, the corresponding environment name, the transaction code, the starting time and the ending time; and transmitting the processing result to a page end through a JSON standard format.
The invention also provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the above method when executing the computer program.
The present invention also provides a computer readable storage medium storing a computer program for executing the above method.
The beneficial technical effects of the invention are as follows: the method simplifies the steps of online transaction fault tracking, positioning and analyzing of the large host through a friendly and easy-to-use visual operation mode, greatly improves the efficiency of technician communication and problem solving, and breaks through the limitation that the traditional CEDF/CEDX tracking mode has higher host technical capability for users. Meanwhile, all transaction faults remain on site, business scenes do not need to be reproduced, and faults occurring in any time period can be rapidly positioned and intuitively displayed.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate and together with the description serve to explain the invention. In the drawings:
FIG. 1A is a schematic diagram of a large-scale host transaction fault location system according to an embodiment of the present invention;
FIG. 1B is a schematic diagram of an application architecture of a large-scale host transaction fault location system according to an embodiment of the present invention;
FIG. 2 is a block diagram of a data processing function according to an embodiment of the present invention;
FIG. 3 is a block diagram of a memory function according to an embodiment of the present invention;
FIG. 4 is a functional block diagram of a task scheduling system according to an embodiment of the present invention;
FIG. 5 is a block diagram of a front-end function of a WEB page according to an embodiment of the present invention;
FIG. 6 is a flowchart of a method for locating a transaction fault of a mainframe according to an embodiment of the present invention;
FIG. 7 is a flow chart of the method for locating a transaction fault of a mainframe according to an embodiment of the present invention;
FIG. 8 is a schematic diagram of an analysis flow chart of a large-scale host transaction fault location method according to an embodiment of the invention;
fig. 9 is a schematic structural diagram of an electronic device according to an embodiment of the invention.
Detailed Description
The following will describe embodiments of the present invention in detail with reference to the drawings and examples, thereby solving the technical problems by applying technical means to the present invention, and realizing the technical effects can be fully understood and implemented accordingly. It should be noted that, as long as no conflict is formed, each embodiment of the present invention and each feature of each embodiment may be combined with each other, and the formed technical solutions are all within the protection scope of the present invention.
Additionally, the steps illustrated in the flowcharts of the figures may be performed in a computer system such as a set of computer executable instructions, and although a logical order is illustrated in the flowcharts, in some cases the steps illustrated or described may be performed in an order other than that herein.
Referring to fig. 1A, the present invention provides a large-scale host transaction fault location system, which includes an IBM host 100 and a platform 200; the IBM host side 100 comprises data processing means 004; the data processing device 004 is used for collecting transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program call state of each node of the transaction data; generating a fault identifier according to the transaction state of each node of the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end; the platform end 200 is configured to receive and store the transaction data, the fault identifier, and the link information sent by the IBM host end; and according to the received task request of the host transaction fault checking process, the corresponding transaction data, fault identification and the link information are called, and displayed and output. In actual operation, the IBM host side: by deploying a data processing module with extremely high execution efficiency, which is developed by using a computer assembly, at the outlet program between the host transaction data packet and the CICS middleware, specifically, the system provides corresponding outlets when application programs execute CICS API instructions, users customize the corresponding outlet programs, and the application programs call the corresponding outlet programs when executing the CICS APIs. When a transaction is started, an export program applies for a memory in CICS through GETMAIN, and stores a memory address in TWA, and then other export programs of the TASK acquire the memory address through TWA, and store transaction link information; when the cross-partition call occurs, the export program transmits the transaction link data to the target partition through the transaction-level shared memory Channel, and when the collapse partition returns, the export program transmits the transaction link data back through the transaction-level shared memory Channel. The data processing module acquires and saves transaction data in real time through the memories, and writes the acquired transaction link information into the TDQ queue when the transaction is finished, so that efficient acquisition, filtering, preliminary analysis and queue reading and writing of the data are realized. Realizing the collapse platform transmission of the transaction link data message through REXX program and socket; open platform end: the open platform end software system is developed through java programming, and a series of functions such as transaction data receiving, data warehousing, storage monitoring, storage control, data archiving, historical data file inquiring, input analysis, task distributing, logic processing, task storing, task tracking, interactive communication, result visualization, log registering and inquiring are realized.
Referring to fig. 2, in an embodiment of the present invention, the data processing apparatus, namely a data processing system 004 in the figure, includes an acquisition module 401, a filtering module 402, an analysis processing module 403, and a host queue read-write module 404; the acquisition module 401 is arranged at an exit program between host transaction and CICS middleware, and acquires transaction data packets running in the CICS middleware through an assembler; the filtering module 402 is configured to extract transaction data including a transaction status from the transaction data packet; the analysis processing module 403 is configured to obtain a program call state of the transaction data at each node according to the transaction data packet and generate link information; generating fault identification according to the transaction state of each node of the transaction data; the host queue read-write module 404 is configured to input the transaction data, the fault identifier, and the link information into a host queue, and then send the transaction data, the fault identifier, and the link information to a platform end through a REXX program and a socket. In actual work, the collection module 401 is a core program, because the concurrency of host transactions is great, a large number of transactions are carried out from the gateway to the CICS at the same time, and the collection of data cannot influence the performance of the transactions, so the collection module 401 adopts a necessary path of development of a host assembler, namely an exit program deployed between the host transactions and the CICS middleware, and the efficient collection of transaction data is completed; the filtering module 402 performs preliminary filtering on the collected transaction data, and only retains useful data; the analysis processing module 403 determines that the transaction link logic and the fault condition are the same, and performs necessary processing on the data (such as generating a hash value, facilitating the carding of the corresponding relationship of the subsequent link, transcoding and formatting the communication area, facilitating the subsequent processing and reading, etc.), and in addition, in combination with the filtering module 402, provides a finer filtering rule for the filtering module (such as performing filtering and cutting-off processing on the repeated link due to program dead-loop or other reasons, etc.); the host queue read-write module 404 uses REXX programs and socket sockets to implement the collapsed platform transmission of host transaction data to the platform side.
Referring to fig. 1B, in an embodiment of the present invention, the platform 200 includes a storage module 007, a task scheduler 008 and a page 009; the storage module 007 is configured to receive the transaction data, the fault identifier and the link information sent by the IBM host 100, and write the transaction data, the fault identifier and the link information into a non-relational database; the task scheduling device 008 is configured to receive a task request for host transaction fault troubleshooting processing input by a user, and call corresponding transaction data, fault identifiers and link information from the non-relational database according to a type of the task request; according to the transaction data, the fault identification and the link information analysis, a processing result is obtained, and the processing result is transmitted to a page end 009 through a JSON standard format; the page end 009 displays and outputs the received processing result; further, the platform end 200 further comprises a log module 010 and a platform communication module 006, wherein the platform communication module 006 communicates with the host communication module 005 of the IBM host end, and the IBM host end may comprise a host transaction 001, a cics middleware portal 002, a cics003, and a data processing system 004.
Referring to fig. 3, in the above embodiment, the storage module 007 includes a storage monitoring unit 701, a storage control unit 702, and a non-relational database 703; the storage monitoring unit 701 is configured to monitor an operation state of the non-relational database and a space state of a disk, and generate alarm information when the operation state and the space state do not conform to a preset rule; the storage control unit 702 is configured to backup, according to the access frequency or the access date of the data files in the non-relational database 703, the data files in a predetermined access frequency or access period in the non-relational database 703 to a predetermined memory; according to the calling instruction of the task scheduling device, the corresponding data file is called in a preset memory and is loaded to the non-relational database; the non-relational database 703 is used to store the transaction data, the fault identification, and the link information. In actual operation, the storage module 007 may include a storage monitoring unit 701, a storage control unit 702, a non-relational database 703, and a data file 704. The storage monitoring unit 701 continuously detects the availability of the database, the disk space and the like, and if the database is detected to be unavailable or the disk space reaches a preset threshold value, an alarm is triggered, and an email is automatically sent to inform operation and maintenance personnel; the storage control unit 702 controls writing of the non-relational database 703 and the data file 704, and the control strategy is that the database is used for storing fresh date data with high use frequency, the data with relatively long use frequency is small, the data is periodically stripped from the database to the backup file storage every day, and the backup file with corresponding date is loaded into the database by the storage control unit when the data is required to be used, so that the performance of the database is ensured and the overall storage cost is controlled.
Task scheduling system 008 is a set of programs normally running on the platform server side for centralized management of task processing and transaction link logic analysis processing. Through MVC architecture pattern (Model, view, controller) development, the task scheduling function is encapsulated, an external service interface of JSON standard format input and output is provided, and a third party can directly call, so that the quick positioning and automatic analysis of the transaction faults of the host are realized. Referring to fig. 4, in an embodiment of the present invention, the task scheduling device, i.e. the task scheduling system 008 in the figure, includes an input parsing module 801, a task requesting module 802, a task distributing module 803, a logic processing module 804 and a task tracking module 806; the input analysis module 801 is configured to receive a task request of a host transaction fault troubleshooting process, obtain a corresponding environment name, a transaction code, a start time and an end time according to the task request, and provide the environment name, the transaction code, the start time and the end time to the logic processing module; the task request module 802 is configured to respond to the received request instruction, and provide a request type corresponding to the request instruction to the task distribution module; the task distribution module 803 is configured to provide a corresponding interface program for processing according to the request type; the logic processing module 804 is configured to store a plurality of preset interface programs, where the interface programs are configured to process the corresponding request instruction according to a preset rule according to the request instruction and the corresponding environment name, transaction code, start time and end time, and generate a processing result; the task tracking module 806 is configured to transmit the processing result to a page end through JSON standard format. In actual operation, the page end 009 comprises a WEB page front end and a WEB page back end, and the task scheduling system 008 includes an input parsing module 801, a task requesting module 802, a task distributing module 803, a logic processing module 804, a task storage module 805, a task tracking module 806 and a communication module 807. The input analysis module 801 analyzes the environment name, the transaction code, the start time and the end time of the task request by reading the data packet input by the front end, and provides information to the subsequent processing module; the task request module 802 is configured to provide a request type to the task distribution module 803 in response to a request of a user; the task distribution module 803 assigns a corresponding interface program to process according to the request type provided by the task request module 802; a logic processing module 804 containing interface programs of various functions, each of which completes corresponding logic function processing; the task storage module 805 is mainly configured to store transaction states and task processing results, such as input information received by the WEB front end, fault location, and problem analysis, so as to be used for task processing and tracking; the task tracking module 806 realizes the whole flow tracking management after task input, including task distribution, task processing, task ending, and returns the result to the front end of the WEB page; the communication module 807 is used for network connection and data transmission with the front end of the WEB page.
Referring to fig. 5, the front end of the WEB page is connected to the task scheduling system 008 through the communication module 902, and provides an input and operation interface for the user, and is used for displaying the processing result. In an embodiment of the present invention, the page 009 includes an input module 901 and a display module 903; the input module 901 is configured to generate a task request according to user input information, and provide the task request to the task scheduling device; the display module 903 is configured to display a processing result provided by the task scheduling device. Specifically, in actual work, the page end 009 may further include a display module 903, where the input module 901 is configured to receive user input, and the input items include an environment name, a transaction code, a start time, an end time, and the like, and the input module also includes operation buttons for querying, canceling, and the like; the communication module 902 realizes the network connection and data transmission between the page end 009 and the task scheduling system 008; the display module 903 displays the result returned by the task scheduling system 008 to the user at the front end of the page.
Referring to fig. 6, the present invention further provides a method for locating a transaction fault of a large host, where the method includes:
s601, an IBM host terminal collects transaction data packets of host transaction and CICS middleware, and extracts transaction data containing transaction states from the transaction data packets;
S602, generating link information according to the program call state of the transaction data at each node, and generating a fault identifier according to the transaction state of the transaction data at each node;
s603, inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end;
s604, the platform end receives and stores the transaction data, the fault identification and the link information sent by the IBM host end;
s605, corresponding transaction data, fault identification and the link information are called according to the received task request of the host transaction fault checking process, and the corresponding transaction data, the fault identification and the link information are displayed and output.
Specifically, in the above embodiment, the steps S601 to S603 may include: collecting a transaction data packet of the CICS middleware running at an exit program between host transaction and the CICS middleware through an assembler, and extracting transaction data containing a transaction state from the transaction data packet; acquiring program call states of the transaction data at all nodes according to the transaction data packet and generating link information; generating a fault identifier according to the transaction state of each node of the transaction data; and after the transaction data, the fault identification and the link information are input into a host queue, the transaction data, the fault identification and the link information are sent to a platform end through a REXX program and a socket. Receiving and storing the transaction data, the fault identification and the link information sent by the IBM host; and according to the received task request of the host transaction fault check processing, the corresponding transaction data, fault identification and the link information are called, and the display output comprises: receiving the transaction data, the fault identification and the link information sent by the IBM host end, and writing the transaction data, the fault identification and the link information into a non-relational database; receiving a task request of host transaction fault troubleshooting processing input by a user, and calling corresponding transaction data, fault identification and link information from the non-relational database according to the type of the task request; according to the transaction data, the fault identification and the link information analysis, a processing result is obtained, and the processing result is transmitted to a page end through a JSON standard format; and the page end displays and outputs the received processing result.
In the above embodiment, receiving the transaction data, the failure identifier, and the link information sent by the IBM host, and writing the transaction data, the failure identifier, and the link information into the non-relational database further includes: monitoring the running state of the non-relational database and the space state of a disk, and generating alarm information when the running state and the space state do not accord with a preset rule; according to the access frequency or the access date of the data files in the non-relational database, backing up the data files in the predetermined access frequency or the access period in the non-relational database to a predetermined memory; and according to the calling instruction of the task scheduling device, calling the corresponding data file in a preset memory and loading the data file into the non-relational database.
In an embodiment of the invention, a task request of host transaction fault troubleshooting processing input by a user is received, and corresponding transaction data, fault identification and link information are called from the non-relational database according to the type of the task request; according to the transaction data, the fault identification and the link information analysis, obtaining a processing result, and transmitting the processing result to a page end through a JSON standard format may include: storing a plurality of preset interface programs in a logic processing module; receiving a task request of host transaction fault troubleshooting processing, analyzing and obtaining a corresponding environment name, transaction codes, start time and end time according to the task request, and providing the environment name, the transaction codes, the start time and the end time to the logic processing module; the logic processing module matches a corresponding preset interface program according to the request type; the interface program processes the corresponding request instruction according to a preset rule and generates a processing result according to the request instruction, the corresponding environment name, the transaction code, the starting time and the ending time; and transmitting the processing result to a page end through a JSON standard format. The preset interface program can be written and stored in advance by staff according to actual needs, can be directly called when the subsequent actual needs are used, analyzes specific fault conditions by adopting the environment name, the transaction code, the starting time and the ending time which are obtained through analysis, does not need to adopt the existing environment reconstruction, and greatly reduces the technical threshold of fault positioning.
The invention solves the problems of high technical capability level requirement, high experience requirement, more operation steps, complexity and lower fault positioning and analysis efficiency of technicians in the online transaction fault investigation process of the large-scale host, and provides a system and a method for quickly positioning and automatically assisting in analysis of the transaction fault of the large-scale host, which are simple and easy to use; meanwhile, the business scene of the transaction fault does not need to be reproduced, and the fault occurring in any time period can be rapidly positioned and intuitively displayed. The invention has the following specific advantages: according to the invention, a user can complete quick positioning of the host transaction faults by only filling simple input items such as environment names, transaction names and time periods in the front-end webpage, and the large-scale host middleware technology and host transaction tracking technology are not required to be relied on. According to the invention, the business scene of transaction faults does not need to be reproduced, and faults occurring in any time period can be rapidly positioned and intuitively displayed. The fault auxiliary analysis function designed by the invention greatly improves the timeliness of fault analysis and solution. The invention can develop a universal interface, provide direct calling for the third party application and meet the requirement of the third party application on positioning and analyzing faults.
Referring to fig. 7 again, in combination with the above embodiment, the host transaction data collection and storage process of the present invention may include the following steps:
Step S101, a high-efficiency data acquisition module is compiled and developed by a host computer, and is deployed at an exit program between a host computer transaction and CICS middleware to acquire host computer transaction data;
in step S102, the data acquisition module filters the data and extracts and supplements relevant useful information.
Step S103, outputting the transaction data, the link information, the communication area information and the like to a host queue.
Step S104, developing a program by REXX language, acquiring host queue data, performing format processing, and then transmitting the host queue data to a platform side through a socket;
step S105: the timing task of the platform side scans and receives transaction data sent by the host side and then writes the transaction data into the NoSQL platform database;
step S106: the platform database retains transaction data of the last period of time, and expired transaction data is automatically migrated to the data file storage.
Referring to fig. 7 and 8, the specific steps of the host transaction fault locating and analyzing process flow may include the following steps:
step S201: the task scheduling system receives and responds to a task request processed by the transaction fault check of the host computer, and provides a request type for the task distribution module to process;
step S202: after the task distribution module processes the received request, judging the type of the request and distributing the request to a corresponding task processing interface program;
Step S203: the interface program receives the request parameters, acquires transaction data from a non-relational platform database NoSQL and carries out logic processing;
step S204: after the interface program finishes logic processing, transmitting the processing result data to a WEB page background program through a JSON standard format;
step S205: the page background program reconstructs data according to the open source Echarts chart plug-in format, and then stores the data into variables and transmits the variables to the front end of the page;
step S206: and after receiving the processing result data of the background program, the front end of the WEB page calls an open source Echarts chart plug-in to complete visual display of information such as transaction links, error reporting information, automatic analysis results, communication areas and the like.
The invention also provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the above method when executing the computer program.
The present invention also provides a computer readable storage medium storing a computer program for executing the above method.
As shown in fig. 9, the electronic device may further include: a communication module 110, an input unit 120, an audio processing unit 130, a display 160, a power supply 170. It is noted that the electronic device 600 need not include all of the components shown in fig. 9; in addition, the electronic device 600 may further include components not shown in fig. 9, to which reference is made to the related art.
As shown in fig. 9, the central processor 900, sometimes also referred to as a controller or operational control, may include a microprocessor or other processor device and/or logic device, which central processor 900 receives inputs and controls the operation of the various components of the electronic device 600.
The memory 140 may be, for example, one or more of a buffer, a flash memory, a hard drive, a removable media, a volatile memory, a non-volatile memory, or other suitable device. The information about failure may be stored, and a program for executing the information may be stored. And the central processor 900 may execute the program stored in the memory 140 to implement information storage or processing, etc.
The input unit 120 provides input to the central processor 900. The input unit 120 is, for example, a key or a touch input device. The power supply 170 is used to provide power to the electronic device 600. The display 160 is used for displaying display objects such as images and characters. The display may be, for example, but not limited to, an LCD display.
The memory 140 may be a solid state memory such as Read Only Memory (ROM), random Access Memory (RAM), SIM card, or the like. But also a memory which holds information even when powered down, can be selectively erased and provided with further data, an example of which is sometimes referred to as EPROM or the like. Memory 140 may also be some other type of device. Memory 140 includes a buffer memory 141 (sometimes referred to as a buffer). The memory 140 may include an application/function storage 142, the application/function storage 142 for storing application programs and function programs or a flow for executing operations of the electronic device 600 by the central processor 900.
The memory 140 may also include a data store 143, the data store 143 for storing data, such as contacts, digital data, pictures, sounds, and/or any other data used by the electronic device. The driver storage 144 of the memory 140 may include various drivers of the electronic device for communication functions and/or for performing other functions of the electronic device (e.g., messaging applications, address book applications, etc.).
The communication module 110 is a transmitter/receiver 110 that transmits and receives signals via an antenna 111. The communication module (transmitter/receiver) 110 is coupled to the central processor 900 to provide an input signal and receive an output signal, which may be the same as in the case of a conventional mobile communication terminal.
Based on different communication technologies, a plurality of communication modules 110, such as a cellular network module, a bluetooth module, and/or a wireless local area network module, etc., may be provided in the same electronic device. The communication module (transmitter/receiver) 110 is also coupled to a speaker 131 and a microphone 132 via an audio processor 130 to provide audio output via the speaker 131 and to receive audio input from the microphone 132 to implement usual telecommunication functions. The audio processor 130 may include any suitable buffers, decoders, amplifiers and so forth. In addition, the audio processor 130 is also coupled to the central processor 900 so that sound can be recorded locally through the microphone 132 and so that sound stored locally can be played through the speaker 131.
It will be appreciated by those skilled in the art that embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The foregoing description of the embodiments has been provided for the purpose of illustrating the general principles of the invention, and is not meant to limit the scope of the invention, but to limit the invention to the particular embodiments, and any modifications, equivalents, improvements, etc. that fall within the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims (11)

1. A large mainframe transaction fault location system, the system comprising an IBM mainframe side and a platform side;
the IBM host side comprises a data processing device;
the data processing device is used for collecting transaction data packets of host transaction and CICS middleware, and extracting transaction data containing transaction states from the transaction data packets; generating link information according to the program call state of each node of the transaction data; generating a fault identifier according to the transaction state of each node of the transaction data; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end;
the platform end is used for receiving and storing the transaction data, the fault identification and the link information sent by the IBM host end; and according to the task request of the received host transaction fault check processing, the corresponding transaction data, fault identification and the link information are called, displayed and output;
the data processing device comprises an acquisition module, a filtering module, an analysis processing module and a host queue read-write module;
the acquisition module is arranged at an exit program between host transaction and CICS middleware, and acquires transaction data packets running in the CICS middleware through an assembler;
The filtering module is used for extracting and obtaining transaction data containing transaction states from the transaction data packet;
the analysis processing module is used for obtaining the program call state of the transaction data at each node according to the transaction data packet and generating link information; generating fault identification according to the transaction state of each node of the transaction data;
the host queue read-write module is used for inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end through REXX programs and socket sockets.
2. The large host transaction fault location system of claim 1, wherein the platform side comprises a storage module, a task scheduling device and a page side;
the storage module is used for receiving the transaction data, the fault identification and the link information sent by the IBM host end and writing the transaction data, the fault identification and the link information into a non-relational database;
the task scheduling device is used for receiving a task request of the host transaction fault troubleshooting processing input by a user, and according to the type of the task request, the task scheduling device is used for scheduling corresponding transaction data, fault identification and link information in the non-relational database; according to the transaction data, the fault identification and the link information analysis, a processing result is obtained, and the processing result is transmitted to a page end through a JSON standard format;
And the page end displays and outputs the received processing result.
3. The large host transaction fault location system of claim 2, wherein the storage module comprises a storage monitoring unit, a storage control unit, and a non-relational database;
the storage monitoring unit is used for monitoring the running state of the non-relational database and the space state of the disk, and generating alarm information when the running state and the space state do not accord with a preset rule;
the storage control unit is used for backing up the data files in the predetermined access frequency or access period in the non-relational database to a predetermined memory according to the access frequency or access date of the data files in the non-relational database; according to the calling instruction of the task scheduling device, the corresponding data file is called in a preset memory and is loaded to the non-relational database;
the non-relational database is configured to store the transaction data, the fault identification, and the link information.
4. The large-scale host transaction fault location system of claim 2, wherein the task scheduling device comprises an input parsing module, a task request module, a task distribution module, a logic processing module, and a task tracking module;
The input analysis module is used for receiving a task request of the host transaction fault troubleshooting process, analyzing and obtaining a corresponding environment name, transaction codes, starting time and ending time according to the task request and providing the environment name, the transaction codes, the starting time and the ending time to the logic processing module;
the task request module is used for responding to the received request instruction and providing a request type corresponding to the request instruction to the task distribution module;
the task distribution module is used for providing a corresponding interface program for processing according to the request type;
the logic processing module is used for storing a plurality of preset interface programs, and the interface programs are used for processing the corresponding request instructions according to preset rules and generating processing results according to the request instructions, the corresponding environment names, the corresponding transaction codes, the corresponding starting time and the corresponding ending time;
the task tracking module is used for transmitting the processing result to a page end through a JSON standard format.
5. The large host transaction fault location system of claim 2, wherein the page side comprises an input module and a display module;
the input module is used for generating a task request according to user input information and providing the task request to the task scheduling device;
The display module is used for displaying the processing result provided by the task scheduling device.
6. A method for locating a transaction fault of a large host, the method comprising:
the method comprises the steps that an IBM host side collects transaction data packets of host transaction and CICS middleware, and transaction data containing transaction states are extracted from the transaction data packets;
generating link information according to the program call state of the transaction data in each node, and generating fault identification according to the transaction state of the transaction data in each node;
inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end;
the platform end receives and stores the transaction data, the fault identification and the link information sent by the IBM host end;
according to the received task request of the host transaction fault check processing, corresponding transaction data, fault identification and the link information are called, displayed and output;
the method comprises the steps that an IBM host side collects transaction data packets of host transaction and CICS middleware, and transaction data containing transaction states are extracted from the transaction data packets; inputting the transaction data, the fault identification and the link information into a host queue and then sending the transaction data, the fault identification and the link information to a platform end comprises the following steps:
Collecting a transaction data packet of the CICS middleware running at an exit program between host transaction and the CICS middleware through an assembler, and extracting transaction data containing a transaction state from the transaction data packet;
acquiring program call states of the transaction data at all nodes according to the transaction data packet and generating link information;
generating a fault identifier according to the transaction state of each node of the transaction data;
and after the transaction data, the fault identification and the link information are input into a host queue, the transaction data, the fault identification and the link information are sent to a platform end through a REXX program and a socket.
7. The method for locating a transaction fault of a mainframe according to claim 6, wherein the transaction data, the fault identification and the link information transmitted from the IBM mainframe side are received and stored; and according to the received task request of the host transaction fault check processing, the corresponding transaction data, fault identification and the link information are called, and the display output comprises:
receiving the transaction data, the fault identification and the link information sent by the IBM host end, and writing the transaction data, the fault identification and the link information into a non-relational database;
Receiving a task request of host transaction fault troubleshooting processing input by a user, and calling corresponding transaction data, fault identification and link information from the non-relational database according to the type of the task request;
according to the transaction data, the fault identification and the link information analysis, a processing result is obtained, and the processing result is transmitted to a page end through a JSON standard format;
and the page end displays and outputs the received processing result.
8. The method of claim 7, wherein receiving the transaction data, the failure identification, and the link information sent by the IBM host, writing the transaction data, the failure identification, and the link information to a non-relational database further comprises:
monitoring the running state of the non-relational database and the space state of a disk, and generating alarm information when the running state and the space state do not accord with a preset rule;
according to the access frequency or the access date of the data files in the non-relational database, backing up the data files in the predetermined access frequency or the access period in the non-relational database to a predetermined memory;
And according to the calling instruction of the task scheduling device, calling the corresponding data file in the preset memory and loading the data file into the non-relational database.
9. The method for locating a transaction fault of a large host according to claim 7, wherein a task request of a host transaction fault troubleshooting process input by a user is received, and corresponding transaction data, fault identification and link information are called from the non-relational database according to the type of the task request; analyzing and obtaining a processing result according to the transaction data, the fault identification and the link information, and transmitting the processing result to a page end through a JSON standard format comprises the following steps:
storing a plurality of preset interface programs in a logic processing module;
receiving a task request of host transaction fault troubleshooting processing, analyzing and obtaining a corresponding environment name, transaction codes, start time and end time according to the task request, and providing the environment name, the transaction codes, the start time and the end time to the logic processing module;
the logic processing module matches a corresponding preset interface program according to the request type;
the interface program processes the corresponding request instruction according to a preset rule and generates a processing result according to the request instruction, the corresponding environment name, the transaction code, the starting time and the ending time;
And transmitting the processing result to a page end through a JSON standard format.
10. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of any of claims 6 to 9 when executing the computer program.
11. A computer readable storage medium, characterized in that the computer readable storage medium stores a computer program for executing the method of any one of claims 6 to 9.
CN202010111536.6A 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method Active CN111324480B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010111536.6A CN111324480B (en) 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010111536.6A CN111324480B (en) 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method

Publications (2)

Publication Number Publication Date
CN111324480A CN111324480A (en) 2020-06-23
CN111324480B true CN111324480B (en) 2023-07-25

Family

ID=71165208

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010111536.6A Active CN111324480B (en) 2020-02-24 2020-02-24 Large-scale host transaction fault positioning system and method

Country Status (1)

Country Link
CN (1) CN111324480B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112035550A (en) * 2020-09-01 2020-12-04 中国银行股份有限公司 Fault positioning method and device
CN112181697A (en) * 2020-09-21 2021-01-05 厦门南讯股份有限公司 Method and system for rapidly analyzing problems
CN112784133B (en) * 2021-01-22 2024-05-17 中信银行股份有限公司 Link visualization method, device, equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6140691A (en) * 1984-07-31 1986-02-26 オムロン株式会社 Abnormal processing for transaction processor
WO2005101214A1 (en) * 2004-04-19 2005-10-27 Oneempower Pte Ltd Transaction system
CN104572351A (en) * 2014-12-23 2015-04-29 中国工商银行股份有限公司 Data recovery system and method based on host system
CN106681909A (en) * 2016-12-02 2017-05-17 中国工商银行股份有限公司 Online transaction fault locating method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6140691A (en) * 1984-07-31 1986-02-26 オムロン株式会社 Abnormal processing for transaction processor
WO2005101214A1 (en) * 2004-04-19 2005-10-27 Oneempower Pte Ltd Transaction system
CN104572351A (en) * 2014-12-23 2015-04-29 中国工商银行股份有限公司 Data recovery system and method based on host system
CN106681909A (en) * 2016-12-02 2017-05-17 中国工商银行股份有限公司 Online transaction fault locating method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陆坤.大型主机系统性能可视化监控与管理平台.实验室科学.2017,第第20卷卷(第第20卷期),第2.1、2.2、3.2节. *

Also Published As

Publication number Publication date
CN111324480A (en) 2020-06-23

Similar Documents

Publication Publication Date Title
CN111324480B (en) Large-scale host transaction fault positioning system and method
CN100412802C (en) Planned computer problem diagnosis and solvement and its automatic report and update
CN102831052B (en) Test exemple automation generating apparatus and method
CN108076098A (en) A kind of method for processing business and system
CN111061696B (en) Method and device for analyzing transaction message log
CN111625535B (en) Method, device and storage medium for realizing business data association
CN113242159B (en) Application access relation determining method and device
CN106815089A (en) Malfunction elimination method and device based on daily record
CN111897738B (en) Automatic testing method and device based on atomic service
CN116204438A (en) Test case generation method, automatic test method and related device
CN113760611B (en) System site switching method and device, electronic equipment and storage medium
CN109753530B (en) Data processing method and device, medium and terminal thereof
CN111078506A (en) Business data batch running task monitoring method and device
CN114895879B (en) Management system design scheme determining method, device, equipment and storage medium
CN112785201B (en) Heterogeneous system quasi-real-time high-reliability interaction system and method
CN114490402A (en) Method and device for checking validity of message field
CN112948195A (en) Interface testing method and device, electronic equipment and storage medium
CN112965873A (en) Page processing method and device, storage medium and processor
CN112102058A (en) Management system parameter configuration method and device
CN113037839B (en) Distributed batch framework communication system and method
CN111538660B (en) Method and device for counting coverage rate of program codes in cloud container
CN112116467A (en) Parallel operation method and system for heterogeneous counter
CN115277887A (en) Message content sending and processing method and device
CN117291517A (en) Method and device for constructing approval process
CN115840699A (en) Method and tool for realizing multi-bank electronic bill business simulation

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant