CN110490451A - Task data management-control method, device and computer equipment based on hadoop - Google Patents

Task data management-control method, device and computer equipment based on hadoop Download PDF

Info

Publication number
CN110490451A
CN110490451A CN201910754460.6A CN201910754460A CN110490451A CN 110490451 A CN110490451 A CN 110490451A CN 201910754460 A CN201910754460 A CN 201910754460A CN 110490451 A CN110490451 A CN 110490451A
Authority
CN
China
Prior art keywords
task
goal
call parameter
title
hadoop
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910754460.6A
Other languages
Chinese (zh)
Inventor
金婕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ping An Property and Casualty Insurance Company of China Ltd
Original Assignee
Ping An Property and Casualty Insurance Company of China Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Property and Casualty Insurance Company of China Ltd filed Critical Ping An Property and Casualty Insurance Company of China Ltd
Priority to CN201910754460.6A priority Critical patent/CN110490451A/en
Publication of CN110490451A publication Critical patent/CN110490451A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/1805Append-only file systems, e.g. using logs or journals to store data
    • G06F16/1815Journaling file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/18File system types
    • G06F16/182Distributed file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0633Workflow analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/04Manufacturing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Abstract

The invention discloses a kind of task data management-control method based on hadoop identifies goal task title therein and goal task call parameter this method comprises: obtaining the source information of the task process log of first task management platform;By the preamble task names and preceding sequence task call parameter progress uniformity comparison in each task record in the goal task title and goal task call parameter and preset front and back sequence task implementation list;When the record of the first task present in the front and back sequence task implementation list is with the task names of the goal task and consistent goal task call parameter, by the first task record in corresponding execute instruction of rear sequence task be sent to the second task management platform.The present invention also provides a kind of task data control device, computer equipment and computer readable storage medium based on hadoop.The present invention is capable of real-time, efficient, the accurate control of across task management platform realization task data.

Description

Task data management-control method, device and computer equipment based on hadoop
Technical field
The present invention relates to flow monitoring technical field more particularly to a kind of task data management-control method based on hadoop, Device, computer equipment and computer readable storage medium.
Background technique
Currently, much can be used for carrying out production task controlling managerial role management platform is applied to the daily of enterprise In production.Certainly, production tasks many for production task and different may execute on different task management platforms, Accordingly, it may be desirable to which to manage and control different production task data respectively preset to complete jointly for multiple tasks management platform Production task.For example, existing OOZIE task management platform and LINKDO task management platform are all more commonly used in production task The management of data, but not being provided between the two task management platforms interacts the task data of two platforms Interface, so, when production task decomposite come preceding sequence task and follow-up work be separately dispensed into OOZIE task management platform and It, can be pretty troublesome to task data management is carried out between the two task management platforms after LINKDO task management platform.For example, The preceding sequence task that a task management platform executes often is occurred in which because there is exception, and after relying on the preceding sequence task Sequence task cannot but learn preamble task execution exception, then follow-up work will be behaved according to normal time, this Sample will generate task data can not true synchronization, task action result be easy deviation the problems such as.Therefore, in the prior art, right It can not achieve the effect that in the control that across task management platform carries out task data accurate, synchronous.
Summary of the invention
In view of this, the present invention proposes that a kind of user is set based on task data management-control method, device, the computer of hadoop Standby and computer readable storage medium, can obtain first task management platform task process log source information, then into Row identification, to identify goal task title and goal task call parameter;Then by the goal task title and target Preamble task names in each task record in task call parameter and preset front and back sequence task implementation list and before Sequence task call parameter carries out uniformity comparison;In the record of the first task present in the front and back sequence task implementation list It, will when preamble task names and preceding sequence task call parameter are with the goal task title and consistent goal task call parameter Corresponding execute instruction of rear sequence task in the first task record is sent to the second task management platform.Pass through this side Formula, can across task management platform realize real-time, efficient, the accurate control of task data.
Firstly, to achieve the above object, the present invention provides a kind of task data management-control method based on hadoop, the side Method includes:
Obtain the source information of the task process log of first task management platform;To the source information of the task process log It is identified, identifies goal task title and goal task call parameter;By the goal task title and goal task The preamble task names and preamble times in each task record in call parameter and preset front and back sequence task implementation list It don't fail to parameter progress uniformity comparison;Preamble in the record of the first task present in the front and back sequence task implementation list It, will be described when task names and preceding sequence task call parameter are with the goal task title and consistent goal task call parameter Corresponding execute instruction of rear sequence task in first task record is sent to the second task management platform.
Optionally, the source information to the task process log identifies, identifies goal task title and mesh The step of mark task call parameter includes: that the source information of the task process log is converted into log text information;To described Log text information carries out sentence segmentation, obtains at least one experience table sentence;To each experience table language Sentence carries out text identification, identifies the goal task title and goal task call parameter.
Optionally, described the step of carrying out text identification to each experience table sentence includes: according to preset Goal task name keys identify each experience table sentence, to find out including the goal task name The first task of title executes record sentence;Record sentence is executed to the first task according to preset call parameter format to carry out Identification, to identify the corresponding goal task call parameter of the goal task title.
Optionally, the call parameter includes time and operating status, the call parameter format include time format and Operating status format.
In addition, to achieve the above object, the present invention also provides a kind of task data control device based on hadoop is described Device includes:
Module is obtained, the source information of the task process log for obtaining first task management platform;Identification module is used for The source information of the task process log is identified, identifies goal task title and goal task call parameter;Judgement Module, for will be in the goal task title and goal task call parameter and preset front and back sequence task implementation list Preamble task names and preceding sequence task call parameter in each task record carry out uniformity comparison;Sending module is used for Preamble task names and preceding sequence task necessity ginseng in the record of the first task present in the front and back sequence task implementation list When number is with the goal task title and consistent goal task call parameter, by the rear sequence task in first task record Corresponding execute instruction is sent to the second task management platform.
Optionally, the identification module is also used to: the source information of the task process log is converted into log text envelope Breath;Sentence segmentation is carried out to the log text information, obtains at least one experience table sentence;Each task is held Row record sentence carries out text identification, identifies the goal task title and goal task call parameter.
Optionally, the identification module is also used to: being held according to preset goal task name keys to each task Row record sentence is identified, so that finding out the first task including the goal task title executes record sentence;According to pre- If call parameter format to the first task execute record sentence identify, to identify the goal task title Corresponding goal task call parameter.
Optionally, the call parameter includes time and operating status, the call parameter format include time format and Operating status format.
Further, the present invention also proposes a kind of computer equipment, and the computer equipment includes memory, processor, The computer program that can be run on the processor is stored on the memory, the computer program is by the processor It realizes when execution such as the step of the above-mentioned task data management-control method based on hadoop.
Further, to achieve the above object, the present invention also provides a kind of computer readable storage medium, the computers Readable storage medium storing program for executing is stored with computer program, and the computer program can be executed by least one processor so that it is described extremely A few processor is executed such as the step of the above-mentioned task data management-control method based on hadoop.
Compared to the prior art, task data management-control method, device, the computer proposed by the invention based on hadoop Equipment and computer readable storage medium can obtain the source information of the task process log of first task management platform, then It is identified, to identify goal task title and goal task call parameter;Then by the goal task title and mesh Preamble task names in each task record in mark task call parameter and preset front and back sequence task implementation list and Preceding sequence task call parameter carries out uniformity comparison;In the record of the first task present in the front and back sequence task implementation list Preamble task names and preceding sequence task call parameter with the goal task title and consistent goal task call parameter when, The second task management platform is sent by corresponding execute instruction of rear sequence task in first task record.Pass through this side Formula, can across task management platform realize real-time, efficient, the accurate control of task data.
Detailed description of the invention
Fig. 1 is the schematic diagram of the optional hardware structure of computer equipment one of the present invention;
Fig. 2 is the program module schematic diagram of one embodiment of task data control device the present invention is based on hadoop;
Fig. 3 is the flow diagram of one embodiment of task data management-control method the present invention is based on hadoop.
Appended drawing reference:
The object of the invention is realized, the embodiments will be further described with reference to the accompanying drawings for functional characteristics and advantage.
Specific embodiment
In order to make the objectives, technical solutions, and advantages of the present invention clearer, with reference to the accompanying drawings and embodiments, right The present invention is further elaborated.It should be appreciated that described herein, specific examples are only used to explain the present invention, not For limiting the present invention.Based on the embodiments of the present invention, those of ordinary skill in the art are not before making creative work Every other embodiment obtained is put, shall fall within the protection scope of the present invention.
It should be noted that the description for being related to " first ", " second " etc. in the present invention is used for description purposes only, and cannot It is interpreted as its relative importance of indication or suggestion or implicitly indicates the quantity of indicated technical characteristic.Define as a result, " the One ", the feature of " second " can explicitly or implicitly include at least one of the features.In addition, the skill between each embodiment Art scheme can be combined with each other, but must be based on can be realized by those of ordinary skill in the art, when technical solution Will be understood that the combination of this technical solution is not present in conjunction with there is conflicting or cannot achieve when, also not the present invention claims Protection scope within.
As shown in fig.1, being the schematic diagram of the optional hardware structure of computer equipment 1 one of the present invention.
In the present embodiment, the computer equipment 1 may include, but be not limited only to, and company can be in communication with each other by system bus Connect memory 11, processor 12, network interface 13.
The computer equipment 1 connects network (Fig. 1 is not marked) by network interface 13, arrives other ends by network connection End equipment such as mobile terminal (Mobile Terminal), user equipment (User Equipment, UE), the end PC and other Business management platform etc..The network can be intranet (Intranet), internet (Internet), global system for mobile telecommunications System (Global System of Mobile communication, GSM), wideband code division multiple access (Wideband Code Division Multiple Access, WCDMA), 4G network, 5G network, bluetooth (Bluetooth), Wi-Fi, speech path network Deng wirelessly or non-wirelessly network.
It should be pointed out that Fig. 1 illustrates only the computer equipment 1 with component 11-13, it should be understood that simultaneously All components shown realistic are not applied, the implementation that can be substituted is more or less component.
Wherein, the memory 11 includes at least a type of readable storage medium storing program for executing, and the readable storage medium storing program for executing includes Flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory etc.), random access storage device (RAM), it is static with Machine accesses memory (SRAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable Read memory (PROM), magnetic storage, disk, CD etc..In some embodiments, the memory 11 can be the meter Calculate the internal storage unit of machine equipment 1, such as the hard disk or memory of the computer equipment 1.In further embodiments, described to deposit Reservoir 11 is also possible to the External memory equipment of the computer equipment 1, such as the plug-in type that the computer equipment 1 is equipped with is hard Disk, intelligent memory card (Smart Media Card, SMC), secure digital (Secure Digital, SD) card, flash card (Flash Card) etc..Certainly, the memory 11 can also both include the internal storage unit of the computer equipment 1 or wrap Include its External memory equipment.In the present embodiment, the memory 11 is installed on the behaviour of the computer equipment 1 commonly used in storage Make system and types of applications software, such as the program code etc. of the task data control device 200 based on hadoop.In addition, institute Stating memory 11 can be also used for temporarily storing the Various types of data that has exported or will export.
The processor 12 can be in some embodiments central processing unit (Central Processing Unit, CPU), controller, microcontroller, microprocessor or other data processing chips.The processor 12 is commonly used in the control meter The overall operation of machine equipment 1 is calculated, such as executes data interaction or the relevant control of communication and processing etc..In the present embodiment, institute Processor 12 is stated for running the program code stored in the memory 11 or processing data, for example, operation it is described based on Task data control device 200 of hadoop etc..
The network interface 13 may include radio network interface or wired network interface, which is commonly used in The computer equipment 1 and other terminal devices such as mobile terminal, user equipment, the end PC and other task management platforms etc. Between establish communication connection.
In the present embodiment, is installed in the computer equipment 1 and run the task data control device based on hadoop When 200, when the task data control device 200 based on hadoop is run, enough obtains and take first task management platform Task process log, is then identified, to identify goal task title and goal task call parameter;Then by the mesh It marks in each task record in task names and goal task call parameter and preset front and back sequence task implementation list Preamble task names and preceding sequence task call parameter carry out uniformity comparison;Exist when in the front and back sequence task implementation list First task record in preamble task names and preceding sequence task call parameter and the goal task title and target appoint Don't fail to parameter it is consistent when, by the first task record in corresponding execute instruction of rear sequence task be sent to the second task pipe Platform.In this way, can across task management platform realize real-time, efficient, the accurate control of task data.
So far, oneself is through describing the application environment of each embodiment of the present invention and the hardware configuration and function of relevant device in detail Energy.In the following, above-mentioned application environment and relevant device will be based on, each embodiment of the invention is proposed.
Firstly, the present invention proposes a kind of task data control device 200 based on hadoop.
As shown in fig.2, being the program module of 200 1 embodiment of task data control device the present invention is based on hadoop Figure.
In the present embodiment, the task data control device 200 based on hadoop is stored in storage including a series of Each reality of the present invention may be implemented when the computer program instructions are executed by processor 12 in computer program instructions on device 11 Apply the task data control function based on hadoop of example.In some embodiments, it is based on the computer program instructions each section The specific operation realized, the task data control device 200 based on hadoop can be divided into one or more modules. For example, the task data control device 200 based on hadoop, which can be divided into, obtains module 201, identification in Fig. 2 Module 202, judgment module 203 and sending module 204.Wherein:
The acquisition module 201, the source information of the task process log for obtaining first task management platform.
Hadoop is the software frame that distributed treatment can be carried out to mass data, so, it can be based on Hadoop technology establishes the multiple task management platform based on hadoop in the computer equipment 1 to carry out task data Corporate management.In the present embodiment, the computer equipment 1, which is connected to, manages platform and the second task including at least first task The multiple tasks for managing platform manage platform, due to programming between the first task management platform and the second task management platform The difference of language or the difference of data format, so that the interaction of data cannot be carried out directly.
Specifically, the computer equipment 1 is connect by managing platform with the first task, and periodically obtains institute State the source information for the real-time task process log that first task management platform prints.In the present embodiment, task management is flat The operating system of platform can be monitored and record to all tasks of self-operating, then the task process of outputting standard format The source information of log, such as the experience table including elements such as task names, time, execution states, and it is stored in operation Subdirectory under catalogue where system, such as task daily record file.It is of course also possible to which monitoring is appointed in the instruction according to user Business classification, the element of monitoring are defined.Therefore, the first task management platform can be preset in the mistake of the task of execution Cheng Zhong prints task process log in real time, updates and stores under preset catalogue.Then, the acquisition module 201 Then according to preset period, such as 30s/ times, under the catalogue of the store tasks process log of first task management platform Obtain the source information of newest task process log.In the present embodiment, the computer equipment 1 is set as possessing described first The log read permission of task management platform, therefore, the acquisition module 201 then can read institute according to the preset period The catalogue file that first task management platform is stored with task process log is stated, and by preset name, for example " XX month XX -appoints The task execution log of business process log " is copied and is received.
The identification module 202 identifies goal task for identifying to the source information of the task process log Title and goal task call parameter.
Specifically, source information of the identification module 202 first by the task process log is converted into log text information, Then sentence segmentation is carried out to the log text information, obtains at least one experience table sentence, then appoint to each Business executes record sentence and carries out text identification, identifies the goal task title and goal task call parameter.Wherein, described Identification module 202 identifies each experience table sentence according to preset goal task name keys, thus It finds out the first task including the goal task title and executes record sentence, then according to preset call parameter format to institute It states first task execution record sentence to be identified, to identify the corresponding goal task necessity ginseng of the goal task title Number.
In the present embodiment, not due to the programming language used or data format of different task management platforms Together.Therefore, format of the identification module 202 first by the task process log of first task management platform printing is converted to Text formatting.For example, the task process log of the first task management platform printing is JAVA format, then the identification mould The task process log of the JAVA format is then converted into text according to preset JAVA- text formatting crossover tool by block 202 Format, so that the source information of the task process log to be converted to the text information of the task process log.Then, institute Identification module 202 is stated to be identified according to text information of the preset goal task title to the task process log.At this In embodiment, all task process before the task process log includes the first task management platform record are believed Breath, therefore the task process log can be identified, to identify the progress information of the goal task.Firstly, The text information of the task process log is carried out sentence segmentation by the identification module 202, wherein sentence segmentation is mainly to be The text information of the task process log is divided into short sentence, therefore can be according to identifying the task process log The punctuate of text information carries out short sentence cutting, to obtain the sentence of each task process log, in the present embodiment, sentence Cutting is mainly the ascii code value for identifying the punctuation mark in the task process log, then carries out sentence segmentation, such as When recognizing ascii code value " 0X2E ", then it is judged as fullstop, carries out line feed to come out the sentence segmentation.Then basis The task names keyword of the preset goal task is compared with each text sentence in the task process log It is right, to obtain the sentence including the goal task name keys.In the present embodiment, the task name of the goal task Claiming keyword is directly the task names of the goal task.It certainly, in other embodiments, can also be according to task names Including distinctive word, word or phrase, appoint using the distinctive word of the task names of the goal task, word or phrase as described The keyword for title of being engaged in.That is, can be according to the goal task name keys, to identify the task process The corresponding sentence of the goal task title in log in text information.
After the corresponding sentence of the goal task title in identifying the task process log in text information, Then further identification includes included by each experience table of the goal task title to the identification module 202 Call parameter.In the present embodiment, the call parameter of the experience table includes: time and operating status.Therefore, may be used Time and operating parameter in each sentence will identify that the task process log come.Specifically, identification process packet Include: according to time format, for example " year-month-day-when-point-second " identifies the time in task execution sentence;According to operation shape It is corresponding that state format such as " operating status: ffff " identifies the goal task title in the task process log sentence Execution state.In this way, so that it may know the call parameter in each experience table of the goal task It does not come out.
The judgment module 203, for by the goal task title and goal task call parameter and it is preset before The preamble task names in each task record in postorder task execution inventory are consistent with preceding sequence task call parameter progress Property compare.
Specifically, the computer equipment 1 presets front and back sequence task implementation list, and the front and back sequence task executes clear Single includes the dependence of any one preceding sequence task and corresponding follow-up work in production process, and the dependence includes holding First task the management platform, preamble task names, preamble execution status of task of sequence task before carrying, and sequence task after carrying Second task management platform, follow-up work title and rear sequence task is corresponding executes instruction.Therefore, in the identification module After 202 pairs of experience tables carry out preamble task names and the identification of corresponding call parameter, the judgment module 203 then may be used With goal task title described further and goal task call parameter with it is every in preset front and back sequence task implementation list Preamble task names and preceding sequence task call parameter in one task record carry out uniformity comparison.In the present embodiment, institute Judgment module 203 is stated according to the goal task title in the experience table in the front and back sequence task implementation list In searched, the goal task title is found out according to the mode that text compares, when finding out the goal task conduct When preamble task names, then time and operating status of the goal task in the task process log are further searched for out Information is also to identify in such a way that text compares for time and running state information.In the present embodiment, when described Judgment module 203 finds out the goal task when storing in the front and back sequence task implementation list as preceding sequence task, then By in the task process log the corresponding operating status of the goal task and the front and back sequence task implementation list in make Correspondence preamble execution status of task for the goal task of preceding sequence task is compared, and when consistent, then judges the mesh Preceding sequence task of the mark task as the front and back sequence task implementation list, and executed completion, then it should further execute this The corresponding follow-up work of preceding sequence task.In other embodiments, sequence task is also before each in the front and back sequence task implementation list Including execute the time, for example, preceding sequence task when being executed between reach T hour after, then operating status reaches F, then it is assumed that this Preceding sequence task normally completes;If executing the time not reaching T hours, and operating status reaches F, then it is assumed that the preceding sequence task is different Often.Then follow-up work can just only be executed after preceding sequence task normally completes.Therefore, the multiple task management system can also The execution time of the goal task in the task process log is judged.
The sending module 204, for working as in the record of first task present in the front and back sequence task implementation list It, will when preamble task names and preceding sequence task call parameter are with the goal task title and consistent goal task call parameter Corresponding execute instruction of rear sequence task in the first task record is sent to the second task management platform.
Specifically, when the judgment module 203 judges goal task title described in the experience table and institute Stating running state information is respectively corresponding preamble task names and preamble task execution in the front and back sequence task implementation list When state, then it is assumed that the preceding sequence task has been completed.Therefore, the sending module 204 then can be according to the front and back sequence task Corresponding execute instruction of rear sequence task in first task record is sent the second task management platform by implementation list.Cause This, the second task management platform can then start to execute the rear sequence task.
It will be recalled from above that the computer equipment 1 can obtain the source of the task process log of first task management platform Then information is identified, to identify goal task title and goal task call parameter;Then by the goal task name Claim and goal task call parameter is appointed with the preamble in each task record in preset front and back sequence task implementation list Title of being engaged in and preceding sequence task call parameter carry out uniformity comparison;It is first present in the front and back sequence task implementation list Preamble task names and preceding sequence task call parameter and the goal task title and goal task necessity in business record are joined When number is consistent, the second task management platform is sent by corresponding execute instruction of rear sequence task in first task record. In this way, can across task management platform realize real-time, efficient, the accurate control of task data.
In addition, the present invention also proposes a kind of task data management-control method based on hadoop, the method is applied to calculate Machine equipment.
As shown in fig.3, being the flow diagram of one embodiment of task data management-control method the present invention is based on hadoop. In the present embodiment, the execution sequence of the step in flow chart shown in Fig. 3 can change according to different requirements, Mou Xiebu Suddenly it can be omitted.
Step S500 obtains the source information of the task process log of first task management platform.
Hadoop is the software frame that distributed treatment can be carried out to mass data, so, it can be based on Hadoop technology establishes the multiple task management platform based on hadoop in the computer equipment to carry out task data Corporate management.In the present embodiment, the computer equipment, which is connected to, manages platform and the second task including at least first task The multiple tasks for managing platform manage platform, due to programming between the first task management platform and the second task management platform The difference of language or the difference of data format, so that the interaction of data cannot be carried out directly.
Specifically, the computer equipment is connect by managing platform with the first task, and periodically obtains institute State the source information for the real-time task process log that first task management platform prints.In the present embodiment, task management is flat The operating system of platform can be monitored and record to all tasks of self-operating, then the task process of outputting standard format The source information of log, such as the experience table including elements such as task names, time, execution states, and it is stored in operation Subdirectory under catalogue where system, such as task daily record file.It is of course also possible to which monitoring is appointed in the instruction according to user Business classification, the element of monitoring are defined.Therefore, the first task management platform can be preset in the mistake of the task of execution Cheng Zhong prints task process log in real time, updates and stores under preset catalogue.Then, the computer equipment Then according to preset period, such as 30s/ times, under the catalogue of the store tasks process log of first task management platform Obtain the source information of newest task process log.In the present embodiment, the computer equipment is set as possessing described first The log read permission of task management platform, therefore, the computer equipment then can read institute according to the preset period The catalogue file that first task management platform is stored with task process log is stated, and by preset name, for example " XX month XX -appoints The task execution log of business process log " is copied and is received.
Step S502 identifies the source information of the task process log, identifies goal task title and target Task call parameter.
Specifically, source information of the computer equipment first by the task process log is converted into log text information, Then sentence segmentation is carried out to the log text information, obtains at least one experience table sentence, then appoint to each Business executes record sentence and carries out text identification, identifies the goal task title and goal task call parameter.Wherein, described Computer equipment identifies each experience table sentence according to preset goal task name keys, to look for The first task including the goal task title executes record sentence out, then according to preset call parameter format to described First task executes record sentence and is identified, to identify the corresponding goal task necessity ginseng of the goal task title Number.
In the present embodiment, not due to the programming language used or data format of different task management platforms Together.Therefore, format of the computer equipment first by the task process log of first task management platform printing is converted to Text formatting.For example, the task process log of the first task management platform printing is JAVA format, then the computer The task process log of the JAVA format is then converted into text lattice according to preset JAVA- text formatting crossover tool by equipment Formula, so that the source information of the task process log to be converted to the text information of the task process log.Then, described Computer equipment is identified according to text information of the preset goal task title to the task process log.In this implementation In example, the task process log includes all task process information before the first task management platform records, Therefore the task process log can be identified, to identify the progress information of the goal task.Firstly, described The text information of the task process log is carried out sentence segmentation by computer equipment, wherein sentence segmentation will be primarily to will The text information of the task process log is divided into short sentence, therefore can be according to the text for identifying the task process log The punctuate of information carries out short sentence cutting, to obtain the sentence of each task process log, in the present embodiment, sentence segmentation The ascii code value for mainly identifying the punctuation mark in the task process log, then carries out sentence segmentation, for example work as knowledge When being clipped to ascii code value " 0X2E ", then it is judged as fullstop, carries out line feed to come out the sentence segmentation.Then according to default The task names keyword of the goal task be compared with each text sentence in the task process log, from And obtain the sentence including the goal task name keys.In the present embodiment, the task names of the goal task are closed Key word is directly the task names of the goal task.Certainly, in other embodiments, can also include according to task names Distinctive word, word or phrase, using the distinctive word of the task names of the goal task, word or phrase as the task name The keyword of title.That is, can be according to the goal task name keys, to identify the task process log The corresponding sentence of the goal task title in middle text information.
After the corresponding sentence of the goal task title in identifying the task process log in text information, Then further identification includes included by each experience table of the goal task title to the computer equipment Call parameter.In the present embodiment, the call parameter of the experience table includes: time and operating status.Therefore, may be used Time and operating parameter in each sentence will identify that the task process log come.Specifically, identification process packet Include: according to time format, for example " year-month-day-when-point-second " identifies the time in task execution sentence;According to operation shape It is corresponding that state format such as " operating status: ffff " identifies the goal task title in the task process log sentence Execution state.In this way, so that it may know the call parameter in each experience table of the goal task It does not come out.
Step S504 executes the goal task title and goal task call parameter and preset front and back sequence task The preamble task names and preceding sequence task call parameter progress uniformity comparison in each task record in inventory.
Specifically, the computer equipment presets front and back sequence task implementation list, and the front and back sequence task executes clear Single includes the dependence of any one preceding sequence task and corresponding follow-up work in production process, and the dependence includes holding First task the management platform, preamble task names, preamble execution status of task of sequence task before carrying, and sequence task after carrying Second task management platform, follow-up work title and rear sequence task is corresponding executes instruction.Therefore, in the computer equipment After carrying out preamble task names and the identification of corresponding call parameter to experience table, then it can be appointed with target described further Before in each task record in title of being engaged in and goal task call parameter and preset front and back sequence task implementation list Sequence task title and preceding sequence task call parameter carry out uniformity comparison.In the present embodiment, the computer equipment is according to institute The goal task title stated in experience table is searched in the front and back sequence task implementation list, according to text The mode of comparison finds out the goal task title, when finding out the goal task as preamble task names, then into One step finds out time and running state information of the goal task in the task process log, for time and operation Status information is also to be identified in such a way that text compares.In the present embodiment, when the computer equipment finds out institute It, then will be in the task process log when stating goal task and storing in the front and back sequence task implementation list as preceding sequence task The corresponding operating status of the goal task and the front and back sequence task implementation list in the target as preceding sequence task The correspondence preamble execution status of task of task is compared, and when consistent, then judges the goal task as the front and back sequence The preceding sequence task of task execution inventory, and executed completion, then it is subsequent corresponding should further to execute the preceding sequence task Business.In other embodiments, each preceding sequence task further includes executing the time in the front and back sequence task implementation list, for example, preceding Sequence task when being executed between reach T hours after, then operating status reaches F, then it is assumed that the preceding sequence task normally completes;If The execution time does not reach T hours, and operating status reaches F, then it is assumed that the preamble task abnormity.Then only in preceding sequence task Follow-up work can be just executed after normally completing.Therefore, the multiple task management system can also be in the task process log Execution time of the goal task judged.
Step S504, the preamble task names in the record of the first task present in the front and back sequence task implementation list When with preceding sequence task call parameter with the goal task title and consistent goal task call parameter, by the first task Corresponding execute instruction of rear sequence task in record is sent to the second task management platform.
Specifically, when the computer equipment judges goal task title described in the experience table and described Running state information is respectively corresponding preamble task names and preamble task execution shape in the front and back sequence task implementation list When state, then it is assumed that the preceding sequence task has been completed.Therefore, the computer equipment can then be executed according to the front and back sequence task Corresponding execute instruction of rear sequence task in first task record is sent the second task management platform by inventory.Therefore, The second task management platform can then start to execute the rear sequence task.
The task data management-control method based on hadoop that the present embodiment is proposed can obtain first task management platform Task process log source information, then identified, to identify goal task title and goal task call parameter;It connects By each in the goal task title and goal task call parameter and preset front and back sequence task implementation list Preamble task names and preceding sequence task call parameter in task record carry out uniformity comparison;When the front and back sequence task executes Preamble task names and preceding sequence task call parameter and the goal task title in the record of first task present in inventory And goal task call parameter it is consistent when, by the first task record in corresponding execute instruction of rear sequence task be sent to Second task management platform.In this way, can across task management platform realize the real-time, efficient, accurate of task data Control.
The serial number of the above embodiments of the invention is only for description, does not represent the advantages or disadvantages of the embodiments.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can be realized by means of software and necessary general hardware platform, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on this understanding, technical solution of the present invention substantially in other words does the prior art The part contributed out can be embodied in the form of software products, which is stored in a storage medium In (such as ROM/RAM, magnetic disk, CD), including some instructions are used so that a terminal device (can be mobile phone, computer, clothes Business device, air conditioner or the network equipment etc.) execute method described in each embodiment of the present invention.
The above is only a preferred embodiment of the present invention, is not intended to limit the scope of the invention, all to utilize this hair Equivalent structure or equivalent flow shift made by bright specification and accompanying drawing content is applied directly or indirectly in other relevant skills Art field, is included within the scope of the present invention.

Claims (10)

1. a kind of task data management-control method based on hadoop, which is characterized in that the method includes the steps:
Obtain the source information of the task process log of first task management platform;
The source information of the task process log is identified, identifies goal task title and goal task call parameter;
By the goal task title and goal task call parameter with it is each in preset front and back sequence task implementation list Preamble task names and preceding sequence task call parameter in task record carry out uniformity comparison;
Preamble task names and preceding sequence task in the record of the first task present in the front and back sequence task implementation list must When wanting parameter with the goal task title and consistent goal task call parameter, by the postorder in first task record Corresponding execute instruction of task is sent to the second task management platform.
2. the task data management-control method based on hadoop as described in claim 1, which is characterized in that described to the task The step of source information of process log is identified, identifies goal task title and goal task call parameter include:
The source information of the task process log is converted into log text information;
Sentence segmentation is carried out to the log text information, obtains at least one experience table sentence;
Text identification is carried out to each experience table sentence, identifies that the goal task title and goal task are necessary Parameter.
3. the task data management-control method based on hadoop as claimed in claim 2, which is characterized in that described to appoint to each Business execution records the step of sentence carries out text identification
Each experience table sentence is identified according to preset goal task name keys, thus find out including The first task of the goal task title executes record sentence;
It executes record sentence to the first task according to preset call parameter format to identify, to identify the mesh Mark the corresponding goal task call parameter of task names.
4. the task data management-control method based on hadoop as claimed in claim 3, which is characterized in that the call parameter packet Time and operating status are included, the call parameter format includes time format and operating status format.
5. a kind of task data control device based on hadoop, which is characterized in that the task data pipe based on hadoop Controlling device includes:
Module is obtained, the source information of the task process log for obtaining first task management platform;
Identification module identifies goal task title and target for identifying to the source information of the task process log Task call parameter;
Judgment module, for executing the goal task title and goal task call parameter and preset front and back sequence task The preamble task names and preceding sequence task call parameter progress uniformity comparison in each task record in inventory;
Sending module, for working as the preamble task names in the record of first task present in the front and back sequence task implementation list When with preceding sequence task call parameter with the goal task title and consistent goal task call parameter, by the first task Corresponding execute instruction of rear sequence task in record is sent to the second task management platform.
6. the task data control device based on hadoop as claimed in claim 5, which is characterized in that the identification module is also For:
The source information of the task process log is converted into log text information;
Sentence segmentation is carried out to the log text information, obtains at least one experience table sentence;
Text identification is carried out to each experience table sentence, identifies that the goal task title and goal task are necessary Parameter.
7. the task data control device based on hadoop as claimed in claim 6, which is characterized in that the identification module is also For:
Each experience table sentence is identified according to preset goal task name keys, thus find out including The first task of the goal task title executes record sentence;
It executes record sentence to the first task according to preset call parameter format to identify, to identify the mesh Mark the corresponding goal task call parameter of task names.
8. the task data control device based on hadoop as claimed in claim 7, which is characterized in that the call parameter packet Time and operating status are included, the call parameter format includes time format and operating status format.
9. a kind of computer equipment, which is characterized in that the computer equipment includes memory, processor, on the memory It is stored with the computer program that can be run on the processor, is realized such as when the computer program is executed by the processor The step of claim 1-4 described in any item task data management-control methods based on hadoop.
10. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has computer journey Sequence, the computer program can be executed by least one processor, so that at least one described processor executes such as claim The step of task data management-control method described in any one of 1-4 based on hadoop.
CN201910754460.6A 2019-08-15 2019-08-15 Task data management-control method, device and computer equipment based on hadoop Pending CN110490451A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910754460.6A CN110490451A (en) 2019-08-15 2019-08-15 Task data management-control method, device and computer equipment based on hadoop

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910754460.6A CN110490451A (en) 2019-08-15 2019-08-15 Task data management-control method, device and computer equipment based on hadoop

Publications (1)

Publication Number Publication Date
CN110490451A true CN110490451A (en) 2019-11-22

Family

ID=68551360

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910754460.6A Pending CN110490451A (en) 2019-08-15 2019-08-15 Task data management-control method, device and computer equipment based on hadoop

Country Status (1)

Country Link
CN (1) CN110490451A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105279213A (en) * 2015-03-13 2016-01-27 中国移动通信集团广东有限公司 Retrieval device and retrieval method for log database
CN108710532A (en) * 2018-05-21 2018-10-26 平安科技(深圳)有限公司 Across dependence implementation method, device, equipment and the storage medium of dispatching platform
CN110069572A (en) * 2019-03-19 2019-07-30 深圳壹账通智能科技有限公司 HIVE method for scheduling task, device, equipment and storage medium based on big data platform

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105279213A (en) * 2015-03-13 2016-01-27 中国移动通信集团广东有限公司 Retrieval device and retrieval method for log database
CN108710532A (en) * 2018-05-21 2018-10-26 平安科技(深圳)有限公司 Across dependence implementation method, device, equipment and the storage medium of dispatching platform
CN110069572A (en) * 2019-03-19 2019-07-30 深圳壹账通智能科技有限公司 HIVE method for scheduling task, device, equipment and storage medium based on big data platform

Similar Documents

Publication Publication Date Title
CN107844634B (en) Modeling method of multivariate general model platform, electronic equipment and computer readable storage medium
CN109684047A (en) Event-handling method, device, equipment and computer storage medium
CN109936621B (en) Information security multi-page message pushing method, device, equipment and storage medium
CN108768929A (en) The analytic method and storage medium of electronic device, reference feedback message
CN110046146A (en) The monitoring method and device of industrial equipment based on mobile edge calculations
CN111342992B (en) Method and system for processing equipment information change record
CN108681504A (en) Automated testing method, test server and computer readable storage medium
CN110069925B (en) Software monitoring method, system and computer readable storage medium
CN107844468A (en) The cross-page recognition methods of form data, electronic equipment and computer-readable recording medium
CN109446515A (en) Group information analysis method, electronic device and computer readable storage medium
CN107357721B (en) Method and device for testing system
CN111580948A (en) Task scheduling method and device and computer equipment
CN110363222A (en) Picture mask method, device, computer equipment and storage medium for model training
CN107766512B (en) Log data storage method and log data storage system
CN109284331A (en) Accreditation information acquisition method, terminal device and medium based on business datum resource
CN110490451A (en) Task data management-control method, device and computer equipment based on hadoop
CN112634025A (en) Wind control rule generation method, device, equipment and computer readable storage medium
CN110515792A (en) Monitoring method, device and computer equipment based on web edition task management platform
CN109409793B (en) Equipment full life cycle management method and related device
CN110502427A (en) Code readability inspection method, device and server
CN110502538A (en) Label of drawing a portrait generates method, system, equipment and the storage medium of logical mappings
CN114238507A (en) Data synchronization method and device based on multiple databases
CN113901093A (en) Service call log relation analysis method and system based on memory cache
CN113434281A (en) Equipment scheduling method and cloud platform
CN112817953A (en) Data verification method and device, computer equipment and computer-readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination