WO2023093200A1 - Method and apparatus for asynchronously processing tasks, and storage medium and electronic apparatus - Google Patents

Method and apparatus for asynchronously processing tasks, and storage medium and electronic apparatus Download PDF

Info

Publication number
WO2023093200A1
WO2023093200A1 PCT/CN2022/117071 CN2022117071W WO2023093200A1 WO 2023093200 A1 WO2023093200 A1 WO 2023093200A1 CN 2022117071 W CN2022117071 W CN 2022117071W WO 2023093200 A1 WO2023093200 A1 WO 2023093200A1
Authority
WO
WIPO (PCT)
Prior art keywords
task
etl
data
server
state
Prior art date
Application number
PCT/CN2022/117071
Other languages
French (fr)
Chinese (zh)
Inventor
韩林
Original Assignee
中兴通讯股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中兴通讯股份有限公司 filed Critical 中兴通讯股份有限公司
Publication of WO2023093200A1 publication Critical patent/WO2023093200A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • G06F16/254Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/25Integrating or interfacing systems involving database management systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5005Allocation of resources, e.g. of the central processing unit [CPU] to service a request
    • G06F9/5027Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
    • G06F9/5038Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the execution order of a plurality of tasks, e.g. taking priority or time dependency constraints into consideration
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • Embodiments of the present disclosure relate to the communication field, and in particular, relate to a task asynchronous processing method, device, storage medium, and electronic device.
  • Data Extract-Transform-Load (ETL) task processing is the core functional module of the shared data exchange platform, which provides data support services for upper-level government and enterprise applications.
  • the data ETL task processing system is also based on the enterprise service bus technology and is distributed, but it only distinguishes the execution status of the task from the arrangement status, the ready status, the running status, and the ending status, among which the running status It is an indivisible state unit, and the resource management strategy of the ETL task in the data ETL task processing system in the data ETL task processing system is consistent: if a data ETL task is scheduled and executed as expected at a certain moment, and The scheduling module judges that the parameter of the number of tasks running concurrently has reached the set threshold, then the task can only be submitted to the execution engine module by the scheduling module after the resources are released after other tasks are executed.
  • the data ETL task being executed by the execution engine module of the system is in the running state, there are actually multiple processing steps in this state due to business characteristics, which is equivalent to multiple sub-states.
  • the data ETL tasks in different sub-states in the running state have different characteristics on the consumption of server resources.
  • the early sub-states such as the waiting state do not consume a lot of server resources but take more time.
  • the later sub-states such as the one after stripping other sub-states
  • the running state consumes a lot of computing resources such as memory and CPU. Therefore, in the original technical solution, due to the unreasonable division of the execution status of the data ETL task and the corresponding execution strategy, the computing resources of the system are not fully and rationally utilized, resulting in low processing efficiency.
  • Embodiments of the present disclosure provide a task asynchronous processing method, device, storage medium, and electronic device, so as to at least solve the problems of low efficiency of distributed data ETL task processing and one-size-fits-all and unreasonable resource management strategies in the related art.
  • a task asynchronous processing method including:
  • the start state and waiting state of the data ETL task are separated from the running state to obtain the processed running state;
  • the tasks executed on the slave ETL server are not limited by the threshold of the concurrent number of tasks running, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running.
  • an apparatus for task asynchronous processing including:
  • the stripping module is configured to strip the start state and the waiting state of the data ETL task from the running state to obtain the processed running state;
  • a conversion module configured to convert the data ETL task into a data ETL task a and a data ETL task b, wherein the data ETL task a corresponds to the data ETL task b one by one;
  • the control module is configured to control the data ETL task a in the starting state and the waiting state to be executed on the slave ETL server, and control the data ETL task b in the running state after the processing to be asynchronous back and forth on the master ETL server Executing, wherein, the tasks executed on the slave ETL server are not limited by the threshold of the number of concurrent tasks running, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running on the master ETL server.
  • a computer-readable storage medium where a computer program is stored in the storage medium, wherein the computer program is set to execute any one of the above method embodiments when running in the steps.
  • an electronic device including a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform any of the above Steps in the method examples.
  • FIG. 1 is a block diagram of a hardware structure of a mobile terminal of a task asynchronous processing method according to an embodiment of the present disclosure
  • FIG. 2 is a flowchart of a task asynchronous processing method according to an embodiment of the present disclosure
  • FIG. 3 is a schematic diagram of an execution state of a data ETL task according to an embodiment of the present disclosure
  • FIG. 4 is a schematic diagram of node composition of a distributed data ETL processing system according to an embodiment of the present disclosure
  • Fig. 5 is a block diagram of an apparatus for task asynchronous processing according to an embodiment of the present disclosure.
  • FIG. 1 is a block diagram of the hardware structure of the mobile terminal according to the task asynchronous processing method of the embodiment of the present disclosure.
  • the mobile terminal may include one or more (only shown in FIG. 1 1)
  • Processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a communication function
  • the transmission device 106 and the input and output device 108 may be executed in mobile terminals, computer terminals or similar computing devices.
  • FIG. 1 is only for illustration, and it does not limit the structure of the above mobile terminal.
  • the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration from that shown in FIG. 1 .
  • the memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as the computer program corresponding to the task asynchronous processing method in the embodiment of the present disclosure, and the processor 102 executes the computer program stored in the memory 104 by running the computer program Various functional applications and service chain address pool slicing processing realize the above-mentioned method.
  • the memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory.
  • the memory 104 may further include a memory that is remotely located relative to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
  • the transmission device 106 is used to receive or transmit data via a network.
  • the specific example of the above network may include a wireless network provided by the communication provider of the mobile terminal.
  • the transmission device 106 includes a network interface controller (NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet.
  • the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
  • RF Radio Frequency
  • FIG. 2 is a flow chart of the task asynchronous processing method according to an embodiment of the present disclosure. As shown in FIG. 2 , the process includes at least But not limited to the following steps:
  • Step S202 stripping the start state and waiting state of the data ETL task from the running state to obtain the processed running state
  • Fig. 3 is a schematic diagram of the execution state of a data ETL task according to an embodiment of the present disclosure.
  • the execution state in the related art includes an orchestration state, a ready state, a running state and an end state, as shown in the left process in Fig. 3, That is, the startup state and the waiting state belong to a part of the running state in related technologies.
  • the starting state and the waiting state are separated from the running state, which is equivalent to re-dividing the execution state. Redefine the execution state of the data ETL task, and improve the execution strategy and resource management strategy according to the re-division of the state.
  • the execution state of the redefined data ETL task includes orchestration state, ready state, start state, waiting state, running state, and end state, as shown in the flow on the right side of Figure 3.
  • Step S204 converting the data ETL task into data ETL task a and data ETL task b, wherein the data ETL task a corresponds to the data ETL task b one by one, and optionally, the data ETL task can be associated by sharing the same task ID a and data ETL task b.
  • Step S206 control the data ETL task a in the startup state and the waiting state to be executed on the slave ETL server, and control the processed data ETL task b in the running state to be executed asynchronously on the master ETL server, wherein, in the slave ETL server
  • the tasks executed on the server are not limited by the threshold of the number of tasks running concurrently, and the tasks executed on the main ETL server are limited by the threshold of the number of tasks running concurrently.
  • the task exception handling in the embodiments of the present disclosure can be applied to a distributed data ETL processing system.
  • the system includes a master ETL server and multiple slave ETL servers. Since tasks in the startup state and waiting state occupy resources for a long time, the startup state and Tasks in the waiting state are executed in the slave ETL server, and the execution of tasks in the slave server will not be limited by the concurrent number threshold of the main ETL server's tasks, so that during the concurrent execution of a large number of data ETL tasks, the computing resources are reduced. invalid occupancy.
  • the problems of low processing efficiency of distributed data ETL tasks and one-size-fits-all and unreasonable resource management strategies in related technologies can be solved. Since tasks in the startup state and waiting state occupy resources for a long time, it is difficult to control the startup state and waiting state.
  • the data ETL task a in the state is executed on the slave ETL server. The execution process does not need to limit computing resources.
  • the data ETL task b is executed on the master ETL server. The execution process needs to limit computing resources, which can greatly improve server resources. In the process of concurrent execution of a large number of data ETL tasks, it reduces the invalid occupation of computing resources and improves the processing capacity of data ETL tasks.
  • the above step S206 may specifically include: controlling the execution of the data ETL task a on the slave ETL server, and after the execution of the data ETL task a, controlling the execution of the corresponding data ETL task b on the master ETL server, configuring the data
  • the task information of the ETL task the task information at least includes task priority information, task running node information, data collection information source, data processing flow, data storage information; the task information of the data ETL task is compressed and packaged into task files A and Task file B; transfer task file A to the slave ETL server, and transfer task file B to the master ETL server, correspondingly, notify the slave ETL server corresponding to task file A to run the data ETL task a corresponding to task file A; in the ETL After the task a runs, notify the main ETL server corresponding to the task file B to run the data ETL task b corresponding to the task file B, and control the data ETL task b to enter the end state after the operation is completed, specifically, the running information of the statistical data ETL
  • the slave ETL server corresponding to the task file A in this embodiment is used to run the following operations: data query, data persistence, compression and packaging, and notification nodes;
  • the main ETL server corresponding to the above-mentioned task file B is used to run the following operations: data transmission , data decompression, data conversion, and data loading to the destination.
  • Fig. 4 is a schematic diagram of the node composition of the distributed data ETL processing system according to an embodiment of the present disclosure.
  • the change of the execution strategy of the corresponding scheduling module after re-dividing according to the state is: the execution control of the data ETL task requires According to the task file, there is a one-to-one correspondence between the data ETL task and the task file.
  • the original data ETL task and its corresponding task file are improved into two tasks (data ETL task a and data ETL task b) and their corresponding task file A and task file B respectively.
  • the change in the resource management strategy of the scheduling module is: for the data ETL task after the execution state is redefined, when its execution state is in the start state or waiting state, the scheduling module will not include it in the configuration parameter of the number of concurrent tasks running This expression is equivalent to the fact that data ETL task a is not limited by this parameter and data ETL task b is limited by this parameter.
  • the above improvement process is formed based on the execution characteristics of the data ETL task: the logical data ETL task a is executed on the execution module of the slave ETL server, and the logical data ETL task b is executed on the execution module of the master ETL server. Since there is a one-to-many correspondence between the master ETL server and the slave servers, the number of slave ETL servers is large and the business load is balanced, so the execution process of the data ETL task a does not need to limit the processing of computing resources, and the data The execution process of ETL task b needs to be processed with limited computing resources; therefore, after improvement, the utilization rate of server resources can be greatly improved and costs can be saved.
  • Data ETL task a and data ETL task b correspond one-to-one, and are executed asynchronously before and after.
  • the execution control process relies on the system service bus technology to complete the communication, so its process control has the characteristics of automation, which increases the flexibility of system scheduling management, and the system and ETL operation Human-computer friendly interaction of maintenance personnel.
  • the relevant information of the ETL task includes, but is not limited to: configuration task priority information, configuration task running node information, configuration data collection information source, design data processing flow, and configuration data storage information.
  • the distribution module of the main ETL server compresses and packages the relevant information of the ETL task in the orchestration state into two ETL task files, task file A and task file B respectively, and the two task files share the same A task ID, the execution process has a contextual relationship, and the compressed and packaged two ETL task files are transmitted to the corresponding ETL server using the SFTP protocol.
  • the task file A is submitted to the execution engine module by the scheduling module on the corresponding slave ETL server, and then decompressed, loaded, verified, parsed, and run respectively.
  • the main logic of its operation is to use the remote call technology Web
  • the asynchronous method of Service notifies the corresponding slave ETL server, and the data ETL task a represented by task file A can end after the notification is successfully sent. It should be noted that the data ETL task can only be executed after the task file is decompressed, loaded, verified, and parsed by the execution engine.
  • the data ETL task a represented by task file A enters the waiting state after running: task file A performs three processes of data query, data persistence, and compression and packaging after the corresponding slave ETL server receives the notification.
  • the above process Control relies on the Enterprise Service Bus technology ESB.
  • Task file B is submitted to the execution engine module by the scheduling module on the corresponding main ETL server after the corresponding main ETL server receives the notification from the enterprise service bus technology ESB, and then decompresses, loads, and verifies respectively , analysis, and operation.
  • the main logic of its operation is data transmission, data decompression, data conversion, and data loading to the destination. It should be noted that the data ETL task can only be executed after the task file is decompressed, loaded, verified, and parsed by the execution engine.
  • the data ETL task b represented by the task file B enters the end state after running: the process status monitoring module checks the verification result, notifies the log module to count the relevant running information of the ETL task, notifies the scheduling module whether exception handling is required, calculates the next execution time, and releases associated computing resources.
  • the embodiment of the present disclosure divides the execution state of the above-mentioned ETL task at a fine-grained level and improves the resource management strategy of the scheduling module, and separates the link of the ETL task's running state, the start state and the waiting state, which are more waiting for a long time.
  • Tasks in this state are not limited by the threshold of the concurrent number of tasks managed by the scheduling module; only other operations that consume more memory resources and computing resources, such as data transmission, data decompression, data conversion, and data loading to the destination, are counted.
  • Into the running state unified into the control and management of the task running concurrent number threshold of the scheduling module.
  • the ETL task asynchronous processing system reduces the invalid occupation of computing resources by data waiting behavior, and improves the data ETL task processing capability.
  • the switching between states depends on the enterprise service bus technology, and the process control has the characteristics of automation.
  • the execution process of the original task is actually split by two successively running data ETL tasks, which increases the flexibility of system scheduling management.
  • Through the visual display of the process status monitoring module the interaction between the system and ETL operation and maintenance personnel is human-computer friendly.
  • the distributed data ETL processing system includes a master ETL server and a slave ETL server.
  • the slave ETL server and the data source storage device are in the same local area network, and the main ETL server often cannot directly obtain the desired data from the data source storage device in the local area network of the commission office, so the slave ETL server acts as the agent of the main ETL server As a horizontal expansion of the main ETL server, it has the same execution engine module and realizes the ability of distributed data processing.
  • the control interaction between the master ETL server and the slave ETL server is through the remote call technology Web Service, and the data interaction is through the SFTP communication protocol.
  • FIG. 5 is a block diagram of a task asynchronous processing device according to an embodiment of the present disclosure. As shown in FIG. 5 , it includes:
  • the stripping module 52 is configured to strip the start state and the waiting state of the data ETL task from the running state to obtain the processed running state;
  • the conversion module 54 is configured to convert the data ETL task into a data ETL task a and a data ETL task b, wherein the data ETL task a corresponds to the data ETL task b;
  • the control module 56 is configured to control the data ETL task a in the startup state and the waiting state to be executed on the slave ETL server, and control the processed data ETL task b in the running state to be executed asynchronously on the master ETL server, wherein, in The tasks executed on the slave ETL server are not limited by the threshold of the number of tasks running concurrently, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running on the master ETL server.
  • control module 56 is also configured to:
  • the control data ETL task a is executed on the slave ETL server, and after the execution of the data ETL task a is completed, the corresponding data ETL task b is controlled to be executed on the master ETL server.
  • the above-mentioned device also includes:
  • the configuration module is configured to configure the task information of the data ETL task, wherein the task information at least includes task priority information, node information of task operation, data collection information source, data processing flow, and data storage information;
  • the compression and packaging module is configured to compress and package the task information of the data ETL task into task file A and task file B, the task file A corresponds to the data ETL task a one by one, and the task file B corresponds to the data ETL task b one by one correspond;
  • the transfer module is configured to transfer task file A to the slave ETL server, and transfer task file B to the master ETL server.
  • control module 56 includes:
  • the first notification submodule is configured to notify the corresponding data ETL task a of task file A from the ETL server running task file A;
  • the second notification submodule is configured to notify the corresponding main ETL server of the task file B to run the data ETL task b corresponding to the task file B after the ETL task a runs;
  • the control sub-module is set to enter the end state after the control data ETL task b finishes running.
  • the slave ETL server corresponding to the above-mentioned task file A is used to run the following operations: data query, data persistence, compression and packaging, and notification nodes;
  • the main ETL server corresponding to the above task file B is used to run the following operations: data transmission, data decompression, data conversion, and data loading to the destination.
  • control submodule is also set to:
  • Embodiments of the present disclosure also provide a computer-readable storage medium, in which a computer program is stored, wherein the computer program is set to execute the steps in any one of the above method embodiments when running.
  • the above-mentioned computer-readable storage medium may include but not limited to: U disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM) , mobile hard disk, magnetic disk or optical disk and other media that can store computer programs.
  • ROM read-only memory
  • RAM random access memory
  • mobile hard disk magnetic disk or optical disk and other media that can store computer programs.
  • Embodiments of the present disclosure also provide an electronic device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in any one of the above method embodiments.
  • the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
  • each module or each step of the above-mentioned disclosure can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network composed of multiple computing devices In fact, they can be implemented in program code executable by a computing device, and thus, they can be stored in a storage device to be executed by a computing device, and in some cases, can be executed in an order different from that shown here. Or described steps, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. As such, the present disclosure is not limited to any specific combination of hardware and software.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

Provided in the embodiments of the present disclosure are a method and apparatus for asynchronously processing tasks, and a storage medium and an electronic apparatus. The method comprises: stripping a starting state and a waiting state of a data ETL task from a running state, so as to obtain a processed running state; transforming the data ETL task into data ETL tasks a and data ETL tasks b; and controlling the data ETL tasks a which are in the starting state and the waiting state to be executed on a slave ETL server, and controlling the data ETL tasks b which are in the processed running state to be asynchronously executed on a master ETL server, wherein the tasks executed on the slave ETL server are not limited by a threshold value of the number of concurrent running tasks, and the tasks executed on the master ETL server are limited by the threshold value of the number of concurrent running tasks. The problems in the related art of the efficiency of processing distributed data ETL tasks being low and a resource management policy being inflexible and irrational can be solved, thereby improving the processing capability of the data ETL tasks.

Description

任务异步处理方法、装置、存储介质及电子装置Task asynchronous processing method, device, storage medium and electronic device
相关申请的交叉引用Cross References to Related Applications
本公开基于2021年11月24日提交的发明名称为“任务异步处理方法、装置、存储介质及电子装置”的中国专利申请CN202111407069.2,并且要求该专利申请的优先权,通过引用将其所公开的内容全部并入本公开。This disclosure is based on the Chinese patent application CN202111407069.2 filed on November 24, 2021 with the title of "task asynchronous processing method, device, storage medium and electronic device", and claims the priority of this patent application, which is incorporated by reference The disclosed content is incorporated in this disclosure in its entirety.
技术领域technical field
本公开实施例涉及通信领域,具体而言,涉及一种任务异步处理方法、装置、存储介质及电子装置。Embodiments of the present disclosure relate to the communication field, and in particular, relate to a task asynchronous processing method, device, storage medium, and electronic device.
背景技术Background technique
数据抽取-转换-装载(Extract-Transform-Load,简称为ETL)任务处理是共享数据交换平台的核心功能模块,它为上层政企应用提供数据支撑服务。在原有的技术方案中,数据ETL任务处理系统也是基于企业服务总线技术的、分布式的,但是它对任务的执行状态的区分仅有编排状态、就绪状态、运行状态、结束状态,其中运行状态是一个不可分割的状态单元,数据ETL任务处理系统中的调度模块对该状态下的ETL任务的资源管理策略是一贯不变的:如果某一时刻一个数据ETL任务按照预期需要被调度执行,并且调度模块判断任务运行并发数量这一参数达到了设定的阈值,那么该任务只能等待其他任务执行结束后释放资源后才会由调度模块提交至执行引擎模块。然而被系统的执行引擎模块正在执行的数据ETL任务尽管处于运行状态,实际上该状态下由于业务特征还有多个处理步骤,相当于存在多个子状态。运行状态中不同的子状态下的数据ETL任务对服务器资源的消耗具有不同的特征,前期子状态如等待状态并不耗费很多服务器资源但耗时较多,后期子状态如剥离其他子状态后的运行状态会消耗大量内存和CPU等计算资源。因此在原有的技术方案中,由于对数据ETL任务的执行状态划分的不合理以及其对应形成的执行策略,系统的计算资源未被充分合理利用,导致处理效率低下。Data Extract-Transform-Load (ETL) task processing is the core functional module of the shared data exchange platform, which provides data support services for upper-level government and enterprise applications. In the original technical solution, the data ETL task processing system is also based on the enterprise service bus technology and is distributed, but it only distinguishes the execution status of the task from the arrangement status, the ready status, the running status, and the ending status, among which the running status It is an indivisible state unit, and the resource management strategy of the ETL task in the data ETL task processing system in the data ETL task processing system is consistent: if a data ETL task is scheduled and executed as expected at a certain moment, and The scheduling module judges that the parameter of the number of tasks running concurrently has reached the set threshold, then the task can only be submitted to the execution engine module by the scheduling module after the resources are released after other tasks are executed. However, although the data ETL task being executed by the execution engine module of the system is in the running state, there are actually multiple processing steps in this state due to business characteristics, which is equivalent to multiple sub-states. The data ETL tasks in different sub-states in the running state have different characteristics on the consumption of server resources. The early sub-states such as the waiting state do not consume a lot of server resources but take more time. The later sub-states such as the one after stripping other sub-states The running state consumes a lot of computing resources such as memory and CPU. Therefore, in the original technical solution, due to the unreasonable division of the execution status of the data ETL task and the corresponding execution strategy, the computing resources of the system are not fully and rationally utilized, resulting in low processing efficiency.
针对相关技术中分布式数据ETL任务处理效率低下、资源管理策略一刀切且不合理的问题,尚未提出解决方案。Aiming at the low efficiency of distributed data ETL task processing and one-size-fits-all and unreasonable resource management strategies in related technologies, no solution has been proposed yet.
发明内容Contents of the invention
本公开实施例提供了一种任务异步处理方法、装置、存储介质及电子装置,以至少解决相关技术中分布式数据ETL任务处理效率低下、资源管理策略一刀切且不合理的问题。Embodiments of the present disclosure provide a task asynchronous processing method, device, storage medium, and electronic device, so as to at least solve the problems of low efficiency of distributed data ETL task processing and one-size-fits-all and unreasonable resource management strategies in the related art.
根据本公开的一个实施例,提供了一种任务异步处理方法,包括:According to an embodiment of the present disclosure, a task asynchronous processing method is provided, including:
将数据ETL任务的启动状态和等待状态从运行状态中剥离,得到处理后的运行状态;The start state and waiting state of the data ETL task are separated from the running state to obtain the processed running state;
将数据ETL任务转换为数据ETL任务a与数据ETL任务b,其中,所述数据ETL任务a和所述数据ETL任务b一一对应;Converting the data ETL task into a data ETL task a and a data ETL task b, wherein the data ETL task a corresponds to the data ETL task b one by one;
控制所述启动状态和所述等待状态下的数据ETL任务a在从ETL服务器上执行,并控制所述处理后的运行状态下的数据ETL任务b在主ETL服务器上前后异步执行,其中,在所述 从ETL服务器上执行的任务不受任务运行并发数量阈值的限制,所述主ETL服务器上执行的任务受所述任务运行并发数量阈值的限制。Controlling the data ETL task a in the startup state and the waiting state to be executed on the slave ETL server, and controlling the data ETL task b in the running state after the processing to be executed asynchronously on the master ETL server, wherein The tasks executed on the slave ETL server are not limited by the threshold of the concurrent number of tasks running, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running.
根据本公开的另一个实施例,还提供了一种任务异步处理装置,包括:According to another embodiment of the present disclosure, there is also provided an apparatus for task asynchronous processing, including:
剥离模块,设置为将数据ETL任务的启动状态和等待状态从运行状态中剥离,得到处理后的运行状态;The stripping module is configured to strip the start state and the waiting state of the data ETL task from the running state to obtain the processed running state;
转换模块,设置为将数据ETL任务转换为数据ETL任务a与数据ETL任务b,其中,所述数据ETL任务a和所述数据ETL任务b一一对应;A conversion module configured to convert the data ETL task into a data ETL task a and a data ETL task b, wherein the data ETL task a corresponds to the data ETL task b one by one;
控制模块,设置为控制所述启动状态和所述等待状态下的数据ETL任务a在从ETL服务器上执行,并控制所述处理后的运行状态下的数据ETL任务b在主ETL服务器上前后异步执行,其中,在所述从ETL服务器上执行的任务不受任务运行并发数量阈值的限制,所述主ETL服务器上执行的任务受所述任务运行并发数量阈值的限制。The control module is configured to control the data ETL task a in the starting state and the waiting state to be executed on the slave ETL server, and control the data ETL task b in the running state after the processing to be asynchronous back and forth on the master ETL server Executing, wherein, the tasks executed on the slave ETL server are not limited by the threshold of the number of concurrent tasks running, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running on the master ETL server.
根据本公开的又一个实施例,还提供了一种计算机可读的存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。According to yet another embodiment of the present disclosure, there is also provided a computer-readable storage medium, where a computer program is stored in the storage medium, wherein the computer program is set to execute any one of the above method embodiments when running in the steps.
根据本公开的又一个实施例,还提供了一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行上述任一项方法实施例中的步骤。According to yet another embodiment of the present disclosure, there is also provided an electronic device, including a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform any of the above Steps in the method examples.
附图说明Description of drawings
图1是本公开实施例的任务异步处理方法的移动终端的硬件结构框图;FIG. 1 is a block diagram of a hardware structure of a mobile terminal of a task asynchronous processing method according to an embodiment of the present disclosure;
图2是根据本公开实施例的任务异步处理方法的流程图;FIG. 2 is a flowchart of a task asynchronous processing method according to an embodiment of the present disclosure;
图3是根据本公开实施例的数据ETL任务的执行状态的示意图;3 is a schematic diagram of an execution state of a data ETL task according to an embodiment of the present disclosure;
图4是根据本公开实施例的分布式数据ETL处理系统的节点组成的示意图;FIG. 4 is a schematic diagram of node composition of a distributed data ETL processing system according to an embodiment of the present disclosure;
图5是根据本公开实施例的任务异步处理装置的框图。Fig. 5 is a block diagram of an apparatus for task asynchronous processing according to an embodiment of the present disclosure.
具体实施方式Detailed ways
下文中将参考附图并结合实施例来详细说明本公开的实施例。Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings and in combination with the embodiments.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。It should be noted that the terms "first" and "second" in the specification and claims of the present disclosure and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence.
本公开实施例中所提供的方法实施例可以在移动终端、计算机终端或者类似的运算装置中执行。以运行在移动终端上为例,图1是本公开实施例的任务异步处理方法的移动终端的硬件结构框图,如图1所示,移动终端可以包括一个或多个(图1中仅示出一个)处理器102(处理器102可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)和用于存储数据的存储器104,其中,上述移动终端还可以包括用于通信功能的传输设备106以及输入输出设备108。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述移动终端的结构造成限定。例如,移动终端还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。The method embodiments provided in the embodiments of the present disclosure may be executed in mobile terminals, computer terminals or similar computing devices. Taking running on a mobile terminal as an example, FIG. 1 is a block diagram of the hardware structure of the mobile terminal according to the task asynchronous processing method of the embodiment of the present disclosure. As shown in FIG. 1 , the mobile terminal may include one or more (only shown in FIG. 1 1) Processor 102 (processor 102 may include but not limited to a processing device such as a microprocessor MCU or a programmable logic device FPGA) and a memory 104 for storing data, wherein the above-mentioned mobile terminal may also include a communication function The transmission device 106 and the input and output device 108. Those skilled in the art can understand that the structure shown in FIG. 1 is only for illustration, and it does not limit the structure of the above mobile terminal. For example, the mobile terminal may also include more or fewer components than those shown in FIG. 1 , or have a different configuration from that shown in FIG. 1 .
存储器104可用于存储计算机程序,例如,应用软件的软件程序以及模块,如本公开实施例中的任务异步处理方法对应的计算机程序,处理器102通过运行存储在存储器104内的 计算机程序,从而执行各种功能应用以及业务链地址池切片处理,即实现上述的方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器102远程设置的存储器,这些远程存储器可以通过网络连接至移动终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 can be used to store computer programs, for example, software programs and modules of application software, such as the computer program corresponding to the task asynchronous processing method in the embodiment of the present disclosure, and the processor 102 executes the computer program stored in the memory 104 by running the computer program Various functional applications and service chain address pool slicing processing realize the above-mentioned method. The memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory 104 may further include a memory that is remotely located relative to the processor 102, and these remote memories may be connected to the mobile terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.
传输装置106用于经由一个网络接收或者发送数据。上述的网络具体实例可包括移动终端的通信供应商提供的无线网络。在一个实例中,传输装置106包括一个网络适配器(Network Interface Controller,简称为NIC),其可通过基站与其他网络设备相连从而可与互联网进行通讯。在一个实例中,传输装置106可以为射频(Radio Frequency,简称为RF)模块,其用于通过无线方式与互联网进行通讯。The transmission device 106 is used to receive or transmit data via a network. The specific example of the above network may include a wireless network provided by the communication provider of the mobile terminal. In one example, the transmission device 106 includes a network interface controller (NIC for short), which can be connected to other network devices through a base station so as to communicate with the Internet. In one example, the transmission device 106 may be a radio frequency (Radio Frequency, referred to as RF) module, which is used to communicate with the Internet in a wireless manner.
在本实施例中提供了一种运行于上述移动终端或网络架构的任务异步处理方法,图2是根据本公开实施例的任务异步处理方法的流程图,如图2所示,该流程至少包括但不限于如下步骤:In this embodiment, a task asynchronous processing method running on the above-mentioned mobile terminal or network architecture is provided. FIG. 2 is a flow chart of the task asynchronous processing method according to an embodiment of the present disclosure. As shown in FIG. 2 , the process includes at least But not limited to the following steps:
步骤S202,将数据ETL任务的启动状态和等待状态从运行状态中剥离,得到处理后的运行状态;Step S202, stripping the start state and waiting state of the data ETL task from the running state to obtain the processed running state;
图3是根据本公开实施例的数据ETL任务的执行状态的示意图,如图3所示,相关技术中执行状态包括编排状态、就绪状态、运行状态以及结束状态,如图3中左侧流程,即启动状态与等待状态在相关技术中属于运行状态的一部分,本实施例将启动状态与等待状态从运行状态中剥离出来,即相当于重新划分了执行状态。重新定义数据ETL任务的执行状态,并根据状态的重新划分对应改进了执行策略和资源管理策略。重新定义后的数据ETL任务的执行状态包括编排状态、就绪状态、启动状态、等待状态、运行状态、结束状态,如图3中右侧流程。Fig. 3 is a schematic diagram of the execution state of a data ETL task according to an embodiment of the present disclosure. As shown in Fig. 3, the execution state in the related art includes an orchestration state, a ready state, a running state and an end state, as shown in the left process in Fig. 3, That is, the startup state and the waiting state belong to a part of the running state in related technologies. In this embodiment, the starting state and the waiting state are separated from the running state, which is equivalent to re-dividing the execution state. Redefine the execution state of the data ETL task, and improve the execution strategy and resource management strategy according to the re-division of the state. The execution state of the redefined data ETL task includes orchestration state, ready state, start state, waiting state, running state, and end state, as shown in the flow on the right side of Figure 3.
步骤S204,将数据ETL任务转换为数据ETL任务a与数据ETL任务b,其中,该数据ETL任务a和该数据ETL任务b一一对应,可选的,可以通过共享同一任务ID关联数据ETL任务a与数据ETL任务b。Step S204, converting the data ETL task into data ETL task a and data ETL task b, wherein the data ETL task a corresponds to the data ETL task b one by one, and optionally, the data ETL task can be associated by sharing the same task ID a and data ETL task b.
步骤S206,控制启动状态和等待状态下的数据ETL任务a在从ETL服务器上执行,并控制处理后的运行状态下的数据ETL任务b在主ETL服务器上前后异步执行,其中,在从ETL服务器上执行的任务不受任务运行并发数量阈值的限制,主ETL服务器上执行的任务受任务运行并发数量阈值的限制。Step S206, control the data ETL task a in the startup state and the waiting state to be executed on the slave ETL server, and control the processed data ETL task b in the running state to be executed asynchronously on the master ETL server, wherein, in the slave ETL server The tasks executed on the server are not limited by the threshold of the number of tasks running concurrently, and the tasks executed on the main ETL server are limited by the threshold of the number of tasks running concurrently.
本公开实施例的任务异常处理可以应用于分布式数据ETL处理系统,该系统包括一个主ETL服务器和多个从ETL服务器,由于启动状态和等待状态的任务占用资源时间较长,将启动状态和等待状态的任务放到从ETL服务器中执行,从服务器中执行任务,不会受主ETL服务器的任务运行并发数量阈值的限制,使得在大量数据ETL任务的并发执行过程中,减少对计算资源的无效占用。The task exception handling in the embodiments of the present disclosure can be applied to a distributed data ETL processing system. The system includes a master ETL server and multiple slave ETL servers. Since tasks in the startup state and waiting state occupy resources for a long time, the startup state and Tasks in the waiting state are executed in the slave ETL server, and the execution of tasks in the slave server will not be limited by the concurrent number threshold of the main ETL server's tasks, so that during the concurrent execution of a large number of data ETL tasks, the computing resources are reduced. invalid occupancy.
通过上述步骤S202至S206,可以解决相关技术中分布式数据ETL任务处理效率低下、资源管理策略一刀切且不合理的问题,由于启动状态和等待状态的任务占用资源时间较长,控制启动状态和等待状态下的数据ETL任务a在从ETL服务器上执行,该执行过程无需做限制计算资源处理,数据ETL任务b在主ETL服务器上执行,该执行过程需要做限制计算资源处理,能够大幅提升服务器资源的利用率,节约成本,在大量数据ETL任务的并发执行过程 中,减少对计算资源的无效占用,提高了数据ETL任务处理能力。Through the above steps S202 to S206, the problems of low processing efficiency of distributed data ETL tasks and one-size-fits-all and unreasonable resource management strategies in related technologies can be solved. Since tasks in the startup state and waiting state occupy resources for a long time, it is difficult to control the startup state and waiting state. The data ETL task a in the state is executed on the slave ETL server. The execution process does not need to limit computing resources. The data ETL task b is executed on the master ETL server. The execution process needs to limit computing resources, which can greatly improve server resources. In the process of concurrent execution of a large number of data ETL tasks, it reduces the invalid occupation of computing resources and improves the processing capacity of data ETL tasks.
在本实施例中,上述步骤S206具体可以包括:控制数据ETL任务a在从ETL服务器上执行,在数据ETL任务a执行结束之后,控制对应的数据ETL任务b在主ETL服务器上执行,配置数据ETL任务的任务信息,该任务信息至少包括任务优先级信息、任务运行的节点信息、数据采集信息源、数据处理流程、数据入库信息;将数据ETL任务的任务信息压缩打包成任务文件A和任务文件B;将任务文件A传输至从ETL服务器,并将任务文件B传输至主ETL服务器,对应的,通知任务文件A对应的从ETL服务器运行任务文件A对应的数据ETL任务a;在ETL任务a运行结束后,通知任务文件B对应的主ETL服务器运行任务文件B对应的数据ETL任务b,并控制数据ETL任务b运行结束后进入结束状态,具体的,统计数据ETL任务的运行信息;根据运行信息确定是否需要异常处理,在确定结果为是的情况下,执行异常处理;计算下次执行时间;释放计算资源,为执行下一数据ETL任务做准备。In this embodiment, the above step S206 may specifically include: controlling the execution of the data ETL task a on the slave ETL server, and after the execution of the data ETL task a, controlling the execution of the corresponding data ETL task b on the master ETL server, configuring the data The task information of the ETL task, the task information at least includes task priority information, task running node information, data collection information source, data processing flow, data storage information; the task information of the data ETL task is compressed and packaged into task files A and Task file B; transfer task file A to the slave ETL server, and transfer task file B to the master ETL server, correspondingly, notify the slave ETL server corresponding to task file A to run the data ETL task a corresponding to task file A; in the ETL After the task a runs, notify the main ETL server corresponding to the task file B to run the data ETL task b corresponding to the task file B, and control the data ETL task b to enter the end state after the operation is completed, specifically, the running information of the statistical data ETL task; Determine whether exception handling is required according to the running information, and if the determination result is yes, execute exception handling; calculate the next execution time; release computing resources to prepare for the execution of the next data ETL task.
本实施例中的任务文件A对应的从ETL服务器用于运行以下操作:数据查询、数据持久化、压缩打包、通知节点;上述的任务文件B对应的主ETL服务器用于运行以下操作:数据传输、数据解压、数据转换、数据加载至目的端。The slave ETL server corresponding to the task file A in this embodiment is used to run the following operations: data query, data persistence, compression and packaging, and notification nodes; the main ETL server corresponding to the above-mentioned task file B is used to run the following operations: data transmission , data decompression, data conversion, and data loading to the destination.
图4是根据本公开实施例的分布式数据ETL处理系统的节点组成的示意图,如图4所示,根据状态重新划分后对应的调度模块的执行策略的变化是:数据ETL任务的执行控制需要依照任务文件,数据ETL任务和任务文件是一一对应的关系。原先的一个数据ETL任务以及其对应的任务文件在改进后变成两个任务(数据ETL任务a和数据ETL任务b)以及它们分别对应的任务文件A和任务文件B。上述变化为执行逻辑的变化,但是从用户感知的角度来看,这仍然是一个个完整的ETL任务在执行:执行逻辑上的任务文件A和数据ETL任务a的执行过程(处理步骤:数据查询、数据持久化、压缩打包),在用户感知上是该数据ETL任务处于启动状态和等待状态;执行逻辑上的任务文件B和数据ETL任务b的执行过程(处理步骤:数据传输、数据解压、数据转换、数据加载至目的端),在用户感知上是该数据ETL任务处于运行状态。数据ETL任务a和数据ETL任务b一一对应,前后异步执行,执行控制过程依靠系统服务总线技术完成通信。Fig. 4 is a schematic diagram of the node composition of the distributed data ETL processing system according to an embodiment of the present disclosure. As shown in Fig. 4, the change of the execution strategy of the corresponding scheduling module after re-dividing according to the state is: the execution control of the data ETL task requires According to the task file, there is a one-to-one correspondence between the data ETL task and the task file. The original data ETL task and its corresponding task file are improved into two tasks (data ETL task a and data ETL task b) and their corresponding task file A and task file B respectively. The above changes are changes in execution logic, but from the perspective of user perception, this is still a complete ETL task being executed: the execution process of executing logical task file A and data ETL task a (processing steps: data query , data persistence, compression and packaging), the user perceives that the data ETL task is in the start state and waiting state; the execution process of the logical task file B and data ETL task b (processing steps: data transmission, data decompression, Data conversion, data loading to the destination), the user perceives that the data ETL task is running. Data ETL task a and data ETL task b correspond one-to-one, and are executed asynchronously. The execution control process relies on the system service bus technology to complete the communication.
调度模块在资源管理策略上的变化是:对于重新定义了执行状态后的数据ETL任务,当其执行状态处于启动状态或等待状态下,调度模块不会将其纳入任务运行并发数量这一配置参数的限制,该表述等价于数据ETL任务a不受该参数限制而数据ETL任务b受该参数限制。The change in the resource management strategy of the scheduling module is: for the data ETL task after the execution state is redefined, when its execution state is in the start state or waiting state, the scheduling module will not include it in the configuration parameter of the number of concurrent tasks running This expression is equivalent to the fact that data ETL task a is not limited by this parameter and data ETL task b is limited by this parameter.
上述改进过程是基于数据ETL任务的执行特点形成的:执行逻辑上的数据ETL任务a在从ETL服务器的执行模块上执行,执行逻辑上的数据ETL任务b在主ETL服务器的执行模块上执行。由于主ETL服务器和从服务器之间是一对多的对应关系,从ETL服务器的数量众多并且业务负载具有执行均衡的特点,所以数据ETL任务a的执行过程是无需做限制计算资源处理,而数据ETL任务b的执行过程需要做限制计算资源处理;因此改进后能够大幅提升服务器资源的利用率,节约成本。由于数据ETL任务a的执行过程包含了数据查询、数据持久化和压缩打包操作,改进后将这些耗时操作剥离出计算资源敏感的环节,因此在大量数据ETL任务的并发执行过程中,减少对计算资源的无效占用,提升数据ETL任务处理能力。数据ETL任务a和数据ETL任务b一一对应,前后异步执行,执行控制过程依靠系统服务总线技术完成通信,因此其流程控制具有自动化特征,增加了系统调度管理的灵活性,并且系统与ETL运维人员的交互人机友好。The above improvement process is formed based on the execution characteristics of the data ETL task: the logical data ETL task a is executed on the execution module of the slave ETL server, and the logical data ETL task b is executed on the execution module of the master ETL server. Since there is a one-to-many correspondence between the master ETL server and the slave servers, the number of slave ETL servers is large and the business load is balanced, so the execution process of the data ETL task a does not need to limit the processing of computing resources, and the data The execution process of ETL task b needs to be processed with limited computing resources; therefore, after improvement, the utilization rate of server resources can be greatly improved and costs can be saved. Since the execution process of the data ETL task a includes data query, data persistence, and compression and packaging operations, these time-consuming operations are stripped out of the links sensitive to computing resources after the improvement, so during the concurrent execution of a large number of data ETL tasks, the need for Ineffective occupancy of computing resources improves data ETL task processing capabilities. Data ETL task a and data ETL task b correspond one-to-one, and are executed asynchronously before and after. The execution control process relies on the system service bus technology to complete the communication, so its process control has the characteristics of automation, which increases the flexibility of system scheduling management, and the system and ETL operation Human-computer friendly interaction of maintenance personnel.
编排任务操作后进入编排状态:相关人员通过分发模块和可视化编排模块,设计和配置ETL任务的相关信息。ETL任务的相关信息其包括但不限于:配置任务优先级信息、配置任务运行的节点信息、配置数据采集信息源、设计数据处理流程、配置数据入库信息。Enter the orchestration state after the orchestration task operation: relevant personnel design and configure the relevant information of the ETL task through the distribution module and the visual orchestration module. The relevant information of the ETL task includes, but is not limited to: configuration task priority information, configuration task running node information, configuration data collection information source, design data processing flow, and configuration data storage information.
保存并同步任务后进入就绪状态:主ETL服务器的分发模块将编排状态下的ETL任务的相关信息压缩打包成两个ETL任务文件,分别为任务文件A和任务文件B,两个任务文件共享同一个任务ID,其被执行过程具有前后关系,将压缩打包后的两个ETL任务文件用SFTP协议传输至对应的ETL服务器。Enter the ready state after saving and synchronizing the task: the distribution module of the main ETL server compresses and packages the relevant information of the ETL task in the orchestration state into two ETL task files, task file A and task file B respectively, and the two task files share the same A task ID, the execution process has a contextual relationship, and the compressed and packaged two ETL task files are transmitted to the corresponding ETL server using the SFTP protocol.
启动任务后进图启动状态:任务文件A在对应的从ETL服务器上被调度模块提交至执行引擎模块,再分别进行解压、加载、验证、解析、运行,其运行的主要逻辑为通过远程调用技术Web Service的异步方式通知对应的从ETL服务器,通知成功发送后任务文件A所代表的数据ETL任务a运行即可结束。需要注意的是,任务文件被执行引擎解压、加载、验证、解析才能进行数据ETL任务的运行。After starting the task, enter the starting state of the figure: the task file A is submitted to the execution engine module by the scheduling module on the corresponding slave ETL server, and then decompressed, loaded, verified, parsed, and run respectively. The main logic of its operation is to use the remote call technology Web The asynchronous method of Service notifies the corresponding slave ETL server, and the data ETL task a represented by task file A can end after the notification is successfully sent. It should be noted that the data ETL task can only be executed after the task file is decompressed, loaded, verified, and parsed by the execution engine.
任务文件A所代表的数据ETL任务a运行结束后进入等待状态:任务文件A在对应的从ETL服务器在收到通知后,进行数据查询、数据持久化、压缩打包共三个流程,上述流程的控制依赖于企业服务总线技术ESB。当数据依次完成数据查询、数据持久化、压缩打包共三个流程后,系统服务总线技术ESB会通知任务文件B在对应的主ETL服务器,通知消息中包含数据的资源路径SFTP,这之后进入运行状态:任务文件B在对应的主ETL服务器在收到企业服务总线技术ESB的通知后,任务文件B在对应的主ETL服务器上被调度模块提交至执行引擎模块,再分别进行解压、加载、验证、解析、运行,其运行的主要逻辑是数据传输、数据解压、数据转换、数据加载至目的端。需要注意的是,任务文件被执行引擎解压、加载、验证、解析才能进行数据ETL任务的运行。The data ETL task a represented by task file A enters the waiting state after running: task file A performs three processes of data query, data persistence, and compression and packaging after the corresponding slave ETL server receives the notification. The above process Control relies on the Enterprise Service Bus technology ESB. After the data completes the three processes of data query, data persistence, and compression and packaging in sequence, the system service bus technology ESB will notify the task file B to be in the corresponding main ETL server, and the notification message includes the resource path SFTP of the data, and then enter the operation Status: Task file B is submitted to the execution engine module by the scheduling module on the corresponding main ETL server after the corresponding main ETL server receives the notification from the enterprise service bus technology ESB, and then decompresses, loads, and verifies respectively , analysis, and operation. The main logic of its operation is data transmission, data decompression, data conversion, and data loading to the destination. It should be noted that the data ETL task can only be executed after the task file is decompressed, loaded, verified, and parsed by the execution engine.
任务文件B所代表的数据ETL任务b运行结束后进入结束状态:流程状态监控模块检查验证结果、通知日志模块统计ETL任务相关运行信息、通知调度模块是否需要异常处理以及计算下次执行时间,释放相关计算资源。The data ETL task b represented by the task file B enters the end state after running: the process status monitoring module checks the verification result, notifies the log module to count the relevant running information of the ETL task, notifies the scheduling module whether exception handling is required, calculates the next execution time, and releases associated computing resources.
本公开实施例通过对上述ETL任务的执行状态的细粒度划分和改进了调度模块的资源管理策略,将ETL任务的运行状态中较多等待时长的环节一启动状态和等待状态剥离出来,对该状态下的任务不受调度模块管理的任务运行并发数量阈值的限制;仅将其他的耗费更多内存资源和计算资源的操作如数据传输、数据解压、数据转换、数据加载至目的端等操作计入运行状态,统一纳入调度模块的任务运行并发数量阈值的控制管理。该ETL任务异步处理系统减少了数据等待行为对计算资源的无效占用,提升数据ETL任务处理能力。各状态之间的切换依赖于企业服务总线技术,流程控制具有自动化特征。原先的一个任务的执行过程实际上是由两个先后运行的数据ETL任务进行拆分,增加了系统调度管理的灵活性。通过流程状态监控模块的可视化展示,系统与ETL运维人员的交互人机友好。The embodiment of the present disclosure divides the execution state of the above-mentioned ETL task at a fine-grained level and improves the resource management strategy of the scheduling module, and separates the link of the ETL task's running state, the start state and the waiting state, which are more waiting for a long time. Tasks in this state are not limited by the threshold of the concurrent number of tasks managed by the scheduling module; only other operations that consume more memory resources and computing resources, such as data transmission, data decompression, data conversion, and data loading to the destination, are counted. Into the running state, unified into the control and management of the task running concurrent number threshold of the scheduling module. The ETL task asynchronous processing system reduces the invalid occupation of computing resources by data waiting behavior, and improves the data ETL task processing capability. The switching between states depends on the enterprise service bus technology, and the process control has the characteristics of automation. The execution process of the original task is actually split by two successively running data ETL tasks, which increases the flexibility of system scheduling management. Through the visual display of the process status monitoring module, the interaction between the system and ETL operation and maintenance personnel is human-computer friendly.
本实施例中,分布式数据ETL处理系统包括主ETL服务器和从ETL服务器。从ETL服务器和数据源存储设备处于同一局域网内,主ETL服务器往往不能直接从委办局的局域网内的数据源存储设备上获取想要的数据,因此从ETL服务器起到了主ETL服务器的代理机的作用,并且作为主ETL服务器的横向扩展,具有同样的执行引擎模块,并实现分布式数据处理的能力。主ETL服务器和从ETL服务器之间的控制交互通过远程调用技术Web Service,数据交互通过SFTP通信协议。In this embodiment, the distributed data ETL processing system includes a master ETL server and a slave ETL server. The slave ETL server and the data source storage device are in the same local area network, and the main ETL server often cannot directly obtain the desired data from the data source storage device in the local area network of the commission office, so the slave ETL server acts as the agent of the main ETL server As a horizontal expansion of the main ETL server, it has the same execution engine module and realizes the ability of distributed data processing. The control interaction between the master ETL server and the slave ETL server is through the remote call technology Web Service, and the data interaction is through the SFTP communication protocol.
根据本公开的另一个实施例,还提供了一种任务异步处理装置,图5是根据本公开实施例的任务异步处理装置的框图,如图5所示,包括:According to another embodiment of the present disclosure, a task asynchronous processing device is also provided. FIG. 5 is a block diagram of a task asynchronous processing device according to an embodiment of the present disclosure. As shown in FIG. 5 , it includes:
剥离模块52,设置为将数据ETL任务的启动状态和等待状态从运行状态中剥离,得到处理后的运行状态;The stripping module 52 is configured to strip the start state and the waiting state of the data ETL task from the running state to obtain the processed running state;
转换模块54,设置为将数据ETL任务转换为数据ETL任务a与数据ETL任务b,其中,该数据ETL任务a和该数据ETL任务b一一对应;The conversion module 54 is configured to convert the data ETL task into a data ETL task a and a data ETL task b, wherein the data ETL task a corresponds to the data ETL task b;
控制模块56,设置为控制启动状态和等待状态下的数据ETL任务a在从ETL服务器上执行,并控制处理后的运行状态下的数据ETL任务b在主ETL服务器上前后异步执行,其中,在从ETL服务器上执行的任务不受任务运行并发数量阈值的限制,主ETL服务器上执行的任务受任务运行并发数量阈值的限制。The control module 56 is configured to control the data ETL task a in the startup state and the waiting state to be executed on the slave ETL server, and control the processed data ETL task b in the running state to be executed asynchronously on the master ETL server, wherein, in The tasks executed on the slave ETL server are not limited by the threshold of the number of tasks running concurrently, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running on the master ETL server.
在一示例性实施例中,控制模块56,还设置为:In an exemplary embodiment, the control module 56 is also configured to:
控制数据ETL任务a在从ETL服务器上执行,在数据ETL任务a执行结束之后,控制对应的数据ETL任务b在主ETL服务器上执行。The control data ETL task a is executed on the slave ETL server, and after the execution of the data ETL task a is completed, the corresponding data ETL task b is controlled to be executed on the master ETL server.
在一示例性实施例中,上述的装置还包括:In an exemplary embodiment, the above-mentioned device also includes:
配置模块,设置为配置数据ETL任务的任务信息,其中,该任务信息至少包括任务优先级信息、任务运行的节点信息、数据采集信息源、数据处理流程、数据入库信息;The configuration module is configured to configure the task information of the data ETL task, wherein the task information at least includes task priority information, node information of task operation, data collection information source, data processing flow, and data storage information;
压缩打包模块,设置为将数据ETL任务的任务信息压缩打包成任务文件A和任务文件B,该任务文件A与该数据ETL任务a一一对应,该任务文件B与该数据ETL任务b一一对应;The compression and packaging module is configured to compress and package the task information of the data ETL task into task file A and task file B, the task file A corresponds to the data ETL task a one by one, and the task file B corresponds to the data ETL task b one by one correspond;
传输模块,设置为将任务文件A传输至从ETL服务器,并将任务文件B传输至主ETL服务器。The transfer module is configured to transfer task file A to the slave ETL server, and transfer task file B to the master ETL server.
在一示例性实施例中,控制模块56包括:In an exemplary embodiment, the control module 56 includes:
第一通知子模块,设置为通知任务文件A对应的从ETL服务器运行任务文件A对应的数据ETL任务a;The first notification submodule is configured to notify the corresponding data ETL task a of task file A from the ETL server running task file A;
第二通知子模块,设置为在ETL任务a运行结束之后,通知任务文件B对应的主ETL服务器运行任务文件B对应的数据ETL任务b;The second notification submodule is configured to notify the corresponding main ETL server of the task file B to run the data ETL task b corresponding to the task file B after the ETL task a runs;
控制子模块,设置为控制数据ETL任务b运行结束后进入结束状态。The control sub-module is set to enter the end state after the control data ETL task b finishes running.
在一示例性实施例中,上述的任务文件A对应的从ETL服务器用于运行以下操作:数据查询、数据持久化、压缩打包、通知节点;In an exemplary embodiment, the slave ETL server corresponding to the above-mentioned task file A is used to run the following operations: data query, data persistence, compression and packaging, and notification nodes;
上述的任务文件B对应的主ETL服务器用于运行以下操作:数据传输、数据解压、数据转换、数据加载至目的端。The main ETL server corresponding to the above task file B is used to run the following operations: data transmission, data decompression, data conversion, and data loading to the destination.
在一示例性实施例中,上述的控制子模块,还设置为:In an exemplary embodiment, the above-mentioned control submodule is also set to:
统计数据ETL任务的运行信息;Statistical data ETL task running information;
根据运行信息确定是否需要异常处理,在确定结果为是的情况下,执行异常处理;Determine whether exception handling is required according to the running information, and execute exception handling if the determination result is yes;
计算下次执行时间;Calculate the next execution time;
释放计算资源。Free up computing resources.
本公开的实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序,其中,该计算机程序被设置为运行时执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide a computer-readable storage medium, in which a computer program is stored, wherein the computer program is set to execute the steps in any one of the above method embodiments when running.
在一个示例性实施例中,上述计算机可读存储介质可以包括但不限于:U盘、只读存储器(Read-Only Memory,简称为ROM)、随机存取存储器(Random Access Memory,简称为RAM)、移动硬盘、磁碟或者光盘等各种可以存储计算机程序的介质。In an exemplary embodiment, the above-mentioned computer-readable storage medium may include but not limited to: U disk, read-only memory (Read-Only Memory, referred to as ROM), random access memory (Random Access Memory, referred to as RAM) , mobile hard disk, magnetic disk or optical disk and other media that can store computer programs.
本公开的实施例还提供了一种电子装置,包括存储器和处理器,该存储器中存储有计算机程序,该处理器被设置为运行计算机程序以执行上述任一项方法实施例中的步骤。Embodiments of the present disclosure also provide an electronic device, including a memory and a processor, where a computer program is stored in the memory, and the processor is configured to run the computer program to execute the steps in any one of the above method embodiments.
在一个示例性实施例中,上述电子装置还可以包括传输设备以及输入输出设备,其中,该传输设备和上述处理器连接,该输入输出设备和上述处理器连接。In an exemplary embodiment, the electronic device may further include a transmission device and an input and output device, wherein the transmission device is connected to the processor, and the input and output device is connected to the processor.
本实施例中的具体示例可以参考上述实施例及示例性实施方式中所描述的示例,本实施例在此不再赘述。For specific examples in this embodiment, reference may be made to the examples described in the foregoing embodiments and exemplary implementation manners, and details will not be repeated here in this embodiment.
显然,本领域的技术人员应该明白,上述的本公开的各模块或各步骤可以用通用的计算装置来实现,它们可以集中在单个的计算装置上,或者分布在多个计算装置所组成的网络上,它们可以用计算装置可执行的程序代码来实现,从而,可以将它们存储在存储装置中由计算装置来执行,并且在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤,或者将它们分别制作成各个集成电路模块,或者将它们中的多个模块或步骤制作成单个集成电路模块来实现。这样,本公开不限制于任何特定的硬件和软件结合。Obviously, those skilled in the art should understand that each module or each step of the above-mentioned disclosure can be realized by a general-purpose computing device, and they can be concentrated on a single computing device, or distributed in a network composed of multiple computing devices In fact, they can be implemented in program code executable by a computing device, and thus, they can be stored in a storage device to be executed by a computing device, and in some cases, can be executed in an order different from that shown here. Or described steps, or they are fabricated into individual integrated circuit modules, or multiple modules or steps among them are fabricated into a single integrated circuit module for implementation. As such, the present disclosure is not limited to any specific combination of hardware and software.
以上所述仅为本公开的优选实施例而已,并不用于限制本公开,对于本领域的技术人员来说,本公开可以有各种更改和变化。凡在本公开的原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The above descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the present disclosure. For those skilled in the art, the present disclosure may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the principle of the present disclosure shall be included in the protection scope of the present disclosure.

Claims (10)

  1. 一种任务异步处理方法,包括:A task asynchronous processing method, comprising:
    将数据ETL任务的启动状态和等待状态从运行状态中剥离,得到处理后的运行状态;The start state and waiting state of the data ETL task are separated from the running state to obtain the processed running state;
    将数据ETL任务转换为数据ETL任务a与数据ETL任务b,其中,所述数据ETL任务a和所述数据ETL任务b一一对应;Converting the data ETL task into a data ETL task a and a data ETL task b, wherein the data ETL task a corresponds to the data ETL task b one by one;
    控制所述启动状态和所述等待状态下的数据ETL任务a在从ETL服务器上执行,并控制所述处理后的运行状态下的数据ETL任务b在主ETL服务器上前后异步执行,其中,在所述从ETL服务器上执行的任务不受任务运行并发数量阈值的限制,所述主ETL服务器上执行的任务受所述任务运行并发数量阈值的限制。Controlling the data ETL task a in the startup state and the waiting state to be executed on the slave ETL server, and controlling the data ETL task b in the running state after the processing to be executed asynchronously on the master ETL server, wherein The tasks executed on the slave ETL server are not limited by the threshold of the concurrent number of tasks running, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running.
  2. 根据权利要求1所述的方法,其中,所述控制所述启动状态和所述等待状态下的数据ETL任务a在从ETL服务器上执行,并控制所述处理后的运行状态下的数据ETL任务b在主ETL服务器上前后异步执行,包括:The method according to claim 1, wherein the data ETL task a under the control of the starting state and the waiting state is executed on the slave ETL server, and controls the data ETL task a under the running state after the processing b is executed asynchronously back and forth on the main ETL server, including:
    控制所述数据ETL任务a在所述从ETL服务器上执行,在所述数据ETL任务a执行结束之后,控制对应的所述数据ETL任务b在所述主ETL服务器上执行。The data ETL task a is controlled to be executed on the slave ETL server, and after the execution of the data ETL task a is completed, the corresponding data ETL task b is controlled to be executed on the master ETL server.
  3. 根据权利要求2所述的方法,其中,在将数据ETL任务转换为共享同一任务ID的数据ETL任务a与数据ETL任务b之前,所述方法还包括:The method according to claim 2, wherein, before converting the data ETL task into data ETL task a and data ETL task b sharing the same task ID, the method further comprises:
    配置所述数据ETL任务的任务信息,其中,所述任务信息至少包括任务优先级信息、任务运行的节点信息、数据采集信息源、数据处理流程、数据入库信息;Configure the task information of the data ETL task, wherein the task information at least includes task priority information, node information for task operation, data collection information source, data processing flow, and data storage information;
    将所述数据ETL任务的任务信息压缩打包成任务文件A和任务文件B,所述任务文件A与所述数据ETL任务a一一对应,所述任务文件B与所述数据ETL任务b一一对应;The task information of the data ETL task is compressed and packaged into a task file A and a task file B, the task file A corresponds to the data ETL task a one by one, and the task file B corresponds to the data ETL task b one by one correspond;
    将所述任务文件A传输至所述从ETL服务器,并将所述任务文件B传输至所述主ETL服务器。The task file A is transferred to the slave ETL server, and the task file B is transferred to the master ETL server.
  4. 根据权利要求3所述的方法,其中,控制所述数据ETL任务a在所述从ETL服务器上执行,在所述数据ETL任务a执行结束之后,控制所述数据ETL任务b在所述主ETL服务器上执行包括:The method according to claim 3, wherein the data ETL task a is controlled to be executed on the slave ETL server, and after the execution of the data ETL task a is completed, the data ETL task b is controlled to be executed on the master ETL server. Execution on the server includes:
    通知所述任务文件A对应的所述从ETL服务器运行所述任务文件A对应的所述数据ETL任务a;Notifying the slave ETL server corresponding to the task file A to run the data ETL task a corresponding to the task file A;
    在所述ETL任务a运行结束之后,通知所述任务文件B对应的所述主ETL服务器运行所述任务文件B对应的数据ETL任务b;After the ETL task a finishes running, notify the main ETL server corresponding to the task file B to run the data ETL task b corresponding to the task file B;
    控制所述数据ETL任务b运行结束后进入结束状态。Control the data ETL task b to enter the end state after running.
  5. 根据权利要求4所述的方法,其中,The method according to claim 4, wherein,
    所述任务文件A对应的所述从ETL服务器用于运行以下操作:数据查询、数据持久化、压缩打包、通知节点;The slave ETL server corresponding to the task file A is used to run the following operations: data query, data persistence, compression and packaging, and notification nodes;
    所述任务文件B对应的所述主ETL服务器用于运行以下操作:数据传输、数据解压、数据转换、数据加载至目的端。The main ETL server corresponding to the task file B is used to run the following operations: data transmission, data decompression, data conversion, and data loading to the destination.
  6. 根据权利要求4所述的方法,其中,控制所述数据ETL任务b运行结束后进入结束状态包括:The method according to claim 4, wherein controlling the data ETL task b to enter the end state after running comprises:
    统计所述数据ETL任务的运行信息;Statize the running information of the data ETL task;
    根据运行信息确定是否需要异常处理,在确定结果为是的情况下,执行异常处理;Determine whether exception handling is required according to the running information, and execute exception handling if the determination result is yes;
    计算下次执行时间;Calculate the next execution time;
    释放计算资源。Free up computing resources.
  7. 一种任务异步处理装置,包括:A task asynchronous processing device, comprising:
    剥离模块,设置为将数据ETL任务的启动状态和等待状态从运行状态中剥离,得到处理后的运行状态;The stripping module is configured to strip the start state and the waiting state of the data ETL task from the running state to obtain the processed running state;
    转换模块,设置为将数据ETL任务转换为数据ETL任务a与数据ETL任务b,其中,所述数据ETL任务a和所述数据ETL任务b一一对应;A conversion module configured to convert the data ETL task into a data ETL task a and a data ETL task b, wherein the data ETL task a corresponds to the data ETL task b one by one;
    控制模块,设置为控制所述启动状态和所述等待状态下的数据ETL任务a在从ETL服务器上执行,并控制所述处理后的运行状态下的数据ETL任务b在主ETL服务器上前后异步执行,其中,在所述从ETL服务器上执行的任务不受任务运行并发数量阈值的限制,所述主ETL服务器上执行的任务受所述任务运行并发数量阈值的限制。The control module is configured to control the data ETL task a in the starting state and the waiting state to be executed on the slave ETL server, and control the data ETL task b in the running state after the processing to be asynchronous back and forth on the master ETL server Executing, wherein, the tasks executed on the slave ETL server are not limited by the threshold of the number of concurrent tasks running, and the tasks executed on the master ETL server are limited by the threshold of the number of concurrent tasks running on the master ETL server.
  8. 根据权利要求7所述的装置,其中,所述控制模块,还设置为:The device according to claim 7, wherein the control module is further configured to:
    控制所述数据ETL任务a在所述从ETL服务器上执行,在所述数据ETL任务a执行结束之后,控制所述数据ETL任务b在所述主ETL服务器上执行。The data ETL task a is controlled to be executed on the slave ETL server, and after the execution of the data ETL task a is completed, the data ETL task b is controlled to be executed on the master ETL server.
  9. 一种计算机可读的存储介质,所述存储介质中存储有计算机程序,其中,所述计算机程序被设置为运行时执行所述权利要求1至6任一项中所述的方法。A computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to perform the method described in any one of claims 1 to 6 when running.
  10. 一种电子装置,包括存储器和处理器,所述存储器中存储有计算机程序,所述处理器被设置为运行所述计算机程序以执行所述权利要求1至6任一项中所述的方法。An electronic device, comprising a memory and a processor, wherein a computer program is stored in the memory, and the processor is configured to run the computer program to perform the method described in any one of claims 1 to 6.
PCT/CN2022/117071 2021-11-24 2022-09-05 Method and apparatus for asynchronously processing tasks, and storage medium and electronic apparatus WO2023093200A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111407069.2A CN116166722A (en) 2021-11-24 2021-11-24 Task asynchronous processing method and device, storage medium and electronic device
CN202111407069.2 2021-11-24

Publications (1)

Publication Number Publication Date
WO2023093200A1 true WO2023093200A1 (en) 2023-06-01

Family

ID=86420565

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/117071 WO2023093200A1 (en) 2021-11-24 2022-09-05 Method and apparatus for asynchronously processing tasks, and storage medium and electronic apparatus

Country Status (2)

Country Link
CN (1) CN116166722A (en)
WO (1) WO2023093200A1 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110295795A1 (en) * 2010-05-28 2011-12-01 Oracle International Corporation System and method for enabling extract transform and load processes in a business intelligence server
CN102541959A (en) * 2010-12-31 2012-07-04 中国移动通信集团安徽有限公司 Method, device and system for scheduling electron transport layer (ETL)
CN108647093A (en) * 2018-05-09 2018-10-12 厦门南讯软件科技有限公司 A kind of distributed task scheduling processing system and its application method
CN111078396A (en) * 2019-11-22 2020-04-28 厦门安胜网络科技有限公司 Distributed data access method and system based on multitask instances

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110295795A1 (en) * 2010-05-28 2011-12-01 Oracle International Corporation System and method for enabling extract transform and load processes in a business intelligence server
CN102541959A (en) * 2010-12-31 2012-07-04 中国移动通信集团安徽有限公司 Method, device and system for scheduling electron transport layer (ETL)
CN108647093A (en) * 2018-05-09 2018-10-12 厦门南讯软件科技有限公司 A kind of distributed task scheduling processing system and its application method
CN111078396A (en) * 2019-11-22 2020-04-28 厦门安胜网络科技有限公司 Distributed data access method and system based on multitask instances

Also Published As

Publication number Publication date
CN116166722A (en) 2023-05-26

Similar Documents

Publication Publication Date Title
CN107087019B (en) Task scheduling method and device based on end cloud cooperative computing architecture
CN115328663B (en) Method, device, equipment and storage medium for scheduling resources based on PaaS platform
US8239868B2 (en) Computer system, servers constituting the same, and job execution control method and program
WO2018072687A1 (en) Resource scheduling method and apparatus, and filtered scheduler
US20160378570A1 (en) Techniques for Offloading Computational Tasks between Nodes
CN111897633A (en) Task processing method and device
CN111935017B (en) Cross-network application calling method and device and routing equipment
KR101558289B1 (en) Composing message processing pipelines
CN113641410A (en) Netty-based high-performance gateway system processing method and system
CN111586140A (en) Data interaction method and server
CN111371848A (en) Request processing method, device, equipment and storage medium
US11700189B2 (en) Method for performing task processing on common service entity, common service entity, apparatus and medium for task processing
CN109388501B (en) Communication matching method, device, equipment and medium based on face recognition request
CN112104679A (en) Method, apparatus, device and medium for processing hypertext transfer protocol request
WO2023093200A1 (en) Method and apparatus for asynchronously processing tasks, and storage medium and electronic apparatus
CN109783202A (en) Event-handling method, system, equipment and storage medium
CN113691466A (en) Data transmission method, intelligent network card, computing device and storage medium
CN101909283B (en) M2M (Machine-to-Machine) platform and method for sharing load downloaded with in high capacity
US11797342B2 (en) Method and supporting node for supporting process scheduling in a cloud system
CN112165529A (en) Method, device, equipment and medium for low-cost cross-network data exchange
CN117573603B (en) Data processing method and computer equipment for remote direct memory access
CN112995704B (en) Cache management method and device, electronic equipment and storage medium
CN117076057B (en) AI service request scheduling method, device, equipment and medium
WO2023040412A1 (en) Message processing method, network card and server
CN117851403A (en) Data dump system, method and device based on time sequence segmentation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22897291

Country of ref document: EP

Kind code of ref document: A1